Academic Commons

Articles

Homology-based inference sets the bar high for protein function prediction

Rost, Burkhard; Roos, Manfred; Mahlich, Yannick; Landerer, Cedric; Krompass, Denis; Kiening, Michael; Kaufman, Stefanie; Hönigschmid, Peter; Hopf, Thomas A.; Hamp, Tobias; Kassner, Rebecca; Seemayer, Sefan; Vicedo, Esmeralda; Schaefer, Christian; Achten, Dominik; Auer, Florian; Boehm, Ariane; Braun, Tatjana; Hecht, Maximilian; Heron, Mark

Background: Any method that de novo predicts protein function should do better than random. More challenging, it also ought to outperform simple homology-based inference. Methods: Here, we describe a few methods that predict protein function exclusively through homology. Together, they set the bar or lower limit for future improvements. Results and conclusions: During the development of these methods, we faced two surprises. Firstly, our most successful implementation for the baseline ranked very high at CAFA1. In fact, our best combination of homology-based methods fared only slightly worse than the top-of-the-line prediction method from the Jones group. Secondly, although the concept of homology-based inference is simple, this work revealed that the precise details of the implementation are crucial: not only did the methods span from top to bottom performers at CAFA, but also the reasons for these differences were unexpected. In this work, we also propose a new rigorous measure to compare predicted and experimental annotations. It puts more emphasis on the details of protein function than the other measures employed by CAFA and may best reflect the expectations of users. Clearly, the definition of proper goals remains one major objective for CAFA.

Subjects

Files

  • thumnail for 1471-2105-14-S3-S7.xml 1471-2105-14-S3-S7.xml binary/octet-stream 80.3 KB Download File
  • thumnail for 1471-2105-14-S3-S7.pdf 1471-2105-14-S3-S7.pdf binary/octet-stream 1.77 MB Download File
  • thumnail for d3929a0bc8f7e25e31d5d21720da439f.zip d3929a0bc8f7e25e31d5d21720da439f.zip binary/octet-stream 1.64 MB Download File

Also Published In

Title
BMC Bioinformatics
DOI
https://doi.org/10.1186/1471-2105-14-S3-S7

More About This Work

Academic Units
Biochemistry and Molecular Biophysics
Publisher
BioMed Central
Published Here
September 8, 2014
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.