Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
.refereed research
 |  |  |  |  | 


Open AccessHighly AccessResearch

Benchmarking ortholog identification methods using functional genomics data

Tim Hulsen1 email, Martijn A Huynen1 email, Jacob de Vlieg1,2 email and Peter MA Groenen2 email

Centre for Molecular and Biomolecular Informatics, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen, Toernooiveld 1, Nijmegen, 6500 GL, The Netherlands

NV Organon, Molenstraat 110, Oss, 5340 BH, The Netherlands

author email corresponding author email

Genome Biology 2006, 7:R31doi:10.1186/gb-2006-7-4-r31

Published: 13 April 2006

Subject areas: Bioinformatics, Genome studies, Evolution

Abstract

Background

The transfer of functional annotations from model organism proteins to human proteins is one of the main applications of comparative genomics. Various methods are used to analyze cross-species orthologous relationships according to an operational definition of orthology. Often the definition of orthology is incorrectly interpreted as a prediction of proteins that are functionally equivalent across species, while in fact it only defines the existence of a common ancestor for a gene in different species. However, it has been demonstrated that orthologs often reveal significant functional similarity. Therefore, the quality of the orthology prediction is an important factor in the transfer of functional annotations (and other related information). To identify protein pairs with the highest possible functional similarity, it is important to qualify ortholog identification methods.

Results

To measure the similarity in function of proteins from different species we used functional genomics data, such as expression data and protein interaction data. We tested several of the most popular ortholog identification methods. In general, we observed a sensitivity/selectivity trade-off: the functional similarity scores per orthologous pair of sequences become higher when the number of proteins included in the ortholog groups decreases.

Conclusion

By combining the sensitivity and the selectivity into an overall score, we show that the InParanoid program is the best ortholog identification method in terms of identifying functionally equivalent proteins.


© 1999-2010 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.