Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
.refereed research
 |  |  |  |  | 


This article is part of the supplement: Quantitative inference of gene function from diverse large-scale datasets

Open AccessMethod

Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function

Weidong Tian1, Lan V Zhang1,4, Murat Taşan1, Francis D Gibbons1,5, Oliver D King1,6, Julie Park2, Zeba Wunderlich1,7, J Michael Cherry2 and Frederick P Roth1,3 email

1Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Longwood Avenue, Boston, Massachusetts 02115, USA

2Department of Genetics, School of Medicine, Stanford University, Stanford, California 94305-5120, USA

3Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Jimmy Fund Way, Boston, Massachusetts 02115, USA

4McKinsey and Company, Hansen Way, Palo Alto, California 94304, USA

5Merrimack Pharmaceuticals, Kendall Square, Cambridge, Massachusetts 02139, USA

6Boston Biomedical Research Institute (BBRI), Grove St., Watertown, Massachusetts 02472, USA

7Massachusetts Institute of Technology, Massachusetts Ave, Cambridge, Massachusetts 02139, USA

author email corresponding author email

Genome Biology 2008, 9(Suppl 1):S7doi:10.1186/gb-2008-9-s1-s7

Published: 27 June 2008

Abstract

Background:

Learning the function of genes is a major goal of computational genomics. Methods for inferring gene function have typically fallen into two categories: 'guilt-by-profiling', which exploits correlation between function and other gene characteristics; and 'guilt-by-association', which transfers function from one gene to another via biological relationships.

Results:

We have developed a strategy ('Funckenstein') that performs guilt-by-profiling and guilt-by-association and combines the results. Using a benchmark set of functional categories and input data for protein-coding genes in Saccharomyces cerevisiae, Funckenstein was compared with a previous combined strategy. Subsequently, we applied Funckenstein to 2,455 Gene Ontology terms. In the process, we developed 2,455 guilt-by-profiling classifiers based on 8,848 gene characteristics and 12 functional linkage graphs based on 23 biological relationships.

Conclusion:

Funckenstein outperforms a previous combined strategy using a common benchmark dataset. The combination of 'guilt-by-profiling' and 'guilt-by-association' gave significant improvement over the component classifiers, showing the greatest synergy for the most specific functions. Performance was evaluated by cross-validation and by literature examination of the top-scoring novel predictions. These quantitative predictions should help prioritize experimental study of yeast gene functions.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.