Abstract

Background:Learning the function of genes is a major goal of computational genomics. Methods for inferring gene function have typically fallen into two categories: 'guilt-by-profiling', which exploits correlation between function and other gene characteristics; and 'guilt-by-association', which transfers function from one gene to another via biological relationships.Results:We have developed a strategy ('Funckenstein') that performs guilt-by-profiling and guilt-by-association and combines the results. Using a benchmark set of functional categories and input data for protein-coding genes in Saccharomyces cerevisiae, Funckenstein was compared with a previous combined strategy. Subsequently, we applied Funckenstein to 2,455 Gene Ontology terms. In the process, we developed 2,455 guilt-by-profiling classifiers based on 8,848 gene characteristics and 12 functional linkage graphs based on 23 biological relationships.Conclusion:Funckenstein outperforms a previous combined strategy using a common benchmark dataset. The combination of 'guilt-by-profiling' and 'guilt-by-association' gave significant improvement over the component classifiers, showing the greatest synergy for the most specific functions. Performance was evaluated by cross-validation and by literature examination of the top-scoring novel predictions. These quantitative predictions should help prioritize experimental study of yeast gene functions.

Keywords

Gene ontologyProfiling (computer programming)Gene knockoutComputational biologyGeneComputer scienceArtificial intelligenceData miningMachine learningBiologyGeneticsGene expression

MeSH Terms

AlgorithmsComputational BiologyGenesFungalMetabolic Networks and PathwaysSaccharomyces cerevisiaeSaccharomyces cerevisiae ProteinsSoftware

Affiliated Institutions

Related Publications

Publication Info

Year
2008
Type
article
Volume
9
Issue
S1
Pages
S7-S7
Citations
90
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

90
OpenAlex
4
Influential

Cite This

Weidong Tian, Lan V. Zhang, Murat Taşan et al. (2008). Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiaegene function. Genome biology , 9 (S1) , S7-S7. https://doi.org/10.1186/gb-2008-9-s1-s7

Identifiers

DOI
10.1186/gb-2008-9-s1-s7
PMID
18613951
PMCID
PMC2447541

Data Quality

Data completeness: 86%