Abstract

Abstract Motivation: A number of available program packages determine the significant enrichments and/or depletions of GO categories among a class of genes of interest. Whereas a correct formulation of the problem leads to a single exact null distribution, these GO tools use a large variety of statistical tests whose denominations often do not clarify the underlying P-value computations. Summary: We review the different formulations of the problem and the tests they lead to: the binomial, χ2, equality of two probabilities, Fisher's exact and hypergeometric tests. We clarify the relationships existing between these tests, in particular the equivalence between the hypergeometric test and Fisher's exact test. We recall that the other tests are valid only for large samples, the test of equality of two probabilities and the χ2-test being equivalent. We discuss the appropriateness of one- and two-sided P-values, as well as some discreteness and conservatism issues. Contact: isabelle.rivals@espci.fr Supplementary information: Supplementary data are available at Bioinformatics online.

Keywords

Hypergeometric distributionExact testBinomial (polynomial)Hypergeometric functionEquivalence (formal languages)p-valueMathematicsClass (philosophy)Test (biology)Null hypothesisStatistical hypothesis testingStatisticsComputer scienceDiscrete mathematicsPure mathematicsArtificial intelligenceBiology

Affiliated Institutions

Related Publications

Publication Info

Year
2006
Type
article
Volume
23
Issue
4
Pages
401-407
Citations
762
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

762
OpenAlex

Cite This

Isabelle Rivals, L. Personnaz, Lieng Taing et al. (2006). Enrichment or depletion of a GO category within a class of genes: which test?. Bioinformatics , 23 (4) , 401-407. https://doi.org/10.1093/bioinformatics/btl633

Identifiers

DOI
10.1093/bioinformatics/btl633