Enrichment or depletion of a GO category within a class of genes: which test?

Abstract : Motivation: A number of available program packages determine the significant enrichments and/or depletions of GO categories among a class of genes of interest. Whereas a correct formulation of the prob-lem leads to a single exact null distribution, these GO tools use a large variety of statistical tests whose denominations often do not clarify the underlying p-value computations. Summary: We review the different formulations of the problem and the tests they lead to: the binomial, χ2, equality of two probabilities, Fisher's exact, and hypergeometric tests. We clarify the relation-ships existing between these tests, in particular the equivalence between the hypergeometric test and Fisher's exact test. We recall that the other tests are valid only for large samples, the test of equal-ity of two probabilities and the χ2 test being equivalent. We discuss the appropriateness of one- and two-sided p-values, as well as some discreteness and conservatism issues.
Complete list of metadatas

Cited literature [33 references]  Display  Hide  Download

https://hal-espci.archives-ouvertes.fr/hal-00801557
Contributor : Isabelle Rivals <>
Submitted on : Sunday, March 17, 2013 - 2:16:30 PM
Last modification on : Wednesday, July 10, 2019 - 7:14:02 PM
Long-term archiving on : Tuesday, June 18, 2013 - 3:57:45 AM

File

Bioinformatics-2007-Rivals-401...
Publisher files allowed on an open archive

Identifiers

Collections

Citation

Isabelle Rivals, Léon Personnaz, Lieng Taing, Potier Marie-Claude. Enrichment or depletion of a GO category within a class of genes: which test?. Bioinformatics, Oxford University Press (OUP), 2007, 23 (4), pp.401-407. ⟨10.1093/bioinformatics/btl633⟩. ⟨hal-00801557⟩

Share

Metrics

Record views

398

Files downloads

733