BMC Bioinformatics (Sep 2004)

Handling multiple testing while interpreting microarrays with the Gene Ontology Database

  • Zhao Hongyu,
  • Osier Michael V,
  • Cheung Kei-Hoi

DOI
https://doi.org/10.1186/1471-2105-5-124
Journal volume & issue
Vol. 5, no. 1
p. 124

Abstract

Read online

Abstract Background The development of software tools that analyze microarray data in the context of genetic knowledgebases is being pursued by multiple research groups using different methods. A common problem for many of these tools is how to correct for multiple statistical testing since simple corrections are overly conservative and more sophisticated corrections are currently impractical. A careful study of the nature of the distribution one would expect by chance, such as by a simulation study, may be able to guide the development of an appropriate correction that is not overly time consuming computationally. Results We present the results from a preliminary study of the distribution one would expect for analyzing sets of genes extracted from Drosophila, S. cerevisiae, Wormbase, and Gramene databases using the Gene Ontology Database. Conclusions We found that the estimated distribution is not regular and is not predictable outside of a particular set of genes. Permutation-based simulations may be necessary to determine the confidence in results of such analyses.