Abstract

Microarray technology is rapidly emerging for genome-wide screening of differentially expressed genes between clinical subtypes or different conditions of human diseases. Traditional statistical testing approaches, such as the two-sample t-test or Wilcoxon test, are frequently used for evaluating statistical significance of informative expressions but require adjustment for large-scale multiplicity. Due to its simplicity, Bonferroni adjustment has been widely used to circumvent this problem. It is well known, however, that the standard Bonferroni test is often very conservative. In the present paper, we compare three multiple testing procedures in the microarray context: the original Bonferroni method, a Bonferroni-type improved single-step method and a step-down method. The latter two methods are based on nonparametric resampling, by which the null distribution can be derived with the dependency structure among gene expressions preserved and the family-wise error rate accurately controlled at the desired level. We also present a sample size calculation method for designing microarray studies. Through simulations and data analyses, we find that the proposed methods for testing and sample size calculation are computationally fast and control error and power precisely.

Keywords

Bonferroni correctionFalse discovery rateResamplingMultiple comparisons problemSample size determinationType I and type II errorsStatistical hypothesis testingStatistical powerComputer scienceWilcoxon signed-rank testStatisticsNonparametric statisticsMathematicsNominal levelData miningGeneticsMann–Whitney U testBiology

Affiliated Institutions

Related Publications

A Direct Approach to False Discovery Rates

Summary Multiple-hypothesis testing involves guarding against much more complicated errors than single-hypothesis testing. Whereas we typically control the type I error rate for...

2002 Journal of the Royal Statistical Soci... 5607 citations

Publication Info

Year
2004
Type
article
Volume
6
Issue
1
Pages
157-169
Citations
80
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

80
OpenAlex

Cite This

Sin‐Ho Jung, Heejung Bang, S. Stanley Young (2004). Sample size calculation for multiple testing in microarray data analysis. Biostatistics , 6 (1) , 157-169. https://doi.org/10.1093/biostatistics/kxh026

Identifiers

DOI
10.1093/biostatistics/kxh026