Abstract

Abstract We introduce a statistical model for microarray gene expression data that comprises data calibration, the quantification of differential expression, and the quantification of measurement error. In particular, we derive a transformation h for intensity measurements, and a difference statistic Δh whose variance is approximately constant along the whole intensity range. This forms a basis for statistical inference from microarray data, and provides a rational data pre-processing strategy for multivariate analyses. For the transformation h, the parametric form h(x)=arsinh(a+bx) is derived from a model of the variance-versus-mean dependence for microarray intensity data, using the method of variance stabilizing transformations. For large intensities, h coincides with the logarithmic transformation, and Δh with the log-ratio. The parameters of h together with those of the calibration between experiments are estimated with a robust variant of maximum-likelihood estimation. We demonstrate our approach on data sets from different experimental platforms, including two-colour cDNA arrays and a series of Affymetrix oligonucleotide arrays. Availability: Software is freely available for academic use as an R package at http://www.dkfz.de/abt0840/whuber Contact: w.huber@dkfz.de

Keywords

CalibrationTransformation (genetics)StatisticsMathematicsParametric statisticsLogarithmComputer scienceBiologyGenetics

MeSH Terms

AlgorithmsAnalysis of VarianceCalibrationData InterpretationStatisticalGene Expression ProfilingLikelihood FunctionsModelsGeneticModelsStatisticalOligonucleotide Array Sequence AnalysisReference StandardsReproducibility of ResultsSensitivity and SpecificitySoftware

Affiliated Institutions

Related Publications

Publication Info

Year
2002
Type
article
Volume
18
Issue
suppl_1
Pages
S96-S104
Citations
2492
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2492
OpenAlex
120
Influential
1997
CrossRef

Cite This

Wolfgang Huber, Anja von Heydebreck, Holger Sültmann et al. (2002). Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics , 18 (suppl_1) , S96-S104. https://doi.org/10.1093/bioinformatics/18.suppl_1.s96

Identifiers

DOI
10.1093/bioinformatics/18.suppl_1.s96
PMID
12169536

Data Quality

Data completeness: 86%