Abstract

We derive and validate an estimator for the parameters of a transformation for the joint calibration (normalization) and variance stabilization of microarray intensity data. With this, the variances of the transformed intensities become approximately independent of their expected values. The transformation is similar to the logarithm in the high intensity range, but has a smaller slope for intensities close to zero. Applications have shown better sensitivity and specificity for the detection of differentially expressed genes. In this paper, we describe the theoretical aspects of the method. We incorporate calibration and variance-mean dependence into a statistical model and use a robust variant of the maximum-likelihood method to estimate the transformation parameters. Using simulations, we investigate the size of the estimation error and its dependence on sample size and the presence of outliers. We find that the error decreases with the square root of the number of probes per array and that the estimation is robust against the presence of differentially expressed genes. Software is publicly available as an R package through the Bioconductor project (http://www.bioconductor.org).

Keywords

BioconductorStatisticsEstimatorOutlierCalibrationMathematicsNormalization (sociology)LogarithmTransformation (genetics)Variance (accounting)Sensitivity (control systems)Computer scienceAlgorithmBiology

Affiliated Institutions

Related Publications

Publication Info

Year
2003
Type
article
Volume
2
Issue
1
Pages
Article3-Article3
Citations
181
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

181
OpenAlex
121
CrossRef

Cite This

Wolfgang Huber, Anja von Heydebreck, Holger Sueltmann et al. (2003). Parameter estimation for the calibration and variance stabilization of microarray data. Statistical Applications in Genetics and Molecular Biology , 2 (1) , Article3-Article3. https://doi.org/10.2202/1544-6115.1008

Identifiers

DOI
10.2202/1544-6115.1008
PMID
16646781

Data Quality

Data completeness: 77%