Incorporating Biological Information as a Prior in an Empirical Bayes Approach to Analyzing Microarray Data

Wei Pan Wei Pan
2005 Statistical Applications in Genetics and Molecular Biology 23 citations

Abstract

Currently the practice of using existing biological knowledge in analyzing high throughput genomic and proteomic data is mainly for the purpose of validations. Here we take a different approach of incorporating biological knowledge into statistical analysis to improve statistical power and efficiency. Specifically, we consider how to fuse biological information into a mixture model to analyze microarray data. In contrast to a standard mixture model where it is assumed that all the genes come from the same (marginal) distribution, including an equal prior probability of having an event, such as having differential expression or being bound by a transcription factor (TF), our proposed mixture model allows the genes in different groups to have different distributions while the grouping of the genes reflects biological information. Using a list of about 800 putative cell cycle-regulated genes as prior biological knowledge, we analyze a genome-wide location data to detect binding sites of TF Fkh1. We find that our proposal improves over the standard approach, resulting in reduced false discovery rates (FDR), and hence it is a useful alternative to the current practice.

Keywords

False discovery rateBayes' theoremComputer scienceData miningBayes factorPrior probabilityBiological dataStatistical modelComputational biologyMicroarray analysis techniquesBayesian probabilityMachine learningBiologyGeneArtificial intelligenceBioinformaticsGene expressionGenetics

Related Publications

Publication Info

Year
2005
Type
article
Volume
4
Issue
1
Pages
Article12-Article12
Citations
23
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

23
OpenAlex

Cite This

Wei Pan (2005). Incorporating Biological Information as a Prior in an Empirical Bayes Approach to Analyzing Microarray Data. Statistical Applications in Genetics and Molecular Biology , 4 (1) , Article12-Article12. https://doi.org/10.2202/1544-6115.1124

Identifiers

DOI
10.2202/1544-6115.1124