Abstract
Currently the practice of using existing biological knowledge in analyzing high throughput genomic and proteomic data is mainly for the purpose of validations. Here we take a different approach of incorporating biological knowledge into statistical analysis to improve statistical power and efficiency. Specifically, we consider how to fuse biological information into a mixture model to analyze microarray data. In contrast to a standard mixture model where it is assumed that all the genes come from the same (marginal) distribution, including an equal prior probability of having an event, such as having differential expression or being bound by a transcription factor (TF), our proposed mixture model allows the genes in different groups to have different distributions while the grouping of the genes reflects biological information. Using a list of about 800 putative cell cycle-regulated genes as prior biological knowledge, we analyze a genome-wide location data to detect binding sites of TF Fkh1. We find that our proposal improves over the standard approach, resulting in reduced false discovery rates (FDR), and hence it is a useful alternative to the current practice.
Keywords
Related Publications
Detecting Differentially Expressed Genes in Microarrays Using Bayesian Model Selection
AbstractDNA microarrays open up a broad new horizon for investigators interested in studying the genetic determinants of disease. The high throughput nature of these arrays, whe...
Empirical Bayes Analysis of a Microarray Experiment
AbstractMicroarrays are a novel technology that facilitates the simultaneous measurement of thousands of gene expression levels. A typical microarray experiment can produce mill...
Significance analysis of microarrays applied to the ionizing radiation response
Microarrays can measure the expression of thousands of genes to identify changes in expression between different biological states. Methods are needed to determine the significa...
Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments
The problem of identifying differentially expressed genes in designed microarray experiments is considered. Lonnstedt and Speed (2002) derived an expression for the posterior od...
Deep sequencing-based expression analysis shows major advances in robustness, resolution and inter-lab portability over five microarray platforms
The hippocampal expression profiles of wild-type mice and mice transgenic for deltaC-doublecortin-like kinase were compared with Solexa/Illumina deep sequencing technology and f...
Publication Info
- Year
- 2005
- Type
- article
- Volume
- 4
- Issue
- 1
- Pages
- Article12-Article12
- Citations
- 23
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.2202/1544-6115.1124