Abstract
Abstract Motivation: Digital gene expression (DGE) technologies measure gene expression by counting sequence tags. They are sensitive technologies for measuring gene expression on a genomic scale, without the need for prior knowledge of the genome sequence. As the cost of sequencing DNA decreases, the number of DGE datasets is expected to grow dramatically. Various tests of differential expression have been proposed for replicated DGE data using binomial, Poisson, negative binomial or pseudo-likelihood (PL) models for the counts, but none of the these are usable when the number of replicates is very small. Results: We develop tests using the negative binomial distribution to model overdispersion relative to the Poisson, and use conditional weighted likelihood to moderate the level of overdispersion across genes. Not only is our strategy applicable even with the smallest number of libraries, but it also proves to be more powerful than previous strategies when more libraries are available. The methodology is equally applicable to other counting technologies, such as proteomic spectral counts. Availability: An R package can be accessed from http://bioinf.wehi.edu.au/resources/ Contact: smyth@wehi.edu.au Supplementary information: http://bioinf.wehi.edu.au/resources/
Keywords
MeSH Terms
Affiliated Institutions
Related Publications
Fitting Discrete Probability Distributions to Evolutionary Events
The assumptions underlying the use of the Poisson distribution are essentially that the probability of an event is small but nearly identical for all occurrences and that the oc...
Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments
The problem of identifying differentially expressed genes in designed microarray experiments is considered. Lonnstedt and Speed (2002) derived an expression for the posterior od...
Some Applications of Radial Plots
Abstract A radial plot is a graphical display for comparing estimates that have differing precisions. It is a scatter plot of standardized estimates against reciprocals of stand...
GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses
Tremendous amount of RNA sequencing data have been produced by large consortium projects such as TCGA and GTEx, creating new opportunities for data mining and deeper understandi...
Expression Atlas update—an integrated database of gene and protein expression in humans, animals and plants
Expression Atlas (http://www.ebi.ac.uk/gxa) provides information about gene and protein expression in animal and plant samples of different cell types, organism parts, developme...
Publication Info
- Year
- 2007
- Type
- article
- Volume
- 23
- Issue
- 21
- Pages
- 2881-2887
- Citations
- 906
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1093/bioinformatics/btm453
- PMID
- 17881408