Abstract
Elimination of the data processing bottleneck in high-throughput sequencing will require both improved accuracy of data processing software and reliable measures of that accuracy. We have developed and implemented in our base-calling program phred the ability to estimate a probability of error for each base-call, as a function of certain parameters computed from the trace data. These error probabilities are shown here to be valid (correspond to actual error rates) and to have high power to discriminate correct base-calls from incorrect ones, for read data collected under several different chemistries and electrophoretic conditions. They play a critical role in our assembly program phrap and our finishing program consed.
Keywords
MeSH Terms
Affiliated Institutions
Related Publications
Fragment assembly with short reads
Abstract Motivation: Current DNA sequencing technology produces reads of about 500–750 bp, with typical coverage under 10×. New sequencing technologies are emerging that produce...
SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing
The latest revolution in the DNA sequencing field has been brought about by the development of automated sequencers that are capable of generating giga base pair data sets quick...
Auto-encoder bottleneck features using deep belief networks
Neural network (NN) bottleneck (BN) features are typically created by training a NN with a middle bottleneck layer. Recently, an alternative structure was proposed which trains ...
Density-functional thermochemistry. I. The effect of the exchange-only gradient correction
Previous work by the author on diatomic molecules and by others on polyatomic systems has revealed that Kohn–Sham density-functional theory with ‘‘gradient corrected’’ exchange-...
<i>CrystalExplorer</i>: a program for Hirshfeld surface analysis, visualization and quantitative analysis of molecular crystals
CrystalExplorer is a native cross-platform program supported on Windows, MacOS and Linux with the primary function of visualization and investigation of molecular crystal struct...
Publication Info
- Year
- 1998
- Type
- article
- Volume
- 8
- Issue
- 3
- Pages
- 186-194
- Citations
- 5469
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1101/gr.8.3.186
- PMID
- 9521922