Abstract
New DNA sequencing technologies can sequence up to one billion bases in a single day at low cost, putting large-scale sequencing within the reach of many scientists. Many researchers are forging ahead with projects to sequence a range of species using the new technologies. However, these new technologies produce read lengths as short as 35-40 nucleotides, posing challenges for genome assembly and annotation. Here we review the challenges and describe some of the bioinformatics systems that are being proposed to solve them. We specifically address issues arising from using these technologies in assembly projects, both de novo and for resequencing purposes, as well as efforts to improve genome annotation in the fragmented assemblies produced by short read lengths.
Keywords
MeSH Terms
Affiliated Institutions
Related Publications
Whole-Genome Sequencing and Assembly with High-Throughput, Short-Read Technologies
While recently developed short-read sequencing technologies may dramatically reduce the sequencing cost and eventually achieve the $1000 goal for re-sequencing, their limitation...
Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads
The Illumina DNA sequencing platform generates accurate but short reads, which can be used to produce accurate but fragmented genome assemblies. Pacific Biosciences and Oxford N...
Transcriptome assembly from long-read RNA-seq alignments with StringTie2
Abstract RNA sequencing using the latest single-molecule sequencing instruments produces reads that are thousands of nucleotides long. The ability to assemble these long reads c...
ABySS: A parallel assembler for short read sequence data
Widespread adoption of massively parallel deoxyribonucleic acid (DNA) sequencing instruments has prompted the recent development of de novo short read assembly algorithms. A com...
<scp>hybrid</scp>SPA<scp>des</scp>: an algorithm for hybrid assembly of short and long reads
Abstract Motivation: Recent advances in single molecule real-time (SMRT) and nanopore sequencing technologies have enabled high-quality assemblies from long and inaccurate reads...
Publication Info
- Year
- 2008
- Type
- review
- Volume
- 24
- Issue
- 3
- Pages
- 142-149
- Citations
- 509
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1016/j.tig.2007.12.006
- PMID
- 18262676
- PMCID
- PMC2680276