Abstract

Existing long-read assemblers require thousands of central processing unit hours to assemble a human genome and are being outpaced by sequencing technologies in terms of both throughput and cost. We developed a long-read assembler wtdbg2 (https://github.com/ruanjue/wtdbg2) that is 2–17 times as fast as published tools while achieving comparable contiguity and accuracy. It paves the way for population-scale long-read assembly in future. Wtdbg2 assembles genomes with comparable contiguity and accuracy to existing tools using long-read sequencing data, and is several times faster, especially for large genomes.

Keywords

ContiguityComputer scienceGenomeThroughputComputational biologyPopulationSequence assemblyHuman genomeBiologyGeneticsGeneOperating system

MeSH Terms

AlgorithmsDatasets as TopicGenomeHumanHigh-Throughput Nucleotide SequencingHumans

Affiliated Institutions

Related Publications

Publication Info

Year
2019
Type
article
Volume
17
Issue
2
Pages
155-158
Citations
1466
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1466
OpenAlex
119
Influential

Cite This

Jue Ruan, Heng Li (2019). Fast and accurate long-read assembly with wtdbg2. Nature Methods , 17 (2) , 155-158. https://doi.org/10.1038/s41592-019-0669-3

Identifiers

DOI
10.1038/s41592-019-0669-3
PMID
31819265
PMCID
PMC7004874

Data Quality

Data completeness: 90%