The Release 6 reference sequence of the <i>Drosophila melanogaster</i> genome

2015 Genome Research 514 citations

Abstract

Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy and middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. Further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.

Keywords

BiologyReference genomeGeneticsContigGenomeSequence assemblyGenome projectComputational biologyPositional cloningBacterial artificial chromosomeGeneLocus (genetics)

MeSH Terms

AnimalsChromosome MappingChromosomesArtificialBacterialComputational BiologyContig MappingDrosophila melanogasterGenomeHigh-Throughput Nucleotide SequencingIn Situ HybridizationFluorescenceMolecular Sequence DataPolytene ChromosomesRestriction Mapping

Affiliated Institutions

Related Publications

The Phusion Assembler

The Phusion assembler has assembled the mouse genome from the whole-genome shotgun (WGS) dataset collected by the Mouse Genome Sequencing Consortium, at ∼7.5× sequence coverage,...

2002 Genome Research 220 citations

Publication Info

Year
2015
Type
article
Volume
25
Issue
3
Pages
445-458
Citations
514
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

514
OpenAlex
33
Influential
476
CrossRef

Cite This

Roger A. Hoskins, Joseph W. Carlson, Kenneth H. Wan et al. (2015). The Release 6 reference sequence of the <i>Drosophila melanogaster</i> genome. Genome Research , 25 (3) , 445-458. https://doi.org/10.1101/gr.185579.114

Identifiers

DOI
10.1101/gr.185579.114
PMID
25589440
PMCID
PMC4352887

Data Quality

Data completeness: 90%