Abstract

The DNA sequence of 91.4 kilobases of the Escherichia coli K-12 genome, spanning the region between rrnC at 84.5 minutes and rrnA at 86.5 minutes on the genetic map (85 to 87 percent on the physical map), is described. Analysis of this sequence identified 82 potential coding regions (open reading frames) covering 84 percent of the sequenced interval. The arrangement of these open reading frames, together with the consensus promoter sequences and terminator-like sequences found by computer searches, made it possible to assign them to proposed transcriptional units. More than half the open reading frames correlated with known genes or functions suggested by similarity to other sequences. Those remaining encode still unidentified proteins. The sequenced region also contains several RNA genes and two types of repeated sequence elements were found. Intergenic regions include three "gray holes," 0.6 to 0.8 kilobases, with no recognizable functions.

Keywords

GeneticsBiologyOpen reading frameIntergenic regionGeneTerminator (solar)GenomeENCODESequence logoCoding regionDNAReading frameDNA sequencingNucleic acid sequenceConsensus sequencePeptide sequence

Affiliated Institutions

Related Publications

Publication Info

Year
1992
Type
article
Volume
257
Issue
5071
Pages
771-778
Citations
247
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

247
OpenAlex

Cite This

Donna L. Daniels, Guy Plunkett, Valerie Burland et al. (1992). Analysis of the <i>Escherichia coli</i> Genome: DNA Sequence of the Region from 84.5 to 86.5 Minutes. Science , 257 (5071) , 771-778. https://doi.org/10.1126/science.1379743

Identifiers

DOI
10.1126/science.1379743