Abstract

Summary Using in silico analysis we studied a novel family of repetitive DNA sequences that is present among both domains of the prokaryotes (Archaea and Bacteria), but absent from eukaryotes or viruses. This family is characterized by direct repeats, varying in size from 21 to 37 bp, interspaced by similarly sized non‐repetitive sequences. To appreciate their characteri‐stic structure, we will refer to this family as the clustered regularly interspaced short palindromic repeats (CRISPR). In most species with two or more CRISPR loci, these loci were flanked on one side by a common leader sequence of 300–500 b. The direct repeats and the leader sequences were conserved within a species, but dissimilar between species. The presence of multiple chromosomal CRISPR loci suggests that CRISPRs are mobile elements. Four CRISPR‐associated ( cas ) genes were identified in CRISPR‐containing prokaryotes that were absent from CRISPR‐negative prokaryotes. The cas genes were invariably located adjacent to a CRISPR locus, indicating that the cas genes and CRISPR loci have a functional relationship. The cas3 gene showed motifs characteristic for helicases of the superfamily 2, and the cas 4 gene showed motifs of the RecB family of exonucleases, suggesting that these genes are involved in DNA metabolism or gene expression. The spatial coherence of CRISPR and cas genes may stimulate new research on the genesis and biological role of these repeats and genes.

Keywords

CRISPRBiologyGeneticsGeneCRISPR interferenceLocus (genetics)Direct repeatPalindromeDNAGenomeCas9

MeSH Terms

Amino Acid SequenceArchaeal ProteinsBacterial ProteinsBase SequenceComputational BiologyGenesArchaealGenesBacterialMolecular Sequence DataProkaryotic CellsRepetitive SequencesNucleic Acid

Affiliated Institutions

Related Publications

Publication Info

Year
2002
Type
article
Volume
43
Issue
6
Pages
1565-1575
Citations
1920
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1920
OpenAlex
82
Influential
1618
CrossRef

Cite This

Ruud Jansen, Jan D. A. van Embden, Wim Gaastra et al. (2002). Identification of genes that are associated with DNA repeats in prokaryotes. Molecular Microbiology , 43 (6) , 1565-1575. https://doi.org/10.1046/j.1365-2958.2002.02839.x

Identifiers

DOI
10.1046/j.1365-2958.2002.02839.x
PMID
11952905

Data Quality

Data completeness: 86%