Abstract
I estimate per nucleotide rates of spontaneous mutations of different kinds in humans directly from the data on per locus mutation rates and on sequences of de novo nonsense nucleotide substitutions, deletions, insertions, and complex events at eight loci causing autosomal dominant diseases and 12 loci causing X-linked diseases. The results are in good agreement with indirect estimates, obtained by comparison of orthologous human and chimpanzee pseudogenes. The average direct estimate of the combined rate of all mutations is 1.8x10(-8) per nucleotide per generation, and the coefficient of variation of this rate across the 20 loci is 0.53. Single nucleotide substitutions are approximately 25 times more common than all other mutations, deletions are approximately three times more common than insertions, complex mutations are very rare, and CpG context increases substitution rates by an order of magnitude. There is only a moderate tendency for loci with high per locus mutation rates to also have higher per nucleotide substitution rates, and per nucleotide rates of deletions and insertions are statistically independent on the per locus mutation rate. Rates of different kinds of mutations are strongly correlated across loci. Mutational hot spots with per nucleotide rates above 5x10(-7) make only a minor contribution to human mutation. In the next decade, direct measurements will produce a rather precise, quantitative description of human spontaneous mutation at the DNA level.
Keywords
Affiliated Institutions
Related Publications
Deleterious- and Disease-Allele Prevalence in Healthy Individuals: Insights from Current Predictions, Mutation Databases, and Population-Scale Resequencing
We have assessed the numbers of potentially deleterious variants in the genomes of apparently healthy humans by using (1) low-coverage whole-genome sequence data from 179 indivi...
Genic Intolerance to Functional Variation and the Interpretation of Personal Genomes
A central challenge in interpreting personal genomes is determining which mutations most likely influence disease. Although progress has been made in scoring the functional impa...
The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine
The Human Gene Mutation Database (HGMD®) is a comprehensive collection of germline mutations in nuclear genes that underlie, or are associated with, human inherited disease. By ...
CADD: predicting the deleteriousness of variants throughout the human genome
Combined Annotation-Dependent Depletion (CADD) is a widely used measure of variant deleteriousness that can effectively prioritize causal variants in genetic analyses, particula...
Estimation of evolutionary distances between homologous nucleotide sequences.
By using two models of evolutionary base substitutions--"three-substitution-type" and "two-frequency-class" models--some formulae are derived which permit a simple estimation of...
Publication Info
- Year
- 2002
- Type
- article
- Volume
- 21
- Issue
- 1
- Pages
- 12-27
- Citations
- 350
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1002/humu.10147