Abstract
Opinions on the hypothesis that ancient genome duplications contributed to the vertebrate genome range from strong skepticism to strong credence. Previous studies concentrated on small numbers of gene families or chromosomal regions that might not have been representative of the whole genome, or used subjective methods to identify paralogous genes and regions. Here we report a systematic and objective analysis of the draft human genome sequence to identify paralogous chromosomal regions (paralogons) formed during chordate evolution and to estimate the ages of duplicate genes. We found that the human genome contains many more paralogons than would be expected by chance. Molecular clock analysis of all protein families in humans that have orthologs in the fly and nematode indicated that a burst of gene duplication activity took place in the period 350 650 Myr ago and that many of the duplicate genes formed at this time are located within paralogons. Our results support the contention that many of the gene families in vertebrates were formed or expanded by large-scale DNA duplications in an early chordate. Considering the incompleteness of the sequence data and the antiquity of the event, the results are compatible with at least one round of polyploidy.
Keywords
MeSH Terms
Affiliated Institutions
Related Publications
The Complete Genome Sequence of <i>Escherichia coli</i> K-12
The 4,639,221–base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five othe...
Genome-wide identification of chromosomal regions of increased tumor expression by transcriptome analysis.
Genes up-regulated in tumor cells provide attractive anticancer therapeutic targets. Although the general underlying mechanism for the increased expression in tumors is unknown,...
The Sequence of the Human Genome
A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was generated by the whole-genome shotgun sequencing method. The 14.8-billion bp ...
Misexpression of <i>Cwnt8C</i> in the mouse induces an ectopic embryonic axis and causes a truncation of the anterior neuroectoderm
ABSTRACT Transgenic embryos expressing Cwnt8C under the control of the human β-actin promoter exhibit duplicated axes or a severely dorsalised phenotype. Although the transgene ...
An integrated encyclopedia of DNA elements in the human genome
The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) pro...
Publication Info
- Year
- 2002
- Type
- article
- Volume
- 31
- Issue
- 2
- Pages
- 200-204
- Citations
- 521
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1038/ng884
- PMID
- 12032567