Abstract
We find that the degree of impairment of protein function by missense variants is predictable by comparative sequence analysis alone. The applicable range of impairment is not confined to binary predictions that distinguish normal from deleterious variants, but extends continuously from mild to severe effects. The accuracy of predictions is strongly dependent on sequence variation and is highest when diverse orthologs are available. High predictive accuracy is achieved by quantification of the physicochemical characteristics in each position of the protein, based on observed evolutionary variation. The strong relationship between physicochemical characteristics of a missense variant and impairment of protein function extends to human disease. By using four diverse proteins for which sufficient comparative sequence data are available, we show that grades of disease, or likelihood of developing cancer, correlate strongly with physicochemical constraint violation by causative amino acid variants.
Keywords
Affiliated Institutions
Related Publications
SIFT web server: predicting effects of amino acid substitutions on proteins
The Sorting Intolerant from Tolerant (SIFT) algorithm predicts the effect of coding variants on protein function. It was first introduced in 2001, with a corresponding website t...
Computational approaches for predicting the biological effect of p53 missense mutations: a comparison of three sequence analysis based methods
Prediction of the biological effect of missense substitutions has become important because they are often observed in known or candidate disease susceptibility genes. In this pa...
Synucleinopathies
The synucleinopathies are a diverse group of neurodegenerative disorders that share a common pathologic lesion composed of aggregates of insoluble alpha-synuclein protein in sel...
Deleterious- and Disease-Allele Prevalence in Healthy Individuals: Insights from Current Predictions, Mutation Databases, and Population-Scale Resequencing
We have assessed the numbers of potentially deleterious variants in the genomes of apparently healthy humans by using (1) low-coverage whole-genome sequence data from 179 indivi...
Accurate proteome-wide missense variant effect prediction with AlphaMissense
The vast majority of missense variants observed in the human genome are of unknown clinical significance. We present AlphaMissense, an adaptation of AlphaFold fine-tuned on huma...
Publication Info
- Year
- 2005
- Type
- article
- Volume
- 15
- Issue
- 7
- Pages
- 978-986
- Citations
- 420
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1101/gr.3804205