Abstract

Abstract For template‐based modeling in the CASP8 Critical Assessment of Techniques for Protein Structure Prediction, this work develops and applies six new full‐model metrics. They are designed to complement and add value to the traditional template‐based assessment by the global distance test (GDT) and related scores (based on multiple superpositions of Cα atoms between target structure and predictions labeled “Model 1”). The new metrics evaluate each predictor group on each target, using all atoms of their best model with above‐average GDT. Two metrics evaluate how “protein‐like” the predicted model is: the MolProbity score used for validating experimental structures, and a mainchain reality score using all‐atom steric clashes, bond length and angle outliers, and backbone dihedrals. Four other new metrics evaluate match of model to target for mainchain and sidechain hydrogen bonds, sidechain end positioning, and sidechain rotamers. Group‐average Z‐score across the six full‐model measures is averaged with group‐average GDT Z‐score to produce the overall ranking for full‐model, high‐accuracy performance. Separate assessments are reported for specific aspects of predictor‐group performance, such as robustness of approximately correct template or fold identification, and self‐scoring ability at identifying the best of their models. Fold identification is distinct from but correlated with group‐average GDT Z‐score if target difficulty is taken into account, whereas self‐scoring is done best by servers and is uncorrelated with GDT performance. Outstanding individual models on specific targets are identified and discussed. Predictor groups excelled at different aspects, highlighting the diversity of current methodologies. However, good full‐model scores correlate robustly with high Cα accuracy. Proteins 2009. © 2009 Wiley‐Liss, Inc.

Keywords

Robustness (evolution)Artificial intelligenceProtein structure predictionOutlierRanking (information retrieval)Computer scienceTemplateMachine learningMathematicsStatisticsAlgorithmData miningChemistryProtein structure

Affiliated Institutions

Related Publications

Publication Info

Year
2009
Type
article
Volume
77
Issue
S9
Pages
29-49
Citations
81
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

81
OpenAlex

Cite This

D.A. Keedy, Christopher J. Williams, Jeffrey J. Headd et al. (2009). The other 90% of the protein: Assessment beyond the Cαs for CASP8 template‐based and high‐accuracy models. Proteins Structure Function and Bioinformatics , 77 (S9) , 29-49. https://doi.org/10.1002/prot.22551

Identifiers

DOI
10.1002/prot.22551