1
|
Manalastas-Cantos K, Adoni KR, Pfeifer M, Märtens B, Grünewald K, Thalassinos K, Topf M. Modeling Flexible Protein Structure With AlphaFold2 and Crosslinking Mass Spectrometry. Mol Cell Proteomics 2024; 23:100724. [PMID: 38266916 PMCID: PMC10884514 DOI: 10.1016/j.mcpro.2024.100724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 12/23/2023] [Accepted: 12/27/2023] [Indexed: 01/26/2024] Open
Abstract
We propose a pipeline that combines AlphaFold2 (AF2) and crosslinking mass spectrometry (XL-MS) to model the structure of proteins with multiple conformations. The pipeline consists of two main steps: ensemble generation using AF2 and conformer selection using XL-MS data. For conformer selection, we developed two scores-the monolink probability score (MP) and the crosslink probability score (XLP)-both of which are based on residue depth from the protein surface. We benchmarked MP and XLP on a large dataset of decoy protein structures and showed that our scores outperform previously developed scores. We then tested our methodology on three proteins having an open and closed conformation in the Protein Data Bank: Complement component 3 (C3), luciferase, and glutamine-binding periplasmic protein, first generating ensembles using AF2, which were then screened for the open and closed conformations using experimental XL-MS data. In five out of six cases, the most accurate model within the AF2 ensembles-or a conformation within 1 Å of this model-was identified using crosslinks, as assessed through the XLP score. In the remaining case, only the monolinks (assessed through the MP score) successfully identified the open conformation of glutamine-binding periplasmic protein, and these results were further improved by including the "occupancy" of the monolinks. This serves as a compelling proof-of-concept for the effectiveness of monolinks. In contrast, the AF2 assessment score was only able to identify the most accurate conformation in two out of six cases. Our results highlight the complementarity of AF2 with experimental methods like XL-MS, with the MP and XLP scores providing reliable metrics to assess the quality of the predicted models. The MP and XLP scoring functions mentioned above are available at https://gitlab.com/topf-lab/xlms-tools.
Collapse
Affiliation(s)
- Karen Manalastas-Cantos
- Center for Data and Computing in Natural Sciences, Universität Hamburg, Hamburg, Germany; Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany
| | - Kish R Adoni
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, UK; Institute of Structural and Molecular Biology, Birkbeck College, University of London, London, United Kingdom
| | - Matthias Pfeifer
- Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany; Universitätsklinikum Hamburg Eppendorf (UKE), Hamburg, Germany
| | - Birgit Märtens
- Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany; Universitätsklinikum Hamburg Eppendorf (UKE), Hamburg, Germany
| | - Kay Grünewald
- Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany; Department of Chemistry, Universität Hamburg, Hamburg, Germany
| | - Konstantinos Thalassinos
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, UK; Institute of Structural and Molecular Biology, Birkbeck College, University of London, London, United Kingdom
| | - Maya Topf
- Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany; Universitätsklinikum Hamburg Eppendorf (UKE), Hamburg, Germany.
| |
Collapse
|
2
|
Durairaj J, de Ridder D, van Dijk AD. Beyond sequence: Structure-based machine learning. Comput Struct Biotechnol J 2022; 21:630-643. [PMID: 36659927 PMCID: PMC9826903 DOI: 10.1016/j.csbj.2022.12.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/21/2022] [Accepted: 12/21/2022] [Indexed: 12/31/2022] Open
Abstract
Recent breakthroughs in protein structure prediction demarcate the start of a new era in structural bioinformatics. Combined with various advances in experimental structure determination and the uninterrupted pace at which new structures are published, this promises an age in which protein structure information is as prevalent and ubiquitous as sequence. Machine learning in protein bioinformatics has been dominated by sequence-based methods, but this is now changing to make use of the deluge of rich structural information as input. Machine learning methods making use of structures are scattered across literature and cover a number of different applications and scopes; while some try to address questions and tasks within a single protein family, others aim to capture characteristics across all available proteins. In this review, we look at the variety of structure-based machine learning approaches, how structures can be used as input, and typical applications of these approaches in protein biology. We also discuss current challenges and opportunities in this all-important and increasingly popular field.
Collapse
Affiliation(s)
- Janani Durairaj
- Biozentrum, University of Basel, Basel, Switzerland
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, Wageningen, the Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, Wageningen, the Netherlands
| | - Aalt D.J. van Dijk
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, Wageningen, the Netherlands
| |
Collapse
|
3
|
Jernigan RL, Sankar K, Jia K, Faraggi E, Kloczkowski A. Computational Ways to Enhance Protein Inhibitor Design. Front Mol Biosci 2021; 7:607323. [PMID: 33614705 PMCID: PMC7886686 DOI: 10.3389/fmolb.2020.607323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 12/08/2020] [Indexed: 11/22/2022] Open
Abstract
Two new computational approaches are described to aid in the design of new peptide-based drugs by evaluating ensembles of protein structures from their dynamics and through the assessing of structures using empirical contact potential. These approaches build on the concept that conformational variability can aid in the binding process and, for disordered proteins, can even facilitate the binding of more diverse ligands. This latter consideration indicates that such a design process should be less restrictive so that multiple inhibitors might be effective. The example chosen here focuses on proteins/peptides that bind to hemagglutinin (HA) to block the large-scale conformational change for activation. Variability in the conformations is considered from sets of experimental structures, or as an alternative, from their simple computed dynamics; the set of designe peptides/small proteins from the David Baker lab designed to bind to hemagglutinin, is the large set considered and is assessed with the new empirical contact potentials.
Collapse
Affiliation(s)
- Robert L. Jernigan
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, IA, United States
| | - Kannan Sankar
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, IA, United States
| | - Kejue Jia
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, IA, United States
| | - Eshel Faraggi
- Research and Information Systems, LLC, Indianapolis, IN, United States
- Department of Physics, Indiana University Purdue University Indianapolis, Indianapolis, IN, United States
| | - Andrzej Kloczkowski
- Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, Columbus, OH, United States
- Department of Pediatrics, The Ohio State University, Columbus, OH, United States
| |
Collapse
|
4
|
Modeling of Protein Structural Flexibility and Large-Scale Dynamics: Coarse-Grained Simulations and Elastic Network Models. Int J Mol Sci 2018; 19:ijms19113496. [PMID: 30404229 PMCID: PMC6274762 DOI: 10.3390/ijms19113496] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Revised: 10/29/2018] [Accepted: 10/31/2018] [Indexed: 12/13/2022] Open
Abstract
Fluctuations of protein three-dimensional structures and large-scale conformational transitions are crucial for the biological function of proteins and their complexes. Experimental studies of such phenomena remain very challenging and therefore molecular modeling can be a good alternative or a valuable supporting tool for the investigation of large molecular systems and long-time events. In this minireview, we present two alternative approaches to the coarse-grained (CG) modeling of dynamic properties of protein systems. We discuss two CG representations of polypeptide chains used for Monte Carlo dynamics simulations of protein local dynamics and conformational transitions, and highly simplified structure-based elastic network models of protein flexibility. In contrast to classical all-atom molecular dynamics, the modeling strategies discussed here allow the quite accurate modeling of much larger systems and longer-time dynamic phenomena. We briefly describe the main features of these models and outline some of their applications, including modeling of near-native structure fluctuations, sampling of large regions of the protein conformational space, or possible support for the structure prediction of large proteins and their complexes.
Collapse
|
5
|
Álvarez Ó, Fernández-Martínez JL, Fernández-Brillet C, Cernea A, Fernández-Muñiz Z, Kloczkowski A. Principal component analysis in protein tertiary structure prediction. J Bioinform Comput Biol 2018; 16:1850005. [PMID: 29566640 DOI: 10.1142/s0219720018500051] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We discuss applicability of principal component analysis (PCA) for protein tertiary structure prediction from amino acid sequence. The algorithm presented in this paper belongs to the category of protein refinement models and involves establishing a low-dimensional space where the sampling (and optimization) is carried out via particle swarm optimizer (PSO). The reduced space is found via PCA performed for a set of low-energy protein models previously found using different optimization techniques. A high frequency term is added into this expansion by projecting the best decoy into the PCA basis set and calculating the residual model. This term is aimed at providing high frequency details in the energy optimization. The goal of this research is to analyze how the dimensionality reduction affects the prediction capability of the PSO procedure. For that purpose, different proteins from the Critical Assessment of Techniques for Protein Structure Prediction experiments were modeled. In all the cases, both the energy of the best decoy and the distance to the native structure have decreased. Our analysis also shows how the predicted backbone structure of native conformation and of alternative low energy states varies with respect to the PCA dimensionality. Generally speaking, the reconstruction can be successfully achieved with 10 principal components and the high frequency term. We also provide a computational analysis of protein energy landscape for the inverse problem of reconstructing structure from the reduced number of principal components, showing that the dimensionality reduction alleviates the ill-posed character of this high-dimensional energy optimization problem. The procedure explained in this paper is very fast and allows testing different PCA expansions. Our results show that PSO improves the energy of the best decoy used in the PCA when the adequate number of PCA terms is considered.
Collapse
Affiliation(s)
- Óscar Álvarez
- * Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico García Lorca, 18, 33007 Oviedo, Spain
| | - Juan Luis Fernández-Martínez
- * Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico García Lorca, 18, 33007 Oviedo, Spain
| | - Celia Fernández-Brillet
- * Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico García Lorca, 18, 33007 Oviedo, Spain
| | - Ana Cernea
- * Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico García Lorca, 18, 33007 Oviedo, Spain
| | - Zulima Fernández-Muñiz
- * Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico García Lorca, 18, 33007 Oviedo, Spain
| | - Andrzej Kloczkowski
- † Batelle Center for Mathematical Medicine, Nationwide Children's Hospital, Columbus, OH, USA.,‡ Department of Pediatrics, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
6
|
Cheng Q, Joung I, Lee J. A Simple and Efficient Protein Structure Refinement Method. J Chem Theory Comput 2017; 13:5146-5162. [PMID: 28800396 DOI: 10.1021/acs.jctc.7b00470] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Improving the quality of a given protein structure can serve as the ultimate solution for accurate protein structure prediction, and seeking such a method is currently a challenge in computational structural biology. In order to promote and encourage much needed such efforts, CASP (Critical Assessment of Structure Prediction) has been providing an ideal computational experimental platform, where it was reported only recently (since CASP10) that systematic protein structure refinement is possible by carrying out extensive (approximately millisecond) MD simulations with proper restraints generated from the given structure. Using an explicit solvent model and much reduced positional and distance restraints than previously exercised, we propose a refinement protocol that combines a series of short (5 ns) MD simulations with energy minimization procedures. Testing and benchmarking on 54 CASP8-10 refinement targets and 34 CASP11 refinement targets shows quite promising results. Using only a small fraction of MD simulation steps (nanosecond versus millisecond), systematic protein structure refinement was demonstrated in this work, indicating that refinement of a given model can be achieved using a few hours of desktop computing.
Collapse
Affiliation(s)
- Qianyi Cheng
- Center for In Silico Protein Science and School of Computational Sciences, Korea Institute for Advanced Study , Seoul 02455, Korea
| | - InSuk Joung
- Center for In Silico Protein Science and School of Computational Sciences, Korea Institute for Advanced Study , Seoul 02455, Korea
| | - Jooyoung Lee
- Center for In Silico Protein Science and School of Computational Sciences, Korea Institute for Advanced Study , Seoul 02455, Korea
| |
Collapse
|
7
|
Putz I, Brock O. Elastic network model of learned maintained contacts to predict protein motion. PLoS One 2017; 12:e0183889. [PMID: 28854238 PMCID: PMC5576689 DOI: 10.1371/journal.pone.0183889] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2016] [Accepted: 08/14/2017] [Indexed: 12/21/2022] Open
Abstract
We present a novel elastic network model, lmcENM, to determine protein motion even for localized functional motions that involve substantial changes in the protein's contact topology. Existing elastic network models assume that the contact topology remains unchanged throughout the motion and are thus most appropriate to simulate highly collective function-related movements. lmcENM uses machine learning to differentiate breaking from maintained contacts. We show that lmcENM accurately captures functional transitions unexplained by the classical ENM and three reference ENM variants, while preserving the simplicity of classical ENM. We demonstrate the effectiveness of our approach on a large set of proteins covering different motion types. Our results suggest that accurately predicting a "deformation-invariant" contact topology offers a promising route to increase the general applicability of ENMs. We also find that to correctly predict this contact topology a combination of several features seems to be relevant which may vary slightly depending on the protein. Additionally, we present case studies of two biologically interesting systems, Ferric Citrate membrane transporter FecA and Arachidonate 15-Lipoxygenase.
Collapse
Affiliation(s)
- Ines Putz
- Robotics and Biology Laboratory, Department of Computer Science and Electrical Engineering, Technische Universität Berlin, Berlin, Berlin, Germany
| | - Oliver Brock
- Robotics and Biology Laboratory, Department of Computer Science and Electrical Engineering, Technische Universität Berlin, Berlin, Berlin, Germany
| |
Collapse
|
8
|
Ozgur B, Ozdemir ES, Gursoy A, Keskin O. Relation between Protein Intrinsic Normal Mode Weights and Pre-Existing Conformer Populations. J Phys Chem B 2017; 121:3686-3700. [DOI: 10.1021/acs.jpcb.6b10401] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Beytullah Ozgur
- Center for Computational Biology and Bioinformatics, ‡Chemical and Biological
Engineering, and §Computer Engineering,
College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - E. Sila Ozdemir
- Center for Computational Biology and Bioinformatics, ‡Chemical and Biological
Engineering, and §Computer Engineering,
College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Attila Gursoy
- Center for Computational Biology and Bioinformatics, ‡Chemical and Biological
Engineering, and §Computer Engineering,
College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Ozlem Keskin
- Center for Computational Biology and Bioinformatics, ‡Chemical and Biological
Engineering, and §Computer Engineering,
College of Engineering, Koc University, 34450 Istanbul, Turkey
| |
Collapse
|
9
|
DeWolf T, Gordon R. Theory of Acoustic Raman Modes in Proteins. PHYSICAL REVIEW LETTERS 2016; 117:138101. [PMID: 27715080 DOI: 10.1103/physrevlett.117.138101] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Indexed: 05/23/2023]
Abstract
We present a theoretical analysis that associates the resonances of extraordinary acoustic Raman (EAR) spectroscopy [Wheaton et al., Nat. Photonics 9, 68 (2015)] with the collective modes of proteins. The theory uses the anisotropic elastic network model to find the protein acoustic modes, and calculates Raman intensity by treating the protein as a polarizable ellipsoid. Reasonable agreement is found between EAR spectra and our theory. Protein acoustic modes have been extensively studied theoretically to assess the role they play in protein function; this result suggests EAR spectroscopy as a new experimental tool for studies of protein acoustic modes.
Collapse
Affiliation(s)
- Timothy DeWolf
- Department of Electrical and Computer Engineering, University of Victoria, Victoria British Columbia V8P 5C2, Canada
| | - Reuven Gordon
- Department of Electrical and Computer Engineering, University of Victoria, Victoria British Columbia V8P 5C2, Canada
| |
Collapse
|
10
|
Measuring and modeling diffuse scattering in protein X-ray crystallography. Proc Natl Acad Sci U S A 2016; 113:4069-74. [PMID: 27035972 DOI: 10.1073/pnas.1524048113] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
X-ray diffraction has the potential to provide rich information about the structural dynamics of macromolecules. To realize this potential, both Bragg scattering, which is currently used to derive macromolecular structures, and diffuse scattering, which reports on correlations in charge density variations, must be measured. Until now, measurement of diffuse scattering from protein crystals has been scarce because of the extra effort of collecting diffuse data. Here, we present 3D measurements of diffuse intensity collected from crystals of the enzymes cyclophilin A and trypsin. The measurements were obtained from the same X-ray diffraction images as the Bragg data, using best practices for standard data collection. To model the underlying dynamics in a practical way that could be used during structure refinement, we tested translation-libration-screw (TLS), liquid-like motions (LLM), and coarse-grained normal-modes (NM) models of protein motions. The LLM model provides a global picture of motions and was refined against the diffuse data, whereas the TLS and NM models provide more detailed and distinct descriptions of atom displacements, and only used information from the Bragg data. Whereas different TLS groupings yielded similar Bragg intensities, they yielded different diffuse intensities, none of which agreed well with the data. In contrast, both the LLM and NM models agreed substantially with the diffuse data. These results demonstrate a realistic path to increase the number of diffuse datasets available to the wider biosciences community and indicate that dynamics-inspired NM structural models can simultaneously agree with both Bragg and diffuse scattering.
Collapse
|
11
|
Özer N, Özen A, Schiffer CA, Haliloğlu T. Drug-resistant HIV-1 protease regains functional dynamics through cleavage site coevolution. Evol Appl 2015; 8:185-98. [PMID: 25685193 PMCID: PMC4319865 DOI: 10.1111/eva.12241] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Accepted: 12/08/2014] [Indexed: 12/20/2022] Open
Abstract
Drug resistance is caused by mutations that change the balance of recognition favoring substrate cleavage over inhibitor binding. Here, a structural dynamics perspective of the regained wild-type functioning in mutant HIV-1 proteases with coevolution of the natural substrates is provided. The collective dynamics of mutant structures of the protease bound to p1-p6 and NC-p1 substrates are assessed using the Anisotropic Network Model (ANM). The drug-induced protease mutations perturb the mechanistically crucial hinge axes that involve key sites for substrate binding and dimerization and mainly coordinate the intrinsic dynamics. Yet with substrate coevolution, while the wild-type dynamic behavior is restored in both p1-p6 ((LP) (1'F)p1-p6D30N/N88D) and NC-p1 ((AP) (2) (V)NC-p1V82A) bound proteases, the dynamic behavior of the NC-p1 bound protease variants (NC-p1V82A and (AP) (2) (V)NC-p1V82A) rather resemble those of the proteases bound to the other substrates, which is consistent with experimental studies. The orientational variations of residue fluctuations along the hinge axes in mutant structures justify the existence of coevolution in p1-p6 and NC-p1 substrates, that is, the dynamic behavior of hinge residues should contribute to the interdependent nature of substrate recognition. Overall, this study aids in the understanding of the structural dynamics basis of drug resistance and evolutionary optimization in the HIV-1 protease system.
Collapse
Affiliation(s)
- Nevra Özer
- Polymer Research Center and Chemical Engineering Department, Bogazici UniversityBebek, Istanbul, Turkey
| | - Ayşegül Özen
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical SchoolWorcester, MA, USA
| | - Celia A Schiffer
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical SchoolWorcester, MA, USA
| | - Türkan Haliloğlu
- Polymer Research Center and Chemical Engineering Department, Bogazici UniversityBebek, Istanbul, Turkey
| |
Collapse
|
12
|
Bastolla U. Computing protein dynamics from protein structure with elastic network models. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2014. [DOI: 10.1002/wcms.1186] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Ugo Bastolla
- Centro de Biologa Molecular Severo Ochoa (CSIC‐UAM)Universidad Autónoma de MadridMadridSpain
| |
Collapse
|
13
|
Nugent T, Cozzetto D, Jones DT. Evaluation of predictions in the CASP10 model refinement category. Proteins 2014; 82 Suppl 2:98-111. [PMID: 23900810 PMCID: PMC4282348 DOI: 10.1002/prot.24377] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Revised: 06/19/2013] [Accepted: 06/28/2013] [Indexed: 12/24/2022]
Abstract
Here we report on the assessment results of the third experiment to evaluate the state of the art in protein model refinement, where participants were invited to improve the accuracy of initial protein models for 27 targets. Using an array of complementary evaluation measures, we find that five groups performed better than the naïve (null) method—a marked improvement over CASP9, although only three were significantly better. The leading groups also demonstrated the ability to consistently improve both backbone and side chain positioning, while other groups reliably enhanced other aspects of protein physicality. The top-ranked group succeeded in improving the backbone conformation in almost 90% of targets, suggesting a strategy that for the first time in CASP refinement is successful in a clear majority of cases. A number of issues remain unsolved: the majority of groups still fail to improve the quality of the starting models; even successful groups are only able to make modest improvements; and no prediction is more similar to the native structure than to the starting model. Successful refinement attempts also often go unrecognized, as suggested by the relatively larger improvements when predictions not submitted as model 1 are also considered. Proteins 2014; 82(Suppl 2):98–111.
Collapse
Affiliation(s)
- Timothy Nugent
- Department of Computer Science Bioinformatics Group, University College London, London, WC1E 6BT, United Kingdom
| | | | | |
Collapse
|