51
|
Huang SY, Zou X. Statistical mechanics-based method to extract atomic distance-dependent potentials from protein structures. Proteins 2011; 79:2648-61. [PMID: 21732421 PMCID: PMC11108592 DOI: 10.1002/prot.23086] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2011] [Revised: 04/21/2011] [Accepted: 05/09/2011] [Indexed: 12/25/2022]
Abstract
In this study, we have developed a statistical mechanics-based iterative method to extract statistical atomic interaction potentials from known, nonredundant protein structures. Our method circumvents the long-standing reference state problem in deriving traditional knowledge-based scoring functions, by using rapid iterations through a physical, global convergence function. The rapid convergence of this physics-based method, unlike other parameter optimization methods, warrants the feasibility of deriving distance-dependent, all-atom statistical potentials to keep the scoring accuracy. The derived potentials, referred to as ITScore/Pro, have been validated using three diverse benchmarks: the high-resolution decoy set, the AMBER benchmark decoy set, and the CASP8 decoy set. Significant improvement in performance has been achieved. Finally, comparisons between the potentials of our model and potentials of a knowledge-based scoring function with a randomized reference state have revealed the reason for the better performance of our scoring function, which could provide useful insight into the development of other physical scoring functions. The potentials developed in this study are generally applicable for structural selection in protein structure prediction.
Collapse
Affiliation(s)
- Sheng-You Huang
- Department of Physics and Astronomy, Department of Biochemistry, Dalton Cardiovascular Research Center, and Informatics Institute, University of Missouri, Columbia, MO 65211
| | - Xiaoqin Zou
- Department of Physics and Astronomy, Department of Biochemistry, Dalton Cardiovascular Research Center, and Informatics Institute, University of Missouri, Columbia, MO 65211
| |
Collapse
|
52
|
MacCallum JL, Pérez A, Schnieders MJ, Hua L, Jacobson MP, Dill KA. Assessment of protein structure refinement in CASP9. Proteins 2011; 79 Suppl 10:74-90. [PMID: 22069034 DOI: 10.1002/prot.23131] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2011] [Revised: 06/15/2011] [Accepted: 07/03/2011] [Indexed: 11/06/2022]
Abstract
We assess performance in the structure refinement category in CASP9. Two years after CASP8, the performance of the best groups has not improved. There are few groups that improve any of our assessment scores with statistical significance. Some predictors, however, are able to consistently improve the physicality of the models. Although we cannot identify any clear bottleneck in improving refinement, several points arise: (1) The refinement portion of CASP has too few targets to make many statistically meaningful conclusions. (2) Predictors are usually very conservative, limiting the possibility of large improvements in models. (3) No group is actually able to correctly rank their five submissions-indicating that potentially better models may be discarded. (4) Different sampling strategies work better for different refinement problems; there is no single strategy that works on all targets. In general, conservative strategies do better, while the greatest improvements come from more adventurous sampling-at the cost of consistency. Comparison with experimental data reveals aspects not captured by comparison to a single structure. In particular, we show that improvement in backbone geometry does not always mean better agreement with experimental data. Finally, we demonstrate that even given the current challenges facing refinement, the refined models are useful for solving the crystallographic phase problem through molecular replacement. Proteins 2011;. © 2011 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Justin L MacCallum
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA.
| | | | | | | | | | | |
Collapse
|
53
|
Olson MA, Chaudhury S, Lee MS. Comparison between self-guided Langevin dynamics and molecular dynamics simulations for structure refinement of protein loop conformations. J Comput Chem 2011; 32:3014-22. [PMID: 21793008 DOI: 10.1002/jcc.21883] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2011] [Accepted: 06/19/2011] [Indexed: 11/09/2022]
Abstract
This article presents a comparative analysis of two replica-exchange simulation methods for the structure refinement of protein loop conformations, starting from low-resolution predictions. The methods are self-guided Langevin dynamics (SGLD) and molecular dynamics (MD) with a Nosé-Hoover thermostat. We investigated a small dataset of 8- and 12-residue loops, with the shorter loops placed initially from a coarse-grained lattice model and the longer loops from an enumeration assembly method (the Loopy program). The CHARMM22 + CMAP force field with a generalized Born implicit solvent model (molecular-surface parameterized GBSW2) was used to explore conformational space. We also assessed two empirical scoring methods to detect nativelike conformations from decoys: the all-atom distance-scaled ideal-gas reference state (DFIRE-AA) statistical potential and the Rosetta energy function. Among the eight-residue loop targets, SGLD out performed MD in all cases, with a median of 0.48 Å reduction in global root-mean-square deviation (RMSD) of the loop backbone coordinates from the native structure. Among the more challenging 12-residue loop targets, SGLD improved the prediction accuracy over MD by a median of 1.31 Å, representing a substantial improvement. The overall median RMSD for SGLD simulations of 12-residue loops was 0.91 Å, yielding refinement of a median 2.70 Å from initial loop placement. Results from DFIRE-AA and the Rosetta model applied to rescoring conformations failed to improve the overall detection calculated from the CHARMM force field. We illustrate the advantage of SGLD over the MD simulation model by presenting potential-energy landscapes for several loop predictions. Our results demonstrate that SGLD significantly outperforms traditional MD in the generation and populating of nativelike loop conformations and that the CHARMM force field performs comparably to other empirical force fields in identifying these conformations from the resulting ensembles.
Collapse
Affiliation(s)
- Mark A Olson
- Department of Cell Biology and Biochemistry, US Army Medical Research Institute of Infectious Diseases, Fredrick, Maryland 21702, USA.
| | | | | |
Collapse
|
54
|
Structural model of the TRPP2/PKD1 C-terminal coiled-coil complex produced by a combined computational and experimental approach. Proc Natl Acad Sci U S A 2011; 108:10133-8. [PMID: 21642537 DOI: 10.1073/pnas.1017669108] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Autosomal dominant polycystic kidney disease (ADPKD) is caused by mutations in TRPP2 and PKD1, which form an ion channel/receptor complex containing three TRPP2 and one PKD1. A TRPP2 C-terminal coiled-coil trimer, critical for the assembly of this complex, associates with a single PKD1 C-terminal coiled-coil. Many ADPKD pathogenic mutations result in the abolishment of the TRPP2/PKD1 coiled-coil complex. To gain molecular and functional insights into this heterotetrameric complex, we computationally constructed a structural model by using a two-step docking strategy, based on a known crystal structure of the TRPP2 coiled-coil trimer. The model shows that this tetrameric complex has a novel di-trimer configuration: An upstream trimer made of three TRPP2 helices and a downstream trimer made of two TRPP2 helices and one PKD1 helix. Mutagenesis and biochemical analysis identified critical TRPP2/PKD1 interface contacts essential for the heteromeric coiled-coil complex. Mutation of these interface positions in the full-length proteins showed that these interactions were critical for the assembly of the full-length complex in cells. Our results provide a means to specifically weaken the TRPP2 and PKD1 association, thus facilitating future in vitro and in vivo studies on the functional importance of this association.
Collapse
|
55
|
di Luccio E, Koehl P. A quality metric for homology modeling: the H-factor. BMC Bioinformatics 2011; 12:48. [PMID: 21291572 PMCID: PMC3213331 DOI: 10.1186/1471-2105-12-48] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2010] [Accepted: 02/04/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The analysis of protein structures provides fundamental insight into most biochemical functions and consequently into the cause and possible treatment of diseases. As the structures of most known proteins cannot be solved experimentally for technical or sometimes simply for time constraints, in silico protein structure prediction is expected to step in and generate a more complete picture of the protein structure universe. Molecular modeling of protein structures is a fast growing field and tremendous works have been done since the publication of the very first model. The growth of modeling techniques and more specifically of those that rely on the existing experimental knowledge of protein structures is intimately linked to the developments of high resolution, experimental techniques such as NMR, X-ray crystallography and electron microscopy. This strong connection between experimental and in silico methods is however not devoid of criticisms and concerns among modelers as well as among experimentalists. RESULTS In this paper, we focus on homology-modeling and more specifically, we review how it is perceived by the structural biology community and what can be done to impress on the experimentalists that it can be a valuable resource to them. We review the common practices and provide a set of guidelines for building better models. For that purpose, we introduce the H-factor, a new indicator for assessing the quality of homology models, mimicking the R-factor in X-ray crystallography. The methods for computing the H-factor is fully described and validated on a series of test cases. CONCLUSIONS We have developed a web service for computing the H-factor for models of a protein structure. This service is freely accessible at http://koehllab.genomecenter.ucdavis.edu/toolkit/h-factor.
Collapse
Affiliation(s)
- Eric di Luccio
- Computer Science Department, Room 4337, Genome Center, GBSF University of California Davis 451 East Health Sciences Drive Davis, CA 95616, USA.
| | | |
Collapse
|
56
|
Nurisso A, Daina A, Walker RC. A practical introduction to molecular dynamics simulations: applications to homology modeling. Methods Mol Biol 2011; 857:137-73. [PMID: 22323220 DOI: 10.1007/978-1-61779-588-6_6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
In this chapter, practical concepts and guidelines are provided for the use of molecular dynamics (MD) simulation for the refinement of homology models. First, an overview of the history and a theoretical background of MD are given. Literature examples of successful MD refinement of homology models are reviewed before selecting the Cytochrome P450 2J2 structure as a case study. We describe the setup of a system for classical MD simulation in a detailed stepwise fashion and how to perform the refinement described in the publication of Li et al. (Proteins 71:938-949, 2008). This tutorial is based on version 11 of the AMBER Molecular Dynamics software package (http://ambermd.org/). However, the approach discussed is equally applicable to any condensed phase MD simulation environment.
Collapse
Affiliation(s)
- Alessandra Nurisso
- School of Pharmaceutical Sciences, University of Geneva, University of Lausanne, Geneva, Switzerland
| | | | | |
Collapse
|
57
|
Application of biasing-potential replica-exchange simulations for loop modeling and refinement of proteins in explicit solvent. Proteins 2010; 78:2809-19. [DOI: 10.1002/prot.22796] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
58
|
Bjelkmar P, Larsson P, Cuendet MA, Hess B, Lindahl E. Implementation of the CHARMM Force Field in GROMACS: Analysis of Protein Stability Effects from Correction Maps, Virtual Interaction Sites, and Water Models. J Chem Theory Comput 2010; 6:459-66. [PMID: 26617301 DOI: 10.1021/ct900549r] [Citation(s) in RCA: 732] [Impact Index Per Article: 52.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
CHARMM27 is a widespread and popular force field for biomolecular simulation, and several recent algorithms such as implicit solvent models have been developed specifically for it. We have here implemented the CHARMM force field and all necessary extended functional forms in the GROMACS molecular simulation package, to make CHARMM-specific features available and to test them in combination with techniques for extended time steps, to make all major force fields available for comparison studies in GROMACS, and to test various solvent model optimizations, in particular the effect of Lennard-Jones interactions on hydrogens. The implementation has full support both for CHARMM-specific features such as multiple potentials over the same dihedral angle and the grid-based energy correction map on the ϕ, ψ protein backbone dihedrals, as well as all GROMACS features such as virtual hydrogen interaction sites that enable 5 fs time steps. The medium-to-long time effects of both the correction maps and virtual sites have been tested by performing a series of 100 ns simulations using different models for water representation, including comparisons between CHARMM and traditional TIP3P. Including the correction maps improves sampling of near native-state conformations in our systems, and to some extent it is even able to refine distorted protein conformations. Finally, we show that this accuracy is largely maintained with a new implicit solvent implementation that works with virtual interaction sites, which enables performance in excess of 250 ns/day for a 900-atom protein on a quad-core desktop computer.
Collapse
Affiliation(s)
- Pär Bjelkmar
- Center for Biomembrane Research, Department of Biochemistry & Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden, and Molecular Modeling Group, Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Per Larsson
- Center for Biomembrane Research, Department of Biochemistry & Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden, and Molecular Modeling Group, Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Michel A Cuendet
- Center for Biomembrane Research, Department of Biochemistry & Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden, and Molecular Modeling Group, Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Berk Hess
- Center for Biomembrane Research, Department of Biochemistry & Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden, and Molecular Modeling Group, Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Erik Lindahl
- Center for Biomembrane Research, Department of Biochemistry & Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden, and Molecular Modeling Group, Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| |
Collapse
|
59
|
MacCallum JL, Hua L, Schnieders MJ, Pande VS, Jacobson MP, Dill KA. Assessment of the protein-structure refinement category in CASP8. Proteins 2010; 77 Suppl 9:66-80. [PMID: 19714776 DOI: 10.1002/prot.22538] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Here, we summarize the assessment of protein structure refinement in CASP8. Twenty-four groups refined a total of 12 target proteins. Averaging over all groups and all proteins, there was no net improvement over the original starting models. However, there are now some individual research groups who consistently do improve protein structures relative to a starting starting model. We compare various measures of quality assessment, including (i) standard backbone-based methods, (ii) new methods from the Richardson group, and (iii) ensemble-based methods for comparing experimental structures, such as NMR NOE violations and the suitability of the predicted models to serve as templates for molecular replacement. On the whole, there is a general correlation among various measures. However, there are interesting differences. Sometimes a structure that is in better agreement with the experimental data is judged to be slightly worse by GDT-TS. This suggests that for comparing protein structures that are already quite close to the native, it may be preferable to use ensemble-based experimentally derived measures of quality, in addition to single-structure-based methods such as GDT-TS.
Collapse
Affiliation(s)
- Justin L MacCallum
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, California 94158, USA
| | | | | | | | | | | |
Collapse
|
60
|
Loksha IV, Maiolo JR, Hong CW, Ng A, Snow CD. SHARPEN-Systematic Hierarchical Algorithms for Rotamers and Proteins on an Extended Network. J Comput Chem 2009; 30:999-1005. [DOI: 10.1002/jcc.21204] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
61
|
Zhang Y. Protein structure prediction: when is it useful? Curr Opin Struct Biol 2009; 19:145-55. [PMID: 19327982 PMCID: PMC2673339 DOI: 10.1016/j.sbi.2009.02.005] [Citation(s) in RCA: 191] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2008] [Revised: 02/18/2009] [Accepted: 02/19/2009] [Indexed: 10/21/2022]
Abstract
Computationally predicted three-dimensional structure of protein molecules has demonstrated the usefulness in many areas of biomedicine, ranging from approximate family assignments to precise drug screening. For nearly 40 years, however, the accuracy of the predicted models has been dictated by the availability of close structural templates. Progress has recently been achieved in refining low-resolution models closer to the native ones; this has been made possible by combining knowledge-based information from multiple sources of structural templates as well as by improving the energy funnel of physics-based force fields. Unfortunately, there has been no essential progress in the development of techniques for detecting remotely homologous templates and for predicting novel protein structures.
Collapse
Affiliation(s)
- Yang Zhang
- Center for Bioinformatics and Department of Molecular Biosciences, University of Kansas, 2030 Becker Drive, Lawrence, KS 66047, USA.
| |
Collapse
|
62
|
Abstract
Current methods for predicting protein structure depend on two interrelated components: (i) an energy function that should have a low value near the correct structure and (ii) a method for searching through different conformations of the polypeptide chain. Identification of the most efficient search methods is essential if we are to be able to apply such methods broadly and with confidence. In addition, efficient search methods provide a rigorous test of existing energy functions, which are generally knowledge-based and contain different terms added together with arbitrary weights. Here, we test different search methods with one of the most accurate and predictive energy functions, namely Rosetta the knowledge-based force-field from Baker's group [Simons K, Kooperberg C, Huang E, Baker D (1997) J Mol Biol 268:209-225]. We use an implementation of a generalized ensemble search method to scale relevant parts of the energy function. This method, known as Hamiltonian Replica Exchange Monte Carlo, outperforms the original Monte Carlo Simulated Annealing used in the Rosetta package in terms of sampling low-energy states. It also outperforms another widely used generalized ensemble search method known as Temperature Replica Exchange Monte Carlo. Our results reveal clear deficiencies in the low-resolution Rosetta energy function in that the lowest energy structures are not necessarily the most native-like. By using a set of nonnative low-energy structures found by our extensive sampling, we discovered that the long-range and short-range backbone hydrogen-bonding energy terms of the Rosetta energy discriminate between the nonnative and native-like structures significantly better than the low-resolution score used in Rosetta.
Collapse
|