1
|
Kunze T, Dreßler C, Lauer C, Paul W, Sebastiani D. Reverse Mapping of Coarse Grained Polyglutamine Conformations from PRIME20 Sampling. Chemphyschem 2024; 25:e202300521. [PMID: 38314956 DOI: 10.1002/cphc.202300521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 02/01/2024] [Accepted: 02/02/2024] [Indexed: 02/07/2024]
Abstract
An inverse coarse-graining protocol is presented for generating and validating atomistic structures of large (bio-) molecules from conformations obtained via a coarse-grained sampling method. Specifically, the protocol is implemented and tested based on the (coarse-grained) PRIME20 protein model (P20/SAMC), and the resulting all-atom conformations are simulated using conventional biomolecular force fields. The phase space sampling at the coarse-grained level is performed with a stochastical approximation Monte Carlo approach. The method is applied to a series of polypeptides, specifically dimers of polyglutamine with varying chain length in aqueous solution. The majority (>70 %) of the conformations obtained from the coarse-grained peptide model can successfully be mapped back to atomistic structures that remain conformationally stable during 10 ns of molecular dynamics simulations. This work can be seen as the first step towards the overarching goal of improving our understanding of protein aggregation phenomena through simulation methods.
Collapse
Affiliation(s)
- Thomas Kunze
- Faculty of Natural Sciences II, Martin-Luther University Halle-Wittenberg, Von-Danckelmann-Platz 4, 06120, Halle, Germany
| | - Christian Dreßler
- Institut für Physik, Ilmenau University of Technology, Weimarer Straße 32, 98693, Ilmenau, Germany
| | - Christian Lauer
- Faculty of Natural Sciences II, Martin-Luther University Halle-Wittenberg, Von-Danckelmann-Platz 4, 06120, Halle, Germany
| | - Wolfgang Paul
- Faculty of Natural Sciences II, Martin-Luther University Halle-Wittenberg, Von-Danckelmann-Platz 4, 06120, Halle, Germany
| | - Daniel Sebastiani
- Faculty of Natural Sciences II, Martin-Luther University Halle-Wittenberg, Von-Danckelmann-Platz 4, 06120, Halle, Germany
| |
Collapse
|
2
|
Hassan M, Coutsias EA. Kinematic Reconstruction of Cyclic Peptides and Protein Backbones from Partial Data. J Chem Inf Model 2021; 61:4975-5000. [PMID: 34570494 PMCID: PMC10129052 DOI: 10.1021/acs.jcim.1c00453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We present an algorithm, QBKR (Quaternary Backbone Kinematic Reconstruction), a fast analytical method for an all-atom backbone reconstruction of proteins and linear or cyclic peptide chains from Cα coordinate traces. Unlike previous analytical methods for deriving all-atom representations from coarse-grained models that rely on canonical geometry with planar peptides in the trans conformation, our de novo kinematic model incorporates noncanonical, cis-trans, geometry naturally. Perturbations to this geometry can be effected with ease in our formulation, for example, to account for a continuous change from cis to trans geometry. A simple optimization of a spring-based objective function is employed for Cα-Cα distance variations that extend beyond the cis-trans limit. The kinematic construction produces a linked chain of peptide units, Cα-C-N-Cα, hinged at the Cα atoms spanning all possible planar and nonplanar peptide conformations. We have combined our method with a ring closure algorithm for the case of ring peptides and missing loops in a protein structure. Here, the reconstruction proceeding from both the N and C termini of the protein backbone (or in both directions from a starting position for rings) requires freedom in the position of one Cα atom (a capstone) to achieve a successful loop or ring closure. A salient feature of our reconstruction method is the ability to enrich conformational ensembles to produce alternative feasible conformations in which H-bond forming C-O or N-H pairs in the backbone can reverse orientations, thus addressing a well-known shortcoming in Cα-based RMSD structure comparison, wherein very close structures may lead to significantly different overall H-bond behavior. We apply the fixed Cα-based design to the reverse reconstruction from noisy Cryo-EM data, a posteriori to the optimization. Our method can be applied to speed up the process of an all-atom description from voluminous experimental data or subpar electron density maps.
Collapse
Affiliation(s)
- Mosavverul Hassan
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794, United States
| | - Evangelos A Coutsias
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794, United States.,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794-5252, United States
| |
Collapse
|
3
|
Olson MA. Disorder-Order Transitions in Conformational Selection of a Peptide by Ebola Virus Nucleoprotein. ACS OMEGA 2020; 5:5691-5697. [PMID: 32226846 PMCID: PMC7097898 DOI: 10.1021/acsomega.9b03581] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Accepted: 02/21/2020] [Indexed: 06/10/2023]
Abstract
This study presents parallel-tempering lattice Monte Carlo simulations based on the side-chain-only (SICHO) model for calculating the conformational landscape of a 28-residue intrinsically disordered peptide extracted from the Ebola virus protein VP35. The central issue is the applicability of the SICHO potential energy function and in general coarse-grained (CG) representations of intermediate resolution for modeling large-scale conformational heterogeneity that includes both folded and unstructured peptide states. Crystallographic data shows that the peptide folds in a 410-helix-turn-310-helix topology upon complex formation with the Ebola virus nucleoprotein, whereas in isolation, the peptide transitions to a disordered conformational ensemble as observed in circular dichroism experiments. The simulation reveals a potential of mean force that displays conformational diversity along the helix-forming reaction coordinate consistent with disorder-order transitions, yet unexpectedly the bound topology is poorly sampled, and a population shift to an unstructured state incurs a significant free-energy penalty. Applying an elastic network interpolation model suggests a hybrid binding mechanism through conformational selection of the 410-helix followed by an induced fit of the 310-helix. A comparison of the CG model with previously reported all-atom CHARMM-based simulations highlights a lattice-based approach that is computationally fast and with the correct parameterization yields good resolution to modeling conformational plasticity.
Collapse
|
4
|
Glagolev MK, Vasilevskaya VV. Coarse-grained simulation of molecular ordering in polylactic blends under uniaxial strain. POLYMER 2020. [DOI: 10.1016/j.polymer.2020.122232] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
5
|
Li M, Teng B, Lu W, Zhang JZ. Atomic-level reconstruction of biomolecules by a rigid-fragment- and local-frame-based (RF-LF) strategy. J Mol Model 2020; 26:31. [PMID: 31965325 DOI: 10.1007/s00894-020-4298-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Accepted: 01/14/2020] [Indexed: 11/29/2022]
Abstract
Coarse-grained (CG) model has been a powerful tool in bridging the gap between theoretical studies and experimental phenomena in biological computing field. The reconstruction from a CG model to an atomic-detail structure is especially important in CG studies of biological systems. In this work, a rigid-fragment- and local-frame-based (RF-LF) backmapping method was proposed to achieve reverse mapping from CG models to atomic-level structures. The initial atomic-level structures were further refined to yield the final backmapping ones. With the popular Martini force field, the performance of the RF-LF method was extensively examined in the CG → AA (CG to AA) backmapping of protein/DNA/RNA systems. Besides, the RF-LF method was also extended to the backmapping of the TMFF model. Numerical results illustrate that the RF-LF backmapping method is generic and parameter-free and can provide a promising way to tackle atomic-level studies in CG models.
Collapse
Affiliation(s)
- Min Li
- College of Physics, Qingdao University, Qingdao, 266071, Shandong, People's Republic of China.
| | - Bing Teng
- College of Physics, Qingdao University, Qingdao, 266071, Shandong, People's Republic of China
| | - WenCai Lu
- College of Physics, Qingdao University, Qingdao, 266071, Shandong, People's Republic of China
| | - John ZengHui Zhang
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200062, People's Republic of China.
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, 200062, People's Republic of China.
| |
Collapse
|
6
|
Badaczewska-Dawid AE, Kolinski A, Kmiecik S. Computational reconstruction of atomistic protein structures from coarse-grained models. Comput Struct Biotechnol J 2019; 18:162-176. [PMID: 31969975 PMCID: PMC6961067 DOI: 10.1016/j.csbj.2019.12.007] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 12/10/2019] [Indexed: 01/02/2023] Open
Abstract
Three-dimensional protein structures, whether determined experimentally or theoretically, are often too low resolution. In this mini-review, we outline the computational methods for protein structure reconstruction from incomplete coarse-grained to all atomistic models. Typical reconstruction schemes can be divided into four major steps. Usually, the first step is reconstruction of the protein backbone chain starting from the C-alpha trace. This is followed by side-chains rebuilding based on protein backbone geometry. Subsequently, hydrogen atoms can be reconstructed. Finally, the resulting all-atom models may require structure optimization. Many methods are available to perform each of these tasks. We discuss the available tools and their potential applications in integrative modeling pipelines that can transfer coarse-grained information from computational predictions, or experiment, to all atomistic structures.
Collapse
Affiliation(s)
| | | | - Sebastian Kmiecik
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| |
Collapse
|
7
|
Kuo AT, Miyazaki Y, Jang C, Miyajima T, Urata S, Nielsen SO, Okazaki S, Shinoda W. Large-scale molecular dynamics simulation of perfluorosulfonic acid membranes: Remapping coarse-grained to all-atomistic simulations. POLYMER 2019. [DOI: 10.1016/j.polymer.2019.121766] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
8
|
Peng J, Yuan C, Ma R, Zhang Z. Backmapping from Multiresolution Coarse-Grained Models to Atomic Structures of Large Biomolecules by Restrained Molecular Dynamics Simulations Using Bayesian Inference. J Chem Theory Comput 2019; 15:3344-3353. [PMID: 30908042 DOI: 10.1021/acs.jctc.9b00062] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Coarse-grained (CG) simulations have allowed access to larger length scales and longer time scales in the study of the dynamic processes of large biomolecules than all-atom (AA) molecular dynamics (MD) simulations. Backmapping from CG models to AA structures has long been studied because it enables us to gain detailed structure insights from CG simulations. Many methods first construct an AA structure from the CG model by fragments, random placement, or geometrical rules and subsequently optimize the solution via energy minimization, simulated annealing or position-restrained simulations. However, such methods may only work well on residue-level CG models and cannot consider the deviations of CG models. In this work, we describe, to the best of our knowledge, a new backmapping method based on Bayesian inference and restrained MD simulations. Restraints with log harmonic energy terms are defined according to the target CG model using the Bayesian inference in which the CG deviations can be estimated. From an initial AA structure obtained from either high-resolution experiments or homology modeling, a MD simulation with the aforementioned restraints is performed to obtain a final AA structure that is a backmapping of the target CG model. The method was validated using multiresolution CG models of the soluble extracellular region of the human epidermal growth factor receptor and was further applied to construct AA structures from CG simulations of the nucleosome core particle. The results demonstrate that our method can generate accurate AA structures of different types of biomolecules from multiple CG models with either residue-level resolution or much lower resolution than one-site-per-residue.
Collapse
Affiliation(s)
- Junhui Peng
- Hefei National Laboratory for Physical Science at Microscale and School of Life Sciences , University of Science and Technology of China , Hefei , Anhui 230026 , People's Republic of China
| | - Chuang Yuan
- Hefei National Laboratory for Physical Science at Microscale and School of Life Sciences , University of Science and Technology of China , Hefei , Anhui 230026 , People's Republic of China
| | - Rongsheng Ma
- Hefei National Laboratory for Physical Science at Microscale and School of Life Sciences , University of Science and Technology of China , Hefei , Anhui 230026 , People's Republic of China
| | - Zhiyong Zhang
- Hefei National Laboratory for Physical Science at Microscale and School of Life Sciences , University of Science and Technology of China , Hefei , Anhui 230026 , People's Republic of China
| |
Collapse
|
9
|
Jumper JM, Faruk NF, Freed KF, Sosnick TR. Accurate calculation of side chain packing and free energy with applications to protein molecular dynamics. PLoS Comput Biol 2018; 14:e1006342. [PMID: 30589846 PMCID: PMC6307715 DOI: 10.1371/journal.pcbi.1006342] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 06/21/2018] [Indexed: 12/02/2022] Open
Abstract
To address the large gap between time scales that can be easily reached by molecular simulations and those required to understand protein dynamics, we present a rapid self-consistent approximation of the side chain free energy at every integration step. In analogy with the adiabatic Born-Oppenheimer approximation for electronic structure, the protein backbone dynamics are simulated as preceding according to the dictates of the free energy of an instantaneously-equilibrated side chain potential. The side chain free energy is computed on the fly, allowing the protein backbone dynamics to traverse a greatly smoothed energetic landscape. This computation results in extremely rapid equilibration and sampling of the Boltzmann distribution. Our method, termed Upside, employs a reduced model involving the three backbone atoms, along with the carbonyl oxygen and amide proton, and a single (oriented) side chain bead having multiple locations reflecting the conformational diversity of the side chain's rotameric states. We also introduce a novel, maximum-likelihood method to parameterize the side chain interactions using protein structures. We demonstrate state-of-the-art accuracy for predicting χ1 rotamer states while consuming only milliseconds of CPU time. Our method enables rapidly equilibrating coarse-grained simulations that can nonetheless contain significant molecular detail. We also show that the resulting free energies of the side chains are sufficiently accurate for de novo folding of some proteins.
Collapse
Affiliation(s)
- John M. Jumper
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Chemistry, and The James Franck Institute, University of Chicago, Chicago, Illinois, United States of America
| | - Nabil F. Faruk
- Graduate Program in Biophysical Sciences, University of Chicago, Chicago, Illinois, United States of America
| | - Karl F. Freed
- Department of Chemistry, and The James Franck Institute, University of Chicago, Chicago, Illinois, United States of America
| | - Tobin R. Sosnick
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, United States of America
- Institute for Biophysical Dynamics, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
10
|
Modeling of Protein Structural Flexibility and Large-Scale Dynamics: Coarse-Grained Simulations and Elastic Network Models. Int J Mol Sci 2018; 19:ijms19113496. [PMID: 30404229 PMCID: PMC6274762 DOI: 10.3390/ijms19113496] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Revised: 10/29/2018] [Accepted: 10/31/2018] [Indexed: 12/13/2022] Open
Abstract
Fluctuations of protein three-dimensional structures and large-scale conformational transitions are crucial for the biological function of proteins and their complexes. Experimental studies of such phenomena remain very challenging and therefore molecular modeling can be a good alternative or a valuable supporting tool for the investigation of large molecular systems and long-time events. In this minireview, we present two alternative approaches to the coarse-grained (CG) modeling of dynamic properties of protein systems. We discuss two CG representations of polypeptide chains used for Monte Carlo dynamics simulations of protein local dynamics and conformational transitions, and highly simplified structure-based elastic network models of protein flexibility. In contrast to classical all-atom molecular dynamics, the modeling strategies discussed here allow the quite accurate modeling of much larger systems and longer-time dynamic phenomena. We briefly describe the main features of these models and outline some of their applications, including modeling of near-native structure fluctuations, sampling of large regions of the protein conformational space, or possible support for the structure prediction of large proteins and their complexes.
Collapse
|
11
|
Dawid AE, Gront D, Kolinski A. Coarse-Grained Modeling of the Interplay between Secondary Structure Propensities and Protein Fold Assembly. J Chem Theory Comput 2018; 14:2277-2287. [PMID: 29486120 DOI: 10.1021/acs.jctc.7b01242] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We recently developed a new coarse-grained model of protein structure and dynamics [ Dawid et al. J. Chem. Theory Comput. 2017 , 13 ( 11 ), 5766 - 5779 ]. The model assumed a single bead representation of amino acid residues, where positions of such united residues were defined by centers of mass of four amino acid fragments. Replica exchange Monte Carlo sampling of the model chains provided good pictures of modeled structures and their dynamics. In its generic form the statistical knowledge-based force field of the model has been dedicated for single-domain globular proteins. Sequence-specific interactions are defined by three-letter secondary structure data. In the present work we demonstrate that different assignments and/or predictions of secondary structures are usually sufficient for enforcing cooperative formation of native-like folds of SURPASS chains for the majority of single-domain globular proteins. Simulations of native-like structure assembly for a representative set of globular proteins have shown that the accuracy of secondary structure data is usually not crucial for model performance, although some specific errors can strongly distort the obtained three-dimensional structures.
Collapse
Affiliation(s)
- Aleksandra E Dawid
- Faculty of Chemistry, Biological and Chemical Research Center , University of Warsaw , Pasteura 1 , 02-093 Warsaw , Poland
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Center , University of Warsaw , Pasteura 1 , 02-093 Warsaw , Poland
| | - Andrzej Kolinski
- Faculty of Chemistry, Biological and Chemical Research Center , University of Warsaw , Pasteura 1 , 02-093 Warsaw , Poland
| |
Collapse
|
12
|
Effects and limitations of a nucleobase-driven backmapping procedure for nucleic acids using steered molecular dynamics. Biochem Biophys Res Commun 2018; 498:352-358. [DOI: 10.1016/j.bbrc.2017.12.057] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Revised: 11/13/2017] [Accepted: 12/11/2017] [Indexed: 11/19/2022]
|
13
|
Watkins AM, Craven TW, Renfrew PD, Arora PS, Bonneau R. Rotamer Libraries for the High-Resolution Design of β-Amino Acid Foldamers. Structure 2017; 25:1771-1780.e3. [PMID: 29033287 PMCID: PMC5845441 DOI: 10.1016/j.str.2017.09.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Revised: 06/21/2017] [Accepted: 09/14/2017] [Indexed: 01/28/2023]
Abstract
β-Amino acids offer attractive opportunities to develop biologically active peptidomimetics, either employed alone or in conjunction with natural α-amino acids. Owing to their potential for unique conformational preferences that deviate considerably from α-peptide geometries, β-amino acids greatly expand the possible chemistries and physical properties available to polyamide foldamers. Complete in silico support for designing new molecules incorporating non-natural amino acids typically requires representing their side-chain conformations as sets of discrete rotamers for model refinement and sequence optimization. Such rotamer libraries are key components of several state-of-the-art design frameworks. Here we report the development, incorporation in to the Rosetta macromolecular modeling suite, and validation of rotamer libraries for β3-amino acids.
Collapse
Affiliation(s)
- Andrew M Watkins
- Department of Chemistry, New York University, New York, NY 10003, USA
| | - Timothy W Craven
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10009, USA; Institute for Protein Design, University of Washington, Seattle, WA 98102, USA
| | - P Douglas Renfrew
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10009, USA; Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY 10010, USA
| | - Paramjit S Arora
- Department of Chemistry, New York University, New York, NY 10003, USA
| | - Richard Bonneau
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10009, USA; Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY 10010, USA; Courant Institute of Mathematical Sciences, Computer Science Department, New York University, New York, NY 10009, USA.
| |
Collapse
|
14
|
Dawid AE, Gront D, Kolinski A. SURPASS Low-Resolution Coarse-Grained Protein Modeling. J Chem Theory Comput 2017; 13:5766-5779. [PMID: 28992694 DOI: 10.1021/acs.jctc.7b00642] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Coarse-grained modeling of biomolecules has a very important role in molecular biology. In this work we present a novel SURPASS (Single United Residue per Pre-Averaged Secondary Structure fragment) model of proteins that can be an interesting alternative for existing coarse-grained models. The design of the model is unique and strongly supported by the statistical analysis of structural regularities characteristic for protein systems. Coarse-graining of protein chain structures assumes a single center of interactions per residue and accounts for preaveraged effects of four adjacent residue fragments. Knowledge-based statistical potentials encode complex interaction patterns of these fragments. Using the Replica Exchange Monte Carlo sampling scheme and a generic version of the SURPASS force field we performed test simulations of a representative set of single-domain globular proteins. The method samples a significant part of conformational space and reproduces protein structures, including native-like, with surprisingly good accuracy. Future extension of the SURPASS model on large biomacromolecular systems is briefly discussed.
Collapse
Affiliation(s)
- Aleksandra E Dawid
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw , Pasteura 1, 02-093 Warsaw, Poland
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw , Pasteura 1, 02-093 Warsaw, Poland
| | - Andrzej Kolinski
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw , Pasteura 1, 02-093 Warsaw, Poland
| |
Collapse
|
15
|
Musiani F, Giorgetti A. Protein Aggregation and Molecular Crowding: Perspectives From Multiscale Simulations. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2016; 329:49-77. [PMID: 28109331 DOI: 10.1016/bs.ircmb.2016.08.009] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Cells are extremely crowded environments, thus the use of diluted salted aqueous solutions containing a single protein is too simplistic to mimic the real situation. Macromolecular crowding might affect protein structure, folding, shape, conformational stability, binding of small molecules, enzymatic activity, interactions with cognate biomolecules, and pathological aggregation. The latter phenomenon typically leads to the formation of amyloid fibrils that are linked to several lethal neurodegenerative diseases, but that can also play a functional role in certain organisms. The majority of molecular simulations performed before the last few years were conducted in diluted solutions and were restricted both in the timescales and in the system dimensions by the available computational resources. In recent years, several computational solutions were developed to get close to physiological conditions. In this review we summarize the main computational techniques used to tackle the issue of protein aggregation both in a diluted and in a crowded environment.
Collapse
Affiliation(s)
- F Musiani
- Laboratory of Bioinorganic Chemistry, University of Bologna, Bologna, Italy.
| | - A Giorgetti
- Applied Bioinformatics Group, University of Verona, Verona, Italy.
| |
Collapse
|
16
|
Abstract
![]()
A method for the local refinement
of protein structures that targets
improvements in local stereochemistry while preserving the overall
fold is presented. The method uses force field-based minimization
and sampling via molecular dynamics simulations with a modified force
field to bring bonds, angles, and torsion angles into an acceptable
range for high-resolution protein structures. The method is implemented
in the locPREFMD web server and was tested on computational models
submitted to CASP11. Using MolProbity scores as the main assessment
criterion, the locPREFMD method significantly improves the stereochemical
quality of given input models close to the quality expected for experimental
structures while maintaining the Cα coordinates of the initial
model.
Collapse
Affiliation(s)
- Michael Feig
- Department of Biochemistry and Molecular Biology and ‡Department of Chemistry, Michigan State University , 603 Wilson Road, Room BCH 218, East Lansing, Michigan 48824, United States
| |
Collapse
|
17
|
Pluhackova K, Böckmann RA. Biomembranes in atomistic and coarse-grained simulations. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2015. [PMID: 26194872 DOI: 10.1088/0953-8984/27/32/323103] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
The architecture of biological membranes is tightly coupled to the localization, organization, and function of membrane proteins. The organelle-specific distribution of lipids allows for the formation of functional microdomains (also called rafts) that facilitate the segregation and aggregation of membrane proteins and thus shape their function. Molecular dynamics simulations enable to directly access the formation, structure, and dynamics of membrane microdomains at the molecular scale and the specific interactions among lipids and proteins on timescales from picoseconds to microseconds. This review focuses on the latest developments of biomembrane force fields for both atomistic and coarse-grained molecular dynamics (MD) simulations, and the different levels of coarsening of biomolecular structures. It also briefly introduces scale-bridging methods applicable to biomembrane studies, and highlights selected recent applications.
Collapse
Affiliation(s)
- Kristyna Pluhackova
- Computational Biology, Department of Biology, Friedrich-Alexander Universität Erlangen-Nürnberg, Staudtstr. 5, 91058 Erlangen, Germany
| | | |
Collapse
|
18
|
Feig M, Mirjalili V. Protein structure refinement via molecular-dynamics simulations: What works and what does not? Proteins 2015; 84 Suppl 1:282-92. [PMID: 26234208 DOI: 10.1002/prot.24871] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Revised: 07/15/2015] [Accepted: 07/29/2015] [Indexed: 12/26/2022]
Abstract
Protein structure refinement during CASP11 by the Feig group was described. Molecular dynamics simulations were used in combination with an improved selection and averaging protocol. On average, modest refinement was achieved with some targets improved significantly. Analysis of the CASP submission from our group focused on refinement success versus amount of sampling, refinement of different secondary structure elements and whether refinement varied as a function of which group provided initial models. The refinement of local stereochemical features was examined via the MolProbity score and an updated protocol was developed that can generate high-quality structures with very low MolProbity scores for most starting structures with modest computational effort. Proteins 2016; 84(Suppl 1):282-292. © 2015 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, 48824. .,Department of Chemistry, Michigan State University, East Lansing, Michigan, 48824.
| | - Vahid Mirjalili
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, 48824.,Department of Mechanical Engineering, Michigan State University, East Lansing, Michigan, 48824
| |
Collapse
|
19
|
Modeling of protein-peptide interactions using the CABS-dock web server for binding site search and flexible docking. Methods 2015; 93:72-83. [PMID: 26165956 DOI: 10.1016/j.ymeth.2015.07.004] [Citation(s) in RCA: 114] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Revised: 07/06/2015] [Accepted: 07/08/2015] [Indexed: 11/22/2022] Open
Abstract
Protein-peptide interactions play essential functional roles in living organisms and their structural characterization is a hot subject of current experimental and theoretical research. Computational modeling of the structure of protein-peptide interactions is usually divided into two stages: prediction of the binding site at a protein receptor surface, and then docking (and modeling) the peptide structure into the known binding site. This paper presents a comprehensive CABS-dock method for the simultaneous search of binding sites and flexible protein-peptide docking, available as a user's friendly web server. We present example CABS-dock results obtained in the default CABS-dock mode and using its advanced options that enable the user to increase the range of flexibility for chosen receptor fragments or to exclude user-selected binding modes from docking search. Furthermore, we demonstrate a strategy to improve CABS-dock performance by assessing the quality of models with classical molecular dynamics. Finally, we discuss the promising extensions and applications of the CABS-dock method and provide a tutorial appendix for the convenient analysis and visualization of CABS-dock results. The CABS-dock web server is freely available at http://biocomp.chem.uw.edu.pl/CABSdock/.
Collapse
|
20
|
Three-dimensional protein structure prediction: Methods and computational strategies. Comput Biol Chem 2014; 53PB:251-276. [DOI: 10.1016/j.compbiolchem.2014.10.001] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Revised: 10/03/2014] [Accepted: 10/07/2014] [Indexed: 01/01/2023]
|
21
|
A. Rohrdanz M, Zheng W, Lambeth B, Vreede J, Clementi C. Multiscale approach to the determination of the photoactive yellow protein signaling state ensemble. PLoS Comput Biol 2014; 10:e1003797. [PMID: 25356903 PMCID: PMC4214557 DOI: 10.1371/journal.pcbi.1003797] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Accepted: 07/08/2014] [Indexed: 02/04/2023] Open
Abstract
The nature of the optical cycle of photoactive yellow protein (PYP) makes its elucidation challenging for both experiment and theory. The long transition times render conventional simulation methods ineffective, and yet the short signaling-state lifetime makes experimental data difficult to obtain and interpret. Here, through an innovative combination of computational methods, a prediction and analysis of the biological signaling state of PYP is presented. Coarse-grained modeling and locally scaled diffusion map are first used to obtain a rough bird's-eye view of the free energy landscape of photo-activated PYP. Then all-atom reconstruction, followed by an enhanced sampling scheme; diffusion map-directed-molecular dynamics are used to focus in on the signaling-state region of configuration space and obtain an ensemble of signaling state structures. To the best of our knowledge, this is the first time an all-atom reconstruction from a coarse grained model has been performed in a relatively unexplored region of molecular configuration space. We compare our signaling state prediction with previous computational and more recent experimental results, and the comparison is favorable, which validates the method presented. This approach provides additional insight to understand the PYP photo cycle, and can be applied to other systems for which more direct methods are impractical. Many protein systems of biological interest undergo dynamical changes on a time scale too long to be modeled using standard computational methods. One example is photoactive yellow protein (PYP), found in several bacterial species. Blue light, potentially harmful for DNA, triggers several structural changes in PYP, eventually resulting in a conformation that changes the swimming behavior of bacteria. This conformation is difficult to investigate, as it is too short lived. In addition, understanding this “signaling state” is computationally difficult because of the long timescale of the transition. We overcome this by constructing a coarse-grained model to rapidly induce transitions to the signaling state. We then reconstruct and further sample the all-atom configurations from these coarse-grained representations. Our results are consistent with all available experimental and computational evidence.
Collapse
Affiliation(s)
- Mary A. Rohrdanz
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, United States of America
- Chemistry Department, Rice University, Houston, Texas, United States of America
| | - Wenwei Zheng
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, United States of America
- Chemistry Department, Rice University, Houston, Texas, United States of America
| | - Bradley Lambeth
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, United States of America
| | - Jocelyne Vreede
- van't Hoff Institute for Molecular Sciences, University of Amsterdam, Amsterdam, The Netherlands
| | - Cecilia Clementi
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, United States of America
- Chemistry Department, Rice University, Houston, Texas, United States of America
- * E-mail:
| |
Collapse
|
22
|
Wassenaar TA, Pluhackova K, Böckmann RA, Marrink SJ, Tieleman DP. Going Backward: A Flexible Geometric Approach to Reverse Transformation from Coarse Grained to Atomistic Models. J Chem Theory Comput 2014; 10:676-90. [PMID: 26580045 DOI: 10.1021/ct400617g] [Citation(s) in RCA: 471] [Impact Index Per Article: 47.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The conversion of coarse-grained to atomistic models is an important step in obtaining insight about atomistic scale processes from coarse-grained simulations. For this process, called backmapping or reverse transformation, several tools are available, but these commonly require libraries of molecule fragments or they are linked to a specific software package. In addition, the methods are usually restricted to specific molecules and to a specific force field. Here, we present an alternative method, consisting of geometric projection and subsequent force-field based relaxation. This method is designed to be simple and flexible, and offers a generic solution for resolution transformation. For simple systems, the conversion only requires a list of particle correspondences on the two levels of resolution. For special cases, such as nondefault protonation states of amino acids and virtual sites, a target particle list can be specified. The mapping uses simple building blocks, which list the particles on the different levels of resolution. For conversion to higher resolution, the initial model is relaxed with several short cycles of energy minimization and position-restrained MD. The reconstruction of an atomistic backbone from a coarse-grained model is done using a new dedicated algorithm. The method is generic and can be used to map between any two particle based representations, provided that a mapping can be written. The focus of this work is on the coarse-grained MARTINI force field, for which mapping definitions are written to allow conversion to and from the higher-resolution force fields GROMOS, CHARMM, and AMBER, and to and from a simplified three-bead lipid model. Together, these offer the possibility to simulate mesoscopic membrane structures, to be transformed to MARTINI and subsequently to an atomistic model for investigation of detailed interactions. The method was tested on a set of systems ranging from a simple, single-component bilayer to a large protein-membrane-solvent complex. The results demonstrate the efficiency and the efficacy of the new approach.
Collapse
Affiliation(s)
- Tsjerk A Wassenaar
- Department of Biological Sciences, University of Calgary, 2500 University Drive NW , Calgary, Alberta, Canada T2N 1N4.,Groningen Biomolecular Sciences and Biotechnology Institute and Zernike Institute for Advanced Materials, University of Groningen , Nijenborgh 7, 9747 AG Groningen, The Netherlands.,Computational Biology, Department of Biology, University of Erlangen-Nürnberg , Staudtstr. 5, 91058 Erlangen, Germany
| | - Kristyna Pluhackova
- Computational Biology, Department of Biology, University of Erlangen-Nürnberg , Staudtstr. 5, 91058 Erlangen, Germany
| | - Rainer A Böckmann
- Computational Biology, Department of Biology, University of Erlangen-Nürnberg , Staudtstr. 5, 91058 Erlangen, Germany
| | - Siewert J Marrink
- Groningen Biomolecular Sciences and Biotechnology Institute and Zernike Institute for Advanced Materials, University of Groningen , Nijenborgh 7, 9747 AG Groningen, The Netherlands
| | - D Peter Tieleman
- Department of Biological Sciences, University of Calgary, 2500 University Drive NW , Calgary, Alberta, Canada T2N 1N4
| |
Collapse
|
23
|
Moore BL, Kelley LA, Barber J, Murray JW, MacDonald JT. High-quality protein backbone reconstruction from alpha carbons using Gaussian mixture models. J Comput Chem 2013; 34:1881-9. [PMID: 23703289 DOI: 10.1002/jcc.23330] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2013] [Revised: 04/01/2013] [Accepted: 04/21/2013] [Indexed: 11/08/2022]
Abstract
Coarse-grained protein structure models offer increased efficiency in structural modeling, but these must be coupled with fast and accurate methods to revert to a full-atom structure. Here, we present a novel algorithm to reconstruct mainchain models from C traces. This has been parameterized by fitting Gaussian mixture models (GMMs) to short backbone fragments centered on idealized peptide bonds. The method we have developed is statistically significantly more accurate than several competing methods, both in terms of RMSD values and dihedral angle differences. The method produced Ramachandran dihedral angle distributions that are closer to that observed in real proteins and better Phaser molecular replacement log-likelihood gains. Amino acid residue sidechain reconstruction accuracy using SCWRL4 was found to be statistically significantly correlated to backbone reconstruction accuracy. Finally, the PD2 method was found to produce significantly lower energy full-atom models using Rosetta which has implications for multiscale protein modeling using coarse-grained models. A webserver and C++ source code is freely available for noncommercial use from: http://www.sbg.bio.ic.ac.uk/phyre2/PD2_ca2main/.
Collapse
Affiliation(s)
- Benjamin L Moore
- Division of Molecular Biosciences, Imperial College, South Kensington Campus, London, United Kingdom
| | | | | | | | | |
Collapse
|
24
|
Producing high-accuracy lattice models from protein atomic coordinates including side chains. Adv Bioinformatics 2012; 2012:148045. [PMID: 22934109 PMCID: PMC3426164 DOI: 10.1155/2012/148045] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2012] [Accepted: 06/18/2012] [Indexed: 02/08/2023] Open
Abstract
Lattice models are a common abstraction used in the study of protein structure, folding, and refinement. They are advantageous because the discretisation of space can make extensive protein evaluations computationally feasible. Various approaches to the protein chain lattice fitting problem have been suggested but only a single backbone-only tool is available currently. We introduce LatFit, a new tool to produce high-accuracy lattice protein models. It generates both backbone-only and backbone-side-chain models in any user defined lattice. LatFit implements a new distance RMSD-optimisation fitting procedure in addition to the known coordinate RMSD method. We tested LatFit's accuracy and speed using a large nonredundant set of high resolution proteins (SCOP database) on three commonly used lattices: 3D cubic, face-centred cubic, and knight's walk. Fitting speed compared favourably to other methods and both backbone-only and backbone-side-chain models show low deviation from the original data (~1.5 Å RMSD in the FCC lattice). To our knowledge this represents the first comprehensive study of lattice quality for on-lattice protein models including side chains while LatFit is the only available tool for such models.
Collapse
|
25
|
Abstract
Functional characterization of proteins being one of the major issues in molecular biology is still unsolved due to several resource and technical limitations of experimental structure determination methods. A suitable methodology for accurate prediction of protein confirmations simply from sequence is therefore emerging as the primary modeling goal of researchers today. Global blind protein structure prediction summit, entitled Critical Assessment of Structure Prediction (CASP), critically assesses the modeling methodologies, to track our algorithmic path development. But our success is still impeded by incompetent modeling methodologies and several key technical lacunas. There is still a great need to focus some key issues for bridging the major though considered trivial gaps, in the upcoming CASP to pave and demarcate our correct way of developing a consistently accurate prediction methodology in the near future. Major problems resulting in divergence of our predicted models from their actual native states are thus highlighted with suggested more stringent and reliable assessment considerations in the CASP test.
Collapse
Affiliation(s)
- Ashish Runthala
- Biological Sciences, Faculty Division III, Birla Institute of Technology & Science, Pilani, Rajasthan, India.
| |
Collapse
|
26
|
Zhang J, He Z, Wang Q, Barz B, Kosztin I, Shang Y, Xu D. Prediction of protein tertiary structures using MUFOLD. Methods Mol Biol 2012; 815:3-13. [PMID: 22130979 DOI: 10.1007/978-1-61779-424-7_1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2023]
Abstract
There have been steady improvements in protein structure prediction during the past two decades. However, current methods are still far from consistently predicting structural models accurately with computing power accessible to common users. To address this challenge, we developed MUFOLD, a hybrid method of using whole and partial template information along with new computational techniques for protein tertiary structure prediction. MUFOLD covers both template-based and ab initio predictions using the same framework and aims to achieve high accuracy and fast computing. Two major novel contributions of MUFOLD are graph-based model generation and molecular dynamics ranking (MDR). By formulating a prediction as a graph realization problem, we apply an efficient optimization approach of Multidimensional Scaling (MDS) to speed up the prediction dramatically. In addition, under this framework, we enhance the predictions consistently by iteratively using the information from generated models. MDR, in contrast to widely used static scoring functions, exploits dynamics properties of structures to evaluate their qualities, which can often identify best structures from a pool more effectively.
Collapse
Affiliation(s)
- Jingfen Zhang
- Department of Computer Science, University of Missouri, Columbia, MO, USA
| | | | | | | | | | | | | |
Collapse
|
27
|
Cheng YM, Gopal SM, Law SM, Feig M. Molecular dynamics trajectory compression with a coarse-grained model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011; 9:476-486. [PMID: 22025759 PMCID: PMC3505254 DOI: 10.1109/tcbb.2011.141] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Molecular dynamics trajectories are very data-intensive thereby limiting sharing and archival of such data. One possible solution is compression of trajectory data. Here, trajectory compression based on conversion to the coarse-grained model PRIMO is proposed. The compressed data is about one third of the original data and fast decompression is possible with an analytical reconstruction procedure from PRIMO to all-atom representations. This protocol largely preserves structural features and to a more limited extent also energetic features of the original trajectory.
Collapse
Affiliation(s)
- Yi-Ming Cheng
- The Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, 48824
| | - Srinivasa Murthy Gopal
- The Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, 48824
| | - Sean M. Law
- The Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, 48824
| | - Michael Feig
- The Departments of Biochemistry and Molecular Biology, Chemistry, and Computer Science and Engineering, Michigan State University, East Lansing, MI 48824
| |
Collapse
|
28
|
Park H, Ko J, Joo K, Lee J, Seok C, Lee J. Refinement of protein termini in template-based modeling using conformational space annealing. Proteins 2011; 79:2725-34. [PMID: 21755541 DOI: 10.1002/prot.23101] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2011] [Revised: 05/17/2011] [Accepted: 05/27/2011] [Indexed: 02/05/2023]
Abstract
The rapid increase in the number of experimentally determined protein structures in recent years enables us to obtain more reliable protein tertiary structure models than ever by template-based modeling. However, refinement of template-based models beyond the limit available from the best templates is still needed for understanding protein function in atomic detail. In this work, we develop a new method for protein terminus modeling that can be applied to refinement of models with unreliable terminus structures. The energy function for terminus modeling consists of both physics-based and knowledge-based potential terms with carefully optimized relative weights. Effective sampling of both the framework and terminus is performed using the conformational space annealing technique. This method has been tested on a set of termini derived from a nonredundant structure database and two sets of termini from the CASP8 targets. The performance of the terminus modeling method is significantly improved over our previous method that does not employ terminus refinement. It is also comparable or superior to the best server methods tested in CASP8. The success of the current approach suggests that similar strategy may be applied to other types of refinement problems such as loop modeling or secondary structure rearrangement.
Collapse
Affiliation(s)
- Hahnbeom Park
- Department of Chemistry, Seoul National University, Seoul 151-747, Republic of Korea
| | | | | | | | | | | |
Collapse
|
29
|
Samiotakis A, Homouz D, Cheung MS. Multiscale investigation of chemical interference in proteins. J Chem Phys 2010; 132:175101. [PMID: 20459186 DOI: 10.1063/1.3404401] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
We developed a multiscale approach (MultiSCAAL) that integrates the potential of mean force obtained from all-atomistic molecular dynamics simulations with a knowledge-based energy function for coarse-grained molecular simulations in better exploring the energy landscape of a small protein under chemical interference such as chemical denaturation. An excessive amount of water molecules in all-atomistic molecular dynamics simulations often negatively impacts the sampling efficiency of some advanced sampling techniques such as the replica exchange method and it makes the investigation of chemical interferences on protein dynamics difficult. Thus, there is a need to develop an effective strategy that focuses on sampling structural changes in protein conformations rather than solvent molecule fluctuations. In this work, we address this issue by devising a multiscale simulation scheme (MultiSCAAL) that bridges the gap between all-atomistic molecular dynamics simulation and coarse-grained molecular simulation. The two key features of this scheme are the Boltzmann inversion and a protein atomistic reconstruction method we previously developed (SCAAL). Using MultiSCAAL, we were able to enhance the sampling efficiency of proteins solvated by explicit water molecules. Our method has been tested on the folding energy landscape of a small protein Trp-cage with explicit solvent under 8M urea using both the all-atomistic replica exchange molecular dynamics and MultiSCAAL. We compared computational analyses on ensemble conformations of Trp-cage with its available experimental NOE distances. The analysis demonstrated that conformations explored by MultiSCAAL better agree with the ones probed in the experiments because it can effectively capture the changes in side-chain orientations that can flip out of the hydrophobic pocket in the presence of urea and water molecules. In this regard, MultiSCAAL is a promising and effective sampling scheme for investigating chemical interference which presents a great challenge when modeling protein interactions in vivo.
Collapse
|
30
|
Zhang J, Wang Q, Barz B, He Z, Kosztin I, Shang Y, Xu D. MUFOLD: A new solution for protein 3D structure prediction. Proteins 2010; 78:1137-52. [PMID: 19927325 DOI: 10.1002/prot.22634] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
There have been steady improvements in protein structure prediction during the past 2 decades. However, current methods are still far from consistently predicting structural models accurately with computing power accessible to common users. Toward achieving more accurate and efficient structure prediction, we developed a number of novel methods and integrated them into a software package, MUFOLD. First, a systematic protocol was developed to identify useful templates and fragments from Protein Data Bank for a given target protein. Then, an efficient process was applied for iterative coarse-grain model generation and evaluation at the Calpha or backbone level. In this process, we construct models using interresidue spatial restraints derived from alignments by multidimensional scaling, evaluate and select models through clustering and static scoring functions, and iteratively improve the selected models by integrating spatial restraints and previous models. Finally, the full-atom models were evaluated using molecular dynamics simulations based on structural changes under simulated heating. We have continuously improved the performance of MUFOLD by using a benchmark of 200 proteins from the Astral database, where no template with >25% sequence identity to any target protein is included. The average root-mean-square deviation of the best models from the native structures is 4.28 A, which shows significant and systematic improvement over our previous methods. The computing time of MUFOLD is much shorter than many other tools, such as Rosetta. MUFOLD demonstrated some success in the 2008 community-wide experiment for protein structure prediction CASP8.
Collapse
Affiliation(s)
- Jingfen Zhang
- Department of Computer Science, University of Missouri, Columbia, Missouri 65211, USA
| | | | | | | | | | | | | |
Collapse
|
31
|
Gopal SM, Mukherjee S, Cheng YM, Feig M. PRIMO/PRIMONA: a coarse-grained model for proteins and nucleic acids that preserves near-atomistic accuracy. Proteins 2010; 78:1266-81. [PMID: 19967787 DOI: 10.1002/prot.22645] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The new coarse graining model PRIMO/PRIMONA for proteins and nucleic acids is proposed. This model combines one to several heavy atoms into coarse-grained sites that are chosen to allow an analytical, high-resolution reconstruction of all-atom models based on molecular bonding geometry constraints. The accuracy of proposed reconstruction method in terms of structure and energetics is tested and compared with other popular reconstruction methods for a variety of protein and nucleic acid test sets.
Collapse
Affiliation(s)
- Srinivasa M Gopal
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, USA
| | | | | | | |
Collapse
|
32
|
Rotkiewicz P, Skolnick J. Fast procedure for reconstruction of full-atom protein models from reduced representations. J Comput Chem 2008; 29:1460-5. [PMID: 18196502 DOI: 10.1002/jcc.20906] [Citation(s) in RCA: 266] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We introduce PULCHRA, a fast and robust method for the reconstruction of full-atom protein models starting from a reduced protein representation. The algorithm is particularly suitable as an intermediate step between coarse-grained model-based structure prediction and applications requiring an all-atom structure, such as molecular dynamics, protein-ligand docking, structure-based function prediction, or assessment of quality of the predicted structure. The accuracy of the method was tested on a set of high-resolution crystallographic structures as well as on a set of low-resolution protein decoys generated by a protein structure prediction algorithm TASSER. The method is implemented as a standalone program that is available for download from http://cssb.biology.gatech.edu/skolnick/files/PULCHRA.
Collapse
Affiliation(s)
- Piotr Rotkiewicz
- Burnham Institute for Medical Research, 10901 N. Torrey Pines Road, La Jolla, California 92037, USA
| | | |
Collapse
|
33
|
Olson MA, Feig M, Brooks CL. Prediction of protein loop conformations using multiscale modeling methods with physical energy scoring functions. J Comput Chem 2008; 29:820-31. [PMID: 17876760 DOI: 10.1002/jcc.20827] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
This article examines ab initio methods for the prediction of protein loops by a computational strategy of multiscale conformational sampling and physical energy scoring functions. Our approach consists of initial sampling of loop conformations from lattice-based low-resolution models followed by refinement using all-atom simulations. To allow enhanced conformational sampling, the replica exchange method was implemented. Physical energy functions based on CHARMM19 and CHARMM22 parameterizations with generalized Born (GB) solvent models were applied in scoring loop conformations extracted from the lattice simulations and, in the case of all-atom simulations, the ensemble of conformations were generated and scored with these models. Predictions are reported for 25 loop segments, each eight residues long and taken from a diverse set of 22 protein structures. We find that the simulations generally sampled conformations with low global root-mean-square-deviation (RMSD) for loop backbone coordinates from the known structures, whereas clustering conformations in RMSD space and scoring detected less favorable loop structures. Specifically, the lattice simulations sampled basins that exhibited an average global RMSD of 2.21 +/- 1.42 A, whereas clustering and scoring the loop conformations determined an RMSD of 3.72 +/- 1.91 A. Using CHARMM19/GB to refine the lattice conformations improved the sampling RMSD to 1.57 +/- 0.98 A and detection to 2.58 +/- 1.48 A. We found that further improvement could be gained from extending the upper temperature in the all-atom refinement from 400 to 800 K, where the results typically yield a reduction of approximately 1 A or greater in the RMSD of the detected loop. Overall, CHARMM19 with a simple pairwise GB solvent model is more efficient at sampling low-RMSD loop basins than CHARMM22 with a higher-resolution modified analytical GB model; however, the latter simulation method provides a more accurate description of the all-atom energy surface, yet demands a much greater computational cost.
Collapse
Affiliation(s)
- Mark A Olson
- Department of Cell Biology and Biochemistry, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, Maryland 21702, USA.
| | | | | |
Collapse
|
34
|
Olson MA, Yeh IC, Lee MS. Coarse-grained lattice model simulations of sequence-structure fitness of a ribosome-inactivating protein. Biopolymers 2008; 89:153-9. [PMID: 17985366 DOI: 10.1002/bip.20880] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Many realistic protein-engineering design problems extend beyond the computational limits of what is considered practical when applying all-atom molecular-dynamics simulation methods. Lattice models provide computationally robust alternatives, yet most are regarded as too simplistic to accurately capture the details of complex designs. We revisit a coarse-grained lattice simulation model and demonstrate that a multiresolution modeling approach of reconstructing all-atom structures from lattice chains is of sufficient accuracy to resolve the comparability of sequence-structure modifications of the ricin A-chain (RTA) protein fold. For a modeled structure, the unfolding-folding transition temperature was calculated from the heat capacity using either the potential energy from the lattice model or the all-atom CHARMM19 force-field plus a generalized Born solvent approximation. We found, that despite the low-resolution modeling of conformational states, the potential energy functions were capable of detecting the relative change in the thermodynamic transition temperature that distinguishes between a protein design and the native RTA fold in excellent accord with reported experimental studies of thermal denaturation. A discussion is provided of different sequences fitted to the RTA fold and a possible unfolding model.
Collapse
Affiliation(s)
- Mark A Olson
- Department of Cell Biology and Biochemistry, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, MD 21702, USA.
| | | | | |
Collapse
|
35
|
Abstract
We developed and tested the I-TASSER protein structure prediction algorithm in the CASP7 experiment, where targets are first threaded through the PDB library and continuous fragments in the threading alignments are exploited to assemble the global structure. The final models are obtained from the progressive refinements started from the last round structure clusters. A majority of the targets in the template-based modeling (TBM) category have the templates drawn closer to the native structure by more than 1 A within the aligned regions. For the free-modeling (FM) targets, I-TASSER builds correct topology for 7/19 cases with sequence up to 155 residues long. For the first time, the automated server prediction generates models as good as the human-expert does in all the categories, which shows the robustness of the method and the potential of the application to genome-wide structure prediction. Despite the success, the accuracy of I-TASSER modeling is still dominated by the similarity of the template and target structures with a strong correlation coefficient ( approximately 0.9) between the root-mean-squared deviation (RMSD) to native of the templates and the final models. Especially, there is no high-resolution model below 2 A for the FM targets. These problems highlight the issues that need to be addressed in the next generation of atomic-level I-TASSER development especially for the FM target modeling.
Collapse
Affiliation(s)
- Yang Zhang
- Center for Bioinformatics, Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas 66047, USA.
| |
Collapse
|
36
|
Zhang Y. I-TASSER server for protein 3D structure prediction. BMC Bioinformatics 2008; 9:40. [PMID: 18215316 PMCID: PMC2245901 DOI: 10.1186/1471-2105-9-40] [Citation(s) in RCA: 3810] [Impact Index Per Article: 238.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2007] [Accepted: 01/23/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Prediction of 3-dimensional protein structures from amino acid sequences represents one of the most important problems in computational structural biology. The community-wide Critical Assessment of Structure Prediction (CASP) experiments have been designed to obtain an objective assessment of the state-of-the-art of the field, where I-TASSER was ranked as the best method in the server section of the recent 7th CASP experiment. Our laboratory has since then received numerous requests about the public availability of the I-TASSER algorithm and the usage of the I-TASSER predictions. RESULTS An on-line version of I-TASSER is developed at the KU Center for Bioinformatics which has generated protein structure predictions for thousands of modeling requests from more than 35 countries. A scoring function (C-score) based on the relative clustering structural density and the consensus significance score of multiple threading templates is introduced to estimate the accuracy of the I-TASSER predictions. A large-scale benchmark test demonstrates a strong correlation between the C-score and the TM-score (a structural similarity measurement with values in [0, 1]) of the first models with a correlation coefficient of 0.91. Using a C-score cutoff > -1.5 for the models of correct topology, both false positive and false negative rates are below 0.1. Combining C-score and protein length, the accuracy of the I-TASSER models can be predicted with an average error of 0.08 for TM-score and 2 A for RMSD. CONCLUSION The I-TASSER server has been developed to generate automated full-length 3D protein structural predictions where the benchmarked scoring system helps users to obtain quantitative assessments of the I-TASSER models. The output of the I-TASSER server for each query includes up to five full-length models, the confidence score, the estimated TM-score and RMSD, and the standard deviation of the estimations. The I-TASSER server is freely available to the academic community at http://zhang.bioinformatics.ku.edu/I-TASSER.
Collapse
Affiliation(s)
- Yang Zhang
- Center for Bioinformatics and Department of Molecular Bioscience, University of Kansas, 2030 Becker Dr, Lawrence, KS 66047, USA.
| |
Collapse
|
37
|
Latek D, Ekonomiuk D, Kolinski A. Protein structure prediction: combining de novo modeling with sparse experimental data. J Comput Chem 2007; 28:1668-76. [PMID: 17342709 DOI: 10.1002/jcc.20657] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Routine structure prediction of new folds is still a challenging task for computational biology. The challenge is not only in the proper determination of overall fold but also in building models of acceptable resolution, useful for modeling the drug interactions and protein-protein complexes. In this work we propose and test a comprehensive approach to protein structure modeling supported by sparse, and relatively easy to obtain, experimental data. We focus on chemical shift-based restraints from NMR, although other sparse restraints could be easily included. In particular, we demonstrate that combining the typical NMR software with artificial intelligence-based prediction of secondary structure enhances significantly the accuracy of the restraints for molecular modeling. The computational procedure is based on the reduced representation approach implemented in the CABS modeling software, which proved to be a versatile tool for protein structure prediction during the CASP (CASP stands for critical assessment of techniques for protein structure prediction) experiments (see http://predictioncenter/CASP6/org). The method is successfully tested on a small set of representative globular proteins of different size and topology, including the two CASP6 targets, for which the required NMR data already exist. The method is implemented in a semi-automated pipeline applicable to a large scale structural annotation of genomic data. Here, we limit the computations to relatively small set. This enabled, without a loss of generality, a detailed discussion of various factors determining accuracy of the proposed approach to the protein structure prediction.
Collapse
Affiliation(s)
- Dorota Latek
- Faculty of Chemistry, Warsaw University, Pateura 1, 02-093 Warsaw, Poland.
| | | | | |
Collapse
|
38
|
Carr JM, Wales DJ. Global optimization and folding pathways of selected alpha-helical proteins. J Chem Phys 2007; 123:234901. [PMID: 16392943 DOI: 10.1063/1.2135783] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The results of basin-hopping global optimization simulations are presented for four small, alpha-helical proteins described by a coarse-grained potential. A step-taking scheme that incorporates the local conformational preferences extracted from a large number of high-resolution protein structures is compared with an unbiased scheme. In addition, the discrete path sampling method is used to investigate the folding of one of the proteins, namely, the villin headpiece subdomain. Folding times from kinetic Monte Carlo simulations and iterative calculations based on a Markovian first-step analysis for the resulting stationary-point database are in good mutual agreement, but differ significantly from the experimental values, probably because the native state is not the global free energy minimum for the potential employed.
Collapse
Affiliation(s)
- Joanne M Carr
- University Chemical Laboratories, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | | |
Collapse
|
39
|
Gront D, Kmiecik S, Kolinski A. Backbone building from quadrilaterals: a fast and accurate algorithm for protein backbone reconstruction from alpha carbon coordinates. J Comput Chem 2007; 28:1593-1597. [PMID: 17342707 DOI: 10.1002/jcc.20624] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
In this contribution, we present an algorithm for protein backbone reconstruction that comprises very high computational efficiency with high accuracy. Reconstruction of the main chain atomic coordinates from the alpha carbon trace is a common task in protein modeling, including de novo structure prediction, comparative modeling, and processing experimental data. The method employed in this work follows the main idea of some earlier approaches to the problem. The details and careful design of the present approach are new and lead to the algorithm that outperforms all commonly used earlier applications. BBQ (Backbone Building from Quadrilaterals) program has been extensively tested both on native structures as well as on near-native decoy models and compared with the different available existing methods. Obtained results provide a comprehensive benchmark of existing tools and evaluate their applicability to a large scale modeling using a reduced representation of protein conformational space. The BBQ package is available for downloading from our website at http://biocomp.chem.uw.edu.pl/services/BBQ/. This webpage also provides a user manual that describes BBQ functions in detail.
Collapse
Affiliation(s)
- Dominik Gront
- Faculty of Chemistry, Warsaw University, Pasteura 1 02-093, Warsaw
| | | | - Andrzej Kolinski
- Faculty of Chemistry, Warsaw University, Pasteura 1 02-093, Warsaw
| |
Collapse
|
40
|
Lee MS, Olson MA. Evaluation of Poisson solvation models using a hybrid explicit/implicit solvent method. J Phys Chem B 2007; 109:5223-36. [PMID: 16863188 DOI: 10.1021/jp046377z] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Implicit solvent methods have become popular tools in the field of protein dynamics simulations, yet evaluation of their validity has been primarily limited to comparisons with experimental and theoretical data for small molecules. In this paper, we use a recently developed hybrid explicit/implicit solvent methodology to evaluate the accuracy of several Poisson-based implicit solvent models. Specifically, we focus on the calculation of electrostatic solvation free energies of various fixed conformations for two proteins. We show that, among various dielectric boundary definitions, the Lee-Richards molecular surface has the best agreement with hybrid solvent results. Furthermore, certain modifications of the molecular surface Poisson protocol provide varied results. For instance, simple modifications of atomic radii on charged residues generally improve absolute errors but do not significantly reduce relative errors among conformations. On the other hand, using a water-probe radius of 1.0 A, as opposed to the standard value of 1.4 A, to generate the molecular surface, moderately improves both absolute and relative results.
Collapse
Affiliation(s)
- Michael S Lee
- Department of Cell Biology and Biochemistry, U.S. Army Medical Research Institute of Infectious Diseases, 1425 Porter Street, Frederick, Maryland 21702, USA.
| | | |
Collapse
|
41
|
Heath AP, Kavraki LE, Clementi C. From coarse-grain to all-atom: Toward multiscale analysis of protein landscapes. Proteins 2007; 68:646-61. [PMID: 17523187 DOI: 10.1002/prot.21371] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Multiscale methods are becoming increasingly promising as a way to characterize the dynamics of large protein systems on biologically relevant time-scales. The underlying assumption in multiscale simulations is that it is possible to move reliably between different resolutions. We present a method that efficiently generates realistic all-atom protein structures starting from the C(alpha) atom positions, as obtained for instance from extensive coarse-grain simulations. The method, a reconstruction algorithm for coarse-grain structures (RACOGS), is validated by reconstructing ensembles of coarse-grain structures obtained during folding simulations of the proteins src-SH3 and S6. The results show that RACOGS consistently produces low energy, all-atom structures. A comparison of the free energy landscapes calculated using the coarse-grain structures versus the all-atom structures shows good correspondence and little distortion in the protein folding landscape.
Collapse
Affiliation(s)
- Allison P Heath
- Department of Computer Science, Rice University, Houston, Texas 77005, USA
| | | | | |
Collapse
|
42
|
Wu S, Skolnick J, Zhang Y. Ab initio modeling of small proteins by iterative TASSER simulations. BMC Biol 2007; 5:17. [PMID: 17488521 PMCID: PMC1878469 DOI: 10.1186/1741-7007-5-17] [Citation(s) in RCA: 341] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2006] [Accepted: 05/08/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Predicting 3-dimensional protein structures from amino-acid sequences is an important unsolved problem in computational structural biology. The problem becomes relatively easier if close homologous proteins have been solved, as high-resolution models can be built by aligning target sequences to the solved homologous structures. However, for sequences without similar folds in the Protein Data Bank (PDB) library, the models have to be predicted from scratch. Progress in the ab initio structure modeling is slow. The aim of this study was to extend the TASSER (threading/assembly/refinement) method for the ab initio modeling and examine systemically its ability to fold small single-domain proteins. RESULTS We developed I-TASSER by iteratively implementing the TASSER method, which is used in the folding test of three benchmarks of small proteins. First, data on 16 small proteins (< 90 residues) were used to generate I-TASSER models, which had an average Calpha-root mean square deviation (RMSD) of 3.8A, with 6 of them having a Calpha-RMSD < 2.5A. The overall result was comparable with the all-atomic ROSETTA simulation, but the central processing unit (CPU) time by I-TASSER was much shorter (150 CPU days vs. 5 CPU hours). Second, data on 20 small proteins (< 120 residues) were used. I-TASSER folded four of them with a Calpha-RMSD < 2.5A. The average Calpha-RMSD of the I-TASSER models was 3.9A, whereas it was 5.9A using TOUCHSTONE-II software. Finally, 20 non-homologous small proteins (< 120 residues) were taken from the PDB library. An average Calpha-RMSD of 3.9A was obtained for the third benchmark, with seven cases having a Calpha-RMSD < 2.5A. CONCLUSION Our simulation results show that I-TASSER can consistently predict the correct folds and sometimes high-resolution models for small single-domain proteins. Compared with other ab initio modeling methods such as ROSETTA and TOUCHSTONE II, the average performance of I-TASSER is either much better or is similar within a lower computational time. These data, together with the significant performance of automated I-TASSER server (the Zhang-Server) in the 'free modeling' section of the recent Critical Assessment of Structure Prediction (CASP)7 experiment, demonstrate new progresses in automated ab initio model generation. The I-TASSER server is freely available for academic users http://zhang.bioinformatics.ku.edu/I-TASSER.
Collapse
Affiliation(s)
- Sitao Wu
- Center for Bioinformatics and Department of Molecular Bioscience, University of Kansas, 2030 Becker Dr, Lawrence, KS 66047, USA
| | - Jeffrey Skolnick
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 250 14th Street NW, Atlanta, GA 30318, USA
| | - Yang Zhang
- Center for Bioinformatics and Department of Molecular Bioscience, University of Kansas, 2030 Becker Dr, Lawrence, KS 66047, USA
| |
Collapse
|
43
|
Montero-Morán GM, Li M, Rendòn-Huerta E, Jourdan F, Lowe DJ, Stumpff-Kane AW, Feig M, Scazzocchio C, Hausinger RP. Purification and characterization of the FeII- and alpha-ketoglutarate-dependent xanthine hydroxylase from Aspergillus nidulans. Biochemistry 2007; 46:5293-304. [PMID: 17429948 PMCID: PMC2525507 DOI: 10.1021/bi700065h] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
His6-tagged xanthine/alpha-ketoglutarate (alphaKG) dioxygenase (XanA) of Aspergillus nidulans was purified from both the fungal mycelium and recombinant Escherichia coli cells, and the properties of the two forms of the protein were compared. Evidence was obtained for both N- and O-linked glycosylation on the fungus-derived XanA, which aggregates into an apparent dodecamer, while bacterium-derived XanA is free of glycosylation and behaves as a monomer. Immunological methods identify phosphothreonine in both forms of XanA, with phosphoserine also detected in the bacterium-derived protein. Mass spectrometric analysis confirms glycosylation and phosphorylation of the fungus-derived sample, which also undergoes extensive truncation at its amino terminus. Despite the major differences in the properties of these proteins, their kinetic parameters are similar (kcat = 30-70 s-1, Km of alphaKG = 31-50 muM, Km of xanthine approximately 45 muM, and pH optima at 7.0-7.4). The enzyme exhibits no significant isotope effect when [8-2H]xanthine is used; however, it demonstrates a 2-fold solvent deuterium isotope effect. CuII and ZnII potently inhibit the FeII-specific enzyme, whereas CoII, MnII, and NiII are weaker inhibitors. NaCl decreases the kcat and increases the Km of both alphaKG and xanthine. The alphaKG cosubstrate can be substituted with alpha-ketoadipate (9-fold decrease in kcat and 5-fold increase in the Km compared to those of the normal alpha-keto acid), while the alphaKG analogue N-oxalylglycine is a competitive inhibitor (Ki = 0.12 muM). No alternative purines effectively substitute for xanthine as a substrate, and only one purine analogue (6,8-dihydroxypurine) results in significant inhibition. Quenching of the endogenous fluorescence of the two enzyme forms by xanthine, alphaKG, and DHP was used to characterize their binding properties. A XanA homology model was generated on the basis of the structure of the related enzyme TauD (PDB entry 1OS7) and provided insights into the sites of posttranslational modification and substrate binding. These studies represent the first biochemical characterization of purified xanthine/alphaKG dioxygenase.
Collapse
Affiliation(s)
- Gabriela M Montero-Morán
- Institut de Génétique et de Microbiologie, Université Paris-Sud, Bâtiment 409, UMR 8621 CNRS, 91405 Orsay Cedex, France
| | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Müller TA, Zavodszky MI, Feig M, Kuhn LA, Hausinger RP. Structural basis for the enantiospecificities of R- and S-specific phenoxypropionate/alpha-ketoglutarate dioxygenases. Protein Sci 2006; 15:1356-68. [PMID: 16731970 PMCID: PMC2242530 DOI: 10.1110/ps.052059406] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
(R)- and (S)-dichlorprop/alpha-ketoglutarate dioxygenases (RdpA and SdpA) catalyze the oxidative cleavage of 2-(2,4-dichlorophenoxy)propanoic acid (dichlorprop) and 2-(4-chloro-2-methyl-phenoxy)propanoic acid (mecoprop) to form pyruvate plus the corresponding phenol concurrent with the conversion of alpha-ketoglutarate (alphaKG) to succinate plus CO2. RdpA and SdpA are strictly enantiospecific, converting only the (R) or the (S) enantiomer, respectively. Homology models were generated for both enzymes on the basis of the structure of the related enzyme TauD (PDB code 1OS7). Docking was used to predict the orientation of the appropriate mecoprop enantiomer in each protein, and the predictions were tested by characterizing the activities of site-directed variants of the enzymes. Mutant proteins that changed at residues predicted to interact with (R)- or (S)-mecoprop exhibited significantly reduced activity, often accompanied by increased Km values, consistent with roles for these residues in substrate binding. Four of the designed SdpA variants were (slightly) active with (R)-mecoprop. The results of the kinetic investigations are consistent with the identification of key interactions in the structural models and demonstrate that enantiospecificity is coordinated by the interactions of a number of residues in RdpA and SdpA. Most significantly, residues Phe171 in RdpA and Glu69 in SdpA apparently act by hindering the binding of the wrong enantiomer more than the correct one, as judged by the observed decreases in Km when these side chains are replaced by Ala.
Collapse
Affiliation(s)
- Tina A Müller
- Department of Microbiology, Michigan State University, East Lansing, Michigan 48824-4320, USA
| | | | | | | | | |
Collapse
|
45
|
Stumpff-Kane AW, Feig M. A correlation-based method for the enhancement of scoring functions on funnel-shaped energy landscapes. Proteins 2006; 63:155-64. [PMID: 16397892 DOI: 10.1002/prot.20853] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
A correlation-based approach is introduced for enhancing the ability of structure-scoring methods to identify and distinguish native-like conformations. The proposed method relies on a funnel-shaped scoring function that decreases steadily toward the native state. It takes advantage of the idea that the structure from a given ensemble that is closest to the native basin leads to the highest correlation coefficient between a given score and distance to that structure as an approximation of the native state for the entire ensemble. The method is applied successfully to a number of different test cases that demonstrate substantial improvements in the correlation of the score with the distance from the true native state but also result in the selection of more native-like structures compared to the original score.
Collapse
Affiliation(s)
- Andrew W Stumpff-Kane
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824-1319, USA
| | | |
Collapse
|
46
|
Koliński A, Bujnicki JM. Generalized protein structure prediction based on combination of fold-recognition with de novo folding and evaluation of models. Proteins 2006; 61 Suppl 7:84-90. [PMID: 16187348 DOI: 10.1002/prot.20723] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
To predict the tertiary structure of full-length sequences of all targets in CASP6, regardless of their potential category (from easy comparative modeling to fold recognition to apparent new folds) we used a novel combination of two very different approaches developed independently in our laboratories, which ranked quite well in different categories in CASP5. First, the GeneSilico metaserver was used to identify domains, predict secondary structure, and generate fold recognition (FR) alignments, which were converted to full-atom models using the "FRankenstein's Monster" approach for comparative modeling (CM) by recombination of protein fragments. Additional models generated "de novo" by fully automated servers were obtained from the CASP website. All these models were evaluated by VERIFY3D, and residues with scores better than 0.2 were used as a source of spatial restraints. Second, a new implementation of the lattice-based protein modeling tool CABS was used to carry out folding guided by the above-mentioned restraints with the Replica Exchange Monte Carlo sampling technique. Decoys generated in the course of simulation were subject to the average linkage hierarchical clustering. For a representative decoy from each cluster, a full-atom model was rebuilt. Finally, five models were selected for submission based on combination of various criteria, including the size, density, and average energy of the corresponding cluster, and the visual evaluation of the full-atom structures and their relationship to the original templates. The combination of FRankenstein and CABS was one of the best-performing algorithms over all categories in CASP6 (it is important to note that our human intervention was very limited, and all steps in our method can be easily automated). We were able to generate a number of very good models, especially in the Comparative Modeling and New Folds categories. Frequently, the best models were closer to the native structure than any of the templates used. The main problem we encountered was in the ranking of the final models (the only step of significant human intervention), due to the insufficient computational power, which precluded the possibility of full-atom refinement and energy-based evaluation.
Collapse
|
47
|
Zhang Y, Arakaki AK, Skolnick J. TASSER: an automated method for the prediction of protein tertiary structures in CASP6. Proteins 2006; 61 Suppl 7:91-98. [PMID: 16187349 DOI: 10.1002/prot.20724] [Citation(s) in RCA: 128] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The recently developed TASSER (Threading/ASSembly/Refinement) method is applied to predict the tertiary structures of all CASP6 targets. TASSER is a hierarchical approach that consists of template identification by the threading program PROSPECTOR_3, followed by tertiary structure assembly via rearranging continuous template fragments. Assembly occurs using parallel hyperbolic Monte Carlo sampling under the guide of an optimized, reduced force field that includes knowledge-based statistical potentials and spatial restraints extracted from threading alignments. Models are automatically selected from the Monte Carlo trajectories in the low-temperature replicas using the clustering program SPICKER. For all 90 CASP targets/domains, PROSPECTOR_3 generates initial alignments with an average root-mean-square deviation (RMSD) to native of 8.4 A with 79% coverage. After TASSER reassembly, the average RMSD decreases to 5.4 A over the same aligned residues; the overall cumulative TM-score increases from 39.44 to 52.53. Despite significant improvements over the PROSPECTOR_3 template alignment observed in all target categories, the overall quality of the final models is essentially dictated by the quality of threading templates: The average TM-scores of TASSER models in the three categories are, respectively, 0.79 [comparative modeling (CM), 43 targets/domains], 0.47 [fold recognition (FR), 37 targets/domains], and 0.30 [new fold (NF), 10 targets/domains]. This highlights the need to develop novel (or improved) approaches to identify very distant targets as well as better NF algorithms.
Collapse
Affiliation(s)
- Yang Zhang
- Center of Excellence in Bioinformatics, University at Buffalo, Buffalo, New York 14203, USA
| | | | | |
Collapse
|
48
|
Feig M, Im W, Brooks CL. Implicit solvation based on generalized Born theory in different dielectric environments. J Chem Phys 2006; 120:903-11. [PMID: 15267926 DOI: 10.1063/1.1631258] [Citation(s) in RCA: 121] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
In this paper we are investigating the effect of the dielectric environment on atomic Born radii used in generalized Born (GB) methods. Motivated by the Kirkwood expression for the reaction field of a single off-center charge in a spherical cavity, we are proposing extended formalisms for the calculation of Born radii as a function of external and internal dielectric constants. We demonstrate that reaction field energies calculated from environmentally dependent Born radii lead to much improved agreement with Poisson-Boltzmann solutions for low dielectric external environments, such as biological membranes or organic solvent, compared to previous methods where the calculation of Born radii does not depend on the environment. We also examine how this new approach can be applied for the calculation of transfer free energies from vacuum to a given external dielectric for a system with an internal dielectric larger than one. This has not been possible with standard GB theory but is relevant when scoring minimized or average structures with implicit solvent.
Collapse
Affiliation(s)
- Michael Feig
- Department of Molecular Biology, TPC6, The Scripps Research Institute, La Jolla, California 92037, USA
| | | | | |
Collapse
|
49
|
Zhang Y, DeVries ME, Skolnick J. Structure modeling of all identified G protein-coupled receptors in the human genome. PLoS Comput Biol 2006; 2:e13. [PMID: 16485037 PMCID: PMC1364505 DOI: 10.1371/journal.pcbi.0020013] [Citation(s) in RCA: 151] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2005] [Accepted: 01/11/2005] [Indexed: 12/22/2022] Open
Abstract
G protein–coupled receptors (GPCRs), encoded by about 5% of human genes, comprise the largest family of integral membrane proteins and act as cell surface receptors responsible for the transduction of endogenous signal into a cellular response. Although tertiary structural information is crucial for function annotation and drug design, there are few experimentally determined GPCR structures. To address this issue, we employ the recently developed threading assembly refinement (TASSER) method to generate structure predictions for all 907 putative GPCRs in the human genome. Unlike traditional homology modeling approaches, TASSER modeling does not require solved homologous template structures; moreover, it often refines the structures closer to native. These features are essential for the comprehensive modeling of all human GPCRs when close homologous templates are absent. Based on a benchmarked confidence score, approximately 820 predicted models should have the correct folds. The majority of GPCR models share the characteristic seven-transmembrane helix topology, but 45 ORFs are predicted to have different structures. This is due to GPCR fragments that are predominantly from extracellular or intracellular domains as well as database annotation errors. Our preliminary validation includes the automated modeling of bovine rhodopsin, the only solved GPCR in the Protein Data Bank. With homologous templates excluded, the final model built by TASSER has a global Cα root-mean-squared deviation from native of 4.6 Å, with a root-mean-squared deviation in the transmembrane helix region of 2.1 Å. Models of several representative GPCRs are compared with mutagenesis and affinity labeling data, and consistent agreement is demonstrated. Structure clustering of the predicted models shows that GPCRs with similar structures tend to belong to a similar functional class even when their sequences are diverse. These results demonstrate the usefulness and robustness of the in silico models for GPCR functional analysis. All predicted GPCR models are freely available for noncommercial users on our Web site (http://www.bioinformatics.buffalo.edu/GPCR). G protein–coupled receptors (GPCRs) are a large superfamily of integral membrane proteins that transduce signals across the cell membrane. Because of the breadth and importance of the physiological roles undertaken by the GPCR family, many of its members are important pharmacological targets. Although the knowledge of a protein's native structure can provide important insight into understanding its function and for the design of new drugs, the experimental determination of the three-dimensional structure of GPCR membrane proteins has proved to be very difficult. This is demonstrated by the fact that there is only one solved GPCR structure (from bovine rhodopsin) deposited in the Protein Data Bank library. In contrast, there are no human GPCR structures in the Protein Data Bank. To address the need for the tertiary structures of human GPCRs, using just sequence information, the authors use a newly developed threading-assembly-refinement method to generate models for all 907 registered GPCRs in the human genome. About 820 GPCRs are anticipated to have correct topology and transmembrane helix arrangement. A subset of the resulting models is validated by comparison with mutagenesis experimental data, and consistent agreement is demonstrated.
Collapse
Affiliation(s)
- Yang Zhang
- Center of Excellence in Bioinformatics, University at Buffalo, Buffalo, New York, United States of America
| | - Mark E DeVries
- Center of Excellence in Bioinformatics, University at Buffalo, Buffalo, New York, United States of America
| | - Jeffrey Skolnick
- Center of Excellence in Bioinformatics, University at Buffalo, Buffalo, New York, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
50
|
Feig M, Chocholoušová J, Tanizaki S. Extending the horizon: towards the efficient modeling of large biomolecular complexes in atomic detail. Theor Chem Acc 2005. [DOI: 10.1007/s00214-005-0062-4] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|