1
|
Saikia B, Baruah A. In silico design of misfolding resistant proteins: the role of structural similarity of a competing conformational ensemble in the optimization of frustration. SOFT MATTER 2024; 20:3283-3298. [PMID: 38529658 DOI: 10.1039/d4sm00171k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
Most state-of-the-art in silico design methods fail due to misfolding of designed sequences to a conformation other than the target. Thus, a method to design misfolding resistant proteins will provide a better understanding of the misfolding phenomenon and will also increase the success rate of in silico design methods. In this work, we optimize the conformational ensemble to be selected for negative design purposes based on the similarity of the conformational ensemble to the target. Five ensembles with different degrees of similarity to the target are created and destabilized and the target is stabilized while designing sequences using mean field theory and Monte Carlo simulation methods. The results suggest that the degree of similarity of the non-native conformations to the target plays a prominent role in designing misfolding resistant protein sequences. The design procedures that destabilize the conformational ensemble with moderate similarity to the target have proven to be more promising. Incorporation of either highly similar or highly dissimilar conformations to the target conformation into the non-native ensemble to be destabilized may lead to sequences with a higher misfolding propensity. This will significantly reduce the conformational space to be considered in any protein design procedure. Interestingly, the results suggest that a sequence with higher frustration in the target structure does not necessarily lead to a misfold prone sequence. A successful design method may purposefully choose a frustrated sequence in the target conformation if that sequence is even more frustrated in the competing non-native conformations.
Collapse
Affiliation(s)
- Bondeepa Saikia
- Department of Chemistry, Dibrugarh University, Dibrugarh 786004, India.
| | - Anupaul Baruah
- Department of Chemistry, Dibrugarh University, Dibrugarh 786004, India.
| |
Collapse
|
2
|
Hendrix E, Motta S, Gahl RF, He Y. Insight into the Initial Stages of the Folding Process in Onconase Revealed by UNRES. J Phys Chem B 2022; 126:7934-7942. [PMID: 36179061 DOI: 10.1021/acs.jpcb.2c04770] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The unfolded state of proteins presents many challenges to elucidate the structural basis for biological function. This state is characterized by a large degree of structural heterogeneity which makes it difficult to generate structural models. However, recent experiments into the initial folding events of the 104-residue ribonuclease homologue onconase (ONC) were able to identify the regions in the protein that participate in the initial folding of this protein. Therefore, to gain additional structural insight into the unfolded state of proteins, this study utilized molecular dynamics simulations using the UNited-RESidue (UNRES) force field to evaluate whether there is a good agreement between the experimentally determined initial structures and the structures identified by computer simulations along a folding pathway. Indeed, these UNRES simulations accurately identified the two regions experimentally observed to form the initial native structure along the folding pathway of ONC. In addition, these regions are determined to be chain folding initiation sites (CFIS) according to methods developed previously. Subsequent self-organization maps (SOM) analysis has revealed key structural states involved in these early folding events.
Collapse
Affiliation(s)
- Emily Hendrix
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico87131, United States
| | - Stefano Motta
- Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan20126, Italy
| | - Robert F Gahl
- Division of Extramural Activities, National Cancer Institute, National Institutes of Health, Bethesda, Maryland20850, United States
| | - Yi He
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico87131, United States.,Translational Informatics Division, Department of Internal Medicine, University of New Mexico, Albuquerque, New Mexico87131, United States
| |
Collapse
|
3
|
Lubecka EA, Hansmann UHE. Early Stages of RNA-Mediated Conversion of Human Prions. J Phys Chem B 2022; 126:6221-6230. [PMID: 35973105 PMCID: PMC9420815 DOI: 10.1021/acs.jpcb.2c04614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Prion diseases are characterized by the conversion of prion proteins from a PrPC fold into a disease-causing PrPSC form that is self-replicating. A possible agent to trigger this conversion is polyadenosine RNA, but both mechanism and pathways of the conversion are poorly understood. Using coarse-grained molecular dynamic simulations we study the time evolution of PrPC over 600 μs. We find that both the D178N mutation and interacting with polyadenosine RNA reduce the helicity of the protein and encourage formation of segments with strand-like motifs. We conjecture that these transient β-strands nucleate the conversion of the protein to the scrapie conformation PrPSC.
Collapse
Affiliation(s)
- Emilia A Lubecka
- Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, G. Narutowicza 11/12, 80-233 Gdansk, Poland
| | - Ulrich H E Hansmann
- Department of Chemistry & Biochemistry, University of Oklahoma, Norman, Oklahoma 73019,United States
| |
Collapse
|
4
|
Hassan M, Coutsias EA. Kinematic Reconstruction of Cyclic Peptides and Protein Backbones from Partial Data. J Chem Inf Model 2021; 61:4975-5000. [PMID: 34570494 PMCID: PMC10129052 DOI: 10.1021/acs.jcim.1c00453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We present an algorithm, QBKR (Quaternary Backbone Kinematic Reconstruction), a fast analytical method for an all-atom backbone reconstruction of proteins and linear or cyclic peptide chains from Cα coordinate traces. Unlike previous analytical methods for deriving all-atom representations from coarse-grained models that rely on canonical geometry with planar peptides in the trans conformation, our de novo kinematic model incorporates noncanonical, cis-trans, geometry naturally. Perturbations to this geometry can be effected with ease in our formulation, for example, to account for a continuous change from cis to trans geometry. A simple optimization of a spring-based objective function is employed for Cα-Cα distance variations that extend beyond the cis-trans limit. The kinematic construction produces a linked chain of peptide units, Cα-C-N-Cα, hinged at the Cα atoms spanning all possible planar and nonplanar peptide conformations. We have combined our method with a ring closure algorithm for the case of ring peptides and missing loops in a protein structure. Here, the reconstruction proceeding from both the N and C termini of the protein backbone (or in both directions from a starting position for rings) requires freedom in the position of one Cα atom (a capstone) to achieve a successful loop or ring closure. A salient feature of our reconstruction method is the ability to enrich conformational ensembles to produce alternative feasible conformations in which H-bond forming C-O or N-H pairs in the backbone can reverse orientations, thus addressing a well-known shortcoming in Cα-based RMSD structure comparison, wherein very close structures may lead to significantly different overall H-bond behavior. We apply the fixed Cα-based design to the reverse reconstruction from noisy Cryo-EM data, a posteriori to the optimization. Our method can be applied to speed up the process of an all-atom description from voluminous experimental data or subpar electron density maps.
Collapse
Affiliation(s)
- Mosavverul Hassan
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794, United States
| | - Evangelos A Coutsias
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794, United States.,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794-5252, United States
| |
Collapse
|
5
|
Saikia B, Gogoi CR, Rahman A, Baruah A. Identification of an optimal foldability criterion to design misfolding resistant protein. J Chem Phys 2021; 155:144102. [PMID: 34654294 DOI: 10.1063/5.0057533] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Proteins achieve their functional, active, and operative three dimensional native structures by overcoming the possibility of being trapped in non-native energy minima present in the energy landscape. The enormous and intricate interactions that play an important role in protein folding also determine the stability of the proteins. The large number of stabilizing/destabilizing interactions makes proteins to be only marginally stable as compared to the other competing structures. Therefore, there are some possibilities that they become trapped in the non-native conformations and thus get misfolded. These misfolded proteins lead to several debilitating diseases. This work performs a comparative study of some existing foldability criteria in the computational design of misfold resistant protein sequences based on self-consistent mean field theory. The foldability criteria selected for this study are Ef, Δ, and Φ that are commonly used in protein design procedures to determine the most efficient foldability criterion for the design of misfolding resistant proteins. The results suggest that the foldability criterion Δ is significantly better in designing a funnel energy landscape stabilizing the target state. The results also suggest that inclusion of negative design features is important for designing misfolding resistant proteins, but more information about the non-native conformations in terms of Φ leads to worse results compared to even simple positive design. The sequences designed using Δ show better resistance to misfolding in the Monte Carlo simulations performed in the study.
Collapse
Affiliation(s)
- Bondeepa Saikia
- Department of Chemistry, Dibrugarh University, Dibrugarh 786004, India
| | - Chimi Rekha Gogoi
- Department of Chemistry, Dibrugarh University, Dibrugarh 786004, India
| | - Aziza Rahman
- Department of Chemistry, Dibrugarh University, Dibrugarh 786004, India
| | - Anupaul Baruah
- Department of Chemistry, Dibrugarh University, Dibrugarh 786004, India
| |
Collapse
|
6
|
Liwo A, Czaplewski C, Sieradzan AK, Lubecka EA, Lipska AG, Golon Ł, Karczyńska A, Krupa P, Mozolewska MA, Makowski M, Ganzynkowicz R, Giełdoń A, Maciejczyk M. Scale-consistent approach to the derivation of coarse-grained force fields for simulating structure, dynamics, and thermodynamics of biopolymers. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2020; 170:73-122. [PMID: 32145953 DOI: 10.1016/bs.pmbts.2019.12.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
In this chapter the scale-consistent approach to the derivation of coarse-grained force fields developed in our laboratory is presented, in which the effective energy function originates from the potential of mean force of the system under consideration and embeds atomistically detailed interactions in the resulting energy terms through use of Kubo's cluster-cumulant expansion, appropriate selection of the major degrees of freedom to be averaged out in the derivation of analytical approximations to the energy terms, and appropriate expression of the interaction energies at the all-atom level in these degrees of freedom. Our approach enables the developers to find correct functional forms of the effective coarse-grained energy terms, without having to import them from all-atom force fields or deriving them on a heuristic basis. In particular, the energy terms derived in such a way exhibit correct dependence on coarse-grained geometry, in particular on site orientation. Moreover, analytical formulas for the multibody (correlation) terms, which appear to be crucial for coarse-grained modeling of many of the regular structures such as, e.g., protein α-helices and β-sheets, can be derived in a systematic way. Implementation of the developed theory to the UNIfied COarse-gRaiNed (UNICORN) model of biological macromolecules, which consists of the UNRES (for proteins), NARES-2P (for nucleic acids), and SUGRES-1P (for polysaccharides) components, and is being developed in our laboratory is described. Successful applications of UNICORN to the prediction of protein structure, simulating the folding and stability of proteins and nucleic acids, and solving biological problems are discussed.
Collapse
Affiliation(s)
- Adam Liwo
- Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland; School of Computational Sciences, Korea Institute for Advanced Study, Seoul, Republic of Korea.
| | | | - Adam K Sieradzan
- Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland; School of Computational Sciences, Korea Institute for Advanced Study, Seoul, Republic of Korea
| | - Emilia A Lubecka
- Institute of Informatics, Faculty of Mathematics, Physics, and Informatics, University of Gdańsk, Gdańsk, Poland
| | | | - Łukasz Golon
- Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
| | | | - Paweł Krupa
- Institute of Physics, Polish Academy of Sciences, Warsaw, Poland
| | | | | | | | - Artur Giełdoń
- Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
| | - Maciej Maciejczyk
- Department of Physics and Biophysics, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn, Poland
| |
Collapse
|
7
|
Physics-Based Modeling of Side Chain—Side Chain Interactions in the UNRES Force Field. SPRINGER SERIES ON BIO- AND NEUROSYSTEMS 2019. [DOI: 10.1007/978-3-319-95843-9_4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
8
|
Sieradzan AK, Makowski M, Augustynowicz A, Liwo A. A general method for the derivation of the functional forms of the effective energy terms in coarse-grained energy functions of polymers. I. Backbone potentials of coarse-grained polypeptide chains. J Chem Phys 2018; 146:124106. [PMID: 28388107 DOI: 10.1063/1.4978680] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
A general and systematic method for the derivation of the functional expressions for the effective energy terms in coarse-grained force fields of polymer chains is proposed. The method is based on the expansion of the potential of mean force of the system studied in the cluster-cumulant series and expanding the all-atom energy in the Taylor series in the squares of interatomic distances about the squares of the distances between coarse-grained centers, to obtain approximate analytical expressions for the cluster cumulants. The primary degrees of freedom to average about are the angles for collective rotation of the atoms contained in the coarse-grained interaction sites about the respective virtual-bond axes. The approach has been applied to the revision of the virtual-bond-angle, virtual-bond-torsional, and backbone-local-and-electrostatic correlation potentials for the UNited RESidue (UNRES) model of polypeptide chains, demonstrating the strong dependence of the torsional and correlation potentials on virtual-bond angles, not considered in the current UNRES. The theoretical considerations are illustrated with the potentials calculated from the ab initiopotential-energysurface of terminally blocked alanine by numerical integration and with the statistical potentials derived from known protein structures. The revised torsional potentials correctly indicate that virtual-bond angles close to 90° result in the preference for the turn and helical structures, while large virtual-bond angles result in the preference for polyproline II and extended backbone geometry. The revised correlation potentials correctly reproduce the preference for the formation of β-sheet structures for large values of virtual-bond angles and for the formation of α-helical structures for virtual-bond angles close to 90°.
Collapse
Affiliation(s)
- Adam K Sieradzan
- Faculty of Chemistry, University of Gdańsk, ul. Wita-Stwosza 63, 80-308 Gdańsk, Poland
| | - Mariusz Makowski
- Faculty of Chemistry, University of Gdańsk, ul. Wita-Stwosza 63, 80-308 Gdańsk, Poland
| | - Antoni Augustynowicz
- Faculty of Mathematics, Physics, and Informatics, University of Gdańsk, ul. Wita Stwosza 57, 80-308 Gdańsk, Poland
| | - Adam Liwo
- Faculty of Chemistry, University of Gdańsk, ul. Wita-Stwosza 63, 80-308 Gdańsk, Poland
| |
Collapse
|
9
|
Guiding exploration in conformational feature space with Lipschitz underestimation for ab-initio protein structure prediction. Comput Biol Chem 2018; 73:105-119. [PMID: 29475175 DOI: 10.1016/j.compbiolchem.2018.02.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Revised: 01/25/2018] [Accepted: 02/03/2018] [Indexed: 01/04/2023]
Abstract
Computing conformations which are essential to associate structural and functional information with gene sequences, is challenging due to the high dimensionality and rugged energy surface of the protein conformational space. Consequently, the dimension of the protein conformational space should be reduced to a proper level, and an effective exploring algorithm should be proposed. In this paper, a plug-in method for guiding exploration in conformational feature space with Lipschitz underestimation (LUE) for ab-initio protein structure prediction is proposed. The conformational space is converted into ultrafast shape recognition (USR) feature space firstly. Based on the USR feature space, the conformational space can be further converted into Underestimation space according to Lipschitz estimation theory for guiding exploration. As a consequence of the use of underestimation model, the tight lower bound estimate information can be used for exploration guidance, the invalid sampling areas can be eliminated in advance, and the number of energy function evaluations can be reduced. The proposed method provides a novel technique to solve the exploring problem of protein conformational space. LUE is applied to differential evolution (DE) algorithm, and metropolis Monte Carlo(MMC) algorithm which is available in the Rosetta; When LUE is applied to DE and MMC, it will be screened by the underestimation method prior to energy calculation and selection. Further, LUE is compared with DE and MMC by testing on 15 small-to-medium structurally diverse proteins. Test results show that near-native protein structures with higher accuracy can be obtained more rapidly and efficiently with the use of LUE.
Collapse
|
10
|
Xu G, Ma T, Zang T, Wang Q, Ma J. OPUS-CSF: A C-atom-based scoring function for ranking protein structural models. Protein Sci 2017; 27:286-292. [PMID: 29047165 PMCID: PMC5734313 DOI: 10.1002/pro.3327] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2017] [Revised: 10/14/2017] [Accepted: 10/16/2017] [Indexed: 12/12/2022]
Abstract
We report a C‐atom‐based scoring function, named OPUS‐CSF, for ranking protein structural models. Rather than using traditional Boltzmann formula, we built a scoring function (CSF score) based on the native distributions (derived from the entire PDB) of coordinate components of mainchain C (carbonyl) atoms on selected residues of peptide segments of 5, 7, 9, and 11 residues in length. In testing OPUS‐CSF on decoy recognition, it maximally recognized 257 native structures out of 278 targets in 11 commonly used decoy sets, significantly outperforming other popular all‐atom empirical potentials. The average correlation coefficient with TM‐score was also comparable with those of other potentials. OPUS‐CSF is a highly coarse‐grained scoring function, which only requires input of partial mainchain information, and very fast. Thus, it is suitable for applications at early stage of structural building.
Collapse
Affiliation(s)
- Gang Xu
- School of Life Sciences, Tsinghua University, Beijing, China
| | - Tianqi Ma
- Applied Physics Program, Rice University, Houston, Texas.,Department of Bioengineering, Rice University, Houston, Texas
| | - Tianwu Zang
- Applied Physics Program, Rice University, Houston, Texas.,Department of Bioengineering, Rice University, Houston, Texas
| | - Qinghua Wang
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, BCM-125, Houston, Texas
| | - Jianpeng Ma
- School of Life Sciences, Tsinghua University, Beijing, China.,Applied Physics Program, Rice University, Houston, Texas.,Department of Bioengineering, Rice University, Houston, Texas.,Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, BCM-125, Houston, Texas
| |
Collapse
|
11
|
Hao XH, Zhang GJ, Zhou XG. Conformational Space Sampling Method Using Multi-Subpopulation Differential Evolution for De novo Protein Structure Prediction. IEEE Trans Nanobioscience 2017; 16:618-633. [DOI: 10.1109/tnb.2017.2749243] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
12
|
Abstract
This work investigates the computational design of a pH induced protein fold switch based on a self-consistent mean-field approach by identifying the ensemble averaged characteristics of sequences that encode a fold switch. The primary challenge to balance the alternative sets of interactions present in both target structures is overcome by simultaneously optimizing two foldability criteria corresponding to two target structures. The change in pH is modeled by altering the residual charge on the amino acids. The energy landscape of the fold switch protein is found to be double funneled. The fold switch sequences stabilize the interactions of the sites with similar relative surface accessibility in both target structures. Fold switch sequences have low sequence complexity and hence lower sequence entropy. The pH induced fold switch is mediated by attractive electrostatic interactions rather than hydrophobic-hydrophobic contacts. This study may provide valuable insights to the design of fold switch proteins.
Collapse
Affiliation(s)
- Anupaul Baruah
- Department of Chemistry, University of Delhi, Delhi-110007, India
| | - Parbati Biswas
- Department of Chemistry, University of Delhi, Delhi-110007, India
| |
Collapse
|
13
|
Baruah A, Rani P, Biswas P. Conformational Entropy of Intrinsically Disordered Proteins from Amino Acid Triads. Sci Rep 2015; 5:11740. [PMID: 26138206 PMCID: PMC4490338 DOI: 10.1038/srep11740] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2014] [Accepted: 05/26/2015] [Indexed: 12/17/2022] Open
Abstract
This work quantitatively characterizes intrinsic disorder in proteins in terms of
sequence composition and backbone conformational entropy. Analysis of the normalized
relative composition of the amino acid triads highlights a distinct boundary between
globular and disordered proteins. The conformational entropy is calculated from the
dihedral angles of the middle amino acid in the amino acid triad for the
conformational ensemble of the globular, partially and completely disordered
proteins relative to the non-redundant database. Both Monte Carlo (MC) and Molecular
Dynamics (MD) simulations are used to characterize the conformational ensemble of
the representative proteins of each group. The results show that the globular
proteins span approximately half of the allowed conformational states in the
Ramachandran space, while the amino acid triads in disordered proteins sample the
entire range of the allowed dihedral angle space following Flory’s
isolated-pair hypothesis. Therefore, only the sequence information in terms of the
relative amino acid triad composition may be sufficient to predict protein disorder
and the backbone conformational entropy, even in the absence of well-defined
structure. The predicted entropies are found to agree with those calculated using
mutual information expansion and the histogram method.
Collapse
Affiliation(s)
- Anupaul Baruah
- Department of Chemistry, University of Delhi, Delhi-110007, India
| | - Pooja Rani
- Department of Chemistry, University of Delhi, Delhi-110007, India
| | - Parbati Biswas
- Department of Chemistry, University of Delhi, Delhi-110007, India
| |
Collapse
|
14
|
Frembgen-Kesner T, Andrews CT, Li S, Ngo NA, Shubert SA, Jain A, Olayiwola OJ, Weishaar MR, Elcock AH. Parametrization of Backbone Flexibility in a Coarse-Grained Force Field for Proteins (COFFDROP) Derived from All-Atom Explicit-Solvent Molecular Dynamics Simulations of All Possible Two-Residue Peptides. J Chem Theory Comput 2015; 11:2341-54. [PMID: 26574429 DOI: 10.1021/acs.jctc.5b00038] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Recently, we reported the parametrization of a set of coarse-grained (CG) nonbonded potential functions, derived from all-atom explicit-solvent molecular dynamics (MD) simulations of amino acid pairs and designed for use in (implicit-solvent) Brownian dynamics (BD) simulations of proteins; this force field was named COFFDROP (COarse-grained Force Field for Dynamic Representations Of Proteins). Here, we describe the extension of COFFDROP to include bonded backbone terms derived from fitting to results of explicit-solvent MD simulations of all possible two-residue peptides containing the 20 standard amino acids, with histidine modeled in both its protonated and neutral forms. The iterative Boltzmann inversion (IBI) method was used to optimize new CG potential functions for backbone-related terms by attempting to reproduce angle, dihedral, and distance probability distributions generated by the MD simulations. In a simple test of the transferability of the extended force field, the angle, dihedral, and distance probability distributions obtained from BD simulations of 56 three-residue peptides were compared to results from corresponding explicit-solvent MD simulations. In a more challenging test of the COFFDROP force field, it was used to simulate eight intrinsically disordered proteins and was shown to quite accurately reproduce the experimental hydrodynamic radii (Rhydro), provided that the favorable nonbonded interactions of the force field were uniformly scaled downward in magnitude. Overall, the results indicate that the COFFDROP force field is likely to find use in modeling the conformational behavior of intrinsically disordered proteins and multidomain proteins connected by flexible linkers.
Collapse
Affiliation(s)
| | - Casey T Andrews
- Department of Biochemistry, University of Iowa , Iowa City, Iowa 52242, United States
| | - Shuxiang Li
- Department of Biochemistry, University of Iowa , Iowa City, Iowa 52242, United States
| | - Nguyet Anh Ngo
- Department of Biochemistry, University of Iowa , Iowa City, Iowa 52242, United States
| | - Scott A Shubert
- Department of Biochemistry, University of Iowa , Iowa City, Iowa 52242, United States
| | - Aakash Jain
- Department of Biochemistry, University of Iowa , Iowa City, Iowa 52242, United States
| | - Oluwatoni J Olayiwola
- Department of Biochemistry, University of Iowa , Iowa City, Iowa 52242, United States
| | - Mitch R Weishaar
- Department of Biochemistry, University of Iowa , Iowa City, Iowa 52242, United States
| | - Adrian H Elcock
- Department of Biochemistry, University of Iowa , Iowa City, Iowa 52242, United States
| |
Collapse
|
15
|
Suprun AD, Shmeleva LV. Alpha-helical regions of the protein molecule as organic nanotubes. NANOSCALE RESEARCH LETTERS 2014; 9:200. [PMID: 24872798 PMCID: PMC4024317 DOI: 10.1186/1556-276x-9-200] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Accepted: 04/08/2014] [Indexed: 06/03/2023]
Abstract
UNLABELLED An α-helical region of protein molecule was considered in a model of nanotube. The molecule is in conditions of quantum excitations. Such model corresponds to a one-dimensional molecular nanocrystal with three molecules in an elementary cell at the presence of excitation. For the analysis of different types of conformational response of the α-helical area of the protein molecule on excitation, the nonlinear response of this area to the intramolecular quantum excitation caused by hydrolysis of adenosine triphosphate (ATP) is taken into account. It has been established that in the simplest case, three types of excitation are realized. As estimates show, each of them 'serves' different kinds of protein. The symmetrical type of excitation, most likely, is realized in the reduction of traversal-striped skeletal muscles. It has the highest excitation energy. This well protects from casual actions. Antisymmetric excitations have intermediate energy (between symmetrical and asymmetrical). They, most likely, are realized in membranous and nucleic proteins. It is shown that the conformational response of the α-helical region of the protein is (in angstroms) a quantity of order N c /5, where N c is the number of spiral turns. For the number of turns typical in this case: N c ~ 10, displacement compounds are a quantity of order 2 Å. It qualitatively corresponds to observable values. Asymmetrical excitations have the lowest energy. Therefore, most likely, they are realized in enzymatic proteins. It was shown that at this type of excitation, the bending of the α-helix is formally directed to the opposite side with respect to the antisymmetric excitations. Also, it has a greater value than the antisymmetric case for N c ≤ 14 and smaller for N c > 14. PACS 92C05. MCS 36.20.Ey.
Collapse
Affiliation(s)
- Anatol D Suprun
- Taras Shevchenko National University of Kyiv, Volodymyrska Street, 64/13, Kyiv 01601, Ukraine
| | - Liudmyla V Shmeleva
- Taras Shevchenko National University of Kyiv, Volodymyrska Street, 64/13, Kyiv 01601, Ukraine
| |
Collapse
|
16
|
Mass spectrometry coupled experiments and protein structure modeling methods. Int J Mol Sci 2013; 14:20635-57. [PMID: 24132151 PMCID: PMC3821635 DOI: 10.3390/ijms141020635] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2013] [Revised: 09/17/2013] [Accepted: 09/19/2013] [Indexed: 01/02/2023] Open
Abstract
With the accumulation of next generation sequencing data, there is increasing interest in the study of intra-species difference in molecular biology, especially in relation to disease analysis. Furthermore, the dynamics of the protein is being identified as a critical factor in its function. Although accuracy of protein structure prediction methods is high, provided there are structural templates, most methods are still insensitive to amino-acid differences at critical points that may change the overall structure. Also, predicted structures are inherently static and do not provide information about structural change over time. It is challenging to address the sensitivity and the dynamics by computational structure predictions alone. However, with the fast development of diverse mass spectrometry coupled experiments, low-resolution but fast and sensitive structural information can be obtained. This information can then be integrated into the structure prediction process to further improve the sensitivity and address the dynamics of the protein structures. For this purpose, this article focuses on reviewing two aspects: the types of mass spectrometry coupled experiments and structural data that are obtainable through those experiments; and the structure prediction methods that can utilize these data as constraints. Also, short review of current efforts in integrating experimental data in the structural modeling is provided.
Collapse
|
17
|
Variation in structure of a protein (H2AX) with knowledge-based interactions. PLoS One 2013; 8:e64507. [PMID: 23741333 PMCID: PMC3669374 DOI: 10.1371/journal.pone.0064507] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 04/16/2013] [Indexed: 11/30/2022] Open
Abstract
The structure of a protein (H2AX) as a function of temperature is examined by three knowledge-based phenomenological interactions, MJ (Miyazawa and Jernigan), BT (Betancourt and Thirumalai), and BFKV (Bastolla et al.) to identify similarities and differences in results. Data from the BT and BFKV residue-residue interactions verify finding with the MJ interaction, i.e., the radius of gyration (Rg) of H2AX depends non-monotonically on temperature. The increase in Rg is followed by a decay on raising the temperature with a maximum at a characteristic value, Tc, which depends on the knowledge-based contact matrix, TcBFKV ≤ TcMJ ≤ TcBT. The range (ΔT) of non-monotonic thermal response and its decay pattern with the temperature are sensitive to interaction. A rather narrow temperature range of ΔTMJ ≈ 0.015–0.022 with the MJ interaction expands and shifts up to ΔTBT ≈ 0.018–0.30 at higher temperatures with the BT interaction and shifts down with the BFKV interaction to ΔTBFKV ≈ 0.011–0.018. The scaling of the structure factor with the wave vector reveals that the structure of the protein undergoes a transformation from a random coil at high temperature to a globular conformation at low temperatures.
Collapse
|
18
|
Pandey RB, Farmer BL. Random coil to globular thermal response of a protein (H3.1) with three knowledge-based coarse-grained potentials. PLoS One 2012; 7:e49352. [PMID: 23166645 PMCID: PMC3498164 DOI: 10.1371/journal.pone.0049352] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2012] [Accepted: 10/10/2012] [Indexed: 11/19/2022] Open
Abstract
The effect of temperature on the conformation of a histone (H3.1) is studied by a coarse-grained Monte Carlo simulation based on three knowledge-based contact potentials (MJ, BT, BFKV). Despite unique energy and mobility profiles of its residues, the histone H3.1 undergoes a systematic (possibly continuous) structural transition from a random coil to a globular conformation on reducing the temperature. The range over which such a systematic response in variation of the radius of gyration (R(g)) with the temperature (T) occurs, however, depends on the potential, i.e. ΔT(MJ) ≈ 0.013-0.020, ΔT(BT) ≈ 0.018-0.026, and ΔT(BFKV) ≈ 0.006-0.013 (in reduced unit). Unlike MJ and BT potentials, results from the BFKV potential show an anomaly where the magnitude of R(g) decreases on raising the temperature in a range ΔT(A) ≈ 0.015-0.018 before reaching its steady-state random coil configuration. Scaling of the structure factor, S(q) ∝ q(-1/ν), with the wave vector, q=2π/λ, and the wavelength, λ, reveals a systematic change in the effective dimension (D(e)∼1/ν) of the histone with all potentials (MJ, BT, BFKV): D(e)∼3 in the globular structure with D(e)∼2 for the random coil. Reproducibility of the general yet unique (monotonic) structural transition of the protein H3.1 with the temperature (in contrast to non-monotonic structural response of a similar but different protein H2AX) with three interaction sets shows that the knowledge-based contact potential is viable tool to investigate structural response of proteins. Caution should be exercise with the quantitative comparisons due to differences in transition regimes with these interactions.
Collapse
Affiliation(s)
- Ras B Pandey
- Department of Physics and Astronomy, University of Southern Mississippi, Hattiesburg, Missouri, USA.
| | | |
Collapse
|
19
|
Sieradzan AK, Hansmann UH, Scheraga HA, Liwo A. Extension of UNRES force field to treat polypeptide chains with D-amino-acid residues. J Chem Theory Comput 2012; 8:4746-4757. [PMID: 24729761 PMCID: PMC3982868 DOI: 10.1021/ct3005563] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Coarse-grained force fields for protein simulations are usually designed and parameterized to treat proteins composed of natural L-amino-acid residues. However, D-amino-acid residues occur in bacterial, fungal (e.g., gramicidins), as well as human-designed proteins. For this reason, we have extended the UNRES coarse-grained force field developed in our laboratory to treat systems with D-amino-acid residues. We developed the respective virtual-bond-torsional and double-torsional potentials for rotation about the C α · · · C α virtual-bond axis and two consecutive C α · · · C α virtual-bond axes, respectively, as functions of virtual-bond-dihedral angles γ. In turn, these were calculated as potentials of mean force (PMFs) from the diabatic energy surfaces of terminally-blocked model compounds for glycine, alanine, and proline. The potential-energy surfaces were calculated by using the ab initio method of molecular quantum mechanics at the Møller-Plesset (MP2) level of theory and the 6-31G(d,p) basis set, with the rotation angles of the peptide groups about [Formula: see text] and [Formula: see text] used as variables, and the energy was minimized with respect to the remaining degrees of freedom. The PMFs were calculated by numerical integration for all pairs and triplets with all possible combinations of types (glycine, alanine, and proline) and chirality (D or L); however, symmetry relations reduce the number of non-equivalent torsional potentials to 13 and the number of double-torsional potentials to 63 for a given C-terminal blocking group. Subsequently, one- (for torsional) and two-dimensional (for double-torsional potentials) Fourier series were fitted to the PMFs to obtain analytical expressions. It was found that the torsional potentials of the x-Y and X-y types, where X and Y are Ala or Pro, respectively, and a lowercase letter denotes D-chirality, have global minima for small absolute values of γ, accounting for the double-helical structure of gramicidin A, which is a dimer of two chains, each possessing an alternating D-Tyr-L-Tyr sequence, and similar peptides. The side-chain and correlation potentials for D-amino-acid residues were obtained by applying the reflection about the [Formula: see text] plane to the respective potentials for the L-amino-acid residues.
Collapse
Affiliation(s)
- Adam K. Sieradzan
- Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland
- Department of Chemistry and Biochemistry, Oklahoma University, Norman, OK, 73019, U.S.A
| | - Ulrich H.E. Hansmann
- Department of Chemistry and Biochemistry, Oklahoma University, Norman, OK, 73019, U.S.A
| | - Harold A. Scheraga
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, N.Y., 14853-1301, U.S.A
| | - Adam Liwo
- Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland
| |
Collapse
|
20
|
Xu D, Zhang Y. Toward optimal fragment generations for ab initio protein structure assembly. Proteins 2012; 81:229-39. [PMID: 22972754 DOI: 10.1002/prot.24179] [Citation(s) in RCA: 170] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2012] [Revised: 08/06/2012] [Accepted: 09/03/2012] [Indexed: 01/03/2023]
Abstract
Fragment assembly using structural motifs excised from other solved proteins has shown to be an efficient method for ab initio protein-structure prediction. However, how to construct accurate fragments, how to derive optimal restraints from fragments, and what the best fragment length is are the basic issues yet to be systematically examined. In this work, we developed a gapless-threading method to generate position-specific structure fragments. Distance profiles and torsion angle pairs are then derived from the fragments by statistical consistency analysis, which achieved comparable accuracy with the machine-learning-based methods although the fragments were taken from unrelated proteins. When measured by both accuracies of the derived distance profiles and torsion angle pairs, we come to a consistent conclusion that the optimal fragment length for structural assembly is around 10, and at least 100 fragments at each location are needed to achieve optimal structure assembly. The distant profiles and torsion angle pairs as derived by the fragments have been successfully used in QUARK for ab initio protein structure assembly and are provided by the QUARK online server at http://zhanglab.ccmb. med.umich.edu/QUARK/.
Collapse
Affiliation(s)
- Dong Xu
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA
| | | |
Collapse
|
21
|
Shen H, Xia Z, Li G, Ren P. A Review of Physics-Based Coarse-Grained Potentials for the Simulations of Protein Structure and Dynamics. ANNUAL REPORTS IN COMPUTATIONAL CHEMISTRY VOLUME 8 2012. [DOI: 10.1016/b978-0-444-59440-2.00005-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
22
|
Liwo A, He Y, Scheraga HA. Coarse-grained force field: general folding theory. Phys Chem Chem Phys 2011; 13:16890-901. [PMID: 21643583 DOI: 10.1039/c1cp20752k] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
We review the coarse-grained UNited RESidue (UNRES) force field for the simulations of protein structure and dynamics, which is being developed in our laboratory over the last several years. UNRES is a physics-based force field, the prototype of which is defined as a potential of mean force of polypeptide chains in water, where all the degrees of freedom except the coordinates of α-carbon atoms and side-chain centers have been integrated out. We describe the initial implementation of UNRES to protein-structure prediction formulated as a search for the global minimum of the potential-energy function and its subsequent molecular dynamics and extensions of molecular-dynamics implementation, which enabled us to study protein-folding pathways and thermodynamics, as well as to reformulate the protein-structure prediction problem as a search for the conformational ensemble with the lowest free energy at temperatures below the folding-transition temperature. Applications of UNRES to study biological problems are also described.
Collapse
Affiliation(s)
- Adam Liwo
- Faculty of Chemistry, University of Gdansk, ul. Sobieskiego 18, 80-952 Gdansk, Poland
| | | | | |
Collapse
|
23
|
Makowski M, Liwo A, Scheraga HA. Simple physics-based analytical formulas for the potentials of mean force of the interaction of amino-acid side chains in water. VI. Oppositely charged side chains. J Phys Chem B 2011; 115:6130-7. [PMID: 21500791 PMCID: PMC3093716 DOI: 10.1021/jp111259e] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The two-site coarse-grained model for the interactions of charged side chains, to be used with our coarse-grained UNRES force field for protein simulations proposed in the accompanying paper, has been extended to pairs of oppositely charged side chains. The potentials of mean force of four pairs of molecules modeling charged amino-acid side chains, i.e., propionate-n-pentylamine cation (for aspartic acid-lysine), butyrate-n-pentylamine cation (for glutamic acid-lysine), propionate-1-butylguanidine (for aspartic acid-arginine), and butyrate-1-butylguanidine (for glutamic acid-arginine) pairs were determined by umbrella-sampling molecular dynamics simulations in explicit water as functions of distance and orientation, and the analytical expression was fitted to the potentials of mean force. Compared to pairs of like-charged side chains discussed in the accompanying paper, an average quadrupole-quadrupole interaction term had to be introduced to reproduce the Coulombic interactions, and a multistate model of charge distribution had to be introduced to fit the potentials of mean force of all oppositely charged pairs well. The model reproduces all salt-bridge minima and, consequently, is likely to improve the performance of the UNRES force field.
Collapse
Affiliation(s)
- Mariusz Makowski
- Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland.
| | | | | |
Collapse
|
24
|
Makowski M, Liwo A, Sobolewski E, Scheraga HA. Simple physics-based analytical formulas for the potentials of mean force of the interaction of amino-acid side chains in water. V. Like-charged side chains. J Phys Chem B 2011; 115:6119-29. [PMID: 21500792 PMCID: PMC3099398 DOI: 10.1021/jp111258p] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A new model of side-chain-side-chain interactions for charged side-chains of amino acids, to be used in the UNRES force-field, has been developed, in which a side chain consists of a nonpolar and a charged site. The interaction energy between the nonpolar sites is composed of a Gay-Berne and a cavity term; the interaction energy between the charged sites consists of a Lennard-Jones term, a Coulombic term, a generalized-Born term, and a cavity term, while the interaction energy between the nonpolar and charged sites is composed of a Gay-Berne and a polarization term. We parametrized the energy function for the models of all six pairs of natural like-charged amino-acid side chains, namely propionate-propionate (for the aspartic acid-aspartic acid pair), butyrate-butyrate (for the glutamic acid-glutamic acid pair), propionate-butyrate (for the aspartic acid-glutamic acid pair), pentylamine cation-pentylamine cation (for the lysine-lysine pair), 1-butylguanidine cation-1-butylguanidine cation (for the arginine-arginine pair), and pentylamine cation-1-butylguanidine cation (for the lysine-arginine pair). By using umbrella-sampling molecular dynamics simulations in explicit TIP3P water, we determined the potentials of mean force of the above-mentioned pairs as functions of distance and orientation and fitted analytical expressions to them. The positions and depths of the contact minima and the positions and heights of the desolvation maxima, including their dependence on the orientation of the molecules were well represented by analytical expressions for all systems. The values of the parameters of all the energy components are physically reasonable, which justifies use of such potentials in coarse-grain protein-folding simulations.
Collapse
Affiliation(s)
- Mariusz Makowski
- Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland.
| | | | | | | |
Collapse
|
25
|
Han W, Wan CK, Jiang F, Wu YD. PACE Force Field for Protein Simulations. 1. Full Parameterization of Version 1 and Verification. J Chem Theory Comput 2010; 6:3373-89. [DOI: 10.1021/ct1003127] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Wei Han
- Department of Chemistry, The Hong Kong University of Science & Technology, Clear Water Bay, Kowloon, Hong Kong, China, School of Chemical Biology and Biotechnology, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, China, and College of Chemistry, Peking University, Beijing, China
| | - Cheuk-Kin Wan
- Department of Chemistry, The Hong Kong University of Science & Technology, Clear Water Bay, Kowloon, Hong Kong, China, School of Chemical Biology and Biotechnology, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, China, and College of Chemistry, Peking University, Beijing, China
| | - Fan Jiang
- Department of Chemistry, The Hong Kong University of Science & Technology, Clear Water Bay, Kowloon, Hong Kong, China, School of Chemical Biology and Biotechnology, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, China, and College of Chemistry, Peking University, Beijing, China
| | - Yun-Dong Wu
- Department of Chemistry, The Hong Kong University of Science & Technology, Clear Water Bay, Kowloon, Hong Kong, China, School of Chemical Biology and Biotechnology, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, China, and College of Chemistry, Peking University, Beijing, China
| |
Collapse
|
26
|
Pandey RB, Farmer BL. Globular structure of a human immunodeficiency virus-1 protease (1DIFA dimer) in an effective solvent medium by a Monte Carlo simulation. J Chem Phys 2010; 132:125101. [PMID: 20370150 DOI: 10.1063/1.3358340] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
A coarse-grained model is used to study the structure and dynamics of a human immunodeficiency virus-1 protease (1DIFA dimer) consisting of 198 residues in an effective solvent medium on a cubic lattice by Monte Carlo simulations for a range of interaction strengths. Energy and mobility profiles of residues are found to depend on the interaction strength and exhibit remarkable segmental symmetries in two monomers. Lowest energy residues such as Arg(41) and Arg(140) (most electrostatic and polar) are not the least mobile; despite the higher energy, the hydrophobic residues (Ile, Leu, and Val) are least mobile and form the core by pinning down the local segments for the globular structure. Variations in the gyration radius (R(g)) and energy (E(c)) of the protein show nonmonotonic dependence on the interaction strength with the smallest R(g) around the largest value of E(c). Pinning of the conformations by the hydrophobic residues at high interaction strength seems to provide seed for the protein chain to collapse.
Collapse
Affiliation(s)
- R B Pandey
- Department of Physics and Astronomy, University of Southern Mississippi, Hattiesburg, Mississippi 39406-5046, USA.
| | | |
Collapse
|
27
|
Kozłowska U, Liwo A, Scheraga HA. Determination of side-chain-rotamer and side-chain and backbone virtual-bond-stretching potentials of mean force from AM1 energy surfaces of terminally-blocked amino-acid residues, for coarse-grained simulations of protein structure and folding. I. The method. J Comput Chem 2010; 31:1143-53. [PMID: 20073062 DOI: 10.1002/jcc.21399] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In this and the accompanying article, we report the development of new physics-based side-chain-rotamer and virtual-bond-deformation potentials which now replace the respective statistical potentials used so far in our physics-based united-reside UNRES force field for large-scale simulations of protein structure and dynamics. In this article, we describe the methodology for determining the corresponding potentials of mean force (PMF's) from the energy surfaces of terminally-blocked amino-acid residues calculated with the AM1 quantum-mechanical semiempirical method. The approach is based on minimization of the AM1 energy for fixed values of the angles lambda for rotation of the peptide groups about the C(alpha)...C(alpha) virtual bonds, and for fixed values of the side-chain dihedral angles chi, which formed a multidimensional grid. A harmonic-approximation approach was developed to extrapolate from the energy at a given grid point to other points of the conformational space to compute the respective contributions to the PMF. To test the applicability of the harmonic approximation, the rotamer PMF's of alanine and valine obtained with this approach have been compared with those obtained by using a Metropolis Monte Carlo method. The PMF surfaces computed with the harmonic approximation are more rugged and have more pronounced minima than the MC-calculated surfaces but the harmonic-approximation- and MC-calculated PMF values are linearly correlated. The potentials derived with the harmonic approximation are, therefore, appropriate for UNRES for which the weights (scaling factors) of the energy terms are determined by force-field optimization for foldability.
Collapse
Affiliation(s)
- Urszula Kozłowska
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, USA.
| | | | | |
Collapse
|
28
|
Májek P, Elber R. A coarse-grained potential for fold recognition and molecular dynamics simulations of proteins. Proteins 2009; 76:822-36. [PMID: 19291741 DOI: 10.1002/prot.22388] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
A coarse-grained potential for protein simulations and fold ranking is presented. The potential is based on a two-point model of individual amino acids and a specific implementation of hydrogen bonding. Parameters are determined for distance dependent pair interactions, pseudo bonds, angles, and torsions. A scaling factor for a hydrogen bonding term is also determined. Iterative sampling for 4867 proteins reproduces distributions of internal coordinates and distances observed in the Protein Data Bank. The adjustment of the potential and resampling are in the spirit of the generalized ensemble approach. No native structure information (e.g., secondary structure) is used in the calculation of the potential or in the simulation of a particular protein. The potential is subject to two tests as follows: (i) simulations of 956 globular proteins in the neighborhood of their native folds (these proteins were not used in the training set) and (ii) discrimination between native and decoy structures for 2470 proteins with 305,000 decoys and the "Decoys 'R' Us" dataset. In the first test, 58% of tested proteins stay within 5 A from the native fold in Molecular Dynamics simulations of more than 20 nanoseconds using the new potential. The potential is also useful in differentiating between correct and approximate folds providing significant signal for structure prediction algorithms. Sampling with the potential consistently regenerates the distribution of distances and internal coordinates it learned. Nevertheless, during Molecular Dynamics simulations structures are found that reproduce the learned distributions but are far from the native fold.
Collapse
Affiliation(s)
- Peter Májek
- Department of Computer Science, Upson Hall 4130, Cornell University, Ithaca, New York 14853-7501, USA
| | | |
Collapse
|
29
|
DeVane R, Shinoda W, Moore PB, Klein ML. A Transferable Coarse Grain Non-bonded Interaction Model For Amino Acids. J Chem Theory Comput 2009; 5:2115-2124. [PMID: 20161179 PMCID: PMC2725330 DOI: 10.1021/ct800441u] [Citation(s) in RCA: 100] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The large quantity of protein sequences being generated from genomic data has greatly outpaced the throughput of experimental protein structure determining methods and consequently brought urgency to the need for accurate protein structure prediction tools. Reduced resolution, or coarse grained (CG) models, have become a mainstay in computational protein structure prediction perfoming among the best tools available. The quest for high quality generalized CG models presents an extremely challenging yet popular endeavor. To this point, a CG based interaction potential is presented here for the naturally occurring amino acids. In the present approach, three to four heavy atoms and associated hydrogens are condensed into a single CG site. The parameterization of the site-site interaction potential relies on experimental data thus providing a novel approach that is neither based on all-atom (AA) simulations nor experimental protein structural data. Specifically, intermolecular potentials, which are based on Lennard-Jones (LJ) style functional forms, are parameterized using thermodynamic data including surface tension and density. Using this approach, an amino acid potential dataset has been developed for use in modeling peptides and proteins. The potential is evaluated here by comparing the solvent accessible surface area (SASA) to AA representations and ranking of protein decoy data sets provided by Decoys 'R' Us. The model is shown to perform very well compared to other existing prediction models for these properties.
Collapse
Affiliation(s)
- Russell DeVane
- Center for Molecular Modeling and Department of Chemistry, University of Pennsylvania, 231 South 34th Street, Philadelphia, PA 19104-6323
| | - Wataru Shinoda
- Research Institute for Computational Sciences (RICS), National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba Central 2, Umezono 1-1-1, Tsukuba, Ibaraki 305-8568, Japan
| | - Preston B. Moore
- Department of Chemistry and Biochemistry, University of the Sciences in Philadelphia, 600 s. 43rd Street, Philadelphia, PA 19104
| | - Michael L. Klein
- Center for Molecular Modeling and Department of Chemistry, University of Pennsylvania, 231 South 34th Street, Philadelphia, PA 19104-6323
| |
Collapse
|
30
|
Pandey RB, Farmer BL. Residue energy and mobility in sequence to global structure and dynamics of a HIV-1 protease (1DIFA) by a coarse-grained Monte Carlo simulation. J Chem Phys 2009; 130:044906. [PMID: 19191412 DOI: 10.1063/1.3050106] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Energy, mobility, and structural profiles of residues in a specific sequence of human immunodeficiency virus (HIV)-1 protease chain and its global conformation and dynamics are studied by a coarse-grained computer simulation model on a cubic lattice. HIV-1 protease is described by a chain of 99 residues (nodes) in a specific sequence (1DIFA) with N- and C-terminals on the lattice, where empty lattice sites represent an effective solvent medium. Internal structures of the residues are ignored but their specificities are captured via an interaction (epsilon(ij)) matrix (residue-residue, residue-solvent) of the coefficient (fepsilon(ij)) of the Lennard-Jones potential. Simulations are performed for a range of interaction strength (f) with the solvent-residue interaction describing the quality of the solvent. Snapshots of the protein show considerable changes in the conformation of the protein on varying the interaction. From the mobility and energy profiles of the residues, it is possible to identify the active (and not so active) segments of the protein and consequently their role in proteolysis. Contrary to interaction thermodynamics, the hydrophobic residues possess higher configurational energy and lower mobility while the electrostatic and polar residues are more mobile despite their lower interaction energy. Segments of hydrophobic core residues, crucial for the structural evolution of the protein are identified-some of which are consistent with recent molecular dynamics simulation in context to possible clinical observations. Global energy and radius of gyration of the protein exhibit nonmonotonic dependence on the interaction strength (f) with opposite trends, e.g., rapid transition into globular structure with higher energy. Variations of the rms displacement of the protein and that of a tracer residue, Gly(49), with the time steps show how they slow down on increasing the interaction strength.
Collapse
Affiliation(s)
- R B Pandey
- Department of Physics and Astronomy, University of Southern Mississippi, Hattiesburg, Mississippi 39406-5046, USA.
| | | |
Collapse
|
31
|
Al-Mekhnaqi AM, Mayeed MS, Newaz GM. Prediction of protein conformation in water and on surfaces by Monte Carlo simulations using united-atom method. MOLECULAR SIMULATION 2009. [DOI: 10.1080/08927020802468364] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
32
|
|
33
|
Pandey RB, Heinz H, Feng J, Farmer BL, Slocik JM, Drummy LF, Naik RR. Adsorption of peptides (A3, Flg, Pd2, Pd4) on gold and palladium surfaces by a coarse-grained Monte Carlo simulation. Phys Chem Chem Phys 2009; 11:1989-2001. [DOI: 10.1039/b816187a] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
34
|
Makowski M, Sobolewski E, Czaplewski C, Ołdziej S, Liwo A, Scheraga HA. Simple physics-based analytical formulas for the potentials of mean force for the interaction of amino acid side chains in water. IV. Pairs of different hydrophobic side chains. J Phys Chem B 2008; 112:11385-95. [PMID: 18700740 DOI: 10.1021/jp803896b] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The potentials of mean force of 21 heterodimers of the molecules modeling hydrophobic amino acid side chains: ethane (for alanine), propane (for proline), isobutane (for valine), isopentane (for leucine and isoleucine), ethylbenzene (for phenylalanine), methyl propyl sulfide (for methionine), and indole (for tryptophane) were determined by umbrella-sampling molecular dynamics simulations in explicit water as functions of distance and orientation. Analytical expressions consisting of the Gay-Berne term to represent effective van der Waals interactions and the cavity term proposed in our earlier work were fitted to the potentials of mean force. The positions and depths of the contact minima and the positions and heights of the desolvation maxima, including their dependence on the orientation of the molecules, are well represented by the analytical expressions for all systems; large deviations between the MD-determined PMF and the analytical approximations are observed for pairs involving the least spheroidal solutes: ethylbenzene, indole, and methyl propyl sulfide at short distances at which the PMF is high and, consequently, these regions are rarely visited. When data from the PMF within only 10 kcal/mol above the global minimum are considered, the standard deviation between the MD-determined and the fitted PMF is from 0.25 to 0.55 kcal/mol (the relative standard deviation being from 4% to 8%); it is larger for pairs involving nonspherical solute molecules. The free energies of contact computed from the PMF surfaces are well correlated with those determined from protein-crystal data with a slope close to that relating the free energies of transfer of amino acids (from water to n-octanol) to the average contact free energies determined from protein-crystal data. These observations justify future use of the determined potentials in coarse-grained protein-folding simulations.
Collapse
Affiliation(s)
- Mariusz Makowski
- Faculty of Chemistry, University of Gdansk, Sobieskiego 18, 80-952 Gdansk, Poland
| | | | | | | | | | | |
Collapse
|
35
|
Shen H, Czaplewski C, Liwo A, Scheraga HA. Implementation of a Serial Replica Exchange Method in a Physics-Based United-Residue (UNRES) Force Field. J Chem Theory Comput 2008; 4:1386-1400. [PMID: 20011673 DOI: 10.1021/ct800063d] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The kinetic-trapping problem in simulating protein folding can be overcome by using a Replica Exchange Method (REM). However, in implementing REM in molecular dynamics simulations, synchronization between processors on parallel computers is required, and communication between processors limits its ability to sample conformational space in a complex system efficiently. To minimize communication between processors during the simulation, a Serial Replica Exchange Method (SREM) has been proposed recently by Hagan et al. (J. Phys. Chem. B2007, 111, 1416-1423). Here, we report the implementation of this new SREM algorithm with our physics-based united-residue (UNRES) force field. The method has been tested on the protein 1E0L with a temperature-independent UNRES force field and on terminally blocked deca-alanine (Ala(10)) and 1GAB with the recently introduced temperature-dependent UNRES force field. With the temperature-independent force field, SREM reproduces the results of REM but is more efficient in terms of wall-clock time and scales better on distributed-memory machines. However, exact application of SREM to the temperature-dependent UNRES algorithm requires the determination of a four-dimensional distribution of UNRES energy components instead of a one-dimensional energy distribution for each temperature, which is prohibitively expensive. Hence, we assumed that the temperature dependence of the force field can be ignored for neighboring temperatures. This version of SREM worked for Ala(10) which is a simple system but failed to reproduce the thermodynamic results as well as regular REM on the more complex 1GAB protein. Hence, SREM can be applied to the temperature-independent but not to the temperature-dependent UNRES force field.
Collapse
Affiliation(s)
- Hujun Shen
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853
| | | | | | | |
Collapse
|
36
|
Pandey RB, Farmer BL. Conformation of a coarse-grained protein chain (an aspartic acid protease) model in effective solvent by a bond-fluctuating Monte Carlo simulation. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 77:031902. [PMID: 18517417 DOI: 10.1103/physreve.77.031902] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2007] [Indexed: 05/26/2023]
Abstract
In a coarse-grained description of a protein chain, all of the 20 amino acid residues can be broadly divided into three groups: Hydrophobic (H) , polar (P) , and electrostatic (E) . A protein can be described by nodes tethered in a chain with a node representing an amino acid group. Aspartic acid protease consists of 99 residues in a well-defined sequence of H , P , and E nodes tethered together by fluctuating bonds. The protein chain is placed on a cubic lattice where empty lattice sites constitute an effective solvent medium. The amino groups (nodes) interact with the solvent (S) sites with appropriate attractive (PS) and repulsive (HS) interactions with the solvent and execute their stochastic movement with the Metropolis algorithm. Variations of the root mean square displacements of the center of mass and that of its center node of the protease chain and its gyration radius with the time steps are examined for different solvent strength. The structure of the protease swells on increasing the solvent interaction strength which tends to enhance the relaxation time to reach the diffusive behavior of the chain. Equilibrium radius of gyration increases linearly on increasing the solvent strength: A slow rate of increase in weak solvent regime is followed by a faster swelling in stronger solvent. Variation of the gyration radius with the time steps suggests that the protein chain moves via contraction and expansion in a somewhat quasiperiodic pattern particularly in strong solvent.
Collapse
Affiliation(s)
- R B Pandey
- Department of Physics and Astronomy, University of Southern Mississippi, Hattiesburg, Mississippi 39406-5046, USA
| | | |
Collapse
|
37
|
Heath AP, Kavraki LE, Clementi C. From coarse-grain to all-atom: Toward multiscale analysis of protein landscapes. Proteins 2007; 68:646-61. [PMID: 17523187 DOI: 10.1002/prot.21371] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Multiscale methods are becoming increasingly promising as a way to characterize the dynamics of large protein systems on biologically relevant time-scales. The underlying assumption in multiscale simulations is that it is possible to move reliably between different resolutions. We present a method that efficiently generates realistic all-atom protein structures starting from the C(alpha) atom positions, as obtained for instance from extensive coarse-grain simulations. The method, a reconstruction algorithm for coarse-grain structures (RACOGS), is validated by reconstructing ensembles of coarse-grain structures obtained during folding simulations of the proteins src-SH3 and S6. The results show that RACOGS consistently produces low energy, all-atom structures. A comparison of the free energy landscapes calculated using the coarse-grain structures versus the all-atom structures shows good correspondence and little distortion in the protein folding landscape.
Collapse
Affiliation(s)
- Allison P Heath
- Department of Computer Science, Rice University, Houston, Texas 77005, USA
| | | | | |
Collapse
|
38
|
Makowski M, Liwo A, Maksimiak K, Makowska J, Scheraga HA. Simple Physics-Based Analytical Formulas for the Potentials of Mean Force for the Interaction of Amino Acid Side Chains in Water. 2. Tests with Simple Spherical Systems. J Phys Chem B 2007; 111:2917-24. [PMID: 17388417 DOI: 10.1021/jp065917k] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Simple analytical functions consisting of electrostatic, polarization, Lennard-Jones or modified Lennard-Jones, and cavity terms are proposed to express the potentials of mean force analytically for spherical particles interacting in water. The cavity term was expressed either through the molecular-surface area of the solute or by using the Gaussian-overlap model of hydrophobic hydration developed in paper 1 of this series. The analytical expressions were fitted to the potentials of mean force of a methane homodimer, heterodimers composed of a methane molecule, and an ammonium cation or a chloride anion, respectively, and dimers consisting of two chloride anions, two ammonium cations, or a chloride ion and an ammonium cation. The potentials of mean force for these dimers were determined by umbrella-sampling molecular dynamics simulations with the AMBER 7.0 force field with TIP3P water either in our earlier work or in this work. For all systems, the analytical formulas fitted the potentials of mean force very well. However, using the molecular-surface area to express the cavity term provided a good fit only when the nonbonded interactions were expressed by an all-repulsive modified Lennard-Jones potential but also resulted in non-physical values of some of the parameters. Conversely, the use of our new Gaussian-overlap-based expression for the cavity term provided a good fit to the potentials of mean force (PMFs) with Lennard-Jones nonbonded potential, and the values of all parameters were physically reasonable.
Collapse
Affiliation(s)
- Mariusz Makowski
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, USA
| | | | | | | | | |
Collapse
|
39
|
Makowski M, Liwo A, Scheraga HA. Simple Physics-Based Analytical Formulas for the Potentials of Mean Force for the Interaction of Amino Acid Side Chains in Water. 1. Approximate Expression for the Free Energy of Hydrophobic Association Based on a Gaussian-Overlap Model. J Phys Chem B 2007; 111:2910-6. [PMID: 17388416 DOI: 10.1021/jp065916s] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
A physics-based model is proposed to derive approximate analytical expressions for the cavity component of the free energy of hydrophobic association of spherical and spheroidal solutes in water. The model is based on the difference between the number and context of the water molecules in the hydration sphere of a hydrophobic dimer and of two isolated hydrophobic solutes. It is assumed that the water molecules touching the convex part of the molecular surface of the dimer and those in the hydration spheres of the monomers contribute equally to the free energy of solvation, and those touching the saddle part of the molecular surface of the dimer result in a more pronounced increase in free energy because of their more restricted mobility (entropy loss) and fewer favorable electrostatic interactions with other water molecules. The density of water in the hydration sphere around a single solute particle is approximated by the derivative of a Gaussian centered on the solute molecule with respect to its standard deviation. On the basis of this approximation, the number of water molecules in different parts of the hydration sphere of the dimer is expressed in terms of the first and the second mixed derivatives of the two Gaussians centered on the first and second solute molecules, respectively, with respect to the standard deviations of these Gaussians, and plausible analytical expressions for the cavity component of the hydrophobic-association energy of spherical and spheroidal solutes are introduced. As opposed to earlier hydration-shell models, our expressions reproduce the desolvation maxima in the potentials of mean force of pairs of nonpolar solutes in water, and their advantage over the models based on molecular-surface area is that they have continuous gradients in the coordinates of solute centers.
Collapse
Affiliation(s)
- Mariusz Makowski
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, USA
| | | | | |
Collapse
|
40
|
Makowski M, Sobolewski E, Czaplewski C, Liwo A, Ołdziej S, No JH, Scheraga HA. Simple physics-based analytical formulas for the potentials of mean force for the interaction of amino acid side chains in water. 3. Calculation and parameterization of the potentials of mean force of pairs of identical hydrophobic side chains. J Phys Chem B 2007; 111:2925-31. [PMID: 17388418 DOI: 10.1021/jp065918c] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The potentials of mean force of homodimers of the molecules modeling hydrophobic amino acid side chains (ethane (for alanine), propane (for proline), isobutane (for valine), isopentane (for leucine and isoleucine), ethylbenzene (for phenylalanine), and methyl propyl sulfide (for methionine)) were determined by umbrella-sampling molecular dynamics simulations in explicit water as functions of distance and orientation. Analytical expressions consisting of the Gay-Berne term to represent effective van der Waals interactions and the cavity term derived in paper 1 of this series were fitted to the potentials of mean force. The positions and depths of the contact minima and the positions and heights of the desolvation maxima, including their dependence on the orientation of the molecules, were well represented by the analytical expressions for all systems, which justifies use of such potentials in coarse-grain protein-folding simulations.
Collapse
Affiliation(s)
- Mariusz Makowski
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, USA
| | | | | | | | | | | | | |
Collapse
|
41
|
Liwo A, Khalili M, Czaplewski C, Kalinowski S, Ołdziej S, Wachucik K, Scheraga HA. Modification and optimization of the united-residue (UNRES) potential energy function for canonical simulations. I. Temperature dependence of the effective energy function and tests of the optimization method with single training proteins. J Phys Chem B 2007; 111:260-85. [PMID: 17201450 PMCID: PMC3236617 DOI: 10.1021/jp065380a] [Citation(s) in RCA: 159] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We report the modification and parametrization of the united-residue (UNRES) force field for energy-based protein structure prediction and protein folding simulations. We tested the approach on three training proteins separately: 1E0L (beta), 1GAB (alpha), and 1E0G (alpha + beta). Heretofore, the UNRES force field had been designed and parametrized to locate native-like structures of proteins as global minima of their effective potential energy surfaces, which largely neglected the conformational entropy because decoys composed of only lowest-energy conformations were used to optimize the force field. Recently, we developed a mesoscopic dynamics procedure for UNRES and applied it with success to simulate protein folding pathways. However, the force field turned out to be largely biased toward -helical structures in canonical simulations because the conformational entropy had been neglected in the parametrization. We applied the hierarchical optimization method, developed in our earlier work, to optimize the force field; in this method, the conformational space of a training protein is divided into levels, each corresponding to a certain degree of native-likeness. The levels are ordered according to increasing native-likeness; level 0 corresponds to structures with no native-like elements, and the highest level corresponds to the fully native-like structures. The aim of optimization is to achieve the order of the free energies of levels, decreasing as their native-likeness increases. The procedure is iterative, and decoys of the training protein(s) generated with the energy function parameters of the preceding iteration are used to optimize the force field in a current iteration. We applied the multiplexing replica-exchange molecular dynamics (MREMD) method, recently implemented in UNRES, to generate decoys; with this modification, conformational entropy is taken into account. Moreover, we optimized the free-energy gaps between levels at temperatures corresponding to a predominance of folded or unfolded structures, as well as to structures at the putative folding-transition temperature, changing the sign of the gaps at the transition temperature. This enabled us to obtain force fields characterized by a single peak in the heat capacity at the transition temperature. Furthermore, we introduced temperature dependence to the UNRES force field; this is consistent with the fact that it is a free-energy and not a potential energy function. beta
Collapse
Affiliation(s)
- Adam Liwo
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, N.Y., 14853-1301, U.S.A
| | - Mey Khalili
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, N.Y., 14853-1301, U.S.A
| | - Cezary Czaplewski
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, N.Y., 14853-1301, U.S.A
| | - Sebastian Kalinowski
- Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland
| | - Stanisław Ołdziej
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, N.Y., 14853-1301, U.S.A
| | - Katarzyna Wachucik
- Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland
| | - Harold A. Scheraga
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, N.Y., 14853-1301, U.S.A
| |
Collapse
|
42
|
Maupetit J, Gautier R, Tufféry P. SABBAC: online Structural Alphabet-based protein BackBone reconstruction from Alpha-Carbon trace. Nucleic Acids Res 2006; 34:W147-51. [PMID: 16844979 PMCID: PMC1538914 DOI: 10.1093/nar/gkl289] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
SABBAC is an on-line service devoted to protein backbone reconstruction from alpha-carbon trace. It is based on the assembly of fragments taken from a library of reduced size, selected from the encoding of the protein trace in a hidden Markov model-derived structural alphabet. The assembly of the fragments is achieved by a greedy algorithm, using an energy-based scoring. Alpha-carbon coordinates remain unaffected. SABBAC simply positions the missing backbone atoms, no further refinement is performed. From our tests, SABBAC performs equal or better than other similar on-line approach and is robust to deviations on the alpha-carbon coordinates. It can be accessed at http://bioserv.rpbs.jussieu.fr/SABBAC.html.
Collapse
Affiliation(s)
| | - R. Gautier
- Institut de Pharmacologie Moléculaire et Cellulaire, UMR 6097 CNRS/UNSA660 route des Lucioles, 06560 Sophia Antipolis, France
| | - Pierre Tufféry
- To whom correspondence should be addressed. Tel: 331 44 27 77 33; Fax: 331 43 26 38 30;
| |
Collapse
|
43
|
Nanias M, Czaplewski C, Scheraga HA. Replica Exchange and Multicanonical Algorithms with the coarse-grained UNRES force field. J Chem Theory Comput 2006; 2:513-528. [PMID: 18797518 DOI: 10.1021/ct050253o] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Three algorithms, namely a Replica Exchange method (REM), a Replica Exchange Multicanonical method (REMUCA), and Replica Exchange Multicanonical with Replica Exchange (REMUCAREM), were implemented with the coarse-grained united-residue force field (UNRES) in both Monte Carlo and Molecular Dynamics versions. The MD algorithms use the constant-temperature Berendsen thermostat, with the velocity Verlet algorithm and variable time step. The algorithms were applied to one peptide (20 residues of Alanine with free ends; ala(20)) and two small proteins, namely an α-helical protein of 46 residues (the B-domain of the staphylococal protein A; 1BDD), and an α+β-protein of 48 residues (the E. Coli Mltd Lysm Domain; 1E0G). Calculated thermodynamic averages, such as canonical average energy and heat capacity, are in good agreement among all simulations for poly-L-alanine, showing that the algorithms were implemented correctly, and that all three algorithms are equally effective for small systems. For protein A, all algorithms performed reasonably well, although some variability in the calculated results was observed whereas, for a more complicated α+β-protein (1E0G), only Replica Exchange was capable of producing reliable statistics for calculating thermodynamic quantities. Finally, from the Replica Exchange molecular dynamics results, we calculated free energy maps as functions of RMSD and radius of gyration for different temperatures. The free energy calculations show correct folding behavior for poly-L-alanine and protein A while, for 1E0G, the native structure had the lowest free energy only at very low temperatures. Hence, the entropy contribution for 1E0G is larger than that for protein A at the same temperature. A larger contribution from entropy means that there are more accessible conformations at a given temperature, making it more difficult to obtain an efficient coverage of conformational space to obtain reliable thermodynamic properties. At the same temperature, ala(20) has the smallest entropy contribution, followed by protein A, and then by 1E0G.
Collapse
Affiliation(s)
- Marian Nanias
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, U.S.A
| | | | | |
Collapse
|
44
|
Ołdziej S, Czaplewski C, Liwo A, Chinchio M, Nanias M, Vila JA, Khalili M, Arnautova YA, Jagielska A, Makowski M, Schafroth HD, Kaźmierkiewicz R, Ripoll DR, Pillardy J, Saunders JA, Kang YK, Gibson KD, Scheraga HA. Physics-based protein-structure prediction using a hierarchical protocol based on the UNRES force field: assessment in two blind tests. Proc Natl Acad Sci U S A 2005; 102:7547-52. [PMID: 15894609 PMCID: PMC1140449 DOI: 10.1073/pnas.0502655102] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Recent improvements in the protein-structure prediction method developed in our laboratory, based on the thermodynamic hypothesis, are described. The conformational space is searched extensively at the united-residue level by using our physics-based UNRES energy function and the conformational space annealing method of global optimization. The lowest-energy coarse-grained structures are then converted to an all-atom representation and energy-minimized with the ECEPP/3 force field. The procedure was assessed in two recent blind tests of protein-structure prediction. During the first blind test, we predicted large fragments of alpha and alpha+beta proteins [60-70 residues with C(alpha) rms deviation (rmsd) <6 A]. However, for alpha+beta proteins, significant topological errors occurred despite low rmsd values. In the second exercise, we predicted whole structures of five proteins (two alpha and three alpha+beta, with sizes of 53-235 residues) with remarkably good accuracy. In particular, for the genomic target TM0487 (a 102-residue alpha+beta protein from Thermotoga maritima), we predicted the complete, topologically correct structure with 7.3-A C(alpha) rmsd. So far this protein is the largest alpha+beta protein predicted based solely on the amino acid sequence and a physics-based potential-energy function and search procedure. For target T0198, a phosphate transport system regulator PhoU from T. maritima (a 235-residue mainly alpha-helical protein), we predicted the topology of the whole six-helix bundle correctly within 8 A rmsd, except the 32 C-terminal residues, most of which form a beta-hairpin. These and other examples described in this work demonstrate significant progress in physics-based protein-structure prediction.
Collapse
Affiliation(s)
- S Ołdziej
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Nanias M, Chinchio M, Ołdziej S, Czaplewski C, Scheraga HA. Protein structure prediction with the UNRES force-field using Replica-Exchange Monte Carlo-with-Minimization; Comparison with MCM, CSA, and CFMC. J Comput Chem 2005; 26:1472-86. [PMID: 16088925 DOI: 10.1002/jcc.20286] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Two current methods of global optimization are coupled to produce the Replica-Exchange method together with Monte Carlo-with-Minimization (REMCM). Its performance is compared with each separate component and with other global optimization techniques. REMCM was applied to search the conformational space of coarse grain protein systems described by the UNRES force field. The method consists of several noninteracting copies of Monte Carlo simulation, and minimization was used after every perturbation to enhance the sampling of low-energy conformations. REMCM was applied to five proteins of different topology, and the results were compared to those from other optimization methods, namely Monte Carlo-with-Minimization (MCM), Conformational Space Annealing (CSA), and Conformational Family Monte Carlo (CFMC). REMCM located global minima for four proteins faster and more consistently than either MCM or CFMC, and it converged faster than CSA for three of the five proteins tested. A performance comparison was also carried out between REMCM and the traditional Replica Exchange method (REM) for one protein, with REMCM showing a significant improvement. Moreover, because of its simplicity, REMCM was easy to implement, thereby offering an alternative to other global optimization methods used in protein structure prediction.
Collapse
Affiliation(s)
- Marian Nanias
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, USA
| | | | | | | | | |
Collapse
|
46
|
Bostick DL, Shen M, Vaisman II. A simple topological representation of protein structure: implications for new, fast, and robust structural classification. Proteins 2004; 56:487-501. [PMID: 15229882 DOI: 10.1002/prot.20146] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
A topological representation of proteins is developed that makes use of two metrics: the Euclidean metric for identifying natural nearest neighboring residues via the Delaunay tessellation in Cartesian space and the distance between residues in sequence space. Using this representation, we introduce a quantitative and computationally inexpensive method for the comparison of protein structural topology. The method ultimately results in a numerical score quantifying the distance between proteins in a heuristically defined topological space. The properties of this scoring scheme are investigated and correlated with the standard Calpha distance root-mean-square deviation measure of protein similarity calculated by rigid body structural alignment. The topological comparison method is shown to have a characteristic dependence on protein conformational differences and secondary structure. This distinctive behavior is also observed in the comparison of proteins within families of structural relatives. The ability of the comparison method to successfully classify proteins into classes, superfamilies, folds, and families that are consistent with standard classification methods, both automated and human-driven, is demonstrated. Furthermore, it is shown that the scoring method allows for a fine-grained classification on the family, protein, and species level that agrees very well with currently established phylogenetic hierarchies. This fine classification is achieved without requiring visual inspection of proteins, sequence analysis, or the use of structural superimposition methods. Implications of the method for a fast, automated, topological hierarchical classification of proteins are discussed.
Collapse
Affiliation(s)
- David L Bostick
- Department of Physics and Program in Molecular/Cell Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | | | | |
Collapse
|
47
|
Czaplewski C, Liwo A, Pillardy J, Ołdziej S, Scheraga HA. Improved conformational space annealing method to treat β-structure with the UNRES force-field and to enhance scalability of parallel implementation. POLYMER 2004. [DOI: 10.1016/j.polymer.2003.10.081] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
48
|
Kaźmierkiewicz R, Liwo A, Scheraga HA. Addition of side chains to a known backbone with defined side-chain centroids. Biophys Chem 2003; 100:261-80. [PMID: 12646370 DOI: 10.1016/s0301-4622(02)00285-5] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
An automatic procedure is proposed for adding side chains to a protein backbone; it is based on optimization of a simplified energy function for peptide side chains, given its backbone and positions of side-chain centroids. The energy is expressed as a sum of the energies of interaction between side chains, and a harmonic penalty function accounting for the preservation of the positions of the C(alpha) atoms and the side-chain centroids. The energy of side-chain interactions is calculated with the soft-sphere ECEPP/3 potential. A Monte Carlo search is carried out to explore all possible side-chain orientations within a fixed backbone and side-chain centroid positions. The initial, usually extended, side-chain conformations are taken directly from the ECEPP/3 database. The procedure was tested on six experimental (X-ray or NMR) structures: immunoglobulin binding protein (PDB code 1IGD, an alpha+beta-protein); transcription factor PML (PDB code 1BOR, a 49-104 fragment of the ring finger domain, predominantly beta-protein); bovine pancreatic trypsin inhibitor (crystal form II) (PDB code 1BPI, an alpha+beta-protein); the monomer of human deoxyhemoglobin (PDB code 1BZ0, an alpha-helical structure); chain A of alcohol dehydrogenase from Drosophila lebanonensis (PDB code 1A4U); as well as on the 10-55 portion of the B domain of staphylococcal protein A (PDB code 1BDD). In all cases except 1BPI, the data for the algorithm (i.e. the backbone or C(alpha) coordinates and the positions of side-chain centroids) were taken from the experimental structures. For protein A, the C(alpha) coordinates and positions of side-chain centroids were also taken from the 1.9-A-resolution model predicted by the UNRES force field. In all comparisons with experimental structures, complete side-chain geometry was reconstructed with a root-mean-square (RMS) deviation of approximately 0.6-0.9 A from the heavy atoms when complete backbone and side-chain-centroid coordinates were used in reconstruction, or approximately 1.0 A when the C(alpha) and centroid coordinates were used.
Collapse
Affiliation(s)
- Rajmund Kaźmierkiewicz
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, USA
| | | | | |
Collapse
|
49
|
Juranić N, Macura S, Prendergast FG. H-bonding mediates polarization of peptide groups in folded proteins. Protein Sci 2003; 12:2633-6. [PMID: 14573874 PMCID: PMC2366962 DOI: 10.1110/ps.03127103] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2003] [Revised: 06/04/2003] [Accepted: 06/18/2003] [Indexed: 10/26/2022]
Abstract
The carbon-nitrogen J-couplings in the hydrogen bonding chains of proteins show that H-bonding mediates peptide-group polarization, which results in the general reduction of peptide-group polarity of folded proteins in solution. The net effect is to make large regions of protein secondary structure, especially beta-sheets, intrinsically more hydrophobic, contributing thereby to overall stability of the tertiary structure.
Collapse
Affiliation(s)
- Nenad Juranić
- Department of Biochemistry and Molecular Biology, Mayo Graduate School, Mayo Clinic and Foundation, Rochester, Minnesota 55905, USA.
| | | | | |
Collapse
|
50
|
DePristo MA, De Bakker PIW, Shetty RP, Blundell TL. Discrete restraint-based protein modeling and the Calpha-trace problem. Protein Sci 2003; 12:2032-46. [PMID: 12931001 PMCID: PMC2323999 DOI: 10.1110/ps.0386903] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
We present a novel de novo method to generate protein models from sparse, discretized restraints on the conformation of the main chain and side chain atoms. We focus on Calpha-trace generation, the problem of constructing an accurate and complete model from approximate knowledge of the positions of the Calpha atoms and, in some cases, the side chain centroids. Spatial restraints on the Calpha atoms and side chain centroids are supplemented by constraints on main chain geometry, phi/xi angles, rotameric side chain conformations, and inter-atomic separations derived from analyses of known protein structures. A novel conformational search algorithm, combining features of tree-search and genetic algorithms, generates models consistent with these restraints by propensity-weighted dihedral angle sampling. Models with ideal geometry, good phi/xi angles, and no inter-atomic overlaps are produced with 0.8 A main chain and, with side chain centroid restraints, 1.0 A all-atom root-mean-square deviation (RMSD) from the crystal structure over a diverse set of target proteins. The mean model derived from 50 independently generated models is closer to the crystal structure than any individual model, with 0.5 A main chain RMSD under only Calpha restraints and 0.7 A all-atom RMSD under both Calpha and centroid restraints. The method is insensitive to randomly distributed errors of up to 4 A in the Calpha restraints. The conformational search algorithm is efficient, with computational cost increasing linearly with protein size. Issues relating to decoy set generation, experimental structure determination, efficiency of conformational sampling, and homology modeling are discussed.
Collapse
Affiliation(s)
- Mark A DePristo
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, England.
| | | | | | | |
Collapse
|