1
|
Shimono Y, Hakamada M, Mabuchi M. NPEX: Never give up protein exploration with deep reinforcement learning. J Mol Graph Model 2024; 131:108802. [PMID: 38838617 DOI: 10.1016/j.jmgm.2024.108802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 05/05/2024] [Accepted: 05/24/2024] [Indexed: 06/07/2024]
Abstract
Elucidating unknown structures of proteins, such as metastable states, is critical in designing therapeutic agents. Protein structure exploration has been performed using advanced computational methods, especially molecular dynamics and Markov chain Monte Carlo simulations, which require untenably long calculation times and prior structural knowledge. Here, we developed an innovative method for protein structure determination called never give up protein exploration (NPEX) with deep reinforcement learning. The NPEX method leverages the soft actor-critic algorithm and the intrinsic reward system, effectively adding a bias potential without the need for prior knowledge. To demonstrate the method's effectiveness, we applied it to four models: a double well, a triple well, the alanine dipeptide, and the tryptophan cage. Compared with Markov chain Monte Carlo simulations, NPEX had markedly greater sampling efficiency. The significantly enhanced computational efficiency and lack of prior domain knowledge requirements of the NPEX method will revolutionize protein structure exploration.
Collapse
Affiliation(s)
- Yuta Shimono
- Graduate School of Energy Science, Kyoto University, Yoshidahonmachi, Sakyo-ku, Kyoto, 606-8501, Japan
| | - Masataka Hakamada
- Graduate School of Energy Science, Kyoto University, Yoshidahonmachi, Sakyo-ku, Kyoto, 606-8501, Japan.
| | - Mamoru Mabuchi
- Graduate School of Energy Science, Kyoto University, Yoshidahonmachi, Sakyo-ku, Kyoto, 606-8501, Japan
| |
Collapse
|
2
|
Nicy, Morgan JWR, Wales DJ. Energy landscapes for clusters of hexapeptides. J Chem Phys 2024; 161:054112. [PMID: 39092941 DOI: 10.1063/5.0220652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Accepted: 07/17/2024] [Indexed: 08/04/2024] Open
Abstract
We present the results for energy landscapes of hexapeptides obtained using interfaces to the Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) program. We have used basin-hopping global optimization and discrete path sampling to explore the landscapes of hexapeptide monomers, dimers, and oligomers containing 10, 100, and 200 monomers modeled using a residue-level coarse-grained potential, Mpipi, implemented in LAMMPS. We find that the dimers of peptides containing amino acid residues that are better at promoting phase separation, such as tyrosine and arginine, have melting peaks at higher temperature in their heat capacity compared to phenylalanine and lysine, respectively. This observation correlates with previous work on the same uncapped hexapeptide monomers modeled using atomistic potential. For oligomers, we compare the variation in monomer conformations with radial distance and observe trends for selected angles calculated for each monomer. The LAMMPS interfaces to the GMIN and OPTIM programs for landscape exploration offer new opportunities to investigate larger systems and provide access to the coarse-grained potentials implemented within LAMMPS.
Collapse
Affiliation(s)
- Nicy
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - John W R Morgan
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - David J Wales
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
3
|
Hooten M, Banerjee A, Dutt M. Multiscale, Multiresolution Coarse-Grained Model via a Hybrid Approach: Solvation, Structure, and Self-Assembly of Aromatic Tripeptides. J Chem Theory Comput 2024; 20:1689-1703. [PMID: 37931005 DOI: 10.1021/acs.jctc.3c00458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2023]
Abstract
Short aromatic peptides have been observed to assemble into diverse nanostructures, including fibers, tubes, and vesicles, using computational techniques. However, the computational studies have employed top-down coarse-grained (CG) models, which are unable to capture the assembly along with the conformation, packing, and organization of the peptides within the aggregates in a manner that is consistent with the all atom (AA) representation of the molecules. In this study, a hybrid structure- and force-based approach is adapted to develop a bottom-up CG force field of triphenylalanine using reference data from AA trajectories. This approach follows a flexible methodology to approximate the chemical complexity of the underlying AA representation with the chosen CG representation. Two CG models are developed with distinct representations of the aromatic side chains. The first uses a simple single-bead representation, while the second uses a three-bead representation to more accurately represent the planarity of the ring. The one-bead model yields nanorods, while the three-bead model results in nanospheres. The role of different chemical groups in the assembly of nanostructures is identified, along with the importance of steric effects on the packing of the peptides within assemblies.
Collapse
Affiliation(s)
- Mason Hooten
- Department of Biomedical Engineering, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, United States
| | - Akash Banerjee
- Department of Chemical and Biochemical Engineering, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, United States
| | - Meenakshi Dutt
- Department of Chemical and Biochemical Engineering, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, United States
| |
Collapse
|
4
|
Majewski M, Pérez A, Thölke P, Doerr S, Charron NE, Giorgino T, Husic BE, Clementi C, Noé F, De Fabritiis G. Machine learning coarse-grained potentials of protein thermodynamics. Nat Commun 2023; 14:5739. [PMID: 37714883 PMCID: PMC10504246 DOI: 10.1038/s41467-023-41343-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 08/29/2023] [Indexed: 09/17/2023] Open
Abstract
A generalized understanding of protein dynamics is an unsolved scientific problem, the solution of which is critical to the interpretation of the structure-function relationships that govern essential biological processes. Here, we approach this problem by constructing coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics. For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins with multiple secondary structure arrangements. The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems. Coarse-grained simulations identify relevant structural states in the ensemble with comparable energetics to the all-atom systems. Furthermore, we show that a single coarse-grained potential can integrate all twelve proteins and can capture experimental structural features of mutated proteins. These results indicate that machine learning coarse-grained potentials could provide a feasible approach to simulate and understand protein dynamics.
Collapse
Affiliation(s)
- Maciej Majewski
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
| | - Adrià Pérez
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
| | - Philipp Thölke
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain
| | - Stefan Doerr
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
| | - Nicholas E Charron
- Department of Physics, Rice University, Houston, TX, 77005, USA
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA
- Department of Physics, FU Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Toni Giorgino
- Biophysics Institute, National Research Council (CNR-IBF), 20133, Milan, Italy
| | - Brooke E Husic
- Department of Mathematics and Computer Science, FU Berlin, Arnimallee 12, 14195, Berlin, Germany
- Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, 08540, USA
- Princeton Center for Theoretical Science, Princeton University, Princeton, NJ, 08540, USA
- Center for the Physics of Biological Function, Princeton University, Princeton, NJ, 08540, USA
| | - Cecilia Clementi
- Department of Physics, Rice University, Houston, TX, 77005, USA.
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA.
- Department of Physics, FU Berlin, Arnimallee 12, 14195, Berlin, Germany.
- Department of Chemistry, Rice University, Houston, TX, 77005, USA.
| | - Frank Noé
- Department of Physics, FU Berlin, Arnimallee 12, 14195, Berlin, Germany.
- Department of Mathematics and Computer Science, FU Berlin, Arnimallee 12, 14195, Berlin, Germany.
- Department of Chemistry, Rice University, Houston, TX, 77005, USA.
- Microsoft Research AI4Science, Karl-Liebknecht Str. 32, 10178, Berlin, Germany.
| | - Gianni De Fabritiis
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain.
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain.
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, 08010, Barcelona, Spain.
| |
Collapse
|
5
|
Su R, Zeng J, Marcink TC, Porotto M, Moscona A, O’Shaughnessy B. Host Cell Membrane Capture by the SARS-CoV-2 Spike Protein Fusion Intermediate. ACS CENTRAL SCIENCE 2023; 9:1213-1228. [PMID: 37396856 PMCID: PMC10255576 DOI: 10.1021/acscentsci.3c00158] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Indexed: 07/04/2023]
Abstract
Cell entry by SARS-CoV-2 is accomplished by the S2 subunit of the spike S protein on the virion surface by capture of the host cell membrane and fusion with the viral envelope. Capture and fusion require the prefusion S2 to transit to its potent fusogenic form, the fusion intermediate (FI). However, the FI structure is unknown, detailed computational models of the FI are unavailable, and the mechanisms and timing of membrane capture and fusion are not established. Here, we constructed a full-length model of the SARS-CoV-2 FI by extrapolating from known SARS-CoV-2 pre- and postfusion structures. In atomistic and coarse-grained molecular dynamics simulations the FI was remarkably flexible and executed giant bending and extensional fluctuations due to three hinges in the C-terminal base. The simulated configurations and their giant fluctuations are quantitatively consistent with SARS-CoV-2 FI configurations measured recently using cryo-electron tomography. Simulations suggested a host cell membrane capture time of ∼2 ms. Isolated fusion peptide simulations identified an N-terminal helix that directed and maintained binding to the membrane but grossly underestimated the binding time, showing that the fusion peptide environment is radically altered when attached to its host fusion protein. The large configurational fluctuations of the FI generated a substantial exploration volume that aided capture of the target membrane, and may set the waiting time for fluctuation-triggered refolding of the FI that draws the viral envelope and host cell membrane together for fusion. These results describe the FI as machinery that uses massive configurational fluctuations for efficient membrane capture and suggest novel potential drug targets.
Collapse
Affiliation(s)
- Rui Su
- Department
of Chemical Engineering, Columbia University, New York, New York 10027, United States
| | - Jin Zeng
- Department
of Chemical Engineering, Columbia University, New York, New York 10027, United States
| | - Tara C. Marcink
- Department
of Pediatrics, Columbia University Vagelos
College of Physicians & Surgeons, New York, New York 10032, United States
- Center
for Host−Pathogen Interaction, Columbia
University Vagelos College of Physicians & Surgeons, New York, New York 10032, United States
| | - Matteo Porotto
- Department
of Pediatrics, Columbia University Vagelos
College of Physicians & Surgeons, New York, New York 10032, United States
- Center
for Host−Pathogen Interaction, Columbia
University Vagelos College of Physicians & Surgeons, New York, New York 10032, United States
- Department
of Experimental Medicine, University of
Campania “Luigi Vanvitelli”, 81100 Caserta, Italy
| | - Anne Moscona
- Department
of Pediatrics, Columbia University Vagelos
College of Physicians & Surgeons, New York, New York 10032, United States
- Center
for Host−Pathogen Interaction, Columbia
University Vagelos College of Physicians & Surgeons, New York, New York 10032, United States
- Department
of Microbiology & Immunology, Columbia
University Vagelos College of Physicians & Surgeons, New York, New York 10032, United States
- Department
of Physiology, Columbia University Vagelos
College of Physicians & Surgeons, New York, New York 10032, United States
| | - Ben O’Shaughnessy
- Department
of Chemical Engineering, Columbia University, New York, New York 10027, United States
| |
Collapse
|
6
|
Durumeric AEP, Charron NE, Templeton C, Musil F, Bonneau K, Pasos-Trejo AS, Chen Y, Kelkar A, Noé F, Clementi C. Machine learned coarse-grained protein force-fields: Are we there yet? Curr Opin Struct Biol 2023; 79:102533. [PMID: 36731338 PMCID: PMC10023382 DOI: 10.1016/j.sbi.2023.102533] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 12/14/2022] [Accepted: 12/18/2022] [Indexed: 02/04/2023]
Abstract
The successful recent application of machine learning methods to scientific problems includes the learning of flexible and accurate atomic-level force-fields for materials and biomolecules from quantum chemical data. In parallel, the machine learning of force-fields at coarser resolutions is rapidly gaining relevance as an efficient way to represent the higher-body interactions needed in coarse-grained force-fields to compensate for the omitted degrees of freedom. Coarse-grained models are important for the study of systems at time and length scales exceeding those of atomistic simulations. However, the development of transferable coarse-grained models via machine learning still presents significant challenges. Here, we discuss recent developments in this field and current efforts to address the remaining challenges.
Collapse
Affiliation(s)
- Aleksander E P Durumeric
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Nicholas E Charron
- Department of Physics and Astronomy, Rice University, 6100 Main Street, Houston, 77005, Texas, USA; Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Center for Theoretical Biological Physics, Rice University, 6100 Main Street, Houston, 77005, Texas, USA
| | - Clark Templeton
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. https://twitter.com/pbrun03
| | - Félix Musil
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. https://twitter.com/FelixMusil
| | - Klara Bonneau
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Aldo S Pasos-Trejo
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. https://twitter.com/sayeg84
| | - Yaoyi Chen
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany. https://twitter.com/hello_yaoyi
| | - Atharva Kelkar
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Frank Noé
- Microsoft Research AI4Science, Karl-Liebknecht Str. 32, Berlin, 10178, Berlin, Germany; Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Department of Chemistry, Rice University, 6100 Main Street, Houston, 77005, Texas, USA. https://twitter.com/FrankNoeBerlin
| | - Cecilia Clementi
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany; Center for Theoretical Biological Physics, Rice University, 6100 Main Street, Houston, 77005, Texas, USA; Department of Chemistry, Rice University, 6100 Main Street, Houston, 77005, Texas, USA; Department of Physics and Astronomy, Rice University, 6100 Main Street, Houston, 77005, Texas, USA.
| |
Collapse
|
7
|
Banerjee A, Dutt M. A hybrid approach for coarse-graining helical peptoids: Solvation, secondary structure, and assembly. J Chem Phys 2023; 158:114105. [PMID: 36948821 DOI: 10.1063/5.0138510] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023] Open
Abstract
Protein mimics such as peptoids form self-assembled nanostructures whose shape and function are governed by the side chain chemistry and secondary structure. Experiments have shown that a peptoid sequence with a helical secondary structure assembles into microspheres that are stable under various conditions. The conformation and organization of the peptoids within the assemblies remains unknown and is elucidated in this study via a hybrid, bottom-up coarse-graining approach. The resultant coarse-grained (CG) model preserves the chemical and structural details that are critical for capturing the secondary structure of the peptoid. The CG model accurately captures the overall conformation and solvation of the peptoids in an aqueous solution. Furthermore, the model resolves the assembly of multiple peptoids into a hemispherical aggregate that is in qualitative agreement with the corresponding results from experiments. The mildly hydrophilic peptoid residues are placed along the curved interface of the aggregate. The composition of the residues on the exterior of the aggregate is determined by two conformations adopted by the peptoid chains. Hence, the CG model simultaneously captures sequence-specific features and the assembly of a large number of peptoids. This multiscale, multiresolution coarse-graining approach could help in predicting the organization and packing of other tunable oligomeric sequences of relevance to biomedicine and electronics.
Collapse
Affiliation(s)
- Akash Banerjee
- Chemical and Biochemical Engineering, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Meenakshi Dutt
- Chemical and Biochemical Engineering, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| |
Collapse
|
8
|
Zhang Y, Wang Y, Xia F, Cao Z, Xu X. Accurate and Efficient Estimation of Lennard-Jones Interactions for Coarse-Grained Particles via a Potential Matching Method. J Chem Theory Comput 2022; 18:4879-4890. [PMID: 35838523 DOI: 10.1021/acs.jctc.2c00513] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The Lennard-Jones (LJ) energy functions are commonly used to describe the nonbonded interactions in bulk coarse-grained (CG) models, which contribute significantly to the stabilization of a local binding configuration or a self-assembly system. In many cases, systematic development of the LJ interaction parameters in a CG model requires a comprehensive sampling of the objective molecules at the all-atom (AA) level, which is therefore extremely time-consuming for large systems. Inspired by the concept of electrostatic potential (ESP), we define the LJ static potential (LJSP), by which the embedding potential energy surface can be constructed analytically. A semianalytic approach, namely, the LJSP matching method, is developed here to derive the CG parameters by minimizing the LJSP difference between the AA and the CG models, which provides a universal way to derive the CG LJ parameters from the AA models without doing presampling. The LJSP matching method is successful not only in deriving the LJ interaction energy landscape in the CG models for proteins, lipids, and DNA but also in reproducing the critical properties such as intermediate structures and enthalpy contributions as exemplified in simulating the self-assembly process of the dipalmitoylphosphatidylcholine (DPPC) lipids.
Collapse
Affiliation(s)
- Yuwei Zhang
- Collaborative Innovation Center of Chemistry for Energy Materials, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, MOE Key Laboratory of Computational Physical Sciences, Departments of Chemistry, Fudan University, Shanghai 200433, China
| | - Yunchu Wang
- LSEC, Institute of Computational Mathematics and Scientific/Engineering Computing, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
| | - Fei Xia
- School of Chemistry and Molecular Engineering, NYU-ECNU Center for Computational Chemistry at NYU Shanghai, East China Normal University, Shanghai 200062, China
| | - Zexing Cao
- State Key Laboratory of Physical Chemistry of Solid Surfaces and Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, College of Chemistry and Chemistry Engineering, Xiamen University, Xiamen 361005, China
| | - Xin Xu
- Collaborative Innovation Center of Chemistry for Energy Materials, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, MOE Key Laboratory of Computational Physical Sciences, Departments of Chemistry, Fudan University, Shanghai 200433, China
| |
Collapse
|
9
|
Magi Meconi G, Sasselli IR, Bianco V, Onuchic JN, Coluzza I. Key aspects of the past 30 years of protein design. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2022; 85:086601. [PMID: 35704983 DOI: 10.1088/1361-6633/ac78ef] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 06/15/2022] [Indexed: 06/15/2023]
Abstract
Proteins are the workhorse of life. They are the building infrastructure of living systems; they are the most efficient molecular machines known, and their enzymatic activity is still unmatched in versatility by any artificial system. Perhaps proteins' most remarkable feature is their modularity. The large amount of information required to specify each protein's function is analogically encoded with an alphabet of just ∼20 letters. The protein folding problem is how to encode all such information in a sequence of 20 letters. In this review, we go through the last 30 years of research to summarize the state of the art and highlight some applications related to fundamental problems of protein evolution.
Collapse
Affiliation(s)
- Giulia Magi Meconi
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | - Ivan R Sasselli
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | | | - Jose N Onuchic
- Center for Theoretical Biological Physics, Department of Physics & Astronomy, Department of Chemistry, Department of Biosciences, Rice University, Houston, TX 77251, United States of America
| | - Ivan Coluzza
- BCMaterials, Basque Center for Materials, Applications and Nanostructures, Bld. Martina Casiano, UPV/EHU Science Park, Barrio Sarriena s/n, 48940 Leioa, Spain
- Basque Foundation for Science, Ikerbasque, 48009, Bilbao, Spain
| |
Collapse
|
10
|
Banerjee A, Lu CY, Dutt M. A hybrid coarse-grained model for structure, solvation and assembly of lipid-like peptides. Phys Chem Chem Phys 2021; 24:1553-1568. [PMID: 34940778 DOI: 10.1039/d1cp04205j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Reconstituted photosynthetic proteins which are activated upon exposure to solar energy hold enormous potential for powering future solid state devices and solar cells. The functionality and integration of these proteins into such devices has been successfully enabled by lipid-like peptides. Yet, a fundamental understanding of the organization of these peptides with respect to the photosynthetic proteins and themselves remains unknown and is critical for guiding the design of such light-activated devices. This study investigates the relative organization of one such peptide sequence V6K2 (V: valine and K: lysine) within assemblies. Given the expansive spatiotemporal scales associated with this study, a hybrid coarse-grained (CG) model which captures the structure, conformation and aggregation of the peptide is adopted. The CG model uses a combination of iterative Boltzmann inversion and force matching to provide insight into the relative organization of V6K2 in assemblies. The CG model reproduces the structure of a V6K2 peptide sequence along with its all atom (AA) solvation structure. The relative organization of multiple peptides in an assembly, as captured by CG simulations, is in agreement with corresponding results from AA simulations. Also, a backmapping procedure reintroduces the AA details of the peptides within the aggregates captured by the CG model to demonstrate the relative organization of the peptides. Furthermore, a large number of peptides self-assemble into an elongated micelle in the CG simulation, which is consistent with experimental findings. The coarse-graining procedure is tested for transferability to longer peptide sequences, and hence can be extended to other amphiphilic peptide sequences.
Collapse
Affiliation(s)
- Akash Banerjee
- Chemical and Biochemical Engineering, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA.
| | - Chien Yu Lu
- Chemical and Biochemical Engineering, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA.
| | - Meenakshi Dutt
- Chemical and Biochemical Engineering, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA.
| |
Collapse
|
11
|
Vaiwala R, Ayappa KG. A generic force field for simulating native protein structures using dissipative particle dynamics. SOFT MATTER 2021; 17:9772-9785. [PMID: 34651150 DOI: 10.1039/d1sm01194d] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
A coarse-grained force field for molecular dynamics simulations of the native structures of proteins in a dissipative particle dynamics (DPD) framework is developed. The parameters for bonded interactions are derived by mapping the bonds and angles for 20 amino acids onto target distributions obtained from fully atomistic simulations in explicit solvent. A dual-basin potential is introduced for stabilizing backbone angles, to cover a wide spectrum of protein secondary structures. The backbone dihedral potential enables folding of the protein from an unfolded initial state to the folded native structure. The proposed force field is validated by evaluating the structural properties of several model peptides and proteins including the SARS-CoV-2 fusion peptide, consisting of α-helices, β-sheets, loops and turns. Detailed comparisons with fully atomistic simulations are carried out to assess the ability of the proposed force field to stabilize the different secondary structures present in proteins. The compact conformations of the native states were evident from the radius of gyration and the high intensity peaks of the root mean square deviation histograms, which were found to be within 0.4 nm. The Ramachandran-like energy landscape on the phase space of backbone angles (θ) and dihedrals (ϕ) effectively captured the conformational phase space of α-helices at ∼(ϕ = 50°,θ = 90°) and β-strands at ∼(ϕ = ±180°,θ = 90-120°). Furthermore, the residue-residue native contacts were also well reproduced by the proposed DPD model. The applicability of the model to multidomain complexes was assessed using lysozyme and a large α-helical bacterial pore-forming toxin, cytolysin A. Our study illustrates that the proposed force field is generic, and can potentially be extended for efficient in silico investigations of membrane bound polypeptides and proteins using DPD simulations.
Collapse
Affiliation(s)
- Rakesh Vaiwala
- Department of Chemical Engineering, Indian Institute of Science, Bangalore 560012, India.
| | - K Ganapathy Ayappa
- Department of Chemical Engineering, Indian Institute of Science, Bangalore 560012, India.
- Centre for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| |
Collapse
|
12
|
Joseph JA, Reinhardt A, Aguirre A, Chew PY, Russell KO, Espinosa JR, Garaizar A, Collepardo-Guevara R. Physics-driven coarse-grained model for biomolecular phase separation with near-quantitative accuracy. NATURE COMPUTATIONAL SCIENCE 2021; 1:732-743. [PMID: 35795820 PMCID: PMC7612994 DOI: 10.1038/s43588-021-00155-3] [Citation(s) in RCA: 113] [Impact Index Per Article: 37.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Various physics- and data-driven sequence-dependent protein coarse-grained models have been developed to study biomolecular phase separation and elucidate the dominant physicochemical driving forces. Here, we present Mpipi, a multiscale coarse-grained model that describes almost quantitatively the change in protein critical temperatures as a function of amino-acid sequence. The model is parameterised from both atomistic simulations and bioinformatics data and accounts for the dominant role of π-π and hybrid cation-π/π-π interactions and the much stronger attractive contacts established by arginines than lysines. We provide a comprehensive set of benchmarks for Mpipi and seven other residue-level coarse-grained models against experimental radii of gyration and quantitative in-vitro phase diagrams; Mpipi predictions agree well with experiment on both fronts. Moreover, it can account for protein-RNA interactions, correctly predicts the multiphase behaviour of a charge-matched poly-arginine/poly-lysine/RNA system, and recapitulates experimental LLPS trends for sequence mutations on FUS, DDX4 and LAF-1 proteins.
Collapse
Affiliation(s)
- Jerelle A. Joseph
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
- Department of Physics, University of Cambridge, Cambridge, CB3 0HE, UK
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Aleks Reinhardt
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
| | - Anne Aguirre
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
| | - Pin Yu Chew
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
| | - Kieran O. Russell
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
| | - Jorge R. Espinosa
- Department of Physics, University of Cambridge, Cambridge, CB3 0HE, UK
| | - Adiran Garaizar
- Department of Physics, University of Cambridge, Cambridge, CB3 0HE, UK
| | - Rosana Collepardo-Guevara
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
- Department of Physics, University of Cambridge, Cambridge, CB3 0HE, UK
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| |
Collapse
|
13
|
Signorini LF, Perego C, Potestio R. Protein self-entanglement modulates successful folding to the native state: A multi-scale modeling study. J Chem Phys 2021; 155:115101. [PMID: 34551527 DOI: 10.1063/5.0063254] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The computer-aided investigation of protein folding has greatly benefited from coarse-grained models, that is, simplified representations at a resolution level lower than atomistic, providing access to qualitative and quantitative details of the folding process that would be hardly attainable, via all-atom descriptions, for medium to long molecules. Nonetheless, the effectiveness of low-resolution models is itself hampered by the presence, in a small but significant number of proteins, of nontrivial topological self-entanglements. Features such as native state knots or slipknots introduce conformational bottlenecks, affecting the probability to fold into the correct conformation; this limitation is particularly severe in the context of coarse-grained models. In this work, we tackle the relationship between folding probability, protein folding pathway, and protein topology in a set of proteins with a nontrivial degree of topological complexity. To avoid or mitigate the risk of incurring in kinetic traps, we make use of the elastic folder model, a coarse-grained model based on angular potentials optimized toward successful folding via a genetic procedure. This light-weight representation allows us to estimate in silico folding probabilities, which we find to anti-correlate with a measure of topological complexity as well as to correlate remarkably well with experimental measurements of the folding rate. These results strengthen the hypothesis that the topological complexity of the native state decreases the folding probability and that the force-field optimization mimics the evolutionary process these proteins have undergone to avoid kinetic traps.
Collapse
Affiliation(s)
- Lorenzo Federico Signorini
- The George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel and Department of Physics, University of Trento, Trento, Italy
| | - Claudio Perego
- Department of Innovative Technologies, University of Applied Sciences and Arts of Southern Switzerland, Manno, Switzerland and Polymer Theory Department, Max Planck Institute for Polymer Research, Mainz, Germany
| | - Raffaello Potestio
- Department of Physics, University of Trento, Trento, Italy and INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| |
Collapse
|
14
|
Computational methods for exploring protein conformations. Biochem Soc Trans 2021; 48:1707-1724. [PMID: 32756904 PMCID: PMC7458412 DOI: 10.1042/bst20200193] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 07/07/2020] [Accepted: 07/09/2020] [Indexed: 12/13/2022]
Abstract
Proteins are dynamic molecules that can transition between a potentially wide range of structures comprising their conformational ensemble. The nature of these conformations and their relative probabilities are described by a high-dimensional free energy landscape. While computer simulation techniques such as molecular dynamics simulations allow characterisation of the metastable conformational states and the transitions between them, and thus free energy landscapes, to be characterised, the barriers between states can be high, precluding efficient sampling without substantial computational resources. Over the past decades, a dizzying array of methods have emerged for enhancing conformational sampling, and for projecting the free energy landscape onto a reduced set of dimensions that allow conformational states to be distinguished, known as collective variables (CVs), along which sampling may be directed. Here, a brief description of what biomolecular simulation entails is followed by a more detailed exposition of the nature of CVs and methods for determining these, and, lastly, an overview of the myriad different approaches for enhancing conformational sampling, most of which rely upon CVs, including new advances in both CV determination and conformational sampling due to machine learning.
Collapse
|
15
|
Han C, Zhang P, Bluestein D, Cong G, Deng Y. Artificial Intelligence for Accelerating Time Integrations in Multiscale Modeling. JOURNAL OF COMPUTATIONAL PHYSICS 2021; 427:110053. [PMID: 35821963 PMCID: PMC9273111 DOI: 10.1016/j.jcp.2020.110053] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
We developed a novel data-driven Artificial Intelligence-enhanced Adaptive Time Stepping algorithm (AI-ATS) that can adapt timestep sizes to underlying biophysical dynamics. We demonstrated its values in solving a complex biophysical problem, at multiple spatiotemporal scales, that describes platelet dynamics in shear blood flow. In order to achieve a significant speedup of this computationally demanding problem, we integrated a framework of novel AI algorithms into the solution of the platelet dynamics equations. Our framework involves recurrent neural network-based autoencoders by the Long Short-Term Memory and the Gated Recurrent Units as the first step for memorizing the dynamic states in long-term dependencies for the input time series, followed by two fully-connected neural networks to optimize timestep sizes and step jumps. The computational efficiency of our AI-ATS is underscored by assessing the accuracy and speed of a multiscale simulation of the platelet with the standard time stepping algorithm (STS). By adapting the timestep size, our AI-ATS guides the omission of multiple redundant time steps without sacrificing significant accuracy of the dynamics. Compared to the STS, our AI-ATS achieved a reduction of 40% unnecessary calculations while bounding the errors of mechanical and thermodynamic properties to 3%.
Collapse
Affiliation(s)
- Changnian Han
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Peng Zhang
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, USA
- Department of Biomedical Engineering, Stony Brook University, Stony Brook, NY 11790, USA
| | - Danny Bluestein
- Department of Biomedical Engineering, Stony Brook University, Stony Brook, NY 11790, USA
| | - Guojing Cong
- IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598, USA
| | - Yuefan Deng
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, USA
| |
Collapse
|
16
|
Heilmann N, Wolf M, Kozlowska M, Sedghamiz E, Setzler J, Brieg M, Wenzel W. Sampling of the conformational landscape of small proteins with Monte Carlo methods. Sci Rep 2020; 10:18211. [PMID: 33097750 PMCID: PMC7585447 DOI: 10.1038/s41598-020-75239-7] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 10/12/2020] [Indexed: 12/24/2022] Open
Abstract
Computer simulation provides an increasingly realistic picture of large-scale conformational change of proteins, but investigations remain fundamentally constrained by the femtosecond timestep of molecular dynamics simulations. For this reason, many biologically interesting questions cannot be addressed using accessible state-of-the-art computational resources. Here, we report the development of an all-atom Monte Carlo approach that permits the modelling of the large-scale conformational change of proteins using standard off-the-shelf computational hardware and standard all-atom force fields. We demonstrate extensive thermodynamic characterization of the folding process of the α-helical Trp-cage, the Villin headpiece and the β-sheet WW-domain. We fully characterize the free energy landscape, transition states, energy barriers between different states, and the per-residue stability of individual amino acids over a wide temperature range. We demonstrate that a state-of-the-art intramolecular force field can be combined with an implicit solvent model to obtain a high quality of the folded structures and also discuss limitations that still remain.
Collapse
Affiliation(s)
- Nana Heilmann
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Moritz Wolf
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Mariana Kozlowska
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Elaheh Sedghamiz
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Julia Setzler
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Martin Brieg
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Wolfgang Wenzel
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany.
| |
Collapse
|
17
|
Liguori N, Croce R, Marrink SJ, Thallmair S. Molecular dynamics simulations in photosynthesis. PHOTOSYNTHESIS RESEARCH 2020; 144:273-295. [PMID: 32297102 PMCID: PMC7203591 DOI: 10.1007/s11120-020-00741-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Accepted: 03/24/2020] [Indexed: 05/12/2023]
Abstract
Photosynthesis is regulated by a dynamic interplay between proteins, enzymes, pigments, lipids, and cofactors that takes place on a large spatio-temporal scale. Molecular dynamics (MD) simulations provide a powerful toolkit to investigate dynamical processes in (bio)molecular ensembles from the (sub)picosecond to the (sub)millisecond regime and from the Å to hundreds of nm length scale. Therefore, MD is well suited to address a variety of questions arising in the field of photosynthesis research. In this review, we provide an introduction to the basic concepts of MD simulations, at atomistic and coarse-grained level of resolution. Furthermore, we discuss applications of MD simulations to model photosynthetic systems of different sizes and complexity and their connection to experimental observables. Finally, we provide a brief glance on which methods provide opportunities to capture phenomena beyond the applicability of classical MD.
Collapse
Affiliation(s)
- Nicoletta Liguori
- Department of Physics and Astronomy and Institute for Lasers, Life and Biophotonics, Faculty of Sciences, De Boelelaan 1081, 1081 HV, Amsterdam, The Netherlands.
| | - Roberta Croce
- Department of Physics and Astronomy and Institute for Lasers, Life and Biophotonics, Faculty of Sciences, De Boelelaan 1081, 1081 HV, Amsterdam, The Netherlands
| | - Siewert J Marrink
- Groningen Biomolecular Sciences and Biotechnology Institute & Zernike Institute for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG, Groningen, The Netherlands
| | - Sebastian Thallmair
- Groningen Biomolecular Sciences and Biotechnology Institute & Zernike Institute for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG, Groningen, The Netherlands.
| |
Collapse
|
18
|
Exploring Successful Parameter Region for Coarse-Grained Simulation of Biomolecules by Bayesian Optimization and Active Learning. Biomolecules 2020; 10:biom10030482. [PMID: 32245275 PMCID: PMC7175118 DOI: 10.3390/biom10030482] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2020] [Revised: 03/11/2020] [Accepted: 03/19/2020] [Indexed: 11/19/2022] Open
Abstract
Accompanied with an increase of revealed biomolecular structures owing to advancements in structural biology, the molecular dynamics (MD) approach, especially coarse-grained (CG) MD suitable for macromolecules, is becoming increasingly important for elucidating their dynamics and behavior. In fact, CG-MD simulation has succeeded in qualitatively reproducing numerous biological processes for various biomolecules such as conformational changes and protein folding with reasonable calculation costs. However, CG-MD simulations strongly depend on various parameters, and selecting an appropriate parameter set is necessary to reproduce a particular biological process. Because exhaustive examination of all candidate parameters is inefficient, it is important to identify successful parameters. Furthermore, the successful region, in which the desired process is reproducible, is essential for describing the detailed mechanics of functional processes and environmental sensitivity and robustness. We propose an efficient search method for identifying the successful region by using two machine learning techniques, Bayesian optimization and active learning. We evaluated its performance using F1-ATPase, a biological rotary motor, with CG-MD simulations. We successfully identified the successful region with lower computational costs (12.3% in the best case) without sacrificing accuracy compared to exhaustive search. This method can accelerate not only parameter search but also biological discussion of the detailed mechanics of functional processes and environmental sensitivity based on MD simulation studies.
Collapse
|
19
|
Mirzaie M. Identification of native protein structures captured by principal interactions. BMC Bioinformatics 2019; 20:604. [PMID: 31752663 PMCID: PMC6873546 DOI: 10.1186/s12859-019-3186-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Accepted: 11/01/2019] [Indexed: 11/20/2022] Open
Abstract
Background Evaluation of protein structure is based on trustworthy potential function. The total potential of a protein structure is approximated as the summation of all pair-wise interaction potentials. Knowledge-based potentials (KBP) are one type of potential functions derived by known experimentally determined protein structures. Although several KBP functions with different methods have been introduced, the key interactions that capture the total potential have not studied yet. Results In this study, we seek the interaction types that preserve as much of the total potential as possible. We employ a procedure based on the principal component analysis (PCA) to extract the significant and key interactions in native protein structures. We call these interactions as principal interactions and show that the results of the model that considers only these interactions are very close to the full interaction model that considers all interactions in protein fold recognition. In fact, the principal interactions maintain the discriminative power of the full interaction model. This method was evaluated on 3 KBPs with different contact definitions and thresholds of distance and revealed that their corresponding principal interactions are very similar and have a lot in common. Additionally, the principal interactions consisted of 20 % of the full interactions on average, and they are between residues, which are considered important in protein folding. Conclusions This work shows that all interaction types are not equally important in discrimination of native structure. The results of the reduced model based on principal interactions that were very close to the full interaction model suggest that a new strategy is needed to capture the role of remaining interactions (non-principal interactions) to improve the power of knowledge-based potential functions.
Collapse
Affiliation(s)
- Mehdi Mirzaie
- Department of Applied Mathematics, Faculty of Mathematical Sciences, Tarbiat Modares University, Jalal Ale Ahmad Highway, P.O.Box: 14115-134, Tehran, Iran.
| |
Collapse
|
20
|
Honorato RV, Roel-Touris J, Bonvin AMJJ. MARTINI-Based Protein-DNA Coarse-Grained HADDOCKing. Front Mol Biosci 2019; 6:102. [PMID: 31632986 PMCID: PMC6779769 DOI: 10.3389/fmolb.2019.00102] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 09/17/2019] [Indexed: 11/13/2022] Open
Abstract
Modeling biomolecular assemblies is an important field in computational structural biology. The inherent complexity of their energy landscape and the computational cost associated with modeling large and complex assemblies are major drawbacks for integrative modeling approaches. The so-called coarse-graining approaches, which reduce the degrees of freedom of the system by grouping several atoms into larger “pseudo-atoms,” have been shown to alleviate some of those limitations, facilitating the identification of the global energy minima assumed to correspond to the native state of the complex, while making the calculations more efficient. Here, we describe and assess the implementation of the MARTINI force field for DNA into HADDOCK, our integrative modeling platform. We combine it with our previous implementation for protein-protein coarse-grained docking, enabling coarse-grained modeling of protein-nucleic acid complexes. The system is modeled using MARTINI topologies and interaction parameters during the rigid body docking and semi-flexible refinement stages of HADDOCK, and the resulting models are then converted back to atomistic resolution by an atom-to-bead distance restraints-guided protocol. We first demonstrate the performance of this protocol using 44 complexes from the protein-DNA docking benchmark, which shows an overall ~6-fold speed increase and maintains similar accuracy as compared to standard atomistic calculations. As a proof of concept, we then model the interaction between the PRC1 and the nucleosome (a former CAPRI target in round 31), using the same information available at the time the target was offered, and compare all-atom and coarse-grained models.
Collapse
Affiliation(s)
- Rodrigo V Honorato
- Faculty of Science-Chemistry, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, Netherlands.,Brazilian Biosciences National Laboratory (LNBio), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, Brazil
| | - Jorge Roel-Touris
- Faculty of Science-Chemistry, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, Netherlands
| | - Alexandre M J J Bonvin
- Faculty of Science-Chemistry, Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
21
|
Marrink SJ, Corradi V, Souza PC, Ingólfsson HI, Tieleman DP, Sansom MS. Computational Modeling of Realistic Cell Membranes. Chem Rev 2019; 119:6184-6226. [PMID: 30623647 PMCID: PMC6509646 DOI: 10.1021/acs.chemrev.8b00460] [Citation(s) in RCA: 422] [Impact Index Per Article: 84.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Indexed: 12/15/2022]
Abstract
Cell membranes contain a large variety of lipid types and are crowded with proteins, endowing them with the plasticity needed to fulfill their key roles in cell functioning. The compositional complexity of cellular membranes gives rise to a heterogeneous lateral organization, which is still poorly understood. Computational models, in particular molecular dynamics simulations and related techniques, have provided important insight into the organizational principles of cell membranes over the past decades. Now, we are witnessing a transition from simulations of simpler membrane models to multicomponent systems, culminating in realistic models of an increasing variety of cell types and organelles. Here, we review the state of the art in the field of realistic membrane simulations and discuss the current limitations and challenges ahead.
Collapse
Affiliation(s)
- Siewert J. Marrink
- Groningen
Biomolecular Sciences and Biotechnology Institute & Zernike Institute
for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands
| | - Valentina Corradi
- Centre
for Molecular Simulation and Department of Biological Sciences, University of Calgary, 2500 University Drive NW, Calgary, Alberta T2N 1N4, Canada
| | - Paulo C.T. Souza
- Groningen
Biomolecular Sciences and Biotechnology Institute & Zernike Institute
for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands
| | - Helgi I. Ingólfsson
- Biosciences
and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, California 94550, United States
| | - D. Peter Tieleman
- Centre
for Molecular Simulation and Department of Biological Sciences, University of Calgary, 2500 University Drive NW, Calgary, Alberta T2N 1N4, Canada
| | - Mark S.P. Sansom
- Department
of Biochemistry, University of Oxford, South Parks Road, Oxford OX1 3QU, U.K.
| |
Collapse
|
22
|
Machado MR, Barrera EE, Klein F, Sóñora M, Silva S, Pantano S. The SIRAH 2.0 Force Field: Altius, Fortius, Citius. J Chem Theory Comput 2019; 15:2719-2733. [PMID: 30810317 DOI: 10.1021/acs.jctc.9b00006] [Citation(s) in RCA: 86] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
A new version of the coarse-grained (CG) SIRAH force field for proteins has been developed. Modifications to bonded and non-bonded interactions on the existing molecular topologies significantly ameliorate the structural description and flexibility of a non-redundant set of proteins. The SIRAH 2.0 force field has also been ported to the popular simulation package AMBER, which along with the former implementation in GROMACS expands significantly the potential range of users and performance of this CG force field on CPU/GPU codes. As a non-trivial example of its application, we undertook the structural and dynamical analysis of the most abundant and conserved calcium-binding protein, calmodulin (CaM). CaM is composed of two calcium-binding motifs called EF-hands, which in the presence of calcium specifically recognize a cognate peptide by embracing it. CG simulations of CaM bound to four calcium ions in the presence or absence of a binding peptide (holo and apo forms, respectively) resulted in good and stable ion coordination. The simulation of the holo form starting from an experimental structure sampled near-native conformations, retrieving quasi-atomistic precision. Removing the binding peptide enabled the EF-hands to perform large reciprocal movements, comparable to those observed in NMR structures. On the other hand, the isolated peptide starting from the helical conformation experienced spontaneous unfolding, in agreement with previous experimental data. However, repositioning the peptide in the neighborhood of one EF-hand not only prevented the peptide from unfolding but also drove CaM to a fully bound conformation, with both EF-hands embracing the cognate peptide, resembling the experimental holo structure. Therefore, SIRAH 2.0 shows the capacity to handle a number of structurally and dynamically challenging situations, including metal ion coordination, unbiased conformational sampling, and specific protein-peptide recognition.
Collapse
Affiliation(s)
- Matías R Machado
- Biomolecular Simulations Group , Institut Pasteur de Montevideo , Mataojo 2020 , CP 11400 Montevideo , Uruguay
| | - Exequiel E Barrera
- Biomolecular Simulations Group , Institut Pasteur de Montevideo , Mataojo 2020 , CP 11400 Montevideo , Uruguay
| | - Florencia Klein
- Biomolecular Simulations Group , Institut Pasteur de Montevideo , Mataojo 2020 , CP 11400 Montevideo , Uruguay
| | - Martín Sóñora
- Biomolecular Simulations Group , Institut Pasteur de Montevideo , Mataojo 2020 , CP 11400 Montevideo , Uruguay
| | - Steffano Silva
- Biomolecular Simulations Group , Institut Pasteur de Montevideo , Mataojo 2020 , CP 11400 Montevideo , Uruguay
| | - Sergio Pantano
- Biomolecular Simulations Group , Institut Pasteur de Montevideo , Mataojo 2020 , CP 11400 Montevideo , Uruguay
| |
Collapse
|
23
|
Leonard AN, Wang E, Monje-Galvan V, Klauda JB. Developing and Testing of Lipid Force Fields with Applications to Modeling Cellular Membranes. Chem Rev 2019; 119:6227-6269. [DOI: 10.1021/acs.chemrev.8b00384] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
24
|
Conformational preferences and phase behavior of intrinsically disordered low complexity sequences: insights from multiscale simulations. Curr Opin Struct Biol 2018; 56:1-10. [PMID: 30439585 DOI: 10.1016/j.sbi.2018.10.003] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Revised: 10/17/2018] [Accepted: 10/19/2018] [Indexed: 11/22/2022]
Abstract
While many proteins and protein regions utilize a complex repertoire of amino acids to achieve their biological function, a subset of protein sequences are enriched in a reduced set of amino acids. These so-called low complexity (LC) sequences, specifically intrinsically disordered variants of LC sequences, have been the focus of recent investigations owing to their roles in a range of biological functions, specifically phase separation. Computational studies of LC sequences have provided rich insights into their behavior both as individual proteins in dilute solutions and as the drivers and modulators of higher-order assemblies. Here, we review how simulations performed across distinct resolutions have provided different types of insights into the biological role of LC sequences.
Collapse
|
25
|
Orekhov PS, Kholina EG, Bozdaganyan ME, Nesterenko AM, Kovalenko IB, Strakhovskaya MG. Molecular Mechanism of Uptake of Cationic Photoantimicrobial Phthalocyanine across Bacterial Membranes Revealed by Molecular Dynamics Simulations. J Phys Chem B 2018; 122:3711-3722. [PMID: 29553736 DOI: 10.1021/acs.jpcb.7b11707] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Phthalocyanines are aromatic macrocyclic compounds, which are structurally related to porphyrins. In clinical practice, phthalocyanines are used in fluorescence imaging and photodynamic therapy of cancer and noncancer lesions. Certain forms of the substituted polycationic metallophthalocyanines have been previously shown to be active in photodynamic inactivation of both Gram-negative and Gram-positive bacteria; one of them is zinc octakis(cholinyl)phthalocyanine (ZnPcChol8+). However, the molecular details of how these compounds translocate across bacterial membranes still remain unclear. In the present work, we have developed a coarse-grained (CG) molecular model of ZnPcChol8+ within the framework of the popular MARTINI CG force field. The obtained model was used to probe the solvation behavior of phthalocyanine molecules, which agreed with experimental results. Subsequently, it was used to investigate the molecular details of interactions between phthalocyanines and membranes of various compositions. The results demonstrate that ZnPcChol8+ has high affinity to both the inner and the outer model membranes of Gram-negative bacteria, although this species does not show noticeable affinity to the 1-palmitoyl-2-oleoyl- sn-glycero-3-phosphatidylcholine membrane. Furthermore, we found out that the process of ZnPcChol8+ penetration toward the center of the outer bacterial membrane is energetically favorable and leads to its overall disturbance and formation of the aqueous pore. Such intramembrane localization of ZnPcChol8+ suggests their twofold cytotoxic effect on bacterial cells: (1) via induction of lipid peroxidation by enhanced production of reactive oxygen species (i.e., photodynamic toxicity); (2) via rendering the bacterial membrane more permeable for additional Pc molecules as well as other compounds. We also found that the kinetics of penetration depends on the presence of phospholipid defects in the lipopolysaccharide leaflet of the outer membrane and the type of counterions, which stabilize it. Thus, the results of our simulations provide a detailed molecular view of ZnPcChol8+ "self-promoted uptake", the pathway previously proposed for some small molecules crossing the outer bacterial membrane.
Collapse
Affiliation(s)
- Philipp S Orekhov
- Moscow Institute of Physics and Technology , Dolgoprudny 141700 , Russia.,Sechenov University , Trubetskaya 8-2 , Moscow 119991 , Russia
| | | | - Marine E Bozdaganyan
- Federal Research and Clinical Center of Specialized Medical Care and Medical Technologies , Federal Medical and Biological Agency of Russia , Moscow 115682 , Russia
| | | | - Ilya B Kovalenko
- Federal Research and Clinical Center of Specialized Medical Care and Medical Technologies , Federal Medical and Biological Agency of Russia , Moscow 115682 , Russia.,Astrakhan State University , Astrakhan 414056 , Russia.,Scientific and Technological Center of Unique Instrumentation of the Russian Academy of Sciences , Moscow 117342 , Russia
| | - Marina G Strakhovskaya
- Federal Research and Clinical Center of Specialized Medical Care and Medical Technologies , Federal Medical and Biological Agency of Russia , Moscow 115682 , Russia
| |
Collapse
|
26
|
Hills RD. Refining amino acid hydrophobicity for dynamics simulation of membrane proteins. PeerJ 2018; 6:e4230. [PMID: 29340240 PMCID: PMC5767086 DOI: 10.7717/peerj.4230] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 12/14/2017] [Indexed: 11/20/2022] Open
Abstract
Coarse-grained (CG) models have been successful in simulating the chemical properties of lipid bilayers, but accurate treatment of membrane proteins and lipid-protein molecular interactions remains a challenge. The CgProt force field, original developed with the multiscale coarse graining method, is assessed by comparing the potentials of mean force for sidechain insertion in a DOPC bilayer to results reported for atomistic molecular dynamics simulations. Reassignment of select CG sidechain sites from the apolar to polar site type was found to improve the attractive interfacial behavior of tyrosine, phenylalanine and asparagine as well as charged lysine and arginine residues. The solvation energy at membrane depths of 0, 1.3 and 1.7 nm correlates with experimental partition coefficients in aqueous mixtures of cyclohexane, octanol and POPC, respectively, for sidechain analogs and Wimley-White peptides. These experimental values serve as important anchor points in choosing between alternate CG models based on their observed permeation profiles, particularly for Arg, Lys and Gln residues where the all-atom OPLS solvation energy does not agree well with experiment. Available partitioning data was also used to reparameterize the representation of the peptide backbone, which needed to be made less attractive for the bilayer hydrophobic core region. The newly developed force field, CgProt 2.4, correctly predicts the global energy minimum in the potentials of mean force for insertion of the uncharged membrane-associated peptides LS3 and WALP23. CgProt will find application in studies of lipid-protein interactions and the conformational properties of diverse membrane protein systems.
Collapse
Affiliation(s)
- Ronald D Hills
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New England, Portland, ME, United States of America
| |
Collapse
|
27
|
Deichmann G, van der Vegt NFA. Conditional Reversible Work Coarse-Grained Models of Molecular Liquids with Coulomb Electrostatics – A Proof of Concept Study on Weakly Polar Organic Molecules. J Chem Theory Comput 2017; 13:6158-6166. [DOI: 10.1021/acs.jctc.7b00611] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Gregor Deichmann
- Eduard-Zintl-Institut für
Anorganische und Physikalische Chemie, Technische Universität Darmstadt, Alarich-Weiss-Straße 10, 64287 Darmstadt, Germany
| | - Nico F. A. van der Vegt
- Eduard-Zintl-Institut für
Anorganische und Physikalische Chemie, Technische Universität Darmstadt, Alarich-Weiss-Straße 10, 64287 Darmstadt, Germany
| |
Collapse
|
28
|
Rudzinski JF, Lu K, Milner ST, Maranas JK, Noid WG. Extended Ensemble Approach to Transferable Potentials for Low-Resolution Coarse-Grained Models of Ionomers. J Chem Theory Comput 2017; 13:2185-2201. [DOI: 10.1021/acs.jctc.6b01160] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Joseph F. Rudzinski
- Department
of Chemistry and ‡Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| | - Keran Lu
- Department
of Chemistry and ‡Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| | - Scott T. Milner
- Department
of Chemistry and ‡Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| | - Janna K. Maranas
- Department
of Chemistry and ‡Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| | - William G. Noid
- Department
of Chemistry and ‡Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| |
Collapse
|
29
|
Coluzza I. Computational protein design: a review. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2017; 29:143001. [PMID: 28140371 DOI: 10.1088/1361-648x/aa5c76] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Proteins are one of the most versatile modular assembling systems in nature. Experimentally, more than 110 000 protein structures have been identified and more are deposited every day in the Protein Data Bank. Such an enormous structural variety is to a first approximation controlled by the sequence of amino acids along the peptide chain of each protein. Understanding how the structural and functional properties of the target can be encoded in this sequence is the main objective of protein design. Unfortunately, rational protein design remains one of the major challenges across the disciplines of biology, physics and chemistry. The implications of solving this problem are enormous and branch into materials science, drug design, evolution and even cryptography. For instance, in the field of drug design an effective computational method to design protein-based ligands for biological targets such as viruses, bacteria or tumour cells, could give a significant boost to the development of new therapies with reduced side effects. In materials science, self-assembly is a highly desired property and soon artificial proteins could represent a new class of designable self-assembling materials. The scope of this review is to describe the state of the art in computational protein design methods and give the reader an outline of what developments could be expected in the near future.
Collapse
Affiliation(s)
- Ivan Coluzza
- Computational Physics, Faculty of Physics, University of Vienna, Vienna, Austria
| |
Collapse
|
30
|
Bhadra P, Pal D. Pipeline for inferring protein function from dynamics using coarse-grained molecular mechanics forcefield. Comput Biol Med 2017; 83:134-142. [PMID: 28279862 DOI: 10.1016/j.compbiomed.2017.02.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2016] [Revised: 02/18/2017] [Accepted: 02/22/2017] [Indexed: 11/28/2022]
Abstract
Dynamics is integral to the function of proteins, yet the use of molecular dynamics (MD) simulation as a technique remains under-explored for molecular function inference. This is more important in the context of genomics projects where novel proteins are determined with limited evolutionary information. Recently we developed a method to match the query protein's flexible segments to infer function using a novel approach combining analysis of residue fluctuation-graphs and auto-correlation vectors derived from coarse-grained (CG) MD trajectory. The method was validated on a diverse dataset with sequence identity between proteins as low as 3%, with high function-recall rates. Here we share its implementation as a publicly accessible web service, named DynFunc (Dynamics Match for Function) to query protein function from ≥1 µs long CG dynamics trajectory information of protein subunits. Users are provided with the custom-developed coarse-grained molecular mechanics (CGMM) forcefield to generate the MD trajectories for their protein of interest. On upload of trajectory information, the DynFunc web server identifies specific flexible regions of the protein linked to putative molecular function. Our unique application does not use evolutionary information to infer molecular function from MD information and can, therefore, work for all proteins, including moonlighting and the novel ones, whenever structural information is available. Our pipeline is expected to be of utility to all structural biologists working with novel proteins and interested in moonlighting functions.
Collapse
Affiliation(s)
- Pratiti Bhadra
- Institute Mathematics Initiative, Indian Institute of Science, Bengaluru 560012, India
| | - Debnath Pal
- Institute Mathematics Initiative, Indian Institute of Science, Bengaluru 560012, India; Computational and Data Sciences, Indian Institute of Science, Bengaluru 560012, India.
| |
Collapse
|
31
|
Chen PC, Hologne M, Walker O. Computing the Rotational Diffusion of Biomolecules via Molecular Dynamics Simulation and Quaternion Orientations. J Phys Chem B 2017; 121:1812-1823. [DOI: 10.1021/acs.jpcb.6b11703] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Po-chia Chen
- Université de Lyon, CNRS, Université Claude Bernard Lyon1, Ens de Lyon, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Maggy Hologne
- Université de Lyon, CNRS, Université Claude Bernard Lyon1, Ens de Lyon, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Olivier Walker
- Université de Lyon, CNRS, Université Claude Bernard Lyon1, Ens de Lyon, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| |
Collapse
|
32
|
Gooneie A, Schuschnigg S, Holzer C. A Review of Multiscale Computational Methods in Polymeric Materials. Polymers (Basel) 2017; 9:E16. [PMID: 30970697 PMCID: PMC6432151 DOI: 10.3390/polym9010016] [Citation(s) in RCA: 113] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2016] [Revised: 12/07/2016] [Accepted: 12/22/2016] [Indexed: 11/17/2022] Open
Abstract
Polymeric materials display distinguished characteristics which stem from the interplay of phenomena at various length and time scales. Further development of polymer systems critically relies on a comprehensive understanding of the fundamentals of their hierarchical structure and behaviors. As such, the inherent multiscale nature of polymer systems is only reflected by a multiscale analysis which accounts for all important mechanisms. Since multiscale modelling is a rapidly growing multidisciplinary field, the emerging possibilities and challenges can be of a truly diverse nature. The present review attempts to provide a rather comprehensive overview of the recent developments in the field of multiscale modelling and simulation of polymeric materials. In order to understand the characteristics of the building blocks of multiscale methods, first a brief review of some significant computational methods at individual length and time scales is provided. These methods cover quantum mechanical scale, atomistic domain (Monte Carlo and molecular dynamics), mesoscopic scale (Brownian dynamics, dissipative particle dynamics, and lattice Boltzmann method), and finally macroscopic realm (finite element and volume methods). Afterwards, different prescriptions to envelope these methods in a multiscale strategy are discussed in details. Sequential, concurrent, and adaptive resolution schemes are presented along with the latest updates and ongoing challenges in research. In sequential methods, various systematic coarse-graining and backmapping approaches are addressed. For the concurrent strategy, we aimed to introduce the fundamentals and significant methods including the handshaking concept, energy-based, and force-based coupling approaches. Although such methods are very popular in metals and carbon nanomaterials, their use in polymeric materials is still limited. We have illustrated their applications in polymer science by several examples hoping for raising attention towards the existing possibilities. The relatively new adaptive resolution schemes are then covered including their advantages and shortcomings. Finally, some novel ideas in order to extend the reaches of atomistic techniques are reviewed. We conclude the review by outlining the existing challenges and possibilities for future research.
Collapse
Affiliation(s)
- Ali Gooneie
- Chair of Polymer Processing, Montanuniversitaet Leoben, Otto Gloeckel-Strasse 2, 8700 Leoben, Austria.
| | - Stephan Schuschnigg
- Chair of Polymer Processing, Montanuniversitaet Leoben, Otto Gloeckel-Strasse 2, 8700 Leoben, Austria.
| | - Clemens Holzer
- Chair of Polymer Processing, Montanuniversitaet Leoben, Otto Gloeckel-Strasse 2, 8700 Leoben, Austria.
| |
Collapse
|
33
|
Ando D, Gopinathan A. Cooperative Interactions between Different Classes of Disordered Proteins Play a Functional Role in the Nuclear Pore Complex of Baker's Yeast. PLoS One 2017; 12:e0169455. [PMID: 28068389 PMCID: PMC5222603 DOI: 10.1371/journal.pone.0169455] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2016] [Accepted: 12/16/2016] [Indexed: 12/02/2022] Open
Abstract
Nucleocytoplasmic transport is highly selective, efficient, and is regulated by a poorly understood mechanism involving hundreds of disordered FG nucleoporin proteins (FG nups) lining the inside wall of the nuclear pore complex (NPC). Previous research has concluded that FG nups in Baker's yeast (S. cerevisiae) are present in a bimodal distribution, with the "Forest Model" classifying FG nups as either di-block polymer like "trees" or single-block polymer like "shrubs". Using a combination of coarse-grained modeling and polymer brush modeling, the function of the di-block FG nups has previously been hypothesized in the Di-block Copolymer Brush Gate (DCBG) model to form a higher-order polymer brush architecture which can open and close to regulate transport across the NPC. In this manuscript we work to extend the original DCBG model by first performing coarse grained simulations of the single-block FG nups which confirm that they have a single block polymer structure rather than the di-block structure of tree nups. Our molecular simulations also demonstrate that these single-block FG nups are likely cohesive, compact, collapsed coil polymers, implying that these FG nups are generally localized to their grafting location within the NPC. We find that adding a layer of single-block FG nups to the DCBG model increases the range of cargo sizes which are able to translocate the pore through a cooperative effect involving single-block and di-block FG nups. This effect can explain the puzzling connection between single-block FG nup deletion mutants in S. cerevisiae and the resulting failure of certain large cargo transport through the NPC. Facilitation of large cargo transport via single-block and di-block FG nup cooperativity in the nuclear pore could provide a model mechanism for designing future biomimetic pores of greater applicability.
Collapse
Affiliation(s)
- David Ando
- Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
- Joint BioEnergy Institute, Emeryville, CA, United States of America
| | - Ajay Gopinathan
- Department of Physics, University of California, Merced, CA, United States of America
| |
Collapse
|
34
|
Fosso-Tande J, Black C, G. Aller S, Lu L, D. Hills Jr R. Simulation of lipid-protein interactions with the CgProt force field. AIMS MOLECULAR SCIENCE 2017. [DOI: 10.3934/molsci.2017.3.352] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
35
|
Joseph JA, Whittleston CS, Wales DJ. Structure, Thermodynamics, and Folding Pathways for a Tryptophan Zipper as a Function of Local Rigidification. J Chem Theory Comput 2016; 12:6109-6117. [DOI: 10.1021/acs.jctc.6b00734] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Affiliation(s)
- Jerelle A. Joseph
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Chris S. Whittleston
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - David J. Wales
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
36
|
Click TH, Raj N, Chu JW. Calculation of Enzyme Fluctuograms from All-Atom Molecular Dynamics Simulation. Methods Enzymol 2016; 578:327-42. [PMID: 27497173 DOI: 10.1016/bs.mie.2016.05.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
In this work, a computational framework is presented to compute the time evolution of force constants for a coarse grained (CG) elastic network model along an all-atom molecular dynamics trajectory of a protein system. Obtained via matching distance fluctuations, these force constants represent strengths of mechanical coupling between CG beads. Variation of coupling strengths with time is hence termed the fluctuogram of protein dynamics. In addition to the schematic procedure and implementation considerations, several ways of combining force constants and data analysis are presented to illustrate the potential application of protein fluctuograms. The unique angle provided by the fluctuogram expands the scope of atomistic simulations and is expected to impact upon fundamental understanding of protein dynamics as well as protein engineering technologies.
Collapse
Affiliation(s)
- T H Click
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, ROC
| | - N Raj
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, ROC
| | - J-W Chu
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, ROC; Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, ROC; Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, Taiwan, ROC.
| |
Collapse
|
37
|
Mahalik JP, Brown KA, Cheng X, Fuentes-Cabrera M. Theoretical Study of the Initial Stages of Self-Assembly of a Carboxysome's Facet. ACS NANO 2016; 10:5751-8. [PMID: 26906087 DOI: 10.1021/acsnano.5b07805] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Bacterial microcompartments, BMCs, are organelles that exist within wide variety of bacteria and act as nanofactories. Among the different types of known BMCs, the carboxysome has been studied the most. The carboxysome plays an important role in the light-independent part of the photosynthesis process, where its icosahedral-like proteinaceous shell acts as a membrane that controls the transport of metabolites. Although a structural model exists for the carboxysome shell, it remains largely unknown how the shell proteins self-assemble. Understanding the self-assembly process can provide insights into how the shell affects the carboxysome's function and how it can be modified to create new functionalities, such as artificial nanoreactors and artificial protein membranes. Here, we describe a theoretical framework that employs Monte Carlo simulations with a coarse-grain potential that reproduces well the atomistic potential of mean force; employing this framework, we are able to capture the initial stages of the 2D self-assembly of CcmK2 hexamers, a major protein-shell component of the carboxysome's facet. The simulations reveal that CcmK2 hexamers self-assemble into clusters that resemble what was seen experimentally in 2D layers. Further analysis of the simulation results suggests that the 2D self-assembly of carboxysome's facets is driven by a nucleation-growth process, which in turn could play an important role in the hierarchical self-assembly of BMC shells in general.
Collapse
Affiliation(s)
| | - Kirsten A Brown
- Chemistry Department, Mercer University , 1501 Mercer University Drive, Macon, Georgia 31207, United States
| | - Xiaolin Cheng
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee , M407 Walters Life Sciences, 1414 Cumberland Avenue, Knoxville, Tennessee 37996, United States
| | | |
Collapse
|
38
|
Kmiecik S, Gront D, Kolinski M, Wieteska L, Dawid AE, Kolinski A. Coarse-Grained Protein Models and Their Applications. Chem Rev 2016; 116:7898-936. [DOI: 10.1021/acs.chemrev.6b00163] [Citation(s) in RCA: 555] [Impact Index Per Article: 69.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Sebastian Kmiecik
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| | - Dominik Gront
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| | - Michal Kolinski
- Bioinformatics
Laboratory, Mossakowski Medical Research Center of the Polish Academy of Sciences, Pawinskiego 5, 02-106 Warsaw, Poland
| | - Lukasz Wieteska
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
- Department
of Medical Biochemistry, Medical University of Lodz, Mazowiecka 6/8, 92-215 Lodz, Poland
| | | | - Andrzej Kolinski
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| |
Collapse
|
39
|
Ramírez CL, Petruk A, Bringas M, Estrin DA, Roitberg AE, Marti MA, Capece L. Coarse-Grained Simulations of Heme Proteins: Validation and Study of Large Conformational Transitions. J Chem Theory Comput 2016; 12:3390-7. [DOI: 10.1021/acs.jctc.6b00278] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Claudia L. Ramírez
- Dto.
de Química Inorgánica, Analítica y Química
Física, Fac. de Ciencias Exactas y Naturales, Univ. de Buenos Aires/INQUIMAE-CONICET, Buenos Aires, C1428EGA, Argentina
- Dto.
de Química Biologica Fac. de Ciencias Exactas y Naturales, Univ. de Buenos Aires/IQUIBICEN-CONICET, Buenos Aires, C1428EGA, Argentina
| | - Ariel Petruk
- Dto.
de Química Inorgánica, Analítica y Química
Física, Fac. de Ciencias Exactas y Naturales, Univ. de Buenos Aires/INQUIMAE-CONICET, Buenos Aires, C1428EGA, Argentina
| | - Mauro Bringas
- Dto.
de Química Inorgánica, Analítica y Química
Física, Fac. de Ciencias Exactas y Naturales, Univ. de Buenos Aires/INQUIMAE-CONICET, Buenos Aires, C1428EGA, Argentina
| | - Dario A. Estrin
- Dto.
de Química Inorgánica, Analítica y Química
Física, Fac. de Ciencias Exactas y Naturales, Univ. de Buenos Aires/INQUIMAE-CONICET, Buenos Aires, C1428EGA, Argentina
| | - Adrian E. Roitberg
- Department
of Chemistry, University of Florida, Gainesville, Florida 32611-7200, United States
| | - Marcelo A. Marti
- Dto.
de Química Biologica Fac. de Ciencias Exactas y Naturales, Univ. de Buenos Aires/IQUIBICEN-CONICET, Buenos Aires, C1428EGA, Argentina
| | - Luciana Capece
- Dto.
de Química Inorgánica, Analítica y Química
Física, Fac. de Ciencias Exactas y Naturales, Univ. de Buenos Aires/INQUIMAE-CONICET, Buenos Aires, C1428EGA, Argentina
| |
Collapse
|
40
|
Dunn NJH, Noid WG. Bottom-up coarse-grained models with predictive accuracy and transferability for both structural and thermodynamic properties of heptane-toluene mixtures. J Chem Phys 2016; 144:204124. [DOI: 10.1063/1.4952422] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Affiliation(s)
- Nicholas J. H. Dunn
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - W. G. Noid
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
41
|
Cao Z, Voth GA. The multiscale coarse-graining method. XI. Accurate interactions based on the centers of charge of coarse-grained sites. J Chem Phys 2016; 143:243116. [PMID: 26723601 DOI: 10.1063/1.4933249] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
It is essential to be able to systematically construct coarse-grained (CG) models that can efficiently and accurately reproduce key properties of higher-resolution models such as all-atom. To fulfill this goal, a mapping operator is needed to transform the higher-resolution configuration to a CG configuration. Certain mapping operators, however, may lose information related to the underlying electrostatic properties. In this paper, a new mapping operator based on the centers of charge of CG sites is proposed to address this issue. Four example systems are chosen to demonstrate this concept. Within the multiscale coarse-graining framework, CG models that use this mapping operator are found to better reproduce the structural correlations of atomistic models. The present work also demonstrates the flexibility of the mapping operator and the robustness of the force matching method. For instance, important functional groups can be isolated and emphasized in the CG model.
Collapse
Affiliation(s)
- Zhen Cao
- Department of Chemistry, James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago, 5735 S Ellis Ave., Chicago, Illinois 60637, USA
| | - Gregory A Voth
- Department of Chemistry, James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago, 5735 S Ellis Ave., Chicago, Illinois 60637, USA
| |
Collapse
|
42
|
Hills RD, McGlinchey N. Model parameters for simulation of physiological lipids. J Comput Chem 2016; 37:1112-8. [PMID: 26864972 PMCID: PMC5067697 DOI: 10.1002/jcc.24324] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Revised: 12/19/2015] [Accepted: 01/17/2016] [Indexed: 12/16/2022]
Abstract
Coarse grain simulation of proteins in their physiological membrane environment can offer insight across timescales, but requires a comprehensive force field. Parameters are explored for multicomponent bilayers composed of unsaturated lipids DOPC and DOPE, mixed-chain saturation POPC and POPE, and anionic lipids found in bacteria: POPG and cardiolipin. A nonbond representation obtained from multiscale force matching is adapted for these lipids and combined with an improved bonding description of cholesterol. Equilibrating the area per lipid yields robust bilayer simulations and properties for common lipid mixtures with the exception of pure DOPE, which has a known tendency to form nonlamellar phase. The models maintain consistency with an existing lipid-protein interaction model, making the force field of general utility for studying membrane proteins in physiologically representative bilayers.
Collapse
Affiliation(s)
- Ronald D Hills
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New England, 716 Stevens Ave, Portland, Maine, 04103
| | - Nicholas McGlinchey
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New England, 716 Stevens Ave, Portland, Maine, 04103
| |
Collapse
|
43
|
Jia Z, Chen J. Necessity of high‐resolution for coarse‐grained modeling of flexible proteins. J Comput Chem 2016; 37:1725-33. [DOI: 10.1002/jcc.24391] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Revised: 03/11/2016] [Accepted: 03/27/2016] [Indexed: 12/13/2022]
Affiliation(s)
- Zhiguang Jia
- Department of Biochemistry and Molecular BiophysicsKansas State UniversityManhattan Kansas66506
| | - Jianhan Chen
- Department of Biochemistry and Molecular BiophysicsKansas State UniversityManhattan Kansas66506
| |
Collapse
|
44
|
Ruff KM, Harmon TS, Pappu RV. CAMELOT: A machine learning approach for coarse-grained simulations of aggregation of block-copolymeric protein sequences. J Chem Phys 2015; 143:243123. [PMID: 26723608 PMCID: PMC4644154 DOI: 10.1063/1.4935066] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2015] [Accepted: 10/21/2015] [Indexed: 01/28/2023] Open
Abstract
We report the development and deployment of a coarse-graining method that is well suited for computer simulations of aggregation and phase separation of protein sequences with block-copolymeric architectures. Our algorithm, named CAMELOT for Coarse-grained simulations Aided by MachinE Learning Optimization and Training, leverages information from converged all atom simulations that is used to determine a suitable resolution and parameterize the coarse-grained model. To parameterize a system-specific coarse-grained model, we use a combination of Boltzmann inversion, non-linear regression, and a Gaussian process Bayesian optimization approach. The accuracy of the coarse-grained model is demonstrated through direct comparisons to results from all atom simulations. We demonstrate the utility of our coarse-graining approach using the block-copolymeric sequence from the exon 1 encoded sequence of the huntingtin protein. This sequence comprises of 17 residues from the N-terminal end of huntingtin (N17) followed by a polyglutamine (polyQ) tract. Simulations based on the CAMELOT approach are used to show that the adsorption and unfolding of the wild type N17 and its sequence variants on the surface of polyQ tracts engender a patchy colloid like architecture that promotes the formation of linear aggregates. These results provide a plausible explanation for experimental observations, which show that N17 accelerates the formation of linear aggregates in block-copolymeric N17-polyQ sequences. The CAMELOT approach is versatile and is generalizable for simulating the aggregation and phase behavior of a range of block-copolymeric protein sequences.
Collapse
Affiliation(s)
- Kiersten M Ruff
- Computational and Systems Biology Program and Center for Biological Systems Engineering, Washington University in St. Louis, St. Louis, Missouri 63130-4899, USA
| | - Tyler S Harmon
- Department of Physics and Center for Biological Systems Engineering, Washington University in St. Louis, St. Louis, Missouri 63130-4899, USA
| | - Rohit V Pappu
- Department of Biomedical Engineering and Center for Biological Systems Engineering, Washington University in St. Louis, CB 1097, St. Louis, Missouri 63130-4899, USA
| |
Collapse
|
45
|
Lombardi LE, Martí MA, Capece L. CG2AA: backmapping protein coarse-grained structures. Bioinformatics 2015; 32:1235-7. [DOI: 10.1093/bioinformatics/btv740] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2015] [Accepted: 12/12/2015] [Indexed: 01/24/2023] Open
|
46
|
Emperador A, Sfriso P, Villarreal MA, Gelpí JL, Orozco M. PACSAB: Coarse-Grained Force Field for the Study of Protein–Protein Interactions and Conformational Sampling in Multiprotein Systems. J Chem Theory Comput 2015; 11:5929-38. [DOI: 10.1021/acs.jctc.5b00660] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Agustí Emperador
- Institute for Research in Biomedicine (IRB Barcelona), Baldiri
i Reixac 10, Barcelona 08028, Spain
- Joint BSC-IRB Research Program in Computational Biology, IRB Barcelona, Barcelona 08028, Spain
| | - Pedro Sfriso
- Institute for Research in Biomedicine (IRB Barcelona), Baldiri
i Reixac 10, Barcelona 08028, Spain
- Joint BSC-IRB Research Program in Computational Biology, IRB Barcelona, Barcelona 08028, Spain
| | - Marcos Ariel Villarreal
- Instituto de Investigaciones en Fisicoquímica de Córdoba
- Departamento de Matemática y Física, CONICET-Universidad Nacional de Córdoba, University City, Córdoba 5000, Argentina
| | - Josep Lluis Gelpí
- Institute for Research in Biomedicine (IRB Barcelona), Baldiri
i Reixac 10, Barcelona 08028, Spain
- Joint BSC-IRB Research Program in Computational Biology, IRB Barcelona, Barcelona 08028, Spain
- Barcelona Supercomputing Center, Jordi Girona
29, Barcelona 08034, Spain
- Departament de Bioquímica, Facultat de Biologia, Avgda Diagonal 645, Barcelona 08028, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine (IRB Barcelona), Baldiri
i Reixac 10, Barcelona 08028, Spain
- Joint BSC-IRB Research Program in Computational Biology, IRB Barcelona, Barcelona 08028, Spain
- Barcelona Supercomputing Center, Jordi Girona
29, Barcelona 08034, Spain
- Departament de Bioquímica, Facultat de Biologia, Avgda Diagonal 645, Barcelona 08028, Spain
| |
Collapse
|
47
|
Solis AD. Amino acid alphabet reduction preserves fold information contained in contact interactions in proteins. Proteins 2015; 83:2198-216. [DOI: 10.1002/prot.24936] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2015] [Revised: 09/04/2015] [Accepted: 09/04/2015] [Indexed: 12/14/2022]
Affiliation(s)
- Armando D. Solis
- Biological Sciences Department, New York City College of Technology; the City University of New York (CUNY); Brooklyn New York 11201
| |
Collapse
|
48
|
Leherte L. Reduced point charge models of proteins: assessment based on molecular dynamics simulations. MOLECULAR SIMULATION 2015. [DOI: 10.1080/08927022.2015.1044452] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
49
|
Affiliation(s)
- Ivan Coluzza
- Department of Computational Physics, Faculty of Physics, University of Vienna , Vienna, Austria
| |
Collapse
|
50
|
Rudzinski JF, Noid WG. Bottom-Up Coarse-Graining of Peptide Ensembles and Helix–Coil Transitions. J Chem Theory Comput 2015; 11:1278-91. [DOI: 10.1021/ct5009922] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Joseph F. Rudzinski
- Department
of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| | - William G. Noid
- Department
of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| |
Collapse
|