1
|
Carrer M, Cezar HM, Bore SL, Ledum M, Cascella M. Learning Force Field Parameters from Differentiable Particle-Field Molecular Dynamics. J Chem Inf Model 2024. [PMID: 38963184 DOI: 10.1021/acs.jcim.4c00564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/05/2024]
Abstract
We develop ∂-HylleraasMD (∂-HyMD), a fully end-to-end differentiable molecular dynamics software based on the Hamiltonian hybrid particle-field formalism, and use it to establish a protocol for automated optimization of force field parameters. ∂-HyMD is templated on the recently released HylleraaasMD software, while using the JAX autodiff framework as the main engine for the differentiable dynamics. ∂-HyMD exploits an embarrassingly parallel optimization algorithm by spawning independent simulations, whose trajectories are simultaneously processed by reverse mode automatic differentiation to calculate the gradient of the loss function, which is in turn used for iterative optimization of the force-field parameters. We show that parallel organization facilitates the convergence of the minimization procedure, avoiding the known memory and numerical stability issues of differentiable molecular dynamics approaches. We showcase the effectiveness of our implementation by producing a library of force field parameters for standard phospholipids, with either zwitterionic or anionic heads and with saturated or unsaturated tails. Compared to the all-atom reference, the force field obtained by ∂-HyMD yields better density profiles than the parameters derived from previously utilized gradient-free optimization procedures. Moreover, ∂-HyMD models can predict with good accuracy properties not included in the learning objective, such as lateral pressure profiles, and are transferable to other systems, including triglycerides.
Collapse
Affiliation(s)
- Manuel Carrer
- Hylleraas Centre for Quantum Molecular Sciences and Department of Chemistry, University of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| | - Henrique Musseli Cezar
- Hylleraas Centre for Quantum Molecular Sciences and Department of Chemistry, University of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| | - Sigbjørn Løland Bore
- Hylleraas Centre for Quantum Molecular Sciences and Department of Chemistry, University of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| | - Morten Ledum
- Hylleraas Centre for Quantum Molecular Sciences and Department of Chemistry, University of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| | - Michele Cascella
- Hylleraas Centre for Quantum Molecular Sciences and Department of Chemistry, University of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| |
Collapse
|
2
|
Giulini M, Fiorentini R, Tubiana L, Potestio R, Menichetti R. EXCOGITO, an Extensible Coarse-Graining Toolbox for the Investigation of Biomolecules by Means of Low-Resolution Representations. J Chem Inf Model 2024; 64:4912-4927. [PMID: 38860513 DOI: 10.1021/acs.jcim.4c00490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2024]
Abstract
Bottom-up coarse-grained (CG) models proved to be essential to complement and sometimes even replace all-atom representations of soft matter systems and biological macromolecules. The development of low-resolution models takes the moves from the reduction of the degrees of freedom employed, that is, the definition of a mapping between a system's high-resolution description and its simplified counterpart. Even in the absence of an explicit parametrization and simulation of a CG model, the observation of the atomistic system in simpler terms can be informative: this idea is leveraged by the mapping entropy, a measure of the information loss inherent to the process of coarsening. Mapping entropy lies at the heart of the extensible coarse-graining toolbox, EXCOGITO, developed to perform a number of operations and analyses on molecular systems pivoting around the properties of mappings. EXCOGITO can process an all-atom trajectory to compute the mapping entropy, identify the mapping that minimizes it, and establish quantitative relations between a low-resolution representation and the geometrical, structural, and energetic features of the system. Here, the software, which is available free of charge under an open-source license, is presented and showcased to introduce potential users to its capabilities and usage.
Collapse
Affiliation(s)
- Marco Giulini
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| | - Raffaele Fiorentini
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| | - Luca Tubiana
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| | - Raffaello Potestio
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| | - Roberto Menichetti
- Physics Department, University of Trento, Via Sommarive, 14, Trento I-38123, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento I-38123, Italy
| |
Collapse
|
3
|
Walter LJ, Quoika PK, Zacharias M. Structure-Based Protein Assembly Simulations Including Various Binding Sites and Conformations. J Chem Inf Model 2024; 64:3465-3476. [PMID: 38602938 DOI: 10.1021/acs.jcim.4c00212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
Many biological functions are mediated by large complexes formed by multiple proteins and other cellular macromolecules. Recent progress in experimental structure determination, as well as in integrative modeling and protein structure prediction using deep learning approaches, has resulted in a rapid increase in the number of solved multiprotein assemblies. However, the assembly process of large complexes from their components is much less well-studied. We introduce a rapid computational structure-based (SB) model, GoCa, that allows to follow the assembly process of large multiprotein complexes based on a known native structure. Beyond existing SB Go̅-type models, it distinguishes between intra- and intersubunit interactions, allowing us to include coupled folding and binding. It accounts automatically for the permutation of identical subunits in a complex and allows the definition of multiple minima (native) structures in the case of proteins that undergo global transitions during assembly. The model is successfully tested on several multiprotein complexes. The source code of the GoCa program including a tutorial is publicly available on Github: https://github.com/ZachariasLab/GoCa. We also provide a web source that allows users to quickly generate the necessary input files for a GoCa simulation: https://goca.t38webservices.nat.tum.de.
Collapse
Affiliation(s)
- Luis J Walter
- Center for Functional Protein Assemblies, Technical University of Munich, Ernst-Otto-Fischer-Str. 8, Garching 85748, Germany
| | - Patrick K Quoika
- Center for Functional Protein Assemblies, Technical University of Munich, Ernst-Otto-Fischer-Str. 8, Garching 85748, Germany
| | - Martin Zacharias
- Center for Functional Protein Assemblies, Technical University of Munich, Ernst-Otto-Fischer-Str. 8, Garching 85748, Germany
| |
Collapse
|
4
|
Karmakar T, Soares TA, Merz KM. Enhancing Coarse-Grained Models through Machine Learning. J Chem Inf Model 2024; 64:2931-2932. [PMID: 38644772 DOI: 10.1021/acs.jcim.4c00537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Affiliation(s)
- Tarak Karmakar
- Department of Chemistry, Indian Institute of Technology, Delhi, Hauz Khas, New Delhi 110016, India
| | - Thereza A Soares
- Department of Chemistry, FFCLRP, University of São Paulo, Ribeirão Preto 14040-901, Brazil
- Hylleraas Centre for Quantum Molecular Sciences, University of Oslo, Oslo 0315, Norway
| | - Kenneth M Merz
- Department of Chemistry, Michigan State University, Lansing 48824, Michigan, United States
| |
Collapse
|
5
|
Wang Y, Hernandez R. Construction of Multiscale Dissipative Particle Dynamics (DPD) Models from Other Coarse-Grained Models. ACS OMEGA 2024; 9:17667-17680. [PMID: 38645334 PMCID: PMC11025104 DOI: 10.1021/acsomega.4c01868] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 03/06/2024] [Accepted: 03/15/2024] [Indexed: 04/23/2024]
Abstract
We present a general scheme for converting coarse-grained models into Dissipative Particle Dynamics (DPD) models. We build the corresponding DPD models by analogy with the de novo DPD coarse-graining scheme suggested by Groot and Warren (J. Chem. Phys., 1997). Electrostatic interactions between charged DPD particles are represented though the addition of a long-range Slater Coulomb potential as suggested by González-Melchor et al. (J. Chem. Phys., 2006). The construction is illustrated by converting MARTINI models for various proteins into a DPD representation, but it not restricted to the usual potential form in the MARTINI model-viz., Lennard-Jones potentials. We further extended the DPD scheme away from the typical use of homogeneous particle sizes, therefore faithfully representing the variations in the particle sizes seen in the underlying MARTINI model. The accuracy of the resulting construction of our generalized DPD models with respect to several structural observables has been benchmarked favorably against all-atom and MARTINI models for a selected set of peptides and proteins, and variations in the scales of the coarse-graining of the water solvent.
Collapse
Affiliation(s)
- Yinhan Wang
- Department of Chemistry, The Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Rigoberto Hernandez
- Department of Chemistry, The Johns Hopkins University, Baltimore, Maryland 21218, United States
| |
Collapse
|
6
|
Pirnia A, Maqdisi R, Mittal S, Sener M, Singharoy A. Perspective on Integrative Simulations of Bioenergetic Domains. J Phys Chem B 2024; 128:3302-3319. [PMID: 38562105 DOI: 10.1021/acs.jpcb.3c07335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Bioenergetic processes in cells, such as photosynthesis or respiration, integrate many time and length scales, which makes the simulation of energy conversion with a mere single level of theory impossible. Just like the myriad of experimental techniques required to examine each level of organization, an array of overlapping computational techniques is necessary to model energy conversion. Here, a perspective is presented on recent efforts for modeling bioenergetic phenomena with a focus on molecular dynamics simulations and its variants as a primary method. An overview of the various classical, quantum mechanical, enhanced sampling, coarse-grained, Brownian dynamics, and Monte Carlo methods is presented. Example applications discussed include multiscale simulations of membrane-wide electron transport, rate kinetics of ATP turnover from electrochemical gradients, and finally, integrative modeling of the chromatophore, a photosynthetic pseudo-organelle.
Collapse
Affiliation(s)
- Adam Pirnia
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287-1004, United States
| | - Ranel Maqdisi
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287-1004, United States
| | - Sumit Mittal
- VIT Bhopal University, Sehore 466114, Madhya Pradesh, India
| | - Melih Sener
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287-1004, United States
- Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Abhishek Singharoy
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287-1004, United States
| |
Collapse
|
7
|
Jin J, Reichman DR. Hierarchical Framework for Predicting Entropies in Bottom-Up Coarse-Grained Models. J Phys Chem B 2024; 128:3182-3199. [PMID: 38507575 DOI: 10.1021/acs.jpcb.3c07624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/22/2024]
Abstract
The thermodynamic entropy of coarse-grained (CG) models stands as one of the most important properties for quantifying the missing information during the CG process and for establishing transferable (or extendible) CG interactions. However, performing additional CG simulations on top of model construction often leads to significant additional computational overhead. In this work, we propose a simple hierarchical framework for predicting the thermodynamic entropies of various molecular CG systems. Our approach employs a decomposition of the CG interactions, enabling the estimation of the CG partition function and thermodynamic properties a priori. Starting from the ideal gas description, we leverage classical perturbation theory to systematically incorporate simple yet essential interactions, ranging from the hard sphere model to the generalized van der Waals model. Additionally, we propose an alternative approach based on multiparticle correlation functions, allowing for systematic improvements through higher-order correlations. Numerical applications to molecular liquids validate the high fidelity of our approach, and our computational protocols demonstrate that a reduced model with simple energetics can reasonably estimate the thermodynamic entropy of CG models without performing any CG simulations. Overall, our findings present a systematic framework for estimating not only the entropy but also other thermodynamic properties of CG models, relying solely on information from the reference system.
Collapse
Affiliation(s)
- Jaehyeok Jin
- Department of Chemistry, Columbia University, 3000 Broadway, New York, New York 10027, United States
| | - David R Reichman
- Department of Chemistry, Columbia University, 3000 Broadway, New York, New York 10027, United States
| |
Collapse
|
8
|
Xie P, Car R, E W. Ab initio generalized Langevin equation. Proc Natl Acad Sci U S A 2024; 121:e2308668121. [PMID: 38551836 PMCID: PMC10998567 DOI: 10.1073/pnas.2308668121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 02/22/2024] [Indexed: 04/08/2024] Open
Abstract
We introduce a machine learning-based approach called ab initio generalized Langevin equation (AIGLE) to model the dynamics of slow collective variables (CVs) in materials and molecules. In this scheme, the parameters are learned from atomistic simulations based on ab initio quantum mechanical models. Force field, memory kernel, and noise generator are constructed in the context of the Mori-Zwanzig formalism, under the constraint of the fluctuation-dissipation theorem. Combined with deep potential molecular dynamics and electronic density functional theory, this approach opens the way to multiscale modeling in a variety of situations. Here, we demonstrate this capability with a study of two mesoscale processes in crystalline lead titanate, namely the field-driven dynamics of a planar ferroelectric domain wall, and the dynamics of an extensive lattice of coarse-grained electric dipoles. In the first case, AIGLE extends the reach of ab initio simulations to a regime of noise-driven motions not accessible to molecular dynamics. In the second case, AIGLE deals with an extensive set of CVs by adopting a local approximation for the memory kernel and retaining only short-range noise correlations. The scheme is computationally more efficient than molecular dynamics by several orders of magnitude and mimics the microscopic dynamics at low frequencies where it reproduces accurately the dominant far-infrared absorption frequency.
Collapse
Affiliation(s)
- Pinchen Xie
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ08544
| | - Roberto Car
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ08544
- Department of Chemistry and Princeton Materials Institute, Princeton University, Princeton, NJ08544
- Department of Physics, Princeton University, Princeton, NJ08544
| | - Weinan E
- AI for Science Institute, Beijing100080, China
- Center for Machine Learning Research and School of Mathematical Sciences, Peking University, Beijing100084, China
| |
Collapse
|
9
|
Yang S, Song C. Switching Go̅ -Martini for Investigating Protein Conformational Transitions and Associated Protein-Lipid Interactions. J Chem Theory Comput 2024; 20:2618-2629. [PMID: 38447049 DOI: 10.1021/acs.jctc.3c01222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2024]
Abstract
Proteins are dynamic biomolecules that can transform between different conformational states when exerting physiological functions, which is difficult to simulate using all-atom methods. Coarse-grained (CG) Go̅-like models are widely used to investigate large-scale conformational transitions, which usually adopt implicit solvent models and therefore cannot explicitly capture the interaction between proteins and surrounding molecules, such as water and lipid molecules. Here, we present a new method, named Switching Go̅-Martini, to simulate large-scale protein conformational transitions between different states, based on the switching Go̅ method and the CG Martini 3 force field. The method is straightforward and efficient, as demonstrated by the benchmarking applications for multiple protein systems, including glutamine binding protein (GlnBP), adenylate kinase (AdK), and β2-adrenergic receptor (β2AR). Moreover, by employing the Switching Go̅-Martini method, we can not only unveil the conformational transition from the E2Pi-PL state to E1 state of the type 4 P-type ATPase (P4-ATPase) flippase ATP8A1-CDC50 but also provide insights into the intricate details of lipid transport.
Collapse
Affiliation(s)
- Song Yang
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Chen Song
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| |
Collapse
|
10
|
Jin X, Hu X, Chen J, Shan L, Hao D, Zhang R. Electric field induced the changes in structure and function of human transforming growth factor beta receptor type I: from molecular dynamics to docking. J Biomol Struct Dyn 2024:1-12. [PMID: 38516997 DOI: 10.1080/07391102.2024.2329288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Accepted: 03/06/2024] [Indexed: 03/23/2024]
Abstract
The transforming growth factor beta (TGF-β) signaling pathway is believed to play essential roles in several physiological activities, including cancer. TGF-β receptor type I (TBR-I) is a key membrane receptor protein in the TGF-β signaling pathway, which relates to many intracellular biological effects. In recent years, cold atmospheric plasma (CAP) has been found to have promising prospects in selective anticancer therapy and has confirmed its essential role in the TGF-β signaling pathway. However, the ambiguous effect of CAP-induced electric field (EF) on TBR-I still limits the application of CAP in clinical therapy. Molecular dynamics is applied to assess the effect of EF on the structure of the extracellular domain of TBR-I using a series of indicators and methods, and then we discuss the ligand binding ability of TBR-I. Results show that moderate EF intensities' structural restraints may contribute to the structural stability and ligand-binding ability of TBR-I, but an EF higher than 0.1 V/nm will be harmful. What's more, EF induces a change in the docking interface of TBR-I, showing the conformation and position of special sequences of residues decide the ligand binding surface. The relevant results suggest that CAP-induced EF plays a crucial role in receptor-receptor interaction and provides significant guidelines for EF-related anticancer therapy.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Xinrui Jin
- School of Energy and Electrical Engineering, Chang'an University, Xi'an, China
| | - Xiaochuan Hu
- School of Energy and Electrical Engineering, Chang'an University, Xi'an, China
| | - Jiayu Chen
- School of Energy and Electrical Engineering, Chang'an University, Xi'an, China
| | - Lequn Shan
- Department of Spine Surgery, Honghui Hospital, Xi'an Jiaotong University, Xi'an, China
| | - Dingjun Hao
- Department of Spine Surgery, Honghui Hospital, Xi'an Jiaotong University, Xi'an, China
| | - Rui Zhang
- Department of Spine Surgery, Honghui Hospital, Xi'an Jiaotong University, Xi'an, China
| |
Collapse
|
11
|
Izvekov S, Kroonblawd MP, Larentzos JP, Brennan JK, Rice BM. Maximum Entropy Theory of Multiscale Coarse-Graining via Matching Thermodynamic Forces: Application to a Molecular Crystal (TATB). J Phys Chem B 2024. [PMID: 38489758 DOI: 10.1021/acs.jpcb.3c07078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/17/2024]
Abstract
The MSCG/FM (multiscale coarse-graining via force-matching) approach is an efficient supervised machine learning method to develop microscopically informed coarse-grained (CG) models. We present a theory based on the principle of maximum entropy (PME) enveloping the existing MSCG/FM approaches. This theory views the MSCG/FM method as a special case of matching the thermodynamic forces from the extended ensemble described by the set of thermodynamic (relevant) system coordinates. This set may include CG coordinates, the stress tensor, applied external fields, and so forth, and may be characterized by nonequilibrium conditions. Following the presentation of the theory, we discuss the consistent matching of both bonded and nonbonded interactions. The proposed PME formulation is used as a starting point to extend the MSCG/FM method to the constant strain ensemble, which together with the explicit matching of the bonded forces is better suited for coarse-graining anisotropic media at a submolecular resolution. The theory is demonstrated by performing the fine coarse-graining of crystalline 1,3,5-triamino-2,4,6-trinitrobenzene (TATB), a well-known insensitive molecular energetic material, which exhibits highly anisotropic mechanical properties.
Collapse
Affiliation(s)
- Sergei Izvekov
- U.S. Army DEVCOM Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - Matthew P Kroonblawd
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
| | - James P Larentzos
- U.S. Army DEVCOM Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - John K Brennan
- U.S. Army DEVCOM Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - Betsy M Rice
- U.S. Army DEVCOM Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| |
Collapse
|
12
|
Ghosh S, Douglas JF. Phase separation in the presence of fractal aggregates. J Chem Phys 2024; 160:104903. [PMID: 38469910 DOI: 10.1063/5.0190196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 02/16/2024] [Indexed: 03/13/2024] Open
Abstract
Liquid-liquid phase separation in diverse manufacturing and biological contexts often occurs in the presence of aggregated particles or complex-shaped structures that do not actively participate in the phase separation process, but these "background" structures can serve to direct the macroscale phase separation morphology by their local symmetry-breaking presence. We perform Cahn-Hilliard phase-field simulations in two dimensions to investigate the morphological evolution, wetting, and domain growth phenomena during the phase separation of a binary mixture in contact with model fractal aggregates. Our simulations reveal that phase separation initially accelerates around the fractal due to the driving force of wetting, leading to the formation of the target composition patterns about the fractals, as previously observed for circular particles. After the formation of a wetting layer on the fractal, however, we observe a dramatic slowing-down in the kinetics of phase separation, and the characteristic domain size eventually "pins" to a finite value or approaches an asymptotic scaling regime as an ordinary phase if the phase separation loses memory of the aggregates when the scale of phase separation becomes much larger than the aggregate. Furthermore, we perform simulations to examine the effects of compositional interference between fractals with a view to elucidating interesting novel morphological features in the phase-separating mixture. Our findings should be helpful in understanding the qualitative aspects of the phase separation processes in mixtures containing particle aggregates relevant for coating, catalyst, adhesive, and electronic applications as well as in diverse biological contexts, where phase separation occurs in the presence of irregular heterogeneities.
Collapse
Affiliation(s)
- Supriyo Ghosh
- Metallurgical & Materials Engineering Department, Indian Institute of Technology, Roorkee, Uttarakhand 247667, India
| | - Jack F Douglas
- Materials Science & Engineering Division, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, USA
| |
Collapse
|
13
|
Mondal S, Mukherjee S, Bagchi B. Melting and Bubble Formation in a Double-Stranded DNA: Microscopic Aspects of Early Base-Pair Opening Events and the Role of Water. J Phys Chem B 2024; 128:2076-2086. [PMID: 38389118 DOI: 10.1021/acs.jpcb.3c06519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2024]
Abstract
Despite its rigid structure, DNA is a remarkably flexible molecule. Flexibility is essential for biological functions (such as transcription and gene repair), which require large-amplitude structural changes such as bubble formation. The bubbles thus formed are required to have a certain stability of their own and survive long on the time scale of molecular motions. A molecular understanding of fluctuations leading to quasi-stable structures is not available. Through extensive atomistic molecular dynamics simulations, we identify a sequence of microscopic events that culminate in local bubble formation, which is initiated by base-pair (BP) opening, resulting from the cleavage of native BP hydrogen bonds (HBs). This is followed by the formation of mismatched BPs with non-native contacts. These metastable structures can either revert to their original forms or undergo a flipping transition to form a local bubble that can span across 3-4 BPs. A substantial distortion of the DNA backbone and a disruption of BP stacking are observed because of the structural changes induced by these local perturbations. We also explored how water helps in the entire process. A small number of water molecules undergo rearrangement to stabilize the intermediate states by forming HBs with DNA bases. Water thus acts as a lubricant that counteracts the enthalpic penalty suffered from the loss of native BP contacts. Although the process of bubble formation is reversible, the sequence of steps involved poses an entropic barrier, preventing it from easily retracing the path to the native state.
Collapse
Affiliation(s)
- Sayantan Mondal
- Department of Chemistry, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States
| | - Saumyak Mukherjee
- Center for Theoretical Chemistry, Ruhr University Bochum, Universitätsstraße 150, Bochum D-44780, Germany
| | - Biman Bagchi
- Solid State and Structural Chemistry Unit, Indian Institute of Science, Bangalore, Karnataka 560012, India
| |
Collapse
|
14
|
Chennakesavalu S, Rotskoff GM. Data-Efficient Generation of Protein Conformational Ensembles with Backbone-to-Side-Chain Transformers. J Phys Chem B 2024; 128:2114-2123. [PMID: 38394363 DOI: 10.1021/acs.jpcb.3c08195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2024]
Abstract
Excitement at the prospect of using data-driven generative models to sample configurational ensembles of biomolecular systems stems from the extraordinary success of these models on a diverse set of high-dimensional sampling tasks. Unlike image generation or even the closely related problem of protein structure prediction, there are currently no data sources with sufficient breadth to parametrize generative models for conformational ensembles. To enable discovery, a fundamentally different approach to building generative models is required: models should be able to propose rare, albeit physical, conformations that may not arise in even the largest data sets. Here we introduce a modular strategy to generate conformations based on "backmapping" from a fixed protein backbone that (1) maintains conformational diversity of the side chains and (2) couples the side-chain fluctuations using global information about the protein conformation. Our model combines simple statistical models of side-chain conformations based on rotamer libraries with the now ubiquitous transformer architecture to sample with atomistic accuracy. Together, these ingredients provide a strategy for rapid data acquisition and hence a crucial ingredient for scalable physical simulation with generative neural networks.
Collapse
Affiliation(s)
| | - Grant M Rotskoff
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
- Institute for Computational and Mathematical Engineering, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
15
|
Lesniewski MC, Noid WG. Insight into the Density-Dependence of Pair Potentials for Predictive Coarse-Grained Models. J Phys Chem B 2024; 128:1298-1316. [PMID: 38271676 DOI: 10.1021/acs.jpcb.3c06890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024]
Abstract
We investigate the temperature- and density-dependence of effective pair potentials for 1-site coarse-grained (CG) models of two industrial solvents, 1,4-dioxane and tetrahydrofuran. We observe that the calculated pair potentials are much more sensitive to density than to temperature. The generalized-Yvon-Born-Green framework reveals that this striking density-dependence reflects corresponding variations in the many-body correlations that determine the environment-mediated indirect contribution to the pair mean force. Moreover, we demonstrate, perhaps surprisingly, that this density-dependence is not important for accurately modeling the intermolecular structure. Accordingly, we adopt a density-independent interaction potential and transfer the density-dependence of the calculated pair potentials into a configuration-independent volume potential. Furthermore, we develop a single global potential that accurately models the intermolecular structure and pressure-volume equation of state across a very wide range of liquid state points. Consequently, this work provides fundamental insight into the density-dependence of effective pair potentials and also provides a significant step toward developing predictive CG models for efficiently modeling industrial solvents.
Collapse
Affiliation(s)
- Maria C Lesniewski
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| | - W G Noid
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| |
Collapse
|
16
|
Jin J, Reichman DR. Perturbative Expansion in Reciprocal Space: Bridging Microscopic and Mesoscopic Descriptions of Molecular Interactions. J Phys Chem B 2024; 128:1061-1078. [PMID: 38232134 DOI: 10.1021/acs.jpcb.3c06048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Determining the Fourier representation of various molecular interactions is important for constructing density-based field theories from a microscopic point of view, enabling a multiscale bridge between microscopic and mesoscopic descriptions. However, due to the strongly repulsive nature of short-ranged interactions, interparticle interactions cannot be formally defined in Fourier space, which renders coarse-grained (CG) approaches in k-space somewhat ambiguous. In this paper, we address this issue by designing a perturbative expansion of pair interactions in reciprocal space. Our perturbation theory, starting from reciprocal space, elucidates the microscopic origins underlying zeroth-order (long-range attractions) and divergent repulsive interactions from higher order contributions. We propose a systematic framework for constructing a faithful Fourier-space representation of molecular interactions, capturing key structural correlations in various systems, including simple model systems and molecular CG models of liquids. Building upon the Ornstein-Zernike equation, our approach can be combined with appropriate closure relations, and to further improve the closure approximations, we develop a bottom-up parameterization strategy for inferring the bridge function from microscopic statistics. By incorporating the bridge function into the Fourier representation, our findings suggest a systematic, bottom-up approach to performing coarse-graining in reciprocal space, leading to the systematic construction of a bottom-up classical field theory of complex aqueous systems.
Collapse
Affiliation(s)
- Jaehyeok Jin
- Department of Chemistry, Columbia University, 3000 Broadway, New York, New York 10027, United States
| | - David R Reichman
- Department of Chemistry, Columbia University, 3000 Broadway, New York, New York 10027, United States
| |
Collapse
|
17
|
Sun J, Jia T, Lin PJ, Li Z, Ji L, Li C. Multiscale Canonical Coherence for Functional Corticomuscular Coupling Analysis. IEEE J Biomed Health Inform 2024; 28:812-822. [PMID: 37963005 DOI: 10.1109/jbhi.2023.3332657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2023]
Abstract
Functional corticomuscular coupling (FCMC) probes multi-level information communication in the sensorimotor system. The canonical Coherence (caCOH) method has been leveraged to measure the FCMC between two multivariate signals within the single scale. In this paper, we propose the concept of multiscale canonical Coherence (MS-caCOH) to disentangle complex multi-layer information and extract coupling features in multivariate spaces from multiple scales. Then, we verified the reliability and effectiveness of MS-caCOH on two types of data sets, i.e., a synthetic multivariate data set and a real-world multivariate data set. Our simulation results showed that compared with caCOH, MS-caCOH enhanced coupling detection and achieved lower pattern recovery errors at multiple frequency scales. Further analysis on experimental data demonstrated that the proposed MS-caCOH method could also capture detailed multiscale spatial-frequency characteristics. This study leverages the multiscale analysis framework and multivariate method to give a new insight into corticomuscular coupling analysis.
Collapse
|
18
|
Del Razo MJ, Crommelin D, Bolhuis PG. Data-driven dynamical coarse-graining for condensed matter systems. J Chem Phys 2024; 160:024108. [PMID: 38193550 DOI: 10.1063/5.0177553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 12/05/2023] [Indexed: 01/10/2024] Open
Abstract
Simulations of condensed matter systems often focus on the dynamics of a few distinguished components but require integrating the full system. A prime example is a molecular dynamics simulation of a (macro)molecule in a solution, where the molecule(s) and the solvent dynamics need to be integrated, rendering the simulations computationally costly and often unfeasible for physically/biologically relevant time scales. Standard coarse graining approaches can reproduce equilibrium distributions and structural features but do not properly include the dynamics. In this work, we develop a general data-driven coarse-graining methodology inspired by the Mori-Zwanzig formalism, which shows that macroscopic systems with a large number of degrees of freedom can be described by a few relevant variables and additional noise and memory terms. Our coarse-graining method consists of numerical integrators for the distinguished components, where the noise and interaction terms with other system components are substituted by a random variable sampled from a data-driven model. The model is parameterized using data from multiple short-time full-system simulations, and then, it is used to run long-time simulations. Applying our methodology to three systems-a distinguished particle under a harmonic and a bistable potential and a dimer with two metastable configurations-the resulting coarse-grained models are capable of reproducing not only the equilibrium distributions but also the dynamic behavior due to temporal correlations and memory effects. Remarkably, our method even reproduces the transition dynamics between metastable states, which is challenging to capture correctly. Our approach is not constrained to specific dynamics and can be extended to systems beyond Langevin dynamics, and, in principle, even to non-equilibrium dynamics.
Collapse
Affiliation(s)
- Mauricio J Del Razo
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
- Van't Hoff Institute for Molecular Sciences, University of Amsterdam, PO Box 94157, 1090GD Amsterdam, The Netherlands
- Korteweg-de Vries Institute for Mathematics, University of Amsterdam, PO Box 94248, 1090GD Amsterdam, The Netherlands
- Dutch Institute for Emergent Phenomena, University of Amsterdam, Amsterdam, The Netherlands
| | - Daan Crommelin
- Korteweg-de Vries Institute for Mathematics, University of Amsterdam, PO Box 94248, 1090GD Amsterdam, The Netherlands
- Centrum Wiskunde & Informatica, 1098 XG Amsterdam, The Netherlands
| | - Peter G Bolhuis
- Van't Hoff Institute for Molecular Sciences, University of Amsterdam, PO Box 94157, 1090GD Amsterdam, The Netherlands
| |
Collapse
|
19
|
Heo L, Feig M. One bead per residue can describe all-atom protein structures. Structure 2024; 32:97-111.e6. [PMID: 38000367 PMCID: PMC10872525 DOI: 10.1016/j.str.2023.10.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 09/16/2023] [Accepted: 10/30/2023] [Indexed: 11/26/2023]
Abstract
Atomistic resolution is the standard for high-resolution biomolecular structures, but experimental structural data are often at lower resolution. Coarse-grained models are also used extensively in computational studies to reach biologically relevant spatial and temporal scales. This study explores the use of advanced machine learning networks for reconstructing atomistic models from reduced representations. The main finding is that a single bead per amino acid residue allows construction of accurate and stereochemically realistic all-atom structures with minimal loss of information. This suggests that lower resolution representations of proteins may be sufficient for many applications when combined with a machine learning framework that encodes knowledge from known structures. Practical applications include the rapid addition of atomistic detail to low-resolution structures from experiment or computational coarse-grained models. The application of rapid, deterministic all-atom reconstruction within multi-scale frameworks is further demonstrated with a rapid protocol for the generation of accurate models from cryo-EM densities close to experimental structures.
Collapse
Affiliation(s)
- Lim Heo
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA.
| |
Collapse
|
20
|
Airas J, Ding X, Zhang B. Transferable Implicit Solvation via Contrastive Learning of Graph Neural Networks. ACS CENTRAL SCIENCE 2023; 9:2286-2297. [PMID: 38161379 PMCID: PMC10755853 DOI: 10.1021/acscentsci.3c01160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 10/26/2023] [Accepted: 10/31/2023] [Indexed: 01/03/2024]
Abstract
Implicit solvent models are essential for molecular dynamics simulations of biomolecules, striking a balance between computational efficiency and biological realism. Efforts are underway to develop accurate and transferable implicit solvent models and coarse-grained (CG) force fields in general, guided by a bottom-up approach that matches the CG energy function with the potential of mean force (PMF) defined by the finer system. However, practical challenges arise due to the lack of analytical expressions for the PMF and algorithmic limitations in parameterizing CG force fields. To address these challenges, a machine learning-based approach is proposed, utilizing graph neural networks (GNNs) to represent the solvation free energy and potential contrasting for parameter optimization. We demonstrate the effectiveness of the approach by deriving a transferable GNN implicit solvent model using 600,000 atomistic configurations of six proteins obtained from explicit solvent simulations. The GNN model provides solvation free energy estimations much more accurately than state-of-the-art implicit solvent models, reproducing configurational distributions of explicit solvent simulations. We also demonstrate the reasonable transferability of the GNN model outside of the training data. Our study offers valuable insights for deriving systematically improvable implicit solvent models and CG force fields from a bottom-up perspective.
Collapse
Affiliation(s)
- Justin Airas
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139-4307, United
States
| | - Xinqiang Ding
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139-4307, United
States
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139-4307, United
States
| |
Collapse
|
21
|
Loose T, Sahrmann PG, Qu TS, Voth GA. Coarse-Graining with Equivariant Neural Networks: A Path Toward Accurate and Data-Efficient Models. J Phys Chem B 2023; 127:10564-10572. [PMID: 38033234 PMCID: PMC10726966 DOI: 10.1021/acs.jpcb.3c05928] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 10/30/2023] [Accepted: 11/09/2023] [Indexed: 12/02/2023]
Abstract
Machine learning has recently entered into the mainstream of coarse-grained (CG) molecular modeling and simulation. While a variety of methods for incorporating deep learning into these models exist, many of them involve training neural networks to act directly as the CG force field. This has several benefits of which the most significant is accuracy. Neural networks can inherently incorporate multibody effects during the calculation of CG forces, and a well-trained neural network force field outperforms pairwise basis sets generated from essentially any methodology. However, this comes at a significant cost. First, these models are typically slower than pairwise force fields, even when accounting for specialized hardware, which accelerates the training and integration of such networks. The second and the focus of this paper is the need for a considerable amount of data to train such force fields. It is common to use 10s of microseconds of molecular dynamics data to train a single CG model, which approaches the point of eliminating the CG model's usefulness in the first place. As we investigate in this work, this "data-hunger" trap from neural networks for predicting molecular energies and forces can be remediated in part by incorporating equivariant convolutional operations. We demonstrate that, for CG water, networks that incorporate equivariant convolutional operations can produce functional models using data sets as small as a single frame of reference data, while networks without these operations cannot.
Collapse
Affiliation(s)
| | | | - Thomas S. Qu
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, James Franck Institute,
and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Gregory A. Voth
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, James Franck Institute,
and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
22
|
Wu J, Xue W, Voth GA. K-Means Clustering Coarse-Graining (KMC-CG): A Next Generation Methodology for Determining Optimal Coarse-Grained Mappings of Large Biomolecules. J Chem Theory Comput 2023; 19:8987-8997. [PMID: 37957028 PMCID: PMC10720621 DOI: 10.1021/acs.jctc.3c01053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 10/25/2023] [Accepted: 10/27/2023] [Indexed: 11/21/2023]
Abstract
Coarse-grained (CG) molecular dynamics (MD) has become a method of choice for simulating various large scale biomolecular processes; therefore, the systematic definition of the CG mappings for biomolecules remains an important topic. Appropriate CG mappings can significantly enhance the representability of a CG model and improve its ability to capture critical features of large biomolecules. In this work, we present a systematic and more generalized method called K-means clustering coarse-graining (KMC-CG), which builds on the earlier approach of essential dynamics coarse-graining (ED-CG). KMC-CG removes the sequence-dependent constraints of ED-CG, allowing it to explore a more extensive space and thus enabling the discovery of more physically optimal CG mappings. Furthermore, the implementation of the K-means clustering algorithm can variationally optimize the CG mapping with efficiency and stability. This new method is tested in three cases: ATP-bound G-actin, the HIV-1 CA pentamer, and the Arp2/3 complex. In these examples, the CG models generated by KMC-CG are seen to better capture the structural, dynamic, and functional domains. KMC-CG therefore provides a robust and consistent approach to generating CG models of large biomolecules that can then be more accurately parametrized by either bottom-up or top-down CG force fields.
Collapse
Affiliation(s)
| | | | - Gregory A. Voth
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, The James Franck Institute,
and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
23
|
Vlachy V, Kalyuzhnyi YV, Hribar-Lee B, Dill KA. Protein Association in Solution: Statistical Mechanical Modeling. Biomolecules 2023; 13:1703. [PMID: 38136574 PMCID: PMC10742237 DOI: 10.3390/biom13121703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 11/16/2023] [Accepted: 11/20/2023] [Indexed: 12/24/2023] Open
Abstract
Protein molecules associate in solution, often in clusters beyond pairwise, leading to liquid phase separations and high viscosities. It is often impractical to study these multi-protein systems by atomistic computer simulations, particularly in multi-component solvents. Instead, their forces and states can be studied by liquid state statistical mechanics. However, past such approaches, such as the Derjaguin-Landau-Verwey-Overbeek (DLVO) theory, were limited to modeling proteins as spheres, and contained no microscopic structure-property relations. Recently, this limitation has been partly overcome by bringing the powerful Wertheim theory of associating molecules to bear on protein association equilibria. Here, we review these developments.
Collapse
Affiliation(s)
- Vojko Vlachy
- Faculty of Chemistry and Chemical Technology, University of Ljubljana, 1000 Ljubljana, Slovenia;
| | | | - Barbara Hribar-Lee
- Faculty of Chemistry and Chemical Technology, University of Ljubljana, 1000 Ljubljana, Slovenia;
| | - Ken A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, New York, NY 11794, USA;
- Department of Chemistry, Physics and Astronomy, Stony Brook University, New York, NY 11790, USA
| |
Collapse
|
24
|
Jin J, Hwang J, Voth GA. Gaussian representation of coarse-grained interactions of liquids: Theory, parametrization, and transferability. J Chem Phys 2023; 159:184105. [PMID: 37942867 DOI: 10.1063/5.0160567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 10/06/2023] [Indexed: 11/10/2023] Open
Abstract
Coarse-grained (CG) interactions determined via bottom-up methodologies can faithfully reproduce the structural correlations observed in fine-grained (atomistic resolution) systems, yet they can suffer from limited extensibility due to complex many-body correlations. As part of an ongoing effort to understand and improve the applicability of bottom-up CG models, we propose an alternative approach to address both accuracy and transferability. Our main idea draws from classical perturbation theory to partition the hard sphere repulsive term from effective CG interactions. We then introduce Gaussian basis functions corresponding to the system's characteristic length by linking these Gaussian sub-interactions to the local particle densities at each coordination shell. The remaining perturbative long-range interaction can be treated as a collective solvation interaction, which we show exhibits a Gaussian form derived from integral equation theories. By applying this numerical parametrization protocol to CG liquid systems, our microscopic theory elucidates the emergence of Gaussian interactions in common phenomenological CG models. To facilitate transferability for these reduced descriptions, we further infer equations of state to determine the sub-interaction parameter as a function of the system variables. The reduced models exhibit excellent transferability across the thermodynamic state points. Furthermore, we propose a new strategy to design the cross-interactions between distinct CG sites in liquid mixtures. This involves combining each Gaussian in the proper radial domain, yielding accurate CG potentials of mean force and structural correlations for multi-component systems. Overall, our findings establish a solid foundation for constructing transferable bottom-up CG models of liquids with enhanced extensibility.
Collapse
Affiliation(s)
- Jaehyeok Jin
- Department of Chemistry, Chicago Center for Theoretical Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, 5735 S. Ellis Ave., Chicago, Illinois 60637, USA
- Department of Chemistry, Columbia University, 3000 Broadway, New York, New York 10027, USA
| | - Jisung Hwang
- Department of Statistics, The University of Chicago, 5747 S. Ellis Ave., Chicago, Illinois 60637, USA
| | - Gregory A Voth
- Department of Chemistry, Chicago Center for Theoretical Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, 5735 S. Ellis Ave., Chicago, Illinois 60637, USA
| |
Collapse
|
25
|
Jones MS, Shmilovich K, Ferguson AL. DiAMoNDBack: Diffusion-Denoising Autoregressive Model for Non-Deterministic Backmapping of Cα Protein Traces. J Chem Theory Comput 2023; 19:7908-7923. [PMID: 37906711 DOI: 10.1021/acs.jctc.3c00840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Coarse-grained molecular models of proteins permit access to length and time scales unattainable by all-atom models and the simulation of processes that occur on long time scales, such as aggregation and folding. The reduced resolution realizes computational accelerations, but an atomistic representation can be vital for a complete understanding of mechanistic details. Backmapping is the process of restoring all-atom resolution to coarse-grained molecular models. In this work, we report DiAMoNDBack (Diffusion-denoising Autoregressive Model for Non-Deterministic Backmapping) as an autoregressive denoising diffusion probability model to restore all-atom details to coarse-grained protein representations retaining only Cα coordinates. The autoregressive generation process proceeds from the protein N-terminus to C-terminus in a residue-by-residue fashion conditioned on the Cα trace and previously backmapped backbone and side-chain atoms within the local neighborhood. The local and autoregressive nature of our model makes it transferable between proteins. The stochastic nature of the denoising diffusion process means that the model generates a realistic ensemble of backbone and side-chain all-atom configurations consistent with the coarse-grained Cα trace. We train DiAMoNDBack over 65k+ structures from the Protein Data Bank (PDB) and validate it in applications to a hold-out PDB test set, intrinsically disordered protein structures from the Protein Ensemble Database (PED), molecular dynamics simulations of fast-folding mini-proteins from DE Shaw Research, and coarse-grained simulation data. We achieve state-of-the-art reconstruction performance in terms of correct bond formation, avoidance of side-chain clashes, and the diversity of the generated side-chain configurational states. We make the DiAMoNDBack model publicly available as a free and open-source Python package.
Collapse
Affiliation(s)
- Michael S Jones
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Kirill Shmilovich
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Andrew L Ferguson
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
26
|
Navarro C, Majewski M, De Fabritiis G. Top-Down Machine Learning of Coarse-Grained Protein Force Fields. J Chem Theory Comput 2023; 19:7518-7526. [PMID: 37874270 PMCID: PMC10777392 DOI: 10.1021/acs.jctc.3c00638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Indexed: 10/25/2023]
Abstract
Developing accurate and efficient coarse-grained representations of proteins is crucial for understanding their folding, function, and interactions over extended time scales. Our methodology involves simulating proteins with molecular dynamics and utilizing the resulting trajectories to train a neural network potential through differentiable trajectory reweighting. Remarkably, this method requires only the native conformation of proteins, eliminating the need for labeled data derived from extensive simulations or memory-intensive end-to-end differentiable simulations. Once trained, the model can be employed to run parallel molecular dynamics simulations and sample folding events for proteins both within and beyond the training distribution, showcasing its extrapolation capabilities. By applying Markov state models, native-like conformations of the simulated proteins can be predicted from the coarse-grained simulations. Owing to its theoretical transferability and ability to use solely experimental static structures as training data, we anticipate that this approach will prove advantageous for developing new protein force fields and further advancing the study of protein dynamics, folding, and interactions.
Collapse
Affiliation(s)
- Carles Navarro
- Acellera
Labs, Doctor Trueta 183, 08005 Barcelona, Spain
| | | | - Gianni De Fabritiis
- Computational
Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003 Barcelona, Spain
- Acellera
Ltd., Devonshire House
582, Middlesex HA7 1JS, United Kingdom
- Institució
Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
27
|
Junier I, Ghobadpour E, Espeli O, Everaers R. DNA supercoiling in bacteria: state of play and challenges from a viewpoint of physics based modeling. Front Microbiol 2023; 14:1192831. [PMID: 37965550 PMCID: PMC10642903 DOI: 10.3389/fmicb.2023.1192831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 09/25/2023] [Indexed: 11/16/2023] Open
Abstract
DNA supercoiling is central to many fundamental processes of living organisms. Its average level along the chromosome and over time reflects the dynamic equilibrium of opposite activities of topoisomerases, which are required to relax mechanical stresses that are inevitably produced during DNA replication and gene transcription. Supercoiling affects all scales of the spatio-temporal organization of bacterial DNA, from the base pair to the large scale chromosome conformation. Highlighted in vitro and in vivo in the 1960s and 1970s, respectively, the first physical models were proposed concomitantly in order to predict the deformation properties of the double helix. About fifteen years later, polymer physics models demonstrated on larger scales the plectonemic nature and the tree-like organization of supercoiled DNA. Since then, many works have tried to establish a better understanding of the multiple structuring and physiological properties of bacterial DNA in thermodynamic equilibrium and far from equilibrium. The purpose of this essay is to address upcoming challenges by thoroughly exploring the relevance, predictive capacity, and limitations of current physical models, with a specific focus on structural properties beyond the scale of the double helix. We discuss more particularly the problem of DNA conformations, the interplay between DNA supercoiling with gene transcription and DNA replication, its role on nucleoid formation and, finally, the problem of scaling up models. Our primary objective is to foster increased collaboration between physicists and biologists. To achieve this, we have reduced the respective jargon to a minimum and we provide some explanatory background material for the two communities.
Collapse
Affiliation(s)
- Ivan Junier
- CNRS, UMR 5525, VetAgro Sup, Grenoble INP, TIMC, Université Grenoble Alpes, Grenoble, France
| | - Elham Ghobadpour
- CNRS, UMR 5525, VetAgro Sup, Grenoble INP, TIMC, Université Grenoble Alpes, Grenoble, France
- École Normale Supérieure (ENS) de Lyon, CNRS, Laboratoire de Physique and Centre Blaise Pascal de l'ENS de Lyon, Lyon, France
| | - Olivier Espeli
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, Paris, France
| | - Ralf Everaers
- École Normale Supérieure (ENS) de Lyon, CNRS, Laboratoire de Physique and Centre Blaise Pascal de l'ENS de Lyon, Lyon, France
| |
Collapse
|
28
|
Peng Y, Pak AJ, Durumeric AEP, Sahrmann PG, Mani S, Jin J, Loose TD, Beiter J, Voth GA. OpenMSCG: A Software Tool for Bottom-Up Coarse-Graining. J Phys Chem B 2023; 127:8537-8550. [PMID: 37791670 PMCID: PMC10577682 DOI: 10.1021/acs.jpcb.3c04473] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 09/05/2023] [Indexed: 10/05/2023]
Abstract
The "bottom-up" approach to coarse-graining, for building accurate and efficient computational models to simulate large-scale and complex phenomena and processes, is an important approach in computational chemistry, biophysics, and materials science. As one example, the Multiscale Coarse-Graining (MS-CG) approach to developing CG models can be rigorously derived using statistical mechanics applied to fine-grained, i.e., all-atom simulation data for a given system. Under a number of circumstances, a systematic procedure, such as MS-CG modeling, is particularly valuable. Here, we present the development of the OpenMSCG software, a modularized open-source software that provides a collection of successful and widely applied bottom-up CG methods, including Boltzmann Inversion (BI), Force-Matching (FM), Ultra-Coarse-Graining (UCG), Relative Entropy Minimization (REM), Essential Dynamics Coarse-Graining (EDCG), and Heterogeneous Elastic Network Modeling (HeteroENM). OpenMSCG is a high-performance and comprehensive toolset that can be used to derive CG models from large-scale fine-grained simulation data in file formats from common molecular dynamics (MD) software packages, such as GROMACS, LAMMPS, and NAMD. OpenMSCG is modularized in the Python programming framework, which allows users to create and customize modeling "recipes" for reproducible results, thus greatly improving the reliability, reproducibility, and sharing of bottom-up CG models and their applications.
Collapse
Affiliation(s)
- Yuxing Peng
- NVIDIA
Corporation, 2788 San Tomas Expressway, Santa Clara, California 95051, United States
| | - Alexander J. Pak
- Department
of Chemical and Biological Engineering, Colorado School of Mines, Golden, Colorado 80401, United States
| | | | - Patrick G. Sahrmann
- Department
of Chemistry, Chicago Center for Theoretical Chemistry, James Franck
Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Sriramvignesh Mani
- Department
of Chemistry, Chicago Center for Theoretical Chemistry, James Franck
Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Jaehyeok Jin
- Department
of Chemistry, Chicago Center for Theoretical Chemistry, James Franck
Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Timothy D. Loose
- Department
of Chemistry, Chicago Center for Theoretical Chemistry, James Franck
Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Jeriann Beiter
- Department
of Chemistry, Chicago Center for Theoretical Chemistry, James Franck
Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Gregory A. Voth
- Department
of Chemistry, Chicago Center for Theoretical Chemistry, James Franck
Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
29
|
Arts M, Garcia Satorras V, Huang CW, Zügner D, Federici M, Clementi C, Noé F, Pinsler R, van den Berg R. Two for One: Diffusion Models and Force Fields for Coarse-Grained Molecular Dynamics. J Chem Theory Comput 2023; 19:6151-6159. [PMID: 37688551 DOI: 10.1021/acs.jctc.3c00702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/11/2023]
Abstract
Coarse-grained (CG) molecular dynamics enables the study of biological processes at temporal and spatial scales that would be intractable at an atomistic resolution. However, accurately learning a CG force field remains a challenge. In this work, we leverage connections between score-based generative models, force fields, and molecular dynamics to learn a CG force field without requiring any force inputs during training. Specifically, we train a diffusion generative model on protein structures from molecular dynamics simulations, and we show that its score function approximates a force field that can directly be used to simulate CG molecular dynamics. While having a vastly simplified training setup compared to previous work, we demonstrate that our approach leads to improved performance across several protein simulations for systems up to 56 amino acids, reproducing the CG equilibrium distribution and preserving the dynamics of all-atom simulations such as protein folding events.
Collapse
Affiliation(s)
- Marloes Arts
- Department of Computer Science, University of Copenhagen, Universitetsparken 1, Copenhagen 2100, Denmark
| | - Victor Garcia Satorras
- AI4Science, Microsoft Research, Evert van de Beekstraat 354, Amsterdam 1118 CZ, The Netherlands
| | - Chin-Wei Huang
- AI4Science, Microsoft Research, Evert van de Beekstraat 354, Amsterdam 1118 CZ, The Netherlands
| | - Daniel Zügner
- AI4Science, Microsoft Research, Karl-Liebknecht-Straße 32, Berlin 10178, Germany
| | - Marco Federici
- Informatics Institute, University of Amsterdam, Science Park 904, Amsterdam 1098 XH, The Netherlands
| | - Cecilia Clementi
- AI4Science, Microsoft Research, Karl-Liebknecht-Straße 32, Berlin 10178, Germany
- Department of Physics, Freie Universität Berlin, Arnimalle 12, Berlin 14195, Germany
| | - Frank Noé
- AI4Science, Microsoft Research, Karl-Liebknecht-Straße 32, Berlin 10178, Germany
| | - Robert Pinsler
- AI4Science, Microsoft Research, 21 Station Road, Cambridge CB1 2FB, U.K
| | - Rianne van den Berg
- AI4Science, Microsoft Research, Evert van de Beekstraat 354, Amsterdam 1118 CZ, The Netherlands
| |
Collapse
|
30
|
Majewski M, Pérez A, Thölke P, Doerr S, Charron NE, Giorgino T, Husic BE, Clementi C, Noé F, De Fabritiis G. Machine learning coarse-grained potentials of protein thermodynamics. Nat Commun 2023; 14:5739. [PMID: 37714883 PMCID: PMC10504246 DOI: 10.1038/s41467-023-41343-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 08/29/2023] [Indexed: 09/17/2023] Open
Abstract
A generalized understanding of protein dynamics is an unsolved scientific problem, the solution of which is critical to the interpretation of the structure-function relationships that govern essential biological processes. Here, we approach this problem by constructing coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics. For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins with multiple secondary structure arrangements. The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems. Coarse-grained simulations identify relevant structural states in the ensemble with comparable energetics to the all-atom systems. Furthermore, we show that a single coarse-grained potential can integrate all twelve proteins and can capture experimental structural features of mutated proteins. These results indicate that machine learning coarse-grained potentials could provide a feasible approach to simulate and understand protein dynamics.
Collapse
Affiliation(s)
- Maciej Majewski
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
| | - Adrià Pérez
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
| | - Philipp Thölke
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain
| | - Stefan Doerr
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
| | - Nicholas E Charron
- Department of Physics, Rice University, Houston, TX, 77005, USA
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA
- Department of Physics, FU Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Toni Giorgino
- Biophysics Institute, National Research Council (CNR-IBF), 20133, Milan, Italy
| | - Brooke E Husic
- Department of Mathematics and Computer Science, FU Berlin, Arnimallee 12, 14195, Berlin, Germany
- Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, 08540, USA
- Princeton Center for Theoretical Science, Princeton University, Princeton, NJ, 08540, USA
- Center for the Physics of Biological Function, Princeton University, Princeton, NJ, 08540, USA
| | - Cecilia Clementi
- Department of Physics, Rice University, Houston, TX, 77005, USA.
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA.
- Department of Physics, FU Berlin, Arnimallee 12, 14195, Berlin, Germany.
- Department of Chemistry, Rice University, Houston, TX, 77005, USA.
| | - Frank Noé
- Department of Physics, FU Berlin, Arnimallee 12, 14195, Berlin, Germany.
- Department of Mathematics and Computer Science, FU Berlin, Arnimallee 12, 14195, Berlin, Germany.
- Department of Chemistry, Rice University, Houston, TX, 77005, USA.
- Microsoft Research AI4Science, Karl-Liebknecht Str. 32, 10178, Berlin, Germany.
| | - Gianni De Fabritiis
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain.
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain.
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, 08010, Barcelona, Spain.
| |
Collapse
|
31
|
Airas J, Ding X, Zhang B. Transferable Coarse Graining via Contrastive Learning of Graph Neural Networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.08.556923. [PMID: 37745447 PMCID: PMC10515757 DOI: 10.1101/2023.09.08.556923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Coarse-grained (CG) force fields are essential for molecular dynamics simulations of biomolecules, striking a balance between computational efficiency and biological realism. These simulations employ simplified models grouping atoms into interaction sites, enabling the study of complex biomolecular systems over biologically relevant timescales. Efforts are underway to develop accurate and transferable CG force fields, guided by a bottom-up approach that matches the CG energy function with the potential of mean force (PMF) defined by the finer system. However, practical challenges arise due to many-body effects, lack of analytical expressions for the PMF, and limitations in parameterizing CG force fields. To address these challenges, a machine learning-based approach is proposed, utilizing graph neural networks (GNNs) to represent CG force fields and potential contrasting for parameterization from atomistic simulation data. We demonstrate the effectiveness of the approach by deriving a transferable GNN implicit solvent model using 600,000 atomistic configurations of six proteins obtained from explicit solvent simulations. The GNN model provides solvation free energy estimations much more accurately than state-of-the-art implicit solvent models, reproducing configurational distributions of explicit solvent simulations. We also demonstrate the reasonable transferability of the GNN model outside the training data. Our study offers valuable insights for building accurate coarse-grained models bottom-up.
Collapse
Affiliation(s)
- Justin Airas
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Xinqiang Ding
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
32
|
Strahan J, Finkel J, Dinner AR, Weare J. Predicting rare events using neural networks and short-trajectory data. JOURNAL OF COMPUTATIONAL PHYSICS 2023; 488:112152. [PMID: 37332834 PMCID: PMC10270692 DOI: 10.1016/j.jcp.2023.112152] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
Estimating the likelihood, timing, and nature of events is a major goal of modeling stochastic dynamical systems. When the event is rare in comparison with the timescales of simulation and/or measurement needed to resolve the elemental dynamics, accurate prediction from direct observations becomes challenging. In such cases a more effective approach is to cast statistics of interest as solutions to Feynman-Kac equations (partial differential equations). Here, we develop an approach to solve Feynman-Kac equations by training neural networks on short-trajectory data. Our approach is based on a Markov approximation but otherwise avoids assumptions about the underlying model and dynamics. This makes it applicable to treating complex computational models and observational data. We illustrate the advantages of our method using a low-dimensional model that facilitates visualization, and this analysis motivates an adaptive sampling strategy that allows on-the-fly identification of and addition of data to regions important for predicting the statistics of interest. Finally, we demonstrate that we can compute accurate statistics for a 75-dimensional model of sudden stratospheric warming. This system provides a stringent test bed for our method.
Collapse
Affiliation(s)
- John Strahan
- Department of Chemistry and James Franck Institute, the University of Chicago, Chicago, IL 60637
| | - Justin Finkel
- Department of Earth, Atmospheric, and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, the University of Chicago, Chicago, IL 60637
- Committee on Computational and Applied Mathematics, the University of Chicago, Chicago, IL 60637
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012
| |
Collapse
|
33
|
Wellawatte GP, Hocky GM, White AD. Neural potentials of proteins extrapolate beyond training data. J Chem Phys 2023; 159:085103. [PMID: 37642255 PMCID: PMC10474891 DOI: 10.1063/5.0147240] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 07/31/2023] [Indexed: 08/31/2023] Open
Abstract
We evaluate neural network (NN) coarse-grained (CG) force fields compared to traditional CG molecular mechanics force fields. We conclude that NN force fields are able to extrapolate and sample from unseen regions of the free energy surface when trained with limited data. Our results come from 88 NN force fields trained on different combinations of clustered free energy surfaces from four protein mapped trajectories. We used a statistical measure named total variation similarity to assess the agreement between reference free energy surfaces from mapped atomistic simulations and CG simulations from trained NN force fields. Our conclusions support the hypothesis that NN CG force fields trained with samples from one region of the proteins' free energy surface can, indeed, extrapolate to unseen regions. Additionally, the force matching error was found to only be weakly correlated with a force field's ability to reconstruct the correct free energy surface.
Collapse
Affiliation(s)
- Geemi P. Wellawatte
- Department of Chemistry, University of Rochester, Rochester, New York 14627, USA
| | - Glen M. Hocky
- Department of Chemistry, Simons Center for Computational Physical Chemistry, New York University, New York, New York 10003, USA
| | - Andrew D. White
- Department of Chemical Engineering, University of Rochester, Rochester, New York 14627, USA
| |
Collapse
|
34
|
Sokhan VP, Seaton MA, Todorov IT. Phase behaviour of coarse-grained fluids. SOFT MATTER 2023. [PMID: 37470164 DOI: 10.1039/d3sm00835e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/21/2023]
Abstract
Soft condensed matter structures often challenge us with complex many-body phenomena governed by collective modes spanning wide spatial and temporal domains. In order to successfully tackle such problems, mesoscopic coarse-grained (CG) statistical models are being developed, providing a dramatic reduction in computational complexity. CG models provide an intermediate step in the complex statistical framework of linking the thermodynamics of condensed phases with the properties of their constituent atoms and molecules. These allow us to offload part of the problem to the CG model itself and reformulate the remainder in terms of reduced CG phase space. However, such exchange of pawns to chess pieces, or 'Hamiltonian renormalization', is a radical step and the thermodynamics of the primary atomic and CG models could be quite distinct. Here, we present a comprehensive study of the phase diagram including binodal and interfacial properties of a dissipative particle dynamics (DPD) model, extended to include finite-range attraction to support the liquid-gas equilibrium. Despite the similarities with the atomic model potentials, its phase envelope is markedly different featuring several anomalies such as an unusually broad liquid range, change in concavity of the liquid coexistence branch with variation of the model parameters, volume contraction on fusion, temperature of maximum density in the liquid phase and negative thermal expansion in the solid phase. These results provide new insight into the connection between simple potential models and complex emergent condensed matter phenomena.
Collapse
Affiliation(s)
- V P Sokhan
- Scientific Computing Department, Science and Technology Facilities Council, STFC Daresbury Laboratory, Sci-Tech Daresbury, Keckwick Lane, Daresbury, Cheshire WA4 4AD, UK.
| | - M A Seaton
- Scientific Computing Department, Science and Technology Facilities Council, STFC Daresbury Laboratory, Sci-Tech Daresbury, Keckwick Lane, Daresbury, Cheshire WA4 4AD, UK.
| | - I T Todorov
- Scientific Computing Department, Science and Technology Facilities Council, STFC Daresbury Laboratory, Sci-Tech Daresbury, Keckwick Lane, Daresbury, Cheshire WA4 4AD, UK.
| |
Collapse
|
35
|
Hudait A, Hurley JH, Voth GA. Dynamics of upstream ESCRT organization at the HIV-1 budding site. Biophys J 2023; 122:2655-2674. [PMID: 37218128 PMCID: PMC10397573 DOI: 10.1016/j.bpj.2023.05.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 03/27/2023] [Accepted: 05/11/2023] [Indexed: 05/24/2023] Open
Abstract
In the late stages of the HIV-1 life cycle, membrane localization and self-assembly of Gag polyproteins induce membrane deformation and budding. Release of the virion requires direct interaction between immature Gag lattice and upstream ESCRT machinery at the viral budding site, followed by assembly of downstream ESCRT-III factors, culminating in membrane scission. However, molecular details of upstream ESCRT assembly dynamics at the viral budding site remain unclear. In this work, using coarse-grained (CG) molecular dynamics (MD) simulations, we investigated the interactions between Gag, ESCRT-I, ESCRT-II, and membrane to delineate the dynamical mechanisms by which upstream ESCRTs assemble templated by late-stage immature Gag lattice. We first systematically derived "bottom-up" CG molecular models and interactions of upstream ESCRT proteins from experimental structural data and extensive all-atom MD simulations. Using these molecular models, we performed CG MD simulations of ESCRT-I oligomerization and ESCRT-I/II supercomplex formation at the neck of the budding virion. Our simulations demonstrate that ESCRT-I can effectively oligomerize to higher-order complexes templated by the immature Gag lattice both in the absence of ESCRT-II and when multiple copies of ESCRT-II are localized at the bud neck. The ESCRT-I/II supercomplexes formed in our simulations exhibit predominantly columnar structures, which has important implications for the nucleation pathway of downstream ESCRT-III polymers. Importantly, ESCRT-I/II supercomplexes bound to Gag initiate membrane neck constriction by pulling the inner edge of the bud neck closer to the ESCRT-I headpiece ring. Our findings serve to elucidate a network of interactions between upstream ESCRT machinery, immature Gag lattice, and membrane neck that regulate protein assembly dynamics at the HIV-1 budding site.
Collapse
Affiliation(s)
- Arpa Hudait
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois
| | - James H Hurley
- Department of Molecular and Cell Biology and California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, California; California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, California
| | - Gregory A Voth
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois.
| |
Collapse
|
36
|
Durumeric AEP, Voth GA. Using classifiers to understand coarse-grained models and their fidelity with the underlying all-atom systems. J Chem Phys 2023; 158:234101. [PMID: 37318166 DOI: 10.1063/5.0146812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 05/26/2023] [Indexed: 06/16/2023] Open
Abstract
Bottom-up coarse-grained (CG) molecular dynamics models are parameterized using complex effective Hamiltonians. These models are typically optimized to approximate high dimensional data from atomistic simulations. However, human validation of these models is often limited to low dimensional statistics that do not necessarily differentiate between the CG model and said atomistic simulations. We propose that classification can be used to variationally estimate high dimensional error and that explainable machine learning can help convey this information to scientists. This approach is demonstrated using Shapley additive explanations and two CG protein models. This framework may also be valuable for ascertaining whether allosteric effects at the atomistic level are accurately propagated to a CG model.
Collapse
Affiliation(s)
- Aleksander E P Durumeric
- Department of Chemistry, Chicago Center for Theoretical Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, 5735 S. Ellis Ave., Chicago, Illinois 60637, USA
| | - Gregory A Voth
- Department of Chemistry, Chicago Center for Theoretical Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, 5735 S. Ellis Ave., Chicago, Illinois 60637, USA
| |
Collapse
|
37
|
Hsieh YC, Delarue M, Orland H, Koehl P. Analyzing the Geometry and Dynamics of Viral Structures: A Review of Computational Approaches Based on Alpha Shape Theory, Normal Mode Analysis, and Poisson-Boltzmann Theories. Viruses 2023; 15:1366. [PMID: 37376665 DOI: 10.3390/v15061366] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 06/05/2023] [Accepted: 06/09/2023] [Indexed: 06/29/2023] Open
Abstract
The current SARS-CoV-2 pandemic highlights our fragility when we are exposed to emergent viruses either directly or through zoonotic diseases. Fortunately, our knowledge of the biology of those viruses is improving. In particular, we have more and more structural information on virions, i.e., the infective form of a virus that includes its genomic material and surrounding protective capsid, and on their gene products. It is important to have methods that enable the analyses of structural information on such large macromolecular systems. We review some of those methods in this paper. We focus on understanding the geometry of virions and viral structural proteins, their dynamics, and their energetics, with the ambition that this understanding can help design antiviral agents. We discuss those methods in light of the specificities of those structures, mainly that they are huge. We focus on three of our own methods based on the alpha shape theory for computing geometry, normal mode analyses to study dynamics, and modified Poisson-Boltzmann theories to study the organization of ions and co-solvent and solvent molecules around biomacromolecules. The corresponding software has computing times that are compatible with the use of regular desktop computers. We show examples of their applications on some outer shells and structural proteins of the West Nile Virus.
Collapse
Affiliation(s)
- Yin-Chen Hsieh
- Institute for Arctic and Marine Biology, Department of Biosciences, Fisheries, and Economics, UiT The Arctic University of Norway, 9037 Tromso, Norway
| | - Marc Delarue
- Institut Pasteur, Université Paris-Cité and CNRS, UMR 3528, Unité Architecture et Dynamique des Macromolécules Biologiques, 75015 Paris, France
| | - Henri Orland
- Institut de Physique Théorique, CEA, CNRS, Université Paris-Saclay, 91191 Gif-sur-Yvette, France
| | - Patrice Koehl
- Department of Computer Science, University of California, Davis, CA 95616, USA
| |
Collapse
|
38
|
Izvekov S, Rice BM. Hierarchical Machine Learning of Low-Resolution Coarse-Grained Free Energy Potentials. J Chem Theory Comput 2023. [PMID: 37256918 DOI: 10.1021/acs.jctc.3c00128] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
A force-matching-based method for supervised machine learning (ML) of coarse-grained (CG) free energy (FE) potentials─known as multiscale coarse-graining via force-matching (MSCG/FM)─is an efficient method to develop microscopically informed CG models that are thermodynamically and statistically equivalent to the reference microscopic models. For low-resolution models, when the coarse-graining is at supramolecular scales, objective-oriented clustering of nonbonded particles is required and the reduced description becomes a function of the clustering algorithm. In the present work, we explore the dependence of the ML of the CG Helmholtz FE potential on the clustering algorithm. We consider coarse-graining based on partitional (k-means, leading to Voronoi diagram) and hierarchical agglomerative (bottom-up) clustering algorithms common in unsupervised ML and develop theory connecting the MSCG/FM learned CG Helmholtz potential and the clustering statistics. By combining the agglomerative clustering and the MSCG/FM learning in a recursive manner, we propose an efficient ML methodology to develop the fine-to-low resolution hierarchies of the CG models. The methodology does not suffer from degrading accuracy or increased computational cost to construct larger hierarchies and as such does not impose an upper size limitation of the CG particles resulting from the extended hierarchies. The utility of the methodology is demonstrated by obtaining the bottom-up agglomerative hierarchy for liquid nitromethane from all-atom molecular dynamics (MD) simulations. For agglomerative hierarchies, we prove the existence of renormalization group transformations that indicate self-similarity and allow for learning the low-resolution MSCG/FM potentials at low computational cost by rescaling and renormalizing the certain finer-resolution members of the hierarchy. The hierarchies of the CG models can be used to carry out simulations under constant-pressure conditions.
Collapse
Affiliation(s)
- Sergei Izvekov
- U.S. Army DEVCOM Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - Betsy M Rice
- U.S. Army DEVCOM Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| |
Collapse
|
39
|
Tillotson MJ, Diamantonis NI, Buda C, Bolton LW, Müller EA. Molecular modelling of the thermophysical properties of fluids: expectations, limitations, gaps and opportunities. Phys Chem Chem Phys 2023; 25:12607-12628. [PMID: 37114325 DOI: 10.1039/d2cp05423j] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
This manuscript provides an overview of the current state of the art in terms of the molecular modelling of the thermophysical properties of fluids. It is intended to manage the expectations and serve as guidance to practising physical chemists, chemical physicists and engineers in terms of the scope and accuracy of the more commonly available intermolecular potentials along with the peculiarities of the software and methods employed in molecular simulations while providing insights on the gaps and opportunities available in this field. The discussion is focused around case studies which showcase both the precision and the limitations of frequently used workflows.
Collapse
Affiliation(s)
- Marcus J Tillotson
- Department of Chemical Engineering, Imperial College London, London, UK.
| | | | | | | | - Erich A Müller
- Department of Chemical Engineering, Imperial College London, London, UK.
| |
Collapse
|
40
|
Weiand E, Ewen JP, Roiter Y, Koenig PH, Page SH, Rodriguez-Ropero F, Angioletti-Uberti S, Dini D. Nanoscale friction of biomimetic hair surfaces. NANOSCALE 2023; 15:7086-7104. [PMID: 36987934 DOI: 10.1039/d2nr05545g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
We investigate the nanoscale friction between biomimetic hair surfaces using chemical colloidal probe atomic force microscopy experiments and nonequilibrium molecular dynamics simulations. In the experiments, friction is measured between water-lubricated silica surfaces functionalised with monolayers formed from either octadecyl or sulfonate groups, which are representative of the surfaces of virgin and ultimately bleached hair, respectively. In the simulations, friction is monitored between coarse-grained model hair surfaces with different levels of chemical damage, where a specified amount of grafted octadecyl groups are randomly replaced with sulfonate groups. The sliding velocity dependence of friction in the simulations can be described using an extended stress-augmented thermally activation model. As the damage level increases in the simulations, the friction coefficient generally increases, but its sliding velocity-dependence decreases. At low sliding velocities, which are closer to those encountered experimentally and physiologically, we observe a monotonic increase of the friction coefficient with damage ratio, which is consistent with our new experiments using biomimetic surfaces and previous ones using real hair. This observation demonstrates that modified surface chemistry, rather than roughness changes or subsurface damage, control the increase in nanoscale friction of bleached or chemically damaged hair. We expect the methods and biomimetic surfaces proposed here to be useful to screen the tribological performance of hair care formulations both experimentally and computationally.
Collapse
Affiliation(s)
- Erik Weiand
- Department of Mechanical Engineering, Imperial College London, South Kensington Campus, SW7 2AZ London, UK.
- Institute of Molecular Science and Engineering, Imperial College London, South Kensington Campus, SW7 2AZ London, UK
- Thomas Young Centre for the Theory and Simulation of Materials, Imperial College London, South Kensington Campus, SW7 2AZ London, UK
| | - James P Ewen
- Department of Mechanical Engineering, Imperial College London, South Kensington Campus, SW7 2AZ London, UK.
- Institute of Molecular Science and Engineering, Imperial College London, South Kensington Campus, SW7 2AZ London, UK
- Thomas Young Centre for the Theory and Simulation of Materials, Imperial College London, South Kensington Campus, SW7 2AZ London, UK
| | - Yuri Roiter
- Corporate Functions Analytical and Data & Modeling Sciences, Mason Business Center, The Procter and Gamble Company, Mason, 45040 Ohio, USA
| | - Peter H Koenig
- Corporate Functions Analytical and Data & Modeling Sciences, Mason Business Center, The Procter and Gamble Company, Mason, 45040 Ohio, USA
| | - Steven H Page
- Corporate Functions Analytical and Data & Modeling Sciences, Mason Business Center, The Procter and Gamble Company, Mason, 45040 Ohio, USA
| | - Francisco Rodriguez-Ropero
- Corporate Functions Analytical and Data & Modeling Sciences, Mason Business Center, The Procter and Gamble Company, Mason, 45040 Ohio, USA
| | | | - Daniele Dini
- Department of Mechanical Engineering, Imperial College London, South Kensington Campus, SW7 2AZ London, UK.
- Institute of Molecular Science and Engineering, Imperial College London, South Kensington Campus, SW7 2AZ London, UK
- Thomas Young Centre for the Theory and Simulation of Materials, Imperial College London, South Kensington Campus, SW7 2AZ London, UK
| |
Collapse
|
41
|
Bryer AJ, Rey JS, Perilla JR. Performance efficient macromolecular mechanics via sub-nanometer shape based coarse graining. Nat Commun 2023; 14:2014. [PMID: 37037809 PMCID: PMC10086035 DOI: 10.1038/s41467-023-37801-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Accepted: 03/30/2023] [Indexed: 04/12/2023] Open
Abstract
Dimensionality reduction via coarse grain modeling is a valuable tool in biomolecular research. For large assemblies, ultra coarse models are often knowledge-based, relying on a priori information to parameterize models thus hindering general predictive capability. Here, we present substantial advances to the shape based coarse graining (SBCG) method, which we refer to as SBCG2. SBCG2 utilizes a revitalized formulation of the topology representing network which makes high-granularity modeling possible, preserving atomistic details that maintain assembly characteristics. Further, we present a method of granularity selection based on charge density Fourier Shell Correlation and have additionally developed a refinement method to optimize, adjust and validate high-granularity models. We demonstrate our approach with the conical HIV-1 capsid and heteromultimeric cofilin-2 bound actin filaments. Our approach is available in the Visual Molecular Dynamics (VMD) software suite, and employs a CHARMM-compatible Hamiltonian that enables high-performance simulation in the GPU-resident NAMD3 molecular dynamics engine.
Collapse
Affiliation(s)
- Alexander J Bryer
- Department of Chemistry and Biochemistry, University of Delaware, Newark, DE, 19716, USA
| | - Juan S Rey
- Department of Chemistry and Biochemistry, University of Delaware, Newark, DE, 19716, USA
| | - Juan R Perilla
- Department of Chemistry and Biochemistry, University of Delaware, Newark, DE, 19716, USA.
| |
Collapse
|
42
|
Hammes-Schiffer S. Exploring Proton-Coupled Electron Transfer at Multiple Scales. NATURE COMPUTATIONAL SCIENCE 2023; 3:291-300. [PMID: 37577057 PMCID: PMC10416817 DOI: 10.1038/s43588-023-00422-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 02/23/2023] [Indexed: 08/15/2023]
Abstract
The coupling of electron and proton transfer is critical for chemical and biological processes spanning a wide range of length and time scales and often occurring in complex environments. Thus, diverse modeling strategies, including analytical theories, quantum chemistry, molecular dynamics, and kinetic modeling, are essential for a comprehensive understanding of such proton-coupled electron transfer reactions. Each of these computational methods provides one piece of the puzzle, and all these pieces must be viewed together to produce the full picture.
Collapse
|
43
|
Hirschmann F, Lopez H, Roosen-Runge F, Seydel T, Schreiber F, Oettel M. Effects of flexibility in coarse-grained models for bovine serum albumin and immunoglobulin G. J Chem Phys 2023; 158:084112. [PMID: 36859072 DOI: 10.1063/5.0132493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/03/2023] Open
Abstract
We construct a coarse-grained, structure-based, low-resolution, 6-bead flexible model of bovine serum albumin (BSA, PDB: 4F5S), which is a popular example of a globular protein in biophysical research. The model is obtained via direct Boltzmann inversion using all-atom simulations of a single molecule, and its particular form is selected from a large pool of 6-bead coarse-grained models using two suitable metrics that quantify the agreement in the distribution of collective coordinates between all-atom and coarse-grained Brownian dynamics simulations of solutions in the dilute limit. For immunoglobulin G (IgG), a similar structure-based 12-bead model has been introduced in the literature [Chaudhri et al., J. Phys. Chem. B 116, 8045 (2012)] and is employed here to compare findings for the compact BSA molecule and the more anisotropic IgG molecule. We define several modified coarse-grained models of BSA and IgG, which differ in their internal constraints and thus account for a variation of flexibility. We study denser solutions of the coarse-grained models with purely repulsive molecules (achievable by suitable salt conditions) and address the effect of packing and flexibility on dynamic and static behavior. Translational and rotational self-diffusivity is enhanced for more elastic models. Finally, we discuss a number of effective sphere sizes for the BSA molecule, which can be defined from its static and dynamic properties. Here, it is found that the effective sphere diameters lie between 4.9 and 6.1 nm, corresponding to a relative spread of about ±10% around a mean of 5.5 nm.
Collapse
Affiliation(s)
- Frank Hirschmann
- Institute for Applied Physics, University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| | - Hender Lopez
- School of Physics, Clinical and Optometric Sciences, Technological University Dublin, Grangegorman D07 ADY7, Ireland
| | - Felix Roosen-Runge
- Department of Biomedical Sciences and Biofilms-Research Center for Biointerfaces (BRCB), Malmö University, 20506 Malmö, Sweden
| | - Tilo Seydel
- Institut Max von Laue-Paul Langevin, 71 Avenue des Martyrs, 38042 Grenoble, France
| | - Frank Schreiber
- Institute for Applied Physics, University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| | - Martin Oettel
- Institute for Applied Physics, University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| |
Collapse
|
44
|
Jin J, Voth GA. Statistical Mechanical Design Principles for Coarse-Grained Interactions across Different Conformational Free Energy Surfaces. J Phys Chem Lett 2023; 14:1354-1362. [PMID: 36728761 PMCID: PMC9940719 DOI: 10.1021/acs.jpclett.2c03844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 01/27/2023] [Indexed: 06/18/2023]
Abstract
Systematic bottom-up coarse-graining (CG) of molecular systems provides a means to explore different coupled length and time scales while treating the molecular-scale physics at a reduced level. However, the configuration dependence of CG interactions often results in CG models with limited applicability for exploring the parametrized configurations. We propose a statistical mechanical theory to design CG interactions across different configurations and conditions. In order to span wide ranges of conformational space, distinct classical CG free energy surfaces for characteristic configurations are identified using molecular collective variables. The coupling interaction between different CG free energy surfaces can then be systematically determined by analogy to quantum mechanical approaches describing coupled states. The present theory can accurately capture the underlying many-body potentials of mean force in the CG variables for various order parameters applied to liquids, interfaces, and in principle proteins, uncovering the complex nature underlying the coupling interaction and imparting a new protocol for the design of predictive multiscale models.
Collapse
Affiliation(s)
| | - Gregory A. Voth
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
45
|
Köhler J, Chen Y, Krämer A, Clementi C, Noé F. Flow-Matching: Efficient Coarse-Graining of Molecular Dynamics without Forces. J Chem Theory Comput 2023; 19:942-952. [PMID: 36668906 DOI: 10.1021/acs.jctc.3c00016] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Coarse-grained (CG) molecular simulations have become a standard tool to study molecular processes on time and length scales inaccessible to all-atom simulations. Parametrizing CG force fields to match all-atom simulations has mainly relied on force-matching or relative entropy minimization, which require many samples from costly simulations with all-atom or CG resolutions, respectively. Here we present flow-matching, a new training method for CG force fields that combines the advantages of both methods by leveraging normalizing flows, a generative deep learning method. Flow-matching first trains a normalizing flow to represent the CG probability density, which is equivalent to minimizing the relative entropy without requiring iterative CG simulations. Subsequently, the flow generates samples and forces according to the learned distribution in order to train the desired CG free energy model via force-matching. Even without requiring forces from the all-atom simulations, flow-matching outperforms classical force-matching by an order of magnitude in terms of data efficiency and produces CG models that can capture the folding and unfolding transitions of small proteins.
Collapse
Affiliation(s)
- Jonas Köhler
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
| | - Yaoyi Chen
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
| | - Andreas Krämer
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
| | - Cecilia Clementi
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany.,Center for Theoretical Biological Physics, Rice University, Houston, Texas77005, United States.,Department of Physics, Rice University, Houston, Texas77005, United States.,Department of Chemistry, Rice University, Houston, Texas77005, United States
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany.,Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany.,Department of Chemistry, Rice University, Houston, Texas77005, United States.,Microsoft Research AI4Science, Karl-Liebknecht Strasse 32, 10178Berlin, Germany
| |
Collapse
|
46
|
Gao A, Remsing RC, Weeks JD. Local Molecular Field Theory for Coulomb Interactions in Aqueous Solutions. J Phys Chem B 2023; 127:809-821. [PMID: 36669139 DOI: 10.1021/acs.jpcb.2c06988] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Coulomb interactions play a crucial role in a wide array of processes in aqueous solutions but present conceptual and computational challenges to both theory and simulations. We review recent developments in an approach addressing these challenges─local molecular field (LMF) theory. LMF theory exploits an exact and physically suggestive separation of intermolecular Coulomb interactions into strong short-range and uniformly slowly varying long-range components. This allows us to accurately determine the averaged effects of the long-range components on the short-range structure using effective single particle fields and analytical corrections, greatly reducing the need for complex lattice summation techniques used in most standard approaches. The simplest use of these ideas in aqueous solutions leads to the short solvent (SS) model, where both solvent-solvent and solute-solvent Coulomb interactions have only short-range components. Here we use the SS model to give a simple description of pairing of nucleobases and biologically relevant ions in water.
Collapse
Affiliation(s)
- Ang Gao
- Department of Physics, Beijing University of Posts and Telecommunications, Beijing, China 100876
| | - Richard C Remsing
- Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, United States
| | - John D Weeks
- Institute for Physical Science and Technology and Department of Chemistry and Biochemistry, University of Maryland, College Park, Maryland 20742, United States
| |
Collapse
|
47
|
Wang TY, Li JF, Zhang HD, Chen JZY. Designs to Improve Capability of Neural Networks to Make Structural Predictions. CHINESE JOURNAL OF POLYMER SCIENCE 2023. [DOI: 10.1007/s10118-023-2910-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
|
48
|
Jin J, Schweizer KS, Voth GA. Understanding dynamics in coarse-grained models. II. Coarse-grained diffusion modeled using hard sphere theory. J Chem Phys 2023; 158:034104. [PMID: 36681632 DOI: 10.1063/5.0116300] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The first paper of this series [J. Chem. Phys. 158, 034103 (2023)] demonstrated that excess entropy scaling holds for both fine-grained and corresponding coarse-grained (CG) systems. Despite its universality, a more exact determination of the scaling relationship was not possible due to the semi-empirical nature. In this second paper, an analytical excess entropy scaling relation is derived for bottom-up CG systems. At the single-site CG resolution, effective hard sphere systems are constructed that yield near-identical dynamical properties as the target CG systems by taking advantage of how hard sphere dynamics and excess entropy can be analytically expressed in terms of the liquid packing fraction. Inspired by classical equilibrium perturbation theories and recent advances in constructing hard sphere models for predicting activated dynamics of supercooled liquids, we propose a new approach for understanding the diffusion of molecular liquids in the normal regime using hard sphere reference fluids. The proposed "fluctuation matching" is designed to have the same amplitude of long wavelength density fluctuations (dimensionless compressibility) as the CG system. Utilizing the Enskog theory to derive an expression for hard sphere diffusion coefficients, a bridge between the CG dynamics and excess entropy is then established. The CG diffusion coefficient can be roughly estimated using various equations of the state, and an accurate prediction of accelerated CG dynamics at different temperatures is also possible in advance of running any CG simulation. By introducing another layer of coarsening, these findings provide a more rigorous method to assess excess entropy scaling and understand the accelerated CG dynamics of molecular fluids.
Collapse
Affiliation(s)
- Jaehyeok Jin
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Kenneth S Schweizer
- Department of Material Science, Department of Chemistry, Department of Chemical and Biomolecular Engineering, and Materials Research Laboratory, University of Illinois, Urbana, Illinois 61801, USA
| | - Gregory A Voth
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
49
|
Jin J, Schweizer KS, Voth GA. Understanding dynamics in coarse-grained models. I. Universal excess entropy scaling relationship. J Chem Phys 2023; 158:034103. [PMID: 36681649 DOI: 10.1063/5.0116299] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Coarse-grained (CG) models facilitate an efficient exploration of complex systems by reducing the unnecessary degrees of freedom of the fine-grained (FG) system while recapitulating major structural correlations. Unlike structural properties, assessing dynamic properties in CG modeling is often unfeasible due to the accelerated dynamics of the CG models, which allows for more efficient structural sampling. Therefore, the ultimate goal of the present series of articles is to establish a better correspondence between the FG and CG dynamics. To assess and compare dynamical properties in the FG and the corresponding CG models, we utilize the excess entropy scaling relationship. For Paper I of this series, we provide evidence that the FG and the corresponding CG counterpart follow the same universal scaling relationship. By carefully reviewing and examining the literature, we develop a new theory to calculate excess entropies for the FG and CG systems while accounting for entropy representability. We demonstrate that the excess entropy scaling idea can be readily applied to liquid water and methanol systems at both the FG and CG resolutions. For both liquids, we reveal that the scaling exponents remain unchanged from the coarse-graining process, indicating that the scaling behavior is universal for the same underlying molecular systems. Combining this finding with the concept of mapping entropy in CG models, we show that the missing entropy plays an important role in accelerating the CG dynamics.
Collapse
Affiliation(s)
- Jaehyeok Jin
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Kenneth S Schweizer
- Department of Material Science, Department of Chemistry, Department of Chemical and Biomolecular Engineering, and Materials Research Laboratory, University of Illinois, Urbana, Illinois 61801, USA
| | - Gregory A Voth
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
50
|
Ouyang J, Sheng Y, Wang W. Recent Advances of Studies on Cell-Penetrating Peptides Based on Molecular Dynamics Simulations. Cells 2022; 11:cells11244016. [PMID: 36552778 PMCID: PMC9776715 DOI: 10.3390/cells11244016] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 12/09/2022] [Accepted: 12/10/2022] [Indexed: 12/14/2022] Open
Abstract
With the ability to transport cargo molecules across cell membranes with low toxicity, cell-penetrating peptides (CPPs) have become promising candidates for next generation peptide-based drug delivery vectors. Over the past three decades since the first CPP was discovered, a great deal of work has been done on the cellular uptake mechanisms and the applications for the delivery of therapeutic molecules, and significant advances have been made. But so far, we still do not have a precise and unified understanding of the structure-activity relationship of the CPPs. Molecular dynamics (MD) simulations provide a method to reveal peptide-membrane interactions at the atomistic level and have become an effective complement to experiments. In this paper, we review the progress of the MD simulations on CPP-membrane interactions, including the computational methods and technical improvements in the MD simulations, the research achievements in the CPP internalization mechanism, CPP decoration and coupling, and the peptide-induced membrane reactions during the penetration process, as well as the comparison of simulated and experimental results.
Collapse
Affiliation(s)
- Jun Ouyang
- School of Public Courses, Bengbu Medical College, Bengbu 233030, China
- Collaborative Innovation Center of Advanced Microstructures, National Laboratory of Solid State Microstructure, Department of Physics, Nanjing University, Nanjing 210093, China
| | - Yuebiao Sheng
- Collaborative Innovation Center of Advanced Microstructures, National Laboratory of Solid State Microstructure, Department of Physics, Nanjing University, Nanjing 210093, China
- High Performance Computing Center, Nanjing University, Nanjing 210093, China
- Correspondence: (Y.S.); (W.W.)
| | - Wei Wang
- Collaborative Innovation Center of Advanced Microstructures, National Laboratory of Solid State Microstructure, Department of Physics, Nanjing University, Nanjing 210093, China
- Correspondence: (Y.S.); (W.W.)
| |
Collapse
|