1
|
Zhao Y, Zhang W, Li T. EPR-Net: constructing a non-equilibrium potential landscape via a variational force projection formulation. Natl Sci Rev 2024; 11:nwae052. [PMID: 38883298 PMCID: PMC11173252 DOI: 10.1093/nsr/nwae052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 01/07/2024] [Accepted: 01/29/2024] [Indexed: 06/18/2024] Open
Abstract
We present EPR-Net, a novel and effective deep learning approach that tackles a crucial challenge in biophysics: constructing potential landscapes for high-dimensional non-equilibrium steady-state systems. EPR-Net leverages a nice mathematical fact that the desired negative potential gradient is simply the orthogonal projection of the driving force of the underlying dynamics in a weighted inner-product space. Remarkably, our loss function has an intimate connection with the steady entropy production rate (EPR), enabling simultaneous landscape construction and EPR estimation. We introduce an enhanced learning strategy for systems with small noise, and extend our framework to include dimensionality reduction and the state-dependent diffusion coefficient case in a unified fashion. Comparative evaluations on benchmark problems demonstrate the superior accuracy, effectiveness and robustness of EPR-Net compared to existing methods. We apply our approach to challenging biophysical problems, such as an eight-dimensional (8D) limit cycle and a 52D multi-stability problem, which provide accurate solutions and interesting insights on constructed landscapes. With its versatility and power, EPR-Net offers a promising solution for diverse landscape construction problems in biophysics.
Collapse
Affiliation(s)
- Yue Zhao
- Center for Data Science, Peking University, Beijing 100871, China
| | - Wei Zhang
- Zuse Institute Berlin, Berlin 14195, Germany
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin 14195, Germany
| | - Tiejun Li
- Center for Data Science, Peking University, Beijing 100871, China
- Laboratory of Mathematics and Applied Mathematics (LMAM) and School of Mathematical Sciences, Peking University, Beijing 100871, China
- Center for Machine Learning Research, Peking University, Beijing 100871, China
| |
Collapse
|
2
|
Noid WG, Szukalo RJ, Kidder KM, Lesniewski MC. Rigorous Progress in Coarse-Graining. Annu Rev Phys Chem 2024; 75:21-45. [PMID: 38941523 DOI: 10.1146/annurev-physchem-062123-010821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024]
Abstract
Low-resolution coarse-grained (CG) models provide remarkable computational and conceptual advantages for simulating soft materials. In principle, bottom-up CG models can reproduce all structural and thermodynamic properties of atomically detailed models that can be observed at the resolution of the CG model. This review discusses recent progress in developing theory and computational methods for achieving this promise. We first briefly review variational approaches for parameterizing interaction potentials and their relationship to machine learning methods. We then discuss recent approaches for simultaneously improving both the transferability and thermodynamic properties of bottom-up models by rigorously addressing the density and temperature dependence of these potentials. We also briefly discuss exciting progress in modeling high-resolution observables with low-resolution CG models. More generally, we highlight the essential role of the bottom-up framework not only for fundamentally understanding the limitations of prior CG models but also for developing robust computational methods that resolve these limitations in practice.
Collapse
Affiliation(s)
- W G Noid
- Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania, USA;
| | - Ryan J Szukalo
- Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania, USA;
- Current affiliation: Department of Chemistry, Princeton University, Princeton, New Jersey, USA
| | - Katherine M Kidder
- Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania, USA;
| | - Maria C Lesniewski
- Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania, USA;
| |
Collapse
|
3
|
Masella M, Léonforté F. The multi-scale polarizable pseudo-particle solvent coarse-grained approach: From NaCl salt solutions to polyelectrolyte hydration. J Chem Phys 2024; 160:204902. [PMID: 38780384 DOI: 10.1063/5.0194968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Accepted: 04/22/2024] [Indexed: 05/25/2024] Open
Abstract
We discuss key parameters that affect the reliability of hybrid simulations in the aqueous phase based on an efficient multi-scale coarse-grained polarizable pseudo-particle approach, denoted as pppl, to model the solvent water, whereas solutes are modeled using an all atom polarizable force field. Among those parameters, the extension of the solvent domain (SD) at the solute vicinity (domain in which each solvent particle corresponds to a single water molecule) and the magnitude of solute/solvent short range polarization damping effects are shown to be pivotal to model NaCl salty aqueous solutions and the hydration of charged systems, such as the hydrophobic polyelectrolyte polymer that we have recently investigated [Masella et al., J. Chem. Phys. 155, 114903 (2021)]. Strong short range damping is pivotal to simulate aqueous salt NaCl solutions at moderate concentration (up to 1.0M). The SD extension (as well as short range damping) has a weak effect on the polymer conformation; however, it plays a pivotal role in computing accurate polymer/solvent interaction energies. As the pppl approach is up to two orders of magnitude computationally more efficient than all atom polarizable force field methods, our results show it to be an efficient alternative route to investigate the equilibrium properties of complex charged molecular systems in extended chemical environments.
Collapse
Affiliation(s)
- Michel Masella
- Laboratoire de Biologie Structurale et Radiobiologie, Service de Bioénergétique, Biologie Structurale et Mécanismes, Institut de Biologie et de Technologies de Saclay, CEA Saclay, F-91191 Gif sur Yvette Cedex, France
| | - Fabien Léonforté
- L'Oréal Group, Research and Innovation, Aulnay-Sous-Bois, France
| |
Collapse
|
4
|
Argudo PG. Lipids and proteins: Insights into the dynamics of assembly, recognition, condensate formation. What is still missing? Biointerphases 2024; 19:038501. [PMID: 38922634 DOI: 10.1116/6.0003662] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 06/03/2024] [Indexed: 06/27/2024] Open
Abstract
Lipid membranes and proteins, which are part of us throughout our lives, have been studied for decades. However, every year, new discoveries show how little we know about them. In a reader-friendly manner for people not involved in the field, this paper tries to serve as a bridge between physicists and biologists and new young researchers diving into the field to show its relevance, pointing out just some of the plethora of lines of research yet to be unraveled. It illustrates how new ways, from experimental to theoretical approaches, are needed in order to understand the structures and interactions that take place in a single lipid, protein, or multicomponent system, as we are still only scratching the surface.
Collapse
Affiliation(s)
- Pablo G Argudo
- Max Planck Institute for Polymer Research (MPI-P), Mainz 55128, Germany
| |
Collapse
|
5
|
Bag S, Meinel MK, Müller-Plathe F. Synthetic Force-Field Database for Training Machine Learning Models to Predict Mobility-Preserving Coarse-Grained Molecular-Simulation Potentials. J Chem Theory Comput 2024; 20:3046-3060. [PMID: 38593205 DOI: 10.1021/acs.jctc.4c00242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/11/2024]
Abstract
Balancing accuracy and efficiency is a common problem in molecular simulation. This tradeoff is evident in coarse-grained molecular dynamics simulation, which prioritizes efficiency, and all-atom molecular simulation, which prioritizes accuracy. Despite continuous efforts, creating a coarse-grained model that accurately captures both the system's structure and dynamics remains elusive. In this article, we present a data-driven approach for constructing coarse-grained models that aim to describe both the structure and dynamics of the system equally well. While the development of machine learning models is well-received in the scientific community, the significance of dataset creation for these models is often overlooked. However, data-driven approaches cannot progress without a robust dataset. To address this, we construct a database of synthetic coarse-grained potentials generated from unphysical all-atom models. A neural network is trained with the generated database to predict the coarse-grained potentials of real liquids. We evaluate their quality by calculating the combined loss of structural and dynamical accuracy upon coarse-graining. When we compare our machine learning-based coarse-grained potential with the one from iterative Boltzmann inversion, the machine learning prediction turns out better for all eight hydrocarbon liquids we studied. As all-atom surfaces turn more nonspherical, both ways of coarse-graining degrade. Still, the neural network outperforms iterative Boltzmann inversion in constructing good quality coarse-grained models for such cases. The synthetic database and the developed machine learning models are freely available to the community, and we believe that our approach will generate interest in efficiently deriving accurate coarse-grained models for liquids.
Collapse
Affiliation(s)
- Saientan Bag
- Eduard-Zintl-Institut für Anorganische und Physikalische Chemie, Technische Universität Darmstadt, Peter-Grünberg-Str. 8, 64287 Darmstadt, Germany
| | - Melissa K Meinel
- Eduard-Zintl-Institut für Anorganische und Physikalische Chemie, Technische Universität Darmstadt, Peter-Grünberg-Str. 8, 64287 Darmstadt, Germany
| | - Florian Müller-Plathe
- Eduard-Zintl-Institut für Anorganische und Physikalische Chemie, Technische Universität Darmstadt, Peter-Grünberg-Str. 8, 64287 Darmstadt, Germany
| |
Collapse
|
6
|
Karmakar T, Soares TA, Merz KM. Enhancing Coarse-Grained Models through Machine Learning. J Chem Inf Model 2024; 64:2931-2932. [PMID: 38644772 DOI: 10.1021/acs.jcim.4c00537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Affiliation(s)
- Tarak Karmakar
- Department of Chemistry, Indian Institute of Technology, Delhi, Hauz Khas, New Delhi 110016, India
| | - Thereza A Soares
- Department of Chemistry, FFCLRP, University of São Paulo, Ribeirão Preto 14040-901, Brazil
- Hylleraas Centre for Quantum Molecular Sciences, University of Oslo, Oslo 0315, Norway
| | - Kenneth M Merz
- Department of Chemistry, Michigan State University, Lansing 48824, Michigan, United States
| |
Collapse
|
7
|
Sai L, Fu L, Zhao J. Predicting Binding Energies and Electronic Properties of Boron Nitride Fullerenes Using a Graph Convolutional Network. J Chem Inf Model 2024; 64:2645-2653. [PMID: 38117935 DOI: 10.1021/acs.jcim.3c01708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]
Abstract
As isoelectronic counterparts of carbon fullerenes, medium-sized boron nitride clusters also prefer cage structures composed of even-sized polygons. As the cluster size increases, the number of cage isomers grows rapidly, and determining the ground state structure requires a tremendous amount of DFT calculations. Herein, we develop a graph convolutional network (GCN) that can describe the energy of a (BN)n cage by its topology connection. We define a vertex feature vector on a dual polyhedron by the permutation of the neighbor vertices' degree and aggregate the information on vertices by two graph convolutional layers to learn the local feature of the dual polyhedron. The GCN is trained on (BN)28 and subsequently tested on (BN)23 and (BN)24 data sets, which satisfactorily reproduce the order of isomer energies from DFT calculations. We further employ the trained GCN to predict the ground state structures within the size range of n = 25-32, which agree well with DFT results. Using the same GCN framework, we also successfully trained the highest-occupied or lowest-unoccupied orbital energies of (BN)28 isomers. The present graph convolutional network establishes a direct mapping between the topological connection and the energetic or electronic properties of a cage-like cluster or molecule.
Collapse
Affiliation(s)
- Linwei Sai
- Department of Mathematics, Hohai University, Changzhou 213200, China
| | - Li Fu
- Key Laboratory of Materials Modification by Laser, Ion and Electron Beams, Dalian University of Technology, Ministry of Education, Dalian 116024, China
| | - Jijun Zhao
- Key Laboratory of Materials Modification by Laser, Ion and Electron Beams, Dalian University of Technology, Ministry of Education, Dalian 116024, China
| |
Collapse
|
8
|
Xie P, Car R, E W. Ab initio generalized Langevin equation. Proc Natl Acad Sci U S A 2024; 121:e2308668121. [PMID: 38551836 PMCID: PMC10998567 DOI: 10.1073/pnas.2308668121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 02/22/2024] [Indexed: 04/08/2024] Open
Abstract
We introduce a machine learning-based approach called ab initio generalized Langevin equation (AIGLE) to model the dynamics of slow collective variables (CVs) in materials and molecules. In this scheme, the parameters are learned from atomistic simulations based on ab initio quantum mechanical models. Force field, memory kernel, and noise generator are constructed in the context of the Mori-Zwanzig formalism, under the constraint of the fluctuation-dissipation theorem. Combined with deep potential molecular dynamics and electronic density functional theory, this approach opens the way to multiscale modeling in a variety of situations. Here, we demonstrate this capability with a study of two mesoscale processes in crystalline lead titanate, namely the field-driven dynamics of a planar ferroelectric domain wall, and the dynamics of an extensive lattice of coarse-grained electric dipoles. In the first case, AIGLE extends the reach of ab initio simulations to a regime of noise-driven motions not accessible to molecular dynamics. In the second case, AIGLE deals with an extensive set of CVs by adopting a local approximation for the memory kernel and retaining only short-range noise correlations. The scheme is computationally more efficient than molecular dynamics by several orders of magnitude and mimics the microscopic dynamics at low frequencies where it reproduces accurately the dominant far-infrared absorption frequency.
Collapse
Affiliation(s)
- Pinchen Xie
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ08544
| | - Roberto Car
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ08544
- Department of Chemistry and Princeton Materials Institute, Princeton University, Princeton, NJ08544
- Department of Physics, Princeton University, Princeton, NJ08544
| | - Weinan E
- AI for Science Institute, Beijing100080, China
- Center for Machine Learning Research and School of Mathematical Sciences, Peking University, Beijing100084, China
| |
Collapse
|
9
|
Wu Z, Zhou T. Structural Coarse-Graining via Multiobjective Optimization with Differentiable Simulation. J Chem Theory Comput 2024; 20:2605-2617. [PMID: 38483262 DOI: 10.1021/acs.jctc.3c01348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
In the realm of multiscale molecular simulations, structure-based coarse-graining is a prominent approach for creating efficient coarse-grained (CG) representations of soft matter systems, such as polymers. This involves optimizing CG interactions by matching static correlation functions of the corresponding degrees of freedom in all-atom (AA) models. Here, we present a versatile method, namely, differentiable coarse-graining (DiffCG), which combines multiobjective optimization and differentiable simulation. The DiffCG approach is capable of constructing robust CG models by iteratively optimizing the effective potentials to simultaneously match multiple target properties. We demonstrate our approach by concurrently optimizing bonded and nonbonded potentials of a CG model of polystyrene (PS) melts. The resulting CG-PS model effectively reproduces both the structural characteristics, such as the equilibrium probability distribution of microscopic degrees of freedom and the thermodynamic pressure of the AA counterpart. More importantly, leveraging the multiobjective optimization capability, we develop a precise and efficient CG model for PS melts that is transferable across a wide range of temperatures, i.e., from 400 to 600 K. It is achieved via optimizing a pairwise potential with nonlinear temperature dependence in the CG model to simultaneously match target data from AA-MD simulations at multiple thermodynamic states. The temperature transferable CG-PS model demonstrates its ability to accurately predict the radial distribution functions and density at different temperatures, including those that are not included in the target thermodynamic states. Our work opens up a promising route for developing accurate and transferable CG models of complex soft-matter systems through multiobjective optimization with differentiable simulation.
Collapse
Affiliation(s)
- Zhenghao Wu
- Department of Chemistry, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, P. R. China
| | - Tianhang Zhou
- College of Carbon Neutrality Future Technology, State Key Laboratory of Heavy Oil Processing, China University of Petroleum (Beijing), Beijing 102249, P. R. China
| |
Collapse
|
10
|
Nie Y, Zheng Z, Li C, Zhan H, Kou L, Gu Y, Lü C. Resolving the dynamic properties of entangled linear polymers in non-equilibrium coarse grain simulation with a priori scaling factors. NANOSCALE 2024. [PMID: 38494916 DOI: 10.1039/d3nr06185j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
The molecular weight of polymers can influence the material properties, but the molecular weight at the experiment level sometimes can be a huge burden for property prediction with full-atomic simulations. The traditional bottom-up coarse grain (CG) simulation can reduce the computation cost. However, the dynamic properties predicted by the CG simulation can deviate from the full-atomic simulation result. Usually, in CG simulations, the diffusion is faster and the viscosity and modulus are much lower. The fast dynamics in CG are usually solved by a posteriori scaling on time, temperature, or potential modifications, which usually have poor transferability to other non-fitted physical properties because of a lack of fundamental physics. In this work, a priori scaling factors were calculated by the loss of degrees of freedom and implemented in the iterative Boltzmann inversion. According to the simulation results on 3 different CG levels at different temperatures and loading rates, such a priori scaling factors can help in reproducing some dynamic properties of polycaprolactone in CG simulation more accurately, such as heat capacity, Young's modulus, and viscosity, while maintaining the accuracy in the structural distribution prediction. The transferability of entropy-enthalpy compensation and a dissipative particle dynamics thermostat is also presented for comparison. The proposed method reveals the huge potential for developing customized CG thermostats and offers a simple way to rebuild multiphysics CG models for polymers with good transferability.
Collapse
Affiliation(s)
- Yihan Nie
- College of Civil Engineering and Architecture, Zhejiang University, Hangzhou 310058, China
| | - Zhuoqun Zheng
- School of Astronautics, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
| | - Chengkai Li
- School of Materials Science and Engineering, Taiyuan University of Science and Technology, Taiyuan 030024, China
| | - Haifei Zhan
- College of Civil Engineering and Architecture, Zhejiang University, Hangzhou 310058, China
- School of Mechanical, Medical and Process Engineering, Queensland University of Technology (QUT), Brisbane QLD 4001, Australia
- Center for Materials Science, Queensland University of Technology (QUT), Brisbane QLD 4001, Australia
| | - Liangzhi Kou
- School of Mechanical, Medical and Process Engineering, Queensland University of Technology (QUT), Brisbane QLD 4001, Australia
- Center for Materials Science, Queensland University of Technology (QUT), Brisbane QLD 4001, Australia
| | - Yuantong Gu
- School of Mechanical, Medical and Process Engineering, Queensland University of Technology (QUT), Brisbane QLD 4001, Australia
- Center for Materials Science, Queensland University of Technology (QUT), Brisbane QLD 4001, Australia
| | - Chaofeng Lü
- Faculty of Mechanical Engineering & Mechanics, Ningbo University, Ningbo 315211, China
- College of Civil Engineering and Architecture, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
11
|
Christians LF, Halingstad EV, Kram E, Okolovitch EM, Pak AJ. Formalizing Coarse-Grained Representations of Anisotropic Interactions at Multimeric Protein Interfaces Using Virtual Sites. J Phys Chem B 2024; 128:1394-1406. [PMID: 38316012 DOI: 10.1021/acs.jpcb.3c07023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2024]
Abstract
Molecular simulations of biomacromolecules that assemble into multimeric complexes remain a challenge due to computationally inaccessible length and time scales. Low-resolution and implicit-solvent coarse-grained modeling approaches using traditional nonbonded interactions (both pairwise and spherically isotropic) have been able to partially address this gap. However, these models may fail to capture the complex anisotropic interactions present at macromolecular interfaces unless higher-order interaction potentials are incorporated at the expense of the computational cost. In this work, we introduce an alternate and systematic approach to represent directional interactions at protein-protein interfaces by using virtual sites restricted to pairwise interactions. We show that virtual site interaction parameters can be optimized within a relative entropy minimization framework by using only information from known statistics between coarse-grained sites. We compare our virtual site models to traditional coarse-grained models using two case studies of multimeric protein assemblies and find that the virtual site models predict pairwise correlations with higher fidelity and, more importantly, assembly behavior that is morphologically consistent with experiments. Our study underscores the importance of anisotropic interaction representations and paves the way for more accurate yet computationally efficient coarse-grained simulations of macromolecular assembly in future research.
Collapse
Affiliation(s)
- Luc F Christians
- Department of Chemical and Biological Engineering, Colorado School of Mines, Golden, Colorado 80401, United States
| | - Ethan V Halingstad
- Department of Chemical and Biological Engineering, Colorado School of Mines, Golden, Colorado 80401, United States
| | - Emiel Kram
- Department of Chemical and Biological Engineering, Colorado School of Mines, Golden, Colorado 80401, United States
| | - Evan M Okolovitch
- Department of Chemical and Biological Engineering, Colorado School of Mines, Golden, Colorado 80401, United States
| | - Alexander J Pak
- Department of Chemical and Biological Engineering, Colorado School of Mines, Golden, Colorado 80401, United States
- Quantitative Biosciences and Engineering Program, Colorado School of Mines, Golden, Colorado 80401, United States
- Materials Science Program, Colorado School of Mines, Golden, Colorado 80401, United States
| |
Collapse
|
12
|
Wang XX, Song T, Lei ZS, Sun XW, Tian JH, Liu ZJ. Study of high-pressure thermophysical properties of orthocarbonate Sr 3CO 5 using deep learning molecular dynamics simulations. Phys Chem Chem Phys 2024; 26:6351-6361. [PMID: 38315085 DOI: 10.1039/d3cp04833k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2024]
Abstract
The exploration of the physical attributes of the recently discovered orthocarbonate Sr3CO5 is significant for comprehending the carbon cycle and storage mechanisms within the Earth's interior. In this study, first-principles calculations are initially used to examine the structural phase transitions of Sr3CO5 polymorphs within the range of lower mantle pressures. The results suggest that Sr3CO5 with the Cmcm phase exhibits a minimal enthalpy between 8.3 and 30.3 GPa. As the pressure exceeds 30.3 GPa, the Cmcm phase undergoes a transition to the I4/mcm phase, while the experimentally observed Pnma phase remains metastable under our studied pressure. Furthermore, the structural data of SrO, SrCO3, and Sr3CO5 polymorphs are utilized to develop a deep learning potential model suitable for the Sr-C-O system, and the pressure-volume relationship and elastic constants calculated using the potential model are in line with the available results. Subsequently, the elastic properties of Cmcm and I4/mcm phases in Sr3CO5 at high temperature and pressure are calculated using the molecular dynamics method. The results indicate that the I4/mcm phase exhibits higher temperature sensitivity in terms of elastic moduli and wave velocities compared to the Cmcm phase. Finally, the thermodynamic properties of the Cmcm and I4/mcm phases are predicted in the range of 0-2000 K and 10-120 GPa, revealing that the heat capacity and bulk thermal expansion coefficient of both phases increase with temperature, with the constant volume heat capacity gradually approaching the Dulong-Petit limit as the temperature rises.
Collapse
Affiliation(s)
- Xin-Xuan Wang
- School of Mathematics and Physics, Lanzhou Jiaotong University, Lanzhou 730070, China.
| | - Ting Song
- School of Mathematics and Physics, Lanzhou Jiaotong University, Lanzhou 730070, China.
| | - Zhen-Shuai Lei
- Faculty of Science, Wuhan University of Technology, Wuhan 430079, China
| | - Xiao-Wei Sun
- School of Mathematics and Physics, Lanzhou Jiaotong University, Lanzhou 730070, China.
| | - Jun-Hong Tian
- School of Mathematics and Physics, Lanzhou Jiaotong University, Lanzhou 730070, China.
| | - Zi-Jiang Liu
- School of Mathematics and Physics, Lanzhou Jiaotong University, Lanzhou 730070, China.
| |
Collapse
|
13
|
Maier JC, Wang CI, Jackson NE. Distilling coarse-grained representations of molecular electronic structure with continuously gated message passing. J Chem Phys 2024; 160:024109. [PMID: 38193551 DOI: 10.1063/5.0179253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Accepted: 12/14/2023] [Indexed: 01/10/2024] Open
Abstract
Bottom-up methods for coarse-grained (CG) molecular modeling are critically needed to establish rigorous links between atomistic reference data and reduced molecular representations. For a target molecule, the ideal reduced CG representation is a function of both the conformational ensemble of the system and the target physical observable(s) to be reproduced at the CG resolution. However, there is an absence of algorithms for selecting CG representations of molecules from which complex properties, including molecular electronic structure, can be accurately modeled. We introduce continuously gated message passing (CGMP), a graph neural network (GNN) method for atomically decomposing molecular electronic structure sampled over conformational ensembles. CGMP integrates 3D-invariant GNNs and a novel gated message passing system to continuously reduce the atomic degrees of freedom accessible for electronic predictions, resulting in a one-shot importance ranking of atoms contributing to a target molecular property. Moreover, CGMP provides the first approach by which to quantify the degeneracy of "good" CG representations conditioned on specific prediction targets, facilitating the development of more transferable CG representations. We further show how CGMP can be used to highlight multiatom correlations, illuminating a path to developing CG electronic Hamiltonians in terms of interpretable collective variables for arbitrarily complex molecules.
Collapse
Affiliation(s)
- J Charlie Maier
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Chun-I Wang
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Nicholas E Jackson
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| |
Collapse
|
14
|
Coste A, Slejko E, Zavadlav J, Praprotnik M. Developing an Implicit Solvation Machine Learning Model for Molecular Simulations of Ionic Media. J Chem Theory Comput 2024; 20:411-420. [PMID: 38118122 PMCID: PMC10782447 DOI: 10.1021/acs.jctc.3c00984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 12/04/2023] [Accepted: 12/04/2023] [Indexed: 12/22/2023]
Abstract
Molecular dynamics (MD) simulations of biophysical systems require accurate modeling of their native environment, i.e., aqueous ionic solution, as it critically impacts the structure and function of biomolecules. On the other hand, the models should be computationally efficient to enable simulations of large spatiotemporal scales. Here, we present the deep implicit solvation model for sodium chloride solutions that satisfies both requirements. Owing to the use of the neural network potential, the model can capture the many-body potential of mean force, while the implicit water treatment renders the model inexpensive. We demonstrate our approach first for pure ionic solutions with concentrations ranging from physiological to 2 M. We then extend the model to capture the effective ion interactions in the vicinity and far away from a DNA molecule. In both cases, the structural properties are in good agreement with all-atom MD, showcasing a general methodology for the efficient and accurate modeling of ionic media.
Collapse
Affiliation(s)
- Amaury Coste
- Laboratory
for Molecular Modeling, National Institute of Chemistry, Ljubljana SI-1001, Slovenia
| | - Ema Slejko
- Laboratory
for Molecular Modeling, National Institute of Chemistry, Ljubljana SI-1001, Slovenia
- Department
of Physics, Faculty of Mathematics and Physics, University of Ljubljana, Ljubljana SI-1000, Slovenia
| | - Julija Zavadlav
- Professorship
of Multiscale Modeling of Fluid Materials, TUM School of Engineering
and Design, Technical University of Munich, Garching Near Munich DE-85748, Germany
| | - Matej Praprotnik
- Laboratory
for Molecular Modeling, National Institute of Chemistry, Ljubljana SI-1001, Slovenia
- Department
of Physics, Faculty of Mathematics and Physics, University of Ljubljana, Ljubljana SI-1000, Slovenia
| |
Collapse
|
15
|
Nitta H, Ozawa T, Yasuoka K. Construction of full-atomistic polymer amorphous structures using reverse-mapping from Kremer-Grest models. J Chem Phys 2023; 159:194903. [PMID: 37982485 DOI: 10.1063/5.0159722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 10/30/2023] [Indexed: 11/21/2023] Open
Abstract
We propose a method to build full-atomistic (FA) amorphous polymer structures using reverse-mapping from coarse-grained (CG) models. In this method, three models with different resolutions are utilized, namely the CG1, CG2, and FA models. It is assumed that the CG1 model is more abstract than the CG2 model. The CG1 is utilized to equilibrate the system, and then sequential reverse-mapping procedures from the CG1 to the CG2 models and from the CG2 to the FA models are conducted. A mapping relation between the CG1 and the FA models is necessary to generate a polymer structure with a given density and radius of chains. Actually, we have used the Kremer-Grest (KG) model as the CG1 and the monomer-level CG model as the CG2 model. Utilizing the mapping relation, we have developed a scheme that constructs an FA polymer model from the KG model. In the scheme, the KG model, the monomer level CG model, and the FA model are successively constructed. The scheme is applied to polyethylene (PE), cis 1,4-polybutadiene (PB), and poly(methyl methacrylate) (PMMA). As a validation, the structures of PE and PB constructed by the scheme were carefully checked through comparison with those obtained using long-time FA molecular dynamics (MD) simulations. We found that both short- and long-range chain structures constructed by the scheme reproduced those obtained by the FA MD simulations. Then, as an interesting application, the scheme is applied to generate an entangled PMMA structure. The results showed that the scheme provides an efficient and easy way to construct amorphous structures of FA polymers.
Collapse
Affiliation(s)
- Hiroya Nitta
- JSOL Corporation, KUDAN-KAIKAN TERRACE 1-6-5, Kudanminami, Chiyoda-ku, Tokyo 102-0074, Japan
| | - Taku Ozawa
- JSOL Corporation, KUDAN-KAIKAN TERRACE 1-6-5, Kudanminami, Chiyoda-ku, Tokyo 102-0074, Japan
| | - Kenji Yasuoka
- Department of Mechanical Engineering, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Kanagawa 223-8522, Japan
| |
Collapse
|
16
|
Jones MS, Shmilovich K, Ferguson AL. DiAMoNDBack: Diffusion-Denoising Autoregressive Model for Non-Deterministic Backmapping of Cα Protein Traces. J Chem Theory Comput 2023; 19:7908-7923. [PMID: 37906711 DOI: 10.1021/acs.jctc.3c00840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Coarse-grained molecular models of proteins permit access to length and time scales unattainable by all-atom models and the simulation of processes that occur on long time scales, such as aggregation and folding. The reduced resolution realizes computational accelerations, but an atomistic representation can be vital for a complete understanding of mechanistic details. Backmapping is the process of restoring all-atom resolution to coarse-grained molecular models. In this work, we report DiAMoNDBack (Diffusion-denoising Autoregressive Model for Non-Deterministic Backmapping) as an autoregressive denoising diffusion probability model to restore all-atom details to coarse-grained protein representations retaining only Cα coordinates. The autoregressive generation process proceeds from the protein N-terminus to C-terminus in a residue-by-residue fashion conditioned on the Cα trace and previously backmapped backbone and side-chain atoms within the local neighborhood. The local and autoregressive nature of our model makes it transferable between proteins. The stochastic nature of the denoising diffusion process means that the model generates a realistic ensemble of backbone and side-chain all-atom configurations consistent with the coarse-grained Cα trace. We train DiAMoNDBack over 65k+ structures from the Protein Data Bank (PDB) and validate it in applications to a hold-out PDB test set, intrinsically disordered protein structures from the Protein Ensemble Database (PED), molecular dynamics simulations of fast-folding mini-proteins from DE Shaw Research, and coarse-grained simulation data. We achieve state-of-the-art reconstruction performance in terms of correct bond formation, avoidance of side-chain clashes, and the diversity of the generated side-chain configurational states. We make the DiAMoNDBack model publicly available as a free and open-source Python package.
Collapse
Affiliation(s)
- Michael S Jones
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Kirill Shmilovich
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Andrew L Ferguson
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
17
|
Navarro C, Majewski M, De Fabritiis G. Top-Down Machine Learning of Coarse-Grained Protein Force Fields. J Chem Theory Comput 2023; 19:7518-7526. [PMID: 37874270 PMCID: PMC10777392 DOI: 10.1021/acs.jctc.3c00638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Indexed: 10/25/2023]
Abstract
Developing accurate and efficient coarse-grained representations of proteins is crucial for understanding their folding, function, and interactions over extended time scales. Our methodology involves simulating proteins with molecular dynamics and utilizing the resulting trajectories to train a neural network potential through differentiable trajectory reweighting. Remarkably, this method requires only the native conformation of proteins, eliminating the need for labeled data derived from extensive simulations or memory-intensive end-to-end differentiable simulations. Once trained, the model can be employed to run parallel molecular dynamics simulations and sample folding events for proteins both within and beyond the training distribution, showcasing its extrapolation capabilities. By applying Markov state models, native-like conformations of the simulated proteins can be predicted from the coarse-grained simulations. Owing to its theoretical transferability and ability to use solely experimental static structures as training data, we anticipate that this approach will prove advantageous for developing new protein force fields and further advancing the study of protein dynamics, folding, and interactions.
Collapse
Affiliation(s)
- Carles Navarro
- Acellera
Labs, Doctor Trueta 183, 08005 Barcelona, Spain
| | | | - Gianni De Fabritiis
- Computational
Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003 Barcelona, Spain
- Acellera
Ltd., Devonshire House
582, Middlesex HA7 1JS, United Kingdom
- Institució
Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
18
|
Lyu L, Lei H. Construction of Coarse-Grained Molecular Dynamics with Many-Body Non-Markovian Memory. PHYSICAL REVIEW LETTERS 2023; 131:177301. [PMID: 37955502 DOI: 10.1103/physrevlett.131.177301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 09/19/2023] [Indexed: 11/14/2023]
Abstract
We introduce a machine-learning-based coarse-grained molecular dynamics model that faithfully retains the many-body nature of the intermolecular dissipative interactions. Unlike the common empirical coarse-grained models, the present model is constructed based on the Mori-Zwanzig formalism and naturally inherits the heterogeneous state-dependent memory term rather than matching the mean-field metrics such as the velocity autocorrelation function. Numerical results show that preserving the many-body nature of the memory term is crucial for predicting the collective transport and diffusion processes, where empirical forms generally show limitations.
Collapse
Affiliation(s)
- Liyao Lyu
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, USA
| | - Huan Lei
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, USA
- Department of Statistics and Probability, Michigan State University, East Lansing, Michigan 48824, USA
| |
Collapse
|
19
|
Nadkarni I, Wu H, Aluru NR. Data-Driven Approach to Coarse-Graining Simple Liquids in Confinement. J Chem Theory Comput 2023; 19:7358-7370. [PMID: 37791529 DOI: 10.1021/acs.jctc.3c00633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
We propose a data-driven framework for identifying coarse-grained (CG) Lennard-Jones (LJ) potential parameters in confined systems for simple liquids. Our approach involves the use of a Deep Neural Network (DNN) that is trained to approximate the solution of the Inverse Liquid State (ILST) problem for confined systems. The DNN model inherently incorporates essential physical characteristics specific to confined fluids, enabling an accurate prediction of inhomogeneity effects. By utilizing transfer learning, we predict single-site LJ potentials of simple multiatomic liquids confined in a slit-like channel, which effectively replicate both the fluid structure and molecular force of the target All-Atom (AA) system when the electrostatic interactions are not dominant. In addition, we showcase the synergy between the data-driven approach and the well-known Bottom-Up coarse-graining method utilizing Relative-Entropy (RE) Minimization. Through the sequential utilization of these two methods, the robustness of the iterative RE method is significantly augmented, leading to a remarkable enhancement in convergence.
Collapse
Affiliation(s)
- Ishan Nadkarni
- Walker Department of Mechanical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Haiyi Wu
- Oden Institute for Computational Engineering and Sciences, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Narayana R Aluru
- Walker Department of Mechanical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
- Oden Institute for Computational Engineering and Sciences, The University of Texas at Austin, Austin, Texas 78712, United States
| |
Collapse
|
20
|
Lederer J, Gastegger M, Schütt KT, Kampffmeyer M, Müller KR, Unke OT. Automatic identification of chemical moieties. Phys Chem Chem Phys 2023; 25:26370-26379. [PMID: 37750554 PMCID: PMC10548786 DOI: 10.1039/d3cp03845a] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 08/18/2023] [Indexed: 09/27/2023]
Abstract
In recent years, the prediction of quantum mechanical observables with machine learning methods has become increasingly popular. Message-passing neural networks (MPNNs) solve this task by constructing atomic representations, from which the properties of interest are predicted. Here, we introduce a method to automatically identify chemical moieties (molecular building blocks) from such representations, enabling a variety of applications beyond property prediction, which otherwise rely on expert knowledge. The required representation can either be provided by a pretrained MPNN, or be learned from scratch using only structural information. Beyond the data-driven design of molecular fingerprints, the versatility of our approach is demonstrated by enabling the selection of representative entries in chemical databases, the automatic construction of coarse-grained force fields, as well as the identification of reaction coordinates.
Collapse
Affiliation(s)
- Jonas Lederer
- Berlin Institute of Technology (TU Berlin), 10587 Berlin, Germany.
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, Germany
| | - Michael Gastegger
- Berlin Institute of Technology (TU Berlin), 10587 Berlin, Germany.
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, Germany
| | - Kristof T Schütt
- Berlin Institute of Technology (TU Berlin), 10587 Berlin, Germany.
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, Germany
| | - Michael Kampffmeyer
- Department of Physics and Technology, UiT The Arctic University of Norway, 9019 Tromsø, Norway
| | - Klaus-Robert Müller
- Berlin Institute of Technology (TU Berlin), 10587 Berlin, Germany.
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, Germany
- Google Deepmind, Germany
- Department of Artificial Intelligence, Korea University, Seoul 136-713, Korea
- Max Planck Institut für Informatik, 66123 Saarbrücken, Germany
| | - Oliver T Unke
- Berlin Institute of Technology (TU Berlin), 10587 Berlin, Germany.
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, Germany
- Google Deepmind, Germany
| |
Collapse
|
21
|
Faure Beaulieu Z, Nicholas TC, Gardner JLA, Goodwin AL, Deringer VL. Coarse-grained versus fully atomistic machine learning for zeolitic imidazolate frameworks. Chem Commun (Camb) 2023; 59:11405-11408. [PMID: 37668310 PMCID: PMC10513772 DOI: 10.1039/d3cc02265j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 08/22/2023] [Indexed: 09/06/2023]
Abstract
Zeolitic imidazolate frameworks are widely thought of as being analogous to inorganic AB2 phases. We test the validity of this assumption by comparing simplified and fully atomistic machine-learning models for local environments in ZIFs. Our work addresses the central question to what extent chemical information can be "coarse-grained" in hybrid framework materials.
Collapse
Affiliation(s)
- Zoé Faure Beaulieu
- Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, UK.
| | - Thomas C Nicholas
- Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, UK.
| | - John L A Gardner
- Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, UK.
| | - Andrew L Goodwin
- Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, UK.
| | - Volker L Deringer
- Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, UK.
| |
Collapse
|
22
|
Majewski M, Pérez A, Thölke P, Doerr S, Charron NE, Giorgino T, Husic BE, Clementi C, Noé F, De Fabritiis G. Machine learning coarse-grained potentials of protein thermodynamics. Nat Commun 2023; 14:5739. [PMID: 37714883 PMCID: PMC10504246 DOI: 10.1038/s41467-023-41343-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 08/29/2023] [Indexed: 09/17/2023] Open
Abstract
A generalized understanding of protein dynamics is an unsolved scientific problem, the solution of which is critical to the interpretation of the structure-function relationships that govern essential biological processes. Here, we approach this problem by constructing coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics. For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins with multiple secondary structure arrangements. The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems. Coarse-grained simulations identify relevant structural states in the ensemble with comparable energetics to the all-atom systems. Furthermore, we show that a single coarse-grained potential can integrate all twelve proteins and can capture experimental structural features of mutated proteins. These results indicate that machine learning coarse-grained potentials could provide a feasible approach to simulate and understand protein dynamics.
Collapse
Affiliation(s)
- Maciej Majewski
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
| | - Adrià Pérez
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
| | - Philipp Thölke
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain
| | - Stefan Doerr
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
| | - Nicholas E Charron
- Department of Physics, Rice University, Houston, TX, 77005, USA
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA
- Department of Physics, FU Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Toni Giorgino
- Biophysics Institute, National Research Council (CNR-IBF), 20133, Milan, Italy
| | - Brooke E Husic
- Department of Mathematics and Computer Science, FU Berlin, Arnimallee 12, 14195, Berlin, Germany
- Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, 08540, USA
- Princeton Center for Theoretical Science, Princeton University, Princeton, NJ, 08540, USA
- Center for the Physics of Biological Function, Princeton University, Princeton, NJ, 08540, USA
| | - Cecilia Clementi
- Department of Physics, Rice University, Houston, TX, 77005, USA.
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA.
- Department of Physics, FU Berlin, Arnimallee 12, 14195, Berlin, Germany.
- Department of Chemistry, Rice University, Houston, TX, 77005, USA.
| | - Frank Noé
- Department of Physics, FU Berlin, Arnimallee 12, 14195, Berlin, Germany.
- Department of Mathematics and Computer Science, FU Berlin, Arnimallee 12, 14195, Berlin, Germany.
- Department of Chemistry, Rice University, Houston, TX, 77005, USA.
- Microsoft Research AI4Science, Karl-Liebknecht Str. 32, 10178, Berlin, Germany.
| | - Gianni De Fabritiis
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain.
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain.
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, 08010, Barcelona, Spain.
| |
Collapse
|
23
|
Badini S, Regondi S, Pugliese R. Unleashing the Power of Artificial Intelligence in Materials Design. MATERIALS (BASEL, SWITZERLAND) 2023; 16:5927. [PMID: 37687620 PMCID: PMC10488647 DOI: 10.3390/ma16175927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 08/25/2023] [Accepted: 08/28/2023] [Indexed: 09/10/2023]
Abstract
The integration of artificial intelligence (AI) algorithms in materials design is revolutionizing the field of materials engineering thanks to their power to predict material properties, design de novo materials with enhanced features, and discover new mechanisms beyond intuition. In addition, they can be used to infer complex design principles and identify high-quality candidates more rapidly than trial-and-error experimentation. From this perspective, herein we describe how these tools can enable the acceleration and enrichment of each stage of the discovery cycle of novel materials with optimized properties. We begin by outlining the state-of-the-art AI models in materials design, including machine learning (ML), deep learning, and materials informatics tools. These methodologies enable the extraction of meaningful information from vast amounts of data, enabling researchers to uncover complex correlations and patterns within material properties, structures, and compositions. Next, a comprehensive overview of AI-driven materials design is provided and its potential future prospects are highlighted. By leveraging such AI algorithms, researchers can efficiently search and analyze databases containing a wide range of material properties, enabling the identification of promising candidates for specific applications. This capability has profound implications across various industries, from drug development to energy storage, where materials performance is crucial. Ultimately, AI-based approaches are poised to revolutionize our understanding and design of materials, ushering in a new era of accelerated innovation and advancement.
Collapse
|
24
|
Wellawatte GP, Hocky GM, White AD. Neural potentials of proteins extrapolate beyond training data. J Chem Phys 2023; 159:085103. [PMID: 37642255 PMCID: PMC10474891 DOI: 10.1063/5.0147240] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 07/31/2023] [Indexed: 08/31/2023] Open
Abstract
We evaluate neural network (NN) coarse-grained (CG) force fields compared to traditional CG molecular mechanics force fields. We conclude that NN force fields are able to extrapolate and sample from unseen regions of the free energy surface when trained with limited data. Our results come from 88 NN force fields trained on different combinations of clustered free energy surfaces from four protein mapped trajectories. We used a statistical measure named total variation similarity to assess the agreement between reference free energy surfaces from mapped atomistic simulations and CG simulations from trained NN force fields. Our conclusions support the hypothesis that NN CG force fields trained with samples from one region of the proteins' free energy surface can, indeed, extrapolate to unseen regions. Additionally, the force matching error was found to only be weakly correlated with a force field's ability to reconstruct the correct free energy surface.
Collapse
Affiliation(s)
- Geemi P. Wellawatte
- Department of Chemistry, University of Rochester, Rochester, New York 14627, USA
| | - Glen M. Hocky
- Department of Chemistry, Simons Center for Computational Physical Chemistry, New York University, New York, New York 10003, USA
| | - Andrew D. White
- Department of Chemical Engineering, University of Rochester, Rochester, New York 14627, USA
| |
Collapse
|
25
|
Sahrmann P, Loose TD, Durumeric AEP, Voth GA. Utilizing Machine Learning to Greatly Expand the Range and Accuracy of Bottom-Up Coarse-Grained Models through Virtual Particles. J Chem Theory Comput 2023; 19:4402-4413. [PMID: 36802592 PMCID: PMC10373655 DOI: 10.1021/acs.jctc.2c01183] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Indexed: 02/22/2023]
Abstract
Coarse-grained (CG) models parametrized using atomistic reference data, i.e., "bottom up" CG models, have proven useful in the study of biomolecules and other soft matter. However, the construction of highly accurate, low resolution CG models of biomolecules remains challenging. We demonstrate in this work how virtual particles, CG sites with no atomistic correspondence, can be incorporated into CG models within the context of relative entropy minimization (REM) as latent variables. The methodology presented, variational derivative relative entropy minimization (VD-REM), enables optimization of virtual particle interactions through a gradient descent algorithm aided by machine learning. We apply this methodology to the challenging case of a solvent-free CG model of a 1,2-dioleoyl-sn-glycero-3-phosphocholine (DOPC) lipid bilayer and demonstrate that introduction of virtual particles captures solvent-mediated behavior and higher-order correlations which REM alone cannot capture in a more standard CG model based only on the mapping of collections of atoms to the CG sites.
Collapse
Affiliation(s)
- Patrick
G. Sahrmann
- Department of Chemistry, Chicago Center
for Theoretical Chemistry, James Franck Institute, and Institute for
Biophysical Dynamics, The University of
Chicago, Chicago, Illinois 60637, United
States
| | - Timothy D. Loose
- Department of Chemistry, Chicago Center
for Theoretical Chemistry, James Franck Institute, and Institute for
Biophysical Dynamics, The University of
Chicago, Chicago, Illinois 60637, United
States
| | - Aleksander E. P. Durumeric
- Department of Chemistry, Chicago Center
for Theoretical Chemistry, James Franck Institute, and Institute for
Biophysical Dynamics, The University of
Chicago, Chicago, Illinois 60637, United
States
| | - Gregory A. Voth
- Department of Chemistry, Chicago Center
for Theoretical Chemistry, James Franck Institute, and Institute for
Biophysical Dynamics, The University of
Chicago, Chicago, Illinois 60637, United
States
| |
Collapse
|
26
|
Krämer A, Durumeric AEP, Charron NE, Chen Y, Clementi C, Noé F. Statistically Optimal Force Aggregation for Coarse-Graining Molecular Dynamics. J Phys Chem Lett 2023; 14:3970-3979. [PMID: 37079800 DOI: 10.1021/acs.jpclett.3c00444] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Machine-learned coarse-grained (CG) models have the potential for simulating large molecular complexes beyond what is possible with atomistic molecular dynamics. However, training accurate CG models remains a challenge. A widely used methodology for learning bottom-up CG force fields maps forces from all-atom molecular dynamics to the CG representation and matches them with a CG force field on average. We show that there is flexibility in how to map all-atom forces to the CG representation and that the most commonly used mapping methods are statistically inefficient and potentially even incorrect in the presence of constraints in the all-atom simulation. We define an optimization statement for force mappings and demonstrate that substantially improved CG force fields can be learned from the same simulation data when using optimized force maps. The method is demonstrated on the miniproteins chignolin and tryptophan cage and published as open-source code.
Collapse
Affiliation(s)
- Andreas Krämer
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
| | - Aleksander E P Durumeric
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
| | - Nicholas E Charron
- Department of Physics and Astronomy, Rice University, Houston, Texas 77005, United States
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77251, United States
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
| | - Yaoyi Chen
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
- International Max Planck Research School for Biology and Computation (IMPRS-BAC), Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Cecilia Clementi
- Department of Physics and Astronomy, Rice University, Houston, Texas 77005, United States
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77251, United States
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
- Department of Chemistry, Rice University, Houston, Texas 77005, United States
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195 Berlin, Germany
- Department of Chemistry, Rice University, Houston, Texas 77005, United States
- Microsoft Research AI4Science, Karl-Liebknecht Straße 32, 10178 Berlin, Germany
| |
Collapse
|
27
|
Ricci E, Vergadou N. Integrating Machine Learning in the Coarse-Grained Molecular Simulation of Polymers. J Phys Chem B 2023; 127:2302-2322. [PMID: 36888553 DOI: 10.1021/acs.jpcb.2c06354] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2023]
Abstract
Machine learning (ML) is having an increasing impact on the physical sciences, engineering, and technology and its integration into molecular simulation frameworks holds great potential to expand their scope of applicability to complex materials and facilitate fundamental knowledge and reliable property predictions, contributing to the development of efficient materials design routes. The application of ML in materials informatics in general, and polymer informatics in particular, has led to interesting results, however great untapped potential lies in the integration of ML techniques into the multiscale molecular simulation methods for the study of macromolecular systems, specifically in the context of Coarse Grained (CG) simulations. In this Perspective, we aim at presenting the pioneering recent research efforts in this direction and discussing how these new ML-based techniques can contribute to critical aspects of the development of multiscale molecular simulation methods for bulk complex chemical systems, especially polymers. Prerequisites for the implementation of such ML-integrated methods and open challenges that need to be met toward the development of general systematic ML-based coarse graining schemes for polymers are discussed.
Collapse
Affiliation(s)
- Eleonora Ricci
- Institute of Nanoscience and Nanotechnology, National Center for Scientific Research "Demokritos", GR-15341 Agia Paraskevi, Athens, Greece
- Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos", GR-15341 Agia Paraskevi, Athens, Greece
| | - Niki Vergadou
- Institute of Nanoscience and Nanotechnology, National Center for Scientific Research "Demokritos", GR-15341 Agia Paraskevi, Athens, Greece
| |
Collapse
|
28
|
Yang W, Templeton C, Rosenberger D, Bittracher A, Nüske F, Noé F, Clementi C. Slicing and Dicing: Optimal Coarse-Grained Representation to Preserve Molecular Kinetics. ACS CENTRAL SCIENCE 2023; 9:186-196. [PMID: 36844497 PMCID: PMC9951291 DOI: 10.1021/acscentsci.2c01200] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Indexed: 05/05/2023]
Abstract
The aim of molecular coarse-graining approaches is to recover relevant physical properties of the molecular system via a lower-resolution model that can be more efficiently simulated. Ideally, the lower resolution still accounts for the degrees of freedom necessary to recover the correct physical behavior. The selection of these degrees of freedom has often relied on the scientist's chemical and physical intuition. In this article, we make the argument that in soft matter contexts desirable coarse-grained models accurately reproduce the long-time dynamics of a system by correctly capturing the rare-event transitions. We propose a bottom-up coarse-graining scheme that correctly preserves the relevant slow degrees of freedom, and we test this idea for three systems of increasing complexity. We show that in contrast to this method existing coarse-graining schemes such as those from information theory or structure-based approaches are not able to recapitulate the slow time scales of the system.
Collapse
Affiliation(s)
- Wangfei Yang
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas77005, United States
- Graduate
Program in Systems, Synthetic and Physical Biology, Rice University, Houston, Texas77005, United States
| | - Clark Templeton
- Department
of Physics, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
| | - David Rosenberger
- Department
of Physics, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
| | - Andreas Bittracher
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 12, 14195Berlin, Germany
| | - Feliks Nüske
- Max
Planck Institute for Dynamics of Complex Technical Systems, Sandtorstrasse 1, 39106Magdeburg, Germany
| | - Frank Noé
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas77005, United States
- Department
of Physics, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 12, 14195Berlin, Germany
- Department
of Chemistry, Rice University, Houston, Texas77005, United States
| | - Cecilia Clementi
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas77005, United States
- Department
of Physics, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
- Department
of Chemistry, Rice University, Houston, Texas77005, United States
- Department
of Physics, Rice University, Houston, Texas77005, United States
- E-mail:
| |
Collapse
|
29
|
Ge P, Zhang L, Lei H. Machine learning assisted coarse-grained molecular dynamics modeling of meso-scale interfacial fluids. J Chem Phys 2023; 158:064104. [PMID: 36792498 DOI: 10.1063/5.0131567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
A hallmark of meso-scale interfacial fluids is the multi-faceted, scale-dependent interfacial energy, which often manifests different characteristics across the molecular and continuum scale. The multi-scale nature imposes a challenge to construct reliable coarse-grained (CG) models, where the CG potential function needs to faithfully encode the many-body interactions arising from the unresolved atomistic interactions and account for the heterogeneous density distributions across the interface. We construct the CG models of both single- and two-component polymeric fluid systems based on the recently developed deep coarse-grained potential [Zhang et al., J. Chem. Phys. 149, 034101 (2018)] scheme, where each polymer molecule is modeled as a CG particle. By only using the training samples of the instantaneous force under the thermal equilibrium state, the constructed CG models can accurately reproduce both the probability density function of the void formation in bulk and the spectrum of the capillary wave across the fluid interface. More importantly, the CG models accurately predict the volume-to-area scaling transition for the apolar solvation energy, illustrating the effectiveness to probe the meso-scale collective behaviors encoded with molecular-level fidelity.
Collapse
Affiliation(s)
- Pei Ge
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, USA
| | | | - Huan Lei
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, USA
| |
Collapse
|
30
|
Wang W, Wu Z, Dietschreit JCB, Gómez-Bombarelli R. Learning pair potentials using differentiable simulations. J Chem Phys 2023; 158:044113. [PMID: 36725529 DOI: 10.1063/5.0126475] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Learning pair interactions from experimental or simulation data is of great interest for molecular simulations. We propose a general stochastic method for learning pair interactions from data using differentiable simulations (DiffSim). DiffSim defines a loss function based on structural observables, such as the radial distribution function, through molecular dynamics (MD) simulations. The interaction potentials are then learned directly by stochastic gradient descent, using backpropagation to calculate the gradient of the structural loss metric with respect to the interaction potential through the MD simulation. This gradient-based method is flexible and can be configured to simulate and optimize multiple systems simultaneously. For example, it is possible to simultaneously learn potentials for different temperatures or for different compositions. We demonstrate the approach by recovering simple pair potentials, such as Lennard-Jones systems, from radial distribution functions. We find that DiffSim can be used to probe a wider functional space of pair potentials compared with traditional methods like iterative Boltzmann inversion. We show that our methods can be used to simultaneously fit potentials for simulations at different compositions and temperatures to improve the transferability of the learned potentials.
Collapse
Affiliation(s)
- Wujie Wang
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, Massachusetts 02139, USA
| | - Zhenghao Wu
- Eduard-Zintl-Institut für Anorganische und Physikalische Chemie, Technische Universität Darmstadt, Alarich-Weiss-Str. 8, 64287 Darmstadt, Germany
| | - Johannes C B Dietschreit
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, Massachusetts 02139, USA
| | - Rafael Gómez-Bombarelli
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, Massachusetts 02139, USA
| |
Collapse
|
31
|
Jeong KJ, Jeong S, Lee S, Son CY. Predictive Molecular Models for Charged Materials Systems: From Energy Materials to Biomacromolecules. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2023; 35:e2204272. [PMID: 36373701 DOI: 10.1002/adma.202204272] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 07/05/2022] [Indexed: 06/16/2023]
Abstract
Electrostatic interactions play a dominant role in charged materials systems. Understanding the complex correlation between macroscopic properties with microscopic structures is of critical importance to develop rational design strategies for advanced materials. But the complexity of this challenging task is augmented by interfaces present in the charged materials systems, such as electrode-electrolyte interfaces or biological membranes. Over the last decades, predictive molecular simulations that are founded in fundamental physics and optimized for charged interfacial systems have proven their value in providing molecular understanding of physicochemical properties and functional mechanisms for diverse materials. Novel design strategies utilizing predictive models have been suggested as promising route for the rational design of materials with tailored properties. Here, an overview of recent advances in the understanding of charged interfacial systems aided by predictive molecular simulations is presented. Focusing on three types of charged interfaces found in energy materials and biomacromolecules, how the molecular models characterize ion structure, charge transport, morphology relation to the environment, and the thermodynamics/kinetics of molecular binding at the interfaces is discussed. The critical analysis brings two prominent field of energy materials and biological science under common perspective, to stimulate crossover in both research field that have been largely separated.
Collapse
Affiliation(s)
- Kyeong-Jun Jeong
- Department of Chemistry, Pohang University of Science and Technology (POSTECH), Pohang, 790-784, South Korea
| | - Seungwon Jeong
- Department of Chemistry, Pohang University of Science and Technology (POSTECH), Pohang, 790-784, South Korea
| | - Sangmin Lee
- Department of Chemistry, Pohang University of Science and Technology (POSTECH), Pohang, 790-784, South Korea
| | - Chang Yun Son
- Department of Chemistry, Pohang University of Science and Technology (POSTECH), Pohang, 790-784, South Korea
| |
Collapse
|
32
|
Thaler S, Stupp M, Zavadlav J. Deep coarse-grained potentials via relative entropy minimization. J Chem Phys 2022; 157:244103. [PMID: 36586977 DOI: 10.1063/5.0124538] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Neural network (NN) potentials are a natural choice for coarse-grained (CG) models. Their many-body capacity allows highly accurate approximations of the potential of mean force, promising CG simulations of unprecedented accuracy. CG NN potentials trained bottom-up via force matching (FM), however, suffer from finite data effects: They rely on prior potentials for physically sound predictions outside the training data domain, and the corresponding free energy surface is sensitive to errors in the transition regions. The standard alternative to FM for classical potentials is relative entropy (RE) minimization, which has not yet been applied to NN potentials. In this work, we demonstrate, for benchmark problems of liquid water and alanine dipeptide, that RE training is more data efficient, due to accessing the CG distribution during training, resulting in improved free energy surfaces and reduced sensitivity to prior potentials. In addition, RE learns to correct time integration errors, allowing larger time steps in CG molecular dynamics simulation, while maintaining accuracy. Thus, our findings support the use of training objectives beyond FM, as a promising direction for improving CG NN potential's accuracy and reliability.
Collapse
Affiliation(s)
- Stephan Thaler
- Multiscale Modeling of Fluid Materials, Department of Engineering Physics and Computation, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany
| | - Maximilian Stupp
- Multiscale Modeling of Fluid Materials, Department of Engineering Physics and Computation, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany
| | - Julija Zavadlav
- Multiscale Modeling of Fluid Materials, Department of Engineering Physics and Computation, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany
| |
Collapse
|
33
|
Abstract
Coarse-grained models have proven helpful for simulating complex systems over long time scales to provide molecular insights into various processes. Methodologies for systematic parametrization of the underlying energy function or force field that describes the interactions among different components of the system are of great interest for ensuring simulation accuracy. We present a new method, potential contrasting, to enable efficient learning of force fields that can accurately reproduce the conformational distribution produced with all-atom simulations. Potential contrasting generalizes the noise contrastive estimation method with umbrella sampling to better learn the complex energy landscape of molecular systems. When applied to the Trp-cage protein, we found that the technique produces force fields that thoroughly capture the thermodynamics of the folding process despite the use of only α-carbons in the coarse-grained model. We further showed that potential contrasting could be applied over large data sets that combine the conformational ensembles of many proteins to improve force field transferability. We anticipate potential contrasting as a powerful tool for building general-purpose coarse-grained force fields.
Collapse
Affiliation(s)
- Xinqiang Ding
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
34
|
Jin J, Pak AJ, Durumeric AEP, Loose TD, Voth GA. Bottom-up Coarse-Graining: Principles and Perspectives. J Chem Theory Comput 2022; 18:5759-5791. [PMID: 36070494 PMCID: PMC9558379 DOI: 10.1021/acs.jctc.2c00643] [Citation(s) in RCA: 79] [Impact Index Per Article: 39.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Indexed: 01/14/2023]
Abstract
Large-scale computational molecular models provide scientists a means to investigate the effect of microscopic details on emergent mesoscopic behavior. Elucidating the relationship between variations on the molecular scale and macroscopic observable properties facilitates an understanding of the molecular interactions driving the properties of real world materials and complex systems (e.g., those found in biology, chemistry, and materials science). As a result, discovering an explicit, systematic connection between microscopic nature and emergent mesoscopic behavior is a fundamental goal for this type of investigation. The molecular forces critical to driving the behavior of complex heterogeneous systems are often unclear. More problematically, simulations of representative model systems are often prohibitively expensive from both spatial and temporal perspectives, impeding straightforward investigations over possible hypotheses characterizing molecular behavior. While the reduction in resolution of a study, such as moving from an atomistic simulation to that of the resolution of large coarse-grained (CG) groups of atoms, can partially ameliorate the cost of individual simulations, the relationship between the proposed microscopic details and this intermediate resolution is nontrivial and presents new obstacles to study. Small portions of these complex systems can be realistically simulated. Alone, these smaller simulations likely do not provide insight into collectively emergent behavior. However, by proposing that the driving forces in both smaller and larger systems (containing many related copies of the smaller system) have an explicit connection, systematic bottom-up CG techniques can be used to transfer CG hypotheses discovered using a smaller scale system to a larger system of primary interest. The proposed connection between different CG systems is prescribed by (i) the CG representation (mapping) and (ii) the functional form and parameters used to represent the CG energetics, which approximate potentials of mean force (PMFs). As a result, the design of CG methods that facilitate a variety of physically relevant representations, approximations, and force fields is critical to moving the frontier of systematic CG forward. Crucially, the proposed connection between the system used for parametrization and the system of interest is orthogonal to the optimization used to approximate the potential of mean force present in all systematic CG methods. The empirical efficacy of machine learning techniques on a variety of tasks provides strong motivation to consider these approaches for approximating the PMF and analyzing these approximations.
Collapse
Affiliation(s)
- Jaehyeok Jin
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| | - Alexander J. Pak
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| | - Aleksander E. P. Durumeric
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| | - Timothy D. Loose
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| | - Gregory A. Voth
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
35
|
Mohajerani F, Tyukodi B, Schlicksup CJ, Hadden-Perilla JA, Zlotnick A, Hagan MF. Multiscale Modeling of Hepatitis B Virus Capsid Assembly and Its Dimorphism. ACS NANO 2022; 16:13845-13859. [PMID: 36054910 PMCID: PMC10273259 DOI: 10.1021/acsnano.2c02119] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Hepatitis B virus (HBV) is an endemic, chronic virus that leads to 800000 deaths per year. Central to the HBV lifecycle, the viral core has a protein capsid assembled from many copies of a single protein. The capsid protein adopts different (quasi-equivalent) conformations to form icosahedral capsids containing 180 or 240 proteins: T = 3 or T = 4, respectively, in Caspar-Klug nomenclature. HBV capsid assembly has become an important target for recently developed antivirals; nonetheless, the assembly pathways and mechanisms that control HBV dimorphism remain unclear. We describe computer simulations of the HBV assembly, using a coarse-grained model that has parameters learned from all-atom molecular dynamics simulations of a complete HBV capsid and yet is computationally tractable. Dynamical simulations with the resulting model reproduce experimental observations of HBV assembly pathways and products. By constructing Markov state models and employing transition path theory, we identify pathways leading to T = 3, T = 4, and other experimentally observed capsid morphologies. The analysis shows that capsid polymorphism is promoted by the low HBV capsid bending modulus, where the key factors controlling polymorphism are the conformational energy landscape and protein-protein binding affinities.
Collapse
Affiliation(s)
- Farzaneh Mohajerani
- Martin A. Fisher School of Physics, Brandeis University, Waltham, Massachusetts02453, United States
| | - Botond Tyukodi
- Martin A. Fisher School of Physics, Brandeis University, Waltham, Massachusetts02453, United States
- Department of Physics, Babeş-Bolyai University, 400084Cluj-Napoca, Romania
| | - Christopher J Schlicksup
- Molecular and Cellular Biochemistry Department, Indiana University, Bloomington, Indiana47405, United States
| | - Jodi A Hadden-Perilla
- Department of Chemistry & Biochemistry, University of Delaware, Newark, Delaware19716, United States
| | - Adam Zlotnick
- Molecular and Cellular Biochemistry Department, Indiana University, Bloomington, Indiana47405, United States
| | - Michael F Hagan
- Martin A. Fisher School of Physics, Brandeis University, Waltham, Massachusetts02453, United States
| |
Collapse
|
36
|
Avery C, Patterson J, Grear T, Frater T, Jacobs DJ. Protein Function Analysis through Machine Learning. Biomolecules 2022; 12:1246. [PMID: 36139085 PMCID: PMC9496392 DOI: 10.3390/biom12091246] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 08/22/2022] [Accepted: 08/31/2022] [Indexed: 11/16/2022] Open
Abstract
Machine learning (ML) has been an important arsenal in computational biology used to elucidate protein function for decades. With the recent burgeoning of novel ML methods and applications, new ML approaches have been incorporated into many areas of computational biology dealing with protein function. We examine how ML has been integrated into a wide range of computational models to improve prediction accuracy and gain a better understanding of protein function. The applications discussed are protein structure prediction, protein engineering using sequence modifications to achieve stability and druggability characteristics, molecular docking in terms of protein-ligand binding, including allosteric effects, protein-protein interactions and protein-centric drug discovery. To quantify the mechanisms underlying protein function, a holistic approach that takes structure, flexibility, stability, and dynamics into account is required, as these aspects become inseparable through their interdependence. Another key component of protein function is conformational dynamics, which often manifest as protein kinetics. Computational methods that use ML to generate representative conformational ensembles and quantify differences in conformational ensembles important for function are included in this review. Future opportunities are highlighted for each of these topics.
Collapse
Affiliation(s)
- Chris Avery
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - John Patterson
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Tyler Grear
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Theodore Frater
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Donald J. Jacobs
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| |
Collapse
|
37
|
Meng Q, Chen J, Ma J, Zhang X, Chen J. Adiabatic models for the quantum dynamics of surface scattering with lattice effects. Phys Chem Chem Phys 2022; 24:16415-16436. [PMID: 35766107 DOI: 10.1039/d2cp01560a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In this contribution, we review models for the lattice effects in quantum dynamics calculations on surface scattering, which is important to modeling heterogeneous catalysis for achieving an interpretation of experimental measurements. Unlike dynamics models for reactions in the gas phase, those for heterogeneous reactions have to include the effects of the surface. For manageable computational costs in calculations, the effects of static surface (SS) are firstly modeled as this is simply and easily implemented. Then, the SS model has to be improved to include the effects of the flexible surface, that is the lattice effects. To do this, various surface models have been designed where the coordinates of the surface atoms are introduced in the Hamiltonian operator, especially those of the top surface atom. Based on this model Hamiltonian operator, extensive multi-dimension quantum dynamics calculations can be performed to recover the lattice effects. Here, we first review an overview of the techniques in constructing the Hamiltonian operator, which is a sum of the kinetic energy operator (KEO) and potential energy surface (PES). Since the PES containing the coordinates of the surface atoms in a cell is still expensive, the SS model is often accepted. We consider a mathematical model, called the coupled harmonic oscillator (CHO) model, to introduce the concepts of adiabatic and diabatic representations for separating the molecule and surface. Under the adiabatic model, we further introduce the expansion model where the potential function is Taylor expanded around the optimized geometry of the surface. By an expansion model truncated at the first and second order, various coupling surface models between the molecule and surface are derived. Moreover, by further and deeply understanding the adiabatic representation, an effective Hamiltonian operator is obtained by optimizing the total wave function in factorized form. By this factorized form of wave function and effective Hamiltonian operator, the geometry phase of the surface wave function is theoretically found. This theoretical prediction may be measured by carefully designing experiments. Finally, discussions on the adiabatic representation, the PES construction, and possibility of the classical-dynamics solutions are given. Based on these discussions, a simple outlook on the dynamics of photocatalytics is finally given.
Collapse
Affiliation(s)
- Qingyong Meng
- Department of Chemistry, Northwestern Polytechnical University, West Youyi Road 127, 710072 Xi'an, China.
| | - Junbo Chen
- Department of Chemistry, Northwestern Polytechnical University, West Youyi Road 127, 710072 Xi'an, China. .,Xi'an Modern Chemistry Research Institute, China North Industries Group Corp., Ltd., East Zhangba Road 168, 710065 Xi'an, China
| | - Jianxing Ma
- Department of Chemistry, Northwestern Polytechnical University, West Youyi Road 127, 710072 Xi'an, China.
| | - Xingyu Zhang
- Department of Chemistry, Northwestern Polytechnical University, West Youyi Road 127, 710072 Xi'an, China.
| | - Jun Chen
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Yangqiao Road West 155, 350002 Fuzhou, China.,Fujian Science & Technology Innovation Laboratory for Optoelectronic Information of China, Optoelectronic Industry Base at High-tech Zone, 350108 Fuzhou, China
| |
Collapse
|
38
|
Moradzadeh A, Aluru NR. Many-Body Neural Network-Based Force Field for Structure-Based Coarse-Graining of Water. J Phys Chem A 2022; 126:2031-2041. [PMID: 35316059 DOI: 10.1021/acs.jpca.1c09786] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
High-fidelity results from atomistic simulations can only be obtained by using accurate force-field (FF) parameters. Although empirical FFs are commonly used in the modeling of atomistic systems due to their simplicity, they have many limitations inherent in the crude approximations associated with their analytical form. Recent advances in neural network-based FFs have led to more accurate FFs by using symmetry functions or full many-body expansions. However, this approach leads to several issues including the arbitrariness of the symmetry functions, and the intangible and uninterpretable interactions which are only known once the positions of all atoms are set. More importantly, training is another bottleneck, as high-quality force and energy information is required, which is usually not accessible from experimental data. To solve these issues within the context of structure-based coarse-graining methods, we switch in this work to a local-search method to target the reference structure instead of using conventional backpropagation algorithms used to target the forces and energies of the reference structure. Our FF is decomposed into two-, three-, and higher-order terms, where each term is modeled with a separate neural network. To show the versatility of our method, we study four different systems, namely, Stillinger-Weber particles as an atomistic case and three water models, namely SPC/E, MB-pol, and ab initio, as coarse-graining cases. We show the successful application of our approach, by reproducing structural properties of different water models, followed by providing insight into the role of two-and three-body interactions. The results of all models indicate that the double-well isotropic pair potential, the signature of water-like behavior in an isotropic system, vanishes upon inclusion of the three-body interaction, showing dominance of the three-body interaction over the two-body interaction in water-like behavior with the single-well isotropic pair potential.
Collapse
Affiliation(s)
- A Moradzadeh
- Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - N R Aluru
- Oden Institute for Computational Engineering and Sciences, Walker Department of Mechanical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| |
Collapse
|
39
|
Jeong J, Moradzadeh A, Aluru NR. Extended DeepILST for Various Thermodynamic States and Applications in Coarse-Graining. J Phys Chem A 2022; 126:1562-1570. [PMID: 35201773 DOI: 10.1021/acs.jpca.1c10865] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Molecular dynamics (MD) simulations are widely used to obtain the microscopic properties of atomistic systems when the interatomic potential or the coarse-grained potential is known. In many practical situations, however, it is necessary to predict the interatomic or coarse-grained potential, which is a tremendous challenge. Many approaches have been developed to predict the potential parameters based on various techniques, including the relative entropy method, integral equation theory, etc., but these methods lack transferability and are limited to a specific range of thermodynamic states. Recently, data-driven and machine learning approaches have been developed to overcome such limitations. In this study, we expand the range of thermodynamic states used to train deep inverse liquid-state theory (DeepILST)1, a deep learning framework for solving the inverse problem of liquid-state theory. We also assess the performance of DeepILST in coarse-graining various multiatom molecules and identify the molecular characteristics that affect the coarse-graining performance of DeepILST.
Collapse
Affiliation(s)
- J Jeong
- Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801 United States
| | - A Moradzadeh
- Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801 United States
| | - N R Aluru
- Walker Department of Mechanical Engineering, Oden Institute for Computational Engineering & Sciences, The University of Texas at Austin, Austin, Texas 78712 United States
| |
Collapse
|
40
|
DeLyser MR, Noid WG. Coarse-grained models for local density gradients. J Chem Phys 2022; 156:034106. [DOI: 10.1063/5.0075291] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Affiliation(s)
- Michael R. DeLyser
- Department of Chemistry, Penn State University, University Park, Pennsylvania 16802, USA
| | - W. G. Noid
- Department of Chemistry, Penn State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
41
|
Guardiani C, Cecconi F, Chiodo L, Cottone G, Malgaretti P, Maragliano L, Barabash ML, Camisasca G, Ceccarelli M, Corry B, Roth R, Giacomello A, Roux B. Computational methods and theory for ion channel research. ADVANCES IN PHYSICS: X 2022; 7:2080587. [PMID: 35874965 PMCID: PMC9302924 DOI: 10.1080/23746149.2022.2080587] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 05/15/2022] [Indexed: 06/15/2023] Open
Abstract
Ion channels are fundamental biological devices that act as gates in order to ensure selective ion transport across cellular membranes; their operation constitutes the molecular mechanism through which basic biological functions, such as nerve signal transmission and muscle contraction, are carried out. Here, we review recent results in the field of computational research on ion channels, covering theoretical advances, state-of-the-art simulation approaches, and frontline modeling techniques. We also report on few selected applications of continuum and atomistic methods to characterize the mechanisms of permeation, selectivity, and gating in biological and model channels.
Collapse
Affiliation(s)
- C. Guardiani
- Dipartimento di Ingegneria Meccanica e Aerospaziale, Sapienza Università di Roma, Rome, Italy
| | - F. Cecconi
- CNR - Istituto dei Sistemi Complessi, Rome, Italy and Istituto Nazionale di Fisica Nucleare, INFN, Roma1 section. 00185, Roma, Italy
| | - L. Chiodo
- Department of Engineering, Campus Bio-Medico University, Rome, Italy
| | - G. Cottone
- Department of Physics and Chemistry-Emilio Segrè, University of Palermo, Palermo, Italy
| | - P. Malgaretti
- Helmholtz Institute Erlangen-Nürnberg for Renewable Energy (IEK-11), Forschungszentrum Jülich, Erlangen, Germany
| | - L. Maragliano
- Department of Life and Environmental Sciences, Polytechnic University of Marche, Ancona, Italy, and Center for Synaptic Neuroscience and Technology, Istituto Italiano di Tecnologia, Genova, Italy
| | - M. L. Barabash
- Department of Materials Science and Nanoengineering, Rice University, Houston, TX 77005, USA
| | - G. Camisasca
- Dipartimento di Ingegneria Meccanica e Aerospaziale, Sapienza Università di Roma, Rome, Italy
- Dipartimento di Fisica, Università Roma Tre, Rome, Italy
| | - M. Ceccarelli
- Department of Physics and CNR-IOM, University of Cagliari, Monserrato 09042-IT, Italy
| | - B. Corry
- Research School of Biology, The Australian National University, Canberra, ACT 2600, Australia
| | - R. Roth
- Institut Für Theoretische Physik, Eberhard Karls Universität Tübingen, Tübingen, Germany
| | - A. Giacomello
- Dipartimento di Ingegneria Meccanica e Aerospaziale, Sapienza Università di Roma, Rome, Italy
| | - B. Roux
- Department of Biochemistry & Molecular Biology, University of Chicago, Chicago IL, USA
| |
Collapse
|
42
|
Vlachas PR, Zavadlav J, Praprotnik M, Koumoutsakos P. Accelerated Simulations of Molecular Systems through Learning of Effective Dynamics. J Chem Theory Comput 2021; 18:538-549. [PMID: 34890204 DOI: 10.1021/acs.jctc.1c00809] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Simulations are vital for understanding and predicting the evolution of complex molecular systems. However, despite advances in algorithms and special purpose hardware, accessing the time scales necessary to capture the structural evolution of biomolecules remains a daunting task. In this work, we present a novel framework to advance simulation time scales by up to 3 orders of magnitude by learning the effective dynamics (LED) of molecular systems. LED augments the equation-free methodology by employing a probabilistic mapping between coarse and fine scales using mixture density network (MDN) autoencoders and evolves the non-Markovian latent dynamics using long short-term memory MDNs. We demonstrate the effectiveness of LED in the Müller-Brown potential, the Trp cage protein, and the alanine dipeptide. LED identifies explainable reduced-order representations, i.e., collective variables, and can generate, at any instant, all-atom molecular trajectories consistent with the collective variables. We believe that the proposed framework provides a dramatic increase to simulation capabilities and opens new horizons for the effective modeling of complex molecular systems.
Collapse
Affiliation(s)
- Pantelis R Vlachas
- Computational Science and Engineering Laboratory, ETH Zurich, CH-8092, Switzerland
| | - Julija Zavadlav
- Professorship of Multiscale Modeling of Fluid Materials, TUM School of Engineering and Design, Technical University of Munich, 85748 Garching bei München, Germany.,Munich Data Science Institute, Technical University of Munich, 85748 Munich, Germany
| | - Matej Praprotnik
- Laboratory for Molecular Modeling, National Institute of Chemistry, SI-1001 Ljubljana, Slovenia.,Department of Physics, Faculty of Mathematics and Physics, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | - Petros Koumoutsakos
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| |
Collapse
|
43
|
Xu P, Mou X, Guo Q, Fu T, Ren H, Wang G, Li Y, Li G. Coarse-grained molecular dynamics study based on TorchMD. CHINESE J CHEM PHYS 2021. [DOI: 10.1063/1674-0068/cjcp2110218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Affiliation(s)
- Peijun Xu
- Liaoning Normal University, Dalian 116029, China
| | - Xiaohong Mou
- Liaoning Normal University, Dalian 116029, China
| | - Qiuhan Guo
- Liaoning Normal University, Dalian 116029, China
| | - Ting Fu
- Pharmacy Department of Affiliated Zhongshan Hospital of Dalian University, Dalian 116001, China
| | - Hong Ren
- Department of Ophthalmology Aerospace Center Hospital, Beijing 100049, China
| | - Guiyan Wang
- Dalian Ocean University, Dalian 116029, China
| | - Yan Li
- Dalian Institute of Chemical Physics, State Key Laboratory of Molecular Reaction Dynamics, Dalian 116023, China
| | - Guohui Li
- Dalian Institute of Chemical Physics, State Key Laboratory of Molecular Reaction Dynamics, Dalian 116023, China
| |
Collapse
|
44
|
Duong VT, Diessner EM, Grazioli G, Martin RW, Butts CT. Neural Upscaling from Residue-Level Protein Structure Networks to Atomistic Structures. Biomolecules 2021; 11:biom11121788. [PMID: 34944432 PMCID: PMC8698800 DOI: 10.3390/biom11121788] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 11/11/2021] [Accepted: 11/19/2021] [Indexed: 01/01/2023] Open
Abstract
Coarse-graining is a powerful tool for extending the reach of dynamic models of proteins and other biological macromolecules. Topological coarse-graining, in which biomolecules or sets thereof are represented via graph structures, is a particularly useful way of obtaining highly compressed representations of molecular structures, and simulations operating via such representations can achieve substantial computational savings. A drawback of coarse-graining, however, is the loss of atomistic detail—an effect that is especially acute for topological representations such as protein structure networks (PSNs). Here, we introduce an approach based on a combination of machine learning and physically-guided refinement for inferring atomic coordinates from PSNs. This “neural upscaling” procedure exploits the constraints implied by PSNs on possible configurations, as well as differences in the likelihood of observing different configurations with the same PSN. Using a 1 μs atomistic molecular dynamics trajectory of Aβ1–40, we show that neural upscaling is able to effectively recapitulate detailed structural information for intrinsically disordered proteins, being particularly successful in recovering features such as transient secondary structure. These results suggest that scalable network-based models for protein structure and dynamics may be used in settings where atomistic detail is desired, with upscaling employed to impute atomic coordinates from PSNs.
Collapse
Affiliation(s)
- Vy T. Duong
- Department of Chemistry, University of California, Irvine, CA 92697, USA; (V.T.D.); (E.M.D.)
| | - Elizabeth M. Diessner
- Department of Chemistry, University of California, Irvine, CA 92697, USA; (V.T.D.); (E.M.D.)
| | - Gianmarc Grazioli
- Department of Chemistry, San Jose State University, San Jose, CA 95192, USA;
| | - Rachel W. Martin
- Department of Chemistry, University of California, Irvine, CA 92697, USA; (V.T.D.); (E.M.D.)
- Department of Molecular Biology & Biochemistry, University of California, Irvine, CA 92697, USA
- Correspondence: (R.W.M.); (C.T.B.)
| | - Carter T. Butts
- Departments of Sociology, Statistics and Electrical Engineering & Computer Science, University of California, Irvine, CA 92697, USA
- Correspondence: (R.W.M.); (C.T.B.)
| |
Collapse
|
45
|
Thaler S, Zavadlav J. Learning neural network potentials from experimental data via Differentiable Trajectory Reweighting. Nat Commun 2021; 12:6884. [PMID: 34824254 PMCID: PMC8617111 DOI: 10.1038/s41467-021-27241-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 11/09/2021] [Indexed: 11/09/2022] Open
Abstract
In molecular dynamics (MD), neural network (NN) potentials trained bottom-up on quantum mechanical data have seen tremendous success recently. Top-down approaches that learn NN potentials directly from experimental data have received less attention, typically facing numerical and computational challenges when backpropagating through MD simulations. We present the Differentiable Trajectory Reweighting (DiffTRe) method, which bypasses differentiation through the MD simulation for time-independent observables. Leveraging thermodynamic perturbation theory, we avoid exploding gradients and achieve around 2 orders of magnitude speed-up in gradient computation for top-down learning. We show effectiveness of DiffTRe in learning NN potentials for an atomistic model of diamond and a coarse-grained model of water based on diverse experimental observables including thermodynamic, structural and mechanical properties. Importantly, DiffTRe also generalizes bottom-up structural coarse-graining methods such as iterative Boltzmann inversion to arbitrary potentials. The presented method constitutes an important milestone towards enriching NN potentials with experimental data, particularly when accurate bottom-up data is unavailable.
Collapse
Affiliation(s)
- Stephan Thaler
- Professorship of Multiscale Modeling of Fluid Materials, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany.
| | - Julija Zavadlav
- Professorship of Multiscale Modeling of Fluid Materials, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany.
- Munich Data Science Institute, Technical University of Munich, Munich, Germany.
| |
Collapse
|
46
|
Dhamankar S, Webb MA. Chemically specific coarse‐graining of polymers: Methods and prospects. JOURNAL OF POLYMER SCIENCE 2021. [DOI: 10.1002/pol.20210555] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Satyen Dhamankar
- Department of Chemical and Biological Engineering Princeton University Princeton New Jersey USA
| | - Michael A. Webb
- Department of Chemical and Biological Engineering Princeton University Princeton New Jersey USA
| |
Collapse
|
47
|
Badin M, Martoňák R. Nucleating a Different Coordination in a Crystal under Pressure: A Study of the B1-B2 Transition in NaCl by Metadynamics. PHYSICAL REVIEW LETTERS 2021; 127:105701. [PMID: 34533357 DOI: 10.1103/physrevlett.127.105701] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 07/21/2021] [Indexed: 06/13/2023]
Abstract
Here we propose an NPT metadynamics simulation scheme for pressure-induced structural phase transitions, using coordination number and volume as collective variables, and apply it to the reconstructive structural transformation B1-B2 in NaCl. By studying systems with size up to 64 000 atoms we reach a regime beyond collective mechanism and observe transformations proceeding via nucleation and growth. We also reveal the crossover of the transition mechanism from Buerger-like for smaller systems to Watanabe-Tolédano for larger ones. The scheme is likely to be applicable to a broader class of pressure-induced structural transitions, allowing study of complex nucleation effects and bringing simulations closer to realistic conditions.
Collapse
Affiliation(s)
- Matej Badin
- SISSA - Scuola Internazionale Superiore di Studi Avanzati, Via Bonomea 265, 34136 Trieste, Italy
- Department of Experimental Physics, Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Mlynská Dolina F2, 842 48 Bratislava, Slovakia
| | - Roman Martoňák
- Department of Experimental Physics, Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Mlynská Dolina F2, 842 48 Bratislava, Slovakia
| |
Collapse
|
48
|
Greener JG, Jones DT. Differentiable molecular simulation can learn all the parameters in a coarse-grained force field for proteins. PLoS One 2021; 16:e0256990. [PMID: 34473813 PMCID: PMC8412298 DOI: 10.1371/journal.pone.0256990] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 08/19/2021] [Indexed: 11/26/2022] Open
Abstract
Finding optimal parameters for force fields used in molecular simulation is a challenging and time-consuming task, partly due to the difficulty of tuning multiple parameters at once. Automatic differentiation presents a general solution: run a simulation, obtain gradients of a loss function with respect to all the parameters, and use these to improve the force field. This approach takes advantage of the deep learning revolution whilst retaining the interpretability and efficiency of existing force fields. We demonstrate that this is possible by parameterising a simple coarse-grained force field for proteins, based on training simulations of up to 2,000 steps learning to keep the native structure stable. The learned potential matches chemical knowledge and PDB data, can fold and reproduce the dynamics of small proteins, and shows ability in protein design and model scoring applications. Problems in applying differentiable molecular simulation to all-atom models of proteins are discussed along with possible solutions and the variety of available loss functions. The learned potential, simulation scripts and training code are made available at https://github.com/psipred/cgdms.
Collapse
Affiliation(s)
- Joe G. Greener
- Department of Computer Science, University College London, London, United Kingdom
| | - David T. Jones
- Department of Computer Science, University College London, London, United Kingdom
| |
Collapse
|
49
|
Chen Y, Krämer A, Charron NE, Husic BE, Clementi C, Noé F. Machine learning implicit solvation for molecular dynamics. J Chem Phys 2021; 155:084101. [PMID: 34470360 DOI: 10.1063/5.0059915] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Accurate modeling of the solvent environment for biological molecules is crucial for computational biology and drug design. A popular approach to achieve long simulation time scales for large system sizes is to incorporate the effect of the solvent in a mean-field fashion with implicit solvent models. However, a challenge with existing implicit solvent models is that they often lack accuracy or certain physical properties compared to explicit solvent models as the many-body effects of the neglected solvent molecules are difficult to model as a mean field. Here, we leverage machine learning (ML) and multi-scale coarse graining (CG) in order to learn implicit solvent models that can approximate the energetic and thermodynamic properties of a given explicit solvent model with arbitrary accuracy, given enough training data. Following the previous ML-CG models CGnet and CGSchnet, we introduce ISSNet, a graph neural network, to model the implicit solvent potential of mean force. ISSNet can learn from explicit solvent simulation data and be readily applied to molecular dynamics simulations. We compare the solute conformational distributions under different solvation treatments for two peptide systems. The results indicate that ISSNet models can outperform widely used generalized Born and surface area models in reproducing the thermodynamics of small protein systems with respect to explicit solvent. The success of this novel method demonstrates the potential benefit of applying machine learning methods in accurate modeling of solvent effects for in silico research and biomedical applications.
Collapse
Affiliation(s)
- Yaoyi Chen
- Department of Mathematics and Computer Science, Freie Universität, Berlin, Germany
| | - Andreas Krämer
- Department of Mathematics and Computer Science, Freie Universität, Berlin, Germany
| | | | - Brooke E Husic
- Department of Mathematics and Computer Science, Freie Universität, Berlin, Germany
| | - Cecilia Clementi
- Department of Physics, Rice University, Houston, Texas 77005, USA
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität, Berlin, Germany
| |
Collapse
|
50
|
Keith JA, Vassilev-Galindo V, Cheng B, Chmiela S, Gastegger M, Müller KR, Tkatchenko A. Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems. Chem Rev 2021; 121:9816-9872. [PMID: 34232033 PMCID: PMC8391798 DOI: 10.1021/acs.chemrev.1c00107] [Citation(s) in RCA: 190] [Impact Index Per Article: 63.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Indexed: 12/23/2022]
Abstract
Machine learning models are poised to make a transformative impact on chemical sciences by dramatically accelerating computational algorithms and amplifying insights available from computational chemistry methods. However, achieving this requires a confluence and coaction of expertise in computer science and physical sciences. This Review is written for new and experienced researchers working at the intersection of both fields. We first provide concise tutorials of computational chemistry and machine learning methods, showing how insights involving both can be achieved. We follow with a critical review of noteworthy applications that demonstrate how computational chemistry and machine learning can be used together to provide insightful (and useful) predictions in molecular and materials modeling, retrosyntheses, catalysis, and drug design.
Collapse
Affiliation(s)
- John A. Keith
- Department
of Chemical and Petroleum Engineering Swanson School of Engineering, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Valentin Vassilev-Galindo
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Bingqing Cheng
- Accelerate
Programme for Scientific Discovery, Department
of Computer Science and Technology, 15 J. J. Thomson Avenue, Cambridge CB3 0FD, United Kingdom
| | - Stefan Chmiela
- Department
of Software Engineering and Theoretical Computer Science, Technische Universität Berlin, 10587, Berlin, Germany
| | - Michael Gastegger
- Department
of Software Engineering and Theoretical Computer Science, Technische Universität Berlin, 10587, Berlin, Germany
| | - Klaus-Robert Müller
- Machine
Learning Group, Technische Universität
Berlin, 10587, Berlin, Germany
- Department
of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul, 02841, Korea
- Max-Planck-Institut für Informatik, 66123 Saarbrücken, Germany
- Google Research, Brain Team, 10117 Berlin, Germany
| | - Alexandre Tkatchenko
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| |
Collapse
|