51
|
Keith JA, Vassilev-Galindo V, Cheng B, Chmiela S, Gastegger M, Müller KR, Tkatchenko A. Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems. Chem Rev 2021; 121:9816-9872. [PMID: 34232033 PMCID: PMC8391798 DOI: 10.1021/acs.chemrev.1c00107] [Citation(s) in RCA: 211] [Impact Index Per Article: 70.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Indexed: 12/23/2022]
Abstract
Machine learning models are poised to make a transformative impact on chemical sciences by dramatically accelerating computational algorithms and amplifying insights available from computational chemistry methods. However, achieving this requires a confluence and coaction of expertise in computer science and physical sciences. This Review is written for new and experienced researchers working at the intersection of both fields. We first provide concise tutorials of computational chemistry and machine learning methods, showing how insights involving both can be achieved. We follow with a critical review of noteworthy applications that demonstrate how computational chemistry and machine learning can be used together to provide insightful (and useful) predictions in molecular and materials modeling, retrosyntheses, catalysis, and drug design.
Collapse
Affiliation(s)
- John A. Keith
- Department
of Chemical and Petroleum Engineering Swanson School of Engineering, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Valentin Vassilev-Galindo
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Bingqing Cheng
- Accelerate
Programme for Scientific Discovery, Department
of Computer Science and Technology, 15 J. J. Thomson Avenue, Cambridge CB3 0FD, United Kingdom
| | - Stefan Chmiela
- Department
of Software Engineering and Theoretical Computer Science, Technische Universität Berlin, 10587, Berlin, Germany
| | - Michael Gastegger
- Department
of Software Engineering and Theoretical Computer Science, Technische Universität Berlin, 10587, Berlin, Germany
| | - Klaus-Robert Müller
- Machine
Learning Group, Technische Universität
Berlin, 10587, Berlin, Germany
- Department
of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul, 02841, Korea
- Max-Planck-Institut für Informatik, 66123 Saarbrücken, Germany
- Google Research, Brain Team, 10117 Berlin, Germany
| | - Alexandre Tkatchenko
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| |
Collapse
|
52
|
Wang J, Charron N, Husic B, Olsson S, Noé F, Clementi C. Multi-body effects in a coarse-grained protein force field. J Chem Phys 2021; 154:164113. [PMID: 33940848 DOI: 10.1063/5.0041022] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
The use of coarse-grained (CG) models is a popular approach to study complex biomolecular systems. By reducing the number of degrees of freedom, a CG model can explore long time- and length-scales inaccessible to computational models at higher resolution. If a CG model is designed by formally integrating out some of the system's degrees of freedom, one expects multi-body interactions to emerge in the effective CG model's energy function. In practice, it has been shown that the inclusion of multi-body terms indeed improves the accuracy of a CG model. However, no general approach has been proposed to systematically construct a CG effective energy that includes arbitrary orders of multi-body terms. In this work, we propose a neural network based approach to address this point and construct a CG model as a multi-body expansion. By applying this approach to a small protein, we evaluate the relative importance of the different multi-body terms in the definition of an accurate model. We observe a slow convergence in the multi-body expansion, where up to five-body interactions are needed to reproduce the free energy of an atomistic model.
Collapse
Affiliation(s)
- Jiang Wang
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA
| | - Nicholas Charron
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA
| | - Brooke Husic
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Simon Olsson
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Frank Noé
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA
| | - Cecilia Clementi
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA
| |
Collapse
|
53
|
Luo S, Thachuk M. Conservative Potentials for a Lattice-Mapped Coarse-Grained Scheme. J Phys Chem A 2021; 125:6486-6497. [PMID: 34264666 DOI: 10.1021/acs.jpca.1c02000] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The conservative potential, arising from a coarse-grain (CG) mapping scheme for nonbonded atomistic particles, is studied. This is a bottom-up approach from first-principles that maps atomistic particles to fluid element-like subcells whose centers lie on a regular, cubic lattice. Unlike standard CG mapping schemes, the current one uses dynamic labeling which on-the-fly changes the CG labels of the particles. The subcells can also be different sizes and shapes, in principle. Equilibrium atomistic molecular dynamics trajectories for different Lennard-Jones fluids are calculated and converted to CG ones, from which CG probability distribution functions are calculated. Correlation studies show position and mass CG variables are uncoupled in a given subcell, as are different vector components of position. Furthermore, the strongest coupling occurs with neighboring cells in specific directions, and the resulting distribution is well described by a multivariate Gaussian. This implies the CG potential has a generalized quadratic form, whose derivative can be determined analytically. A microscopic rationalization is provided for the signs and relative magnitudes of different correlation coefficients, and in some cases, a connection is made with bulk properties of the fluid. We argue the generalized quadratic form should be robust to changes in the particulars of the CG scheme, as well as the nature of the atomistic intermolecular potential. Only a few potential parameters need to be calculated from the underlying atomistic system. This is significant because it indicates the transferability of this form to other, more complex systems. This transferability will be tested in future work, where mapping schemes with fuzzy boundaries will be considered.
Collapse
Affiliation(s)
- Siwei Luo
- Department of Chemistry, University of British Columbia,Vancouver V6T 1Z1, Canada
| | - Mark Thachuk
- Department of Chemistry, University of British Columbia,Vancouver V6T 1Z1, Canada
| |
Collapse
|
54
|
Ma Z, Wang S, Kim M, Liu K, Chen CL, Pan W. Transfer learning of memory kernels for transferable coarse-graining of polymer dynamics. SOFT MATTER 2021; 17:5864-5877. [PMID: 34096961 DOI: 10.1039/d1sm00364j] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The present work concerns the transferability of coarse-grained (CG) modeling in reproducing the dynamic properties of the reference atomistic systems across a range of parameters. In particular, we focus on implicit-solvent CG modeling of polymer solutions. The CG model is based on the generalized Langevin equation, where the memory kernel plays the critical role in determining the dynamics in all time scales. Thus, we propose methods for transfer learning of memory kernels. The key ingredient of our methods is Gaussian process regression. By integration with the model order reduction via proper orthogonal decomposition and the active learning technique, the transfer learning can be practically efficient and requires minimum training data. Through two example polymer solution systems, we demonstrate the accuracy and efficiency of the proposed transfer learning methods in the construction of transferable memory kernels. The transferability allows for out-of-sample predictions, even in the extrapolated domain of parameters. Built on the transferable memory kernels, the CG models can reproduce the dynamic properties of polymers in all time scales at different thermodynamic conditions (such as temperature and solvent viscosity) and for different systems with varying concentrations and lengths of polymers.
Collapse
Affiliation(s)
- Zhan Ma
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | - Shu Wang
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | - Minhee Kim
- Department of Industrial and Systems Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Kaibo Liu
- Department of Industrial and Systems Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Chun-Long Chen
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Wenxiao Pan
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
55
|
Terao T. Semi-supervised learning for the study of structural formation in colloidal systems via image recognition. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2021; 33:325901. [PMID: 33962403 DOI: 10.1088/1361-648x/abfee4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 05/07/2021] [Indexed: 06/12/2023]
Abstract
The analysis of the structural formation of colloidal systems using machine learning techniques has recently attracted much attention. In many of these studies, local bond-order parameters (LBOPs) were employed as descriptors, where such LBOPs are suitable mainly for the detection of crystal structures. On the other hand, image-based convolutional neural networks (CNNs) are quite effective in detecting not only crystals but also random structures, and the author demonstrated their efficiency in a previous paper. However, in supervised learning, it is difficult to obtain a correct result when there is an unexpected new phase that was unknown when training the CNN. In this paper, we propose a hybrid scheme that consists of supervised and unsupervised learning techniques, employing two different approaches: image-based CNN and generalized LBOP. The proposed method was applied to two-dimensional colloidal systems, and its efficiency was demonstrated.
Collapse
Affiliation(s)
- Takamichi Terao
- Department of Electrical, Electronic and Computer Engineering, Gifu University, Gifu, Japan
| |
Collapse
|
56
|
Yang X, Zhuang Y, Zhu J, Le J, Cheng J. Recent progress on multiscale modeling of electrochemistry. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1559] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Xiao‐Hui Yang
- State Key Laboratory of Physical Chemistry of Solid Surfaces iChEM, College of Chemistry and Chemical Engineering, Xiamen University Xiamen China
| | - Yong‐Bin Zhuang
- State Key Laboratory of Physical Chemistry of Solid Surfaces iChEM, College of Chemistry and Chemical Engineering, Xiamen University Xiamen China
| | - Jia‐Xin Zhu
- State Key Laboratory of Physical Chemistry of Solid Surfaces iChEM, College of Chemistry and Chemical Engineering, Xiamen University Xiamen China
| | - Jia‐Bo Le
- Ningbo Institute of Materials Technology and Engineering Chinese Academy of Sciences Ningbo China
| | - Jun Cheng
- State Key Laboratory of Physical Chemistry of Solid Surfaces iChEM, College of Chemistry and Chemical Engineering, Xiamen University Xiamen China
| |
Collapse
|
57
|
Westermayr J, Gastegger M, Schütt KT, Maurer RJ. Perspective on integrating machine learning into computational chemistry and materials science. J Chem Phys 2021; 154:230903. [PMID: 34241249 DOI: 10.1063/5.0047760] [Citation(s) in RCA: 67] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Machine learning (ML) methods are being used in almost every conceivable area of electronic structure theory and molecular simulation. In particular, ML has become firmly established in the construction of high-dimensional interatomic potentials. Not a day goes by without another proof of principle being published on how ML methods can represent and predict quantum mechanical properties-be they observable, such as molecular polarizabilities, or not, such as atomic charges. As ML is becoming pervasive in electronic structure theory and molecular simulation, we provide an overview of how atomistic computational modeling is being transformed by the incorporation of ML approaches. From the perspective of the practitioner in the field, we assess how common workflows to predict structure, dynamics, and spectroscopy are affected by ML. Finally, we discuss how a tighter and lasting integration of ML methods with computational chemistry and materials science can be achieved and what it will mean for research practice, software development, and postgraduate training.
Collapse
Affiliation(s)
- Julia Westermayr
- Department of Chemistry, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, United Kingdom
| | - Michael Gastegger
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Kristof T Schütt
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Reinhard J Maurer
- Department of Chemistry, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, United Kingdom
| |
Collapse
|
58
|
Giulini M, Rigoli M, Mattiotti G, Menichetti R, Tarenzi T, Fiorentini R, Potestio R. From System Modeling to System Analysis: The Impact of Resolution Level and Resolution Distribution in the Computer-Aided Investigation of Biomolecules. Front Mol Biosci 2021; 8:676976. [PMID: 34164432 PMCID: PMC8215203 DOI: 10.3389/fmolb.2021.676976] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Accepted: 05/06/2021] [Indexed: 12/18/2022] Open
Abstract
The ever increasing computer power, together with the improved accuracy of atomistic force fields, enables researchers to investigate biological systems at the molecular level with remarkable detail. However, the relevant length and time scales of many processes of interest are still hardly within reach even for state-of-the-art hardware, thus leaving important questions often unanswered. The computer-aided investigation of many biological physics problems thus largely benefits from the usage of coarse-grained models, that is, simplified representations of a molecule at a level of resolution that is lower than atomistic. A plethora of coarse-grained models have been developed, which differ most notably in their granularity; this latter aspect determines one of the crucial open issues in the field, i.e. the identification of an optimal degree of coarsening, which enables the greatest simplification at the expenses of the smallest information loss. In this review, we present the problem of coarse-grained modeling in biophysics from the viewpoint of system representation and information content. In particular, we discuss two distinct yet complementary aspects of protein modeling: on the one hand, the relationship between the resolution of a model and its capacity of accurately reproducing the properties of interest; on the other hand, the possibility of employing a lower resolution description of a detailed model to extract simple, useful, and intelligible information from the latter.
Collapse
Affiliation(s)
- Marco Giulini
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Marta Rigoli
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Giovanni Mattiotti
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Roberto Menichetti
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Thomas Tarenzi
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Raffaele Fiorentini
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Raffaello Potestio
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| |
Collapse
|
59
|
Friederich P, Häse F, Proppe J, Aspuru-Guzik A. Machine-learned potentials for next-generation matter simulations. NATURE MATERIALS 2021; 20:750-761. [PMID: 34045696 DOI: 10.1038/s41563-020-0777-6] [Citation(s) in RCA: 123] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Accepted: 07/17/2020] [Indexed: 05/18/2023]
Abstract
The choice of simulation methods in computational materials science is driven by a fundamental trade-off: bridging large time- and length-scales with highly accurate simulations at an affordable computational cost. Venturing the investigation of complex phenomena on large scales requires fast yet accurate computational methods. We review the emerging field of machine-learned potentials, which promises to reach the accuracy of quantum mechanical computations at a substantially reduced computational cost. This Review will summarize the basic principles of the underlying machine learning methods, the data acquisition process and active learning procedures. We highlight multiple recent applications of machine-learned potentials in various fields, ranging from organic chemistry and biomolecules to inorganic crystal structure predictions and surface science. We furthermore discuss the developments required to promote a broader use of ML potentials, and the possibility of using them to help solve open questions in materials science and facilitate fully computational materials design.
Collapse
Affiliation(s)
- Pascal Friederich
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Institute of Nanotechnology, Karlsruhe Institute of Technology, Eggenstein-Leopoldshafen, Germany
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Florian Häse
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA
| | - Jonny Proppe
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Institute of Physical Chemistry, Georg-August University, Göttingen, Germany
| | - Alán Aspuru-Guzik
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada.
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA.
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), Toronto, Ontario, Canada.
| |
Collapse
|
60
|
Cao X, Tian P. "Dividing and Conquering" and "Caching" in Molecular Modeling. Int J Mol Sci 2021; 22:5053. [PMID: 34068835 PMCID: PMC8126232 DOI: 10.3390/ijms22095053] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 04/26/2021] [Accepted: 04/27/2021] [Indexed: 11/17/2022] Open
Abstract
Molecular modeling is widely utilized in subjects including but not limited to physics, chemistry, biology, materials science and engineering. Impressive progress has been made in development of theories, algorithms and software packages. To divide and conquer, and to cache intermediate results have been long standing principles in development of algorithms. Not surprisingly, most important methodological advancements in more than half century of molecular modeling are various implementations of these two fundamental principles. In the mainstream classical computational molecular science, tremendous efforts have been invested on two lines of algorithm development. The first is coarse graining, which is to represent multiple basic particles in higher resolution modeling as a single larger and softer particle in lower resolution counterpart, with resulting force fields of partial transferability at the expense of some information loss. The second is enhanced sampling, which realizes "dividing and conquering" and/or "caching" in configurational space with focus either on reaction coordinates and collective variables as in metadynamics and related algorithms, or on the transition matrix and state discretization as in Markov state models. For this line of algorithms, spatial resolution is maintained but results are not transferable. Deep learning has been utilized to realize more efficient and accurate ways of "dividing and conquering" and "caching" along these two lines of algorithmic research. We proposed and demonstrated the local free energy landscape approach, a new framework for classical computational molecular science. This framework is based on a third class of algorithm that facilitates molecular modeling through partially transferable in resolution "caching" of distributions for local clusters of molecular degrees of freedom. Differences, connections and potential interactions among these three algorithmic directions are discussed, with the hope to stimulate development of more elegant, efficient and reliable formulations and algorithms for "dividing and conquering" and "caching" in complex molecular systems.
Collapse
Affiliation(s)
- Xiaoyong Cao
- School of Life Sciences, Jilin University, Changchun 130012, China;
| | - Pu Tian
- School of Life Sciences, Jilin University, Changchun 130012, China;
- School of Artificial Intelligence, Jilin University, Changchun 130012, China
| |
Collapse
|
61
|
Rudzinski JF, Kloth S, Wörner S, Pal T, Kremer K, Bereau T, Vogel M. Dynamical properties across different coarse-grained models for ionic liquids. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2021; 33:224001. [PMID: 33592598 DOI: 10.1088/1361-648x/abe6e1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Accepted: 02/16/2021] [Indexed: 06/12/2023]
Abstract
Room-temperature ionic liquids (RTILs) stand out among molecular liquids for their rich physicochemical characteristics, including structural and dynamic heterogeneity. The significance of electrostatic interactions in RTILs results in long characteristic length- and timescales, and has motivated the development of a number of coarse-grained (CG) simulation models. In this study, we aim to better understand the connection between certain CG parameterization strategies and the dynamical properties and transferability of the resulting models. We systematically compare five CG models: a model largely parameterized from experimental thermodynamic observables; a refinement of this model to increase its structural accuracy; and three models that reproduce a given set of structural distribution functions by construction, with varying intramolecular parameterizations and reference temperatures. All five CG models display limited structural transferability over temperature, and also result in various effective dynamical speedup factors, relative to a reference atomistic model. On the other hand, the structure-based CG models tend to result in more consistent cation-anion relative diffusion than the thermodynamic-based models, for a single thermodynamic state point. By linking short- and long-timescale dynamical behaviors, we demonstrate that the varying dynamical properties of the different CG models can be largely collapsed onto a single curve, which provides evidence for a route to constructing dynamically-consistent CG models of RTILs.
Collapse
Affiliation(s)
| | - Sebastian Kloth
- Institute of Condensed Matter Physics, Technische Universität Darmstadt, Hochschulstr. 6, 64289 Darmstadt, Germany
| | - Svenja Wörner
- Max Planck Institute for Polymer Research, 55128 Mainz, Germany
| | - Tamisra Pal
- Institute of Condensed Matter Physics, Technische Universität Darmstadt, Hochschulstr. 6, 64289 Darmstadt, Germany
| | - Kurt Kremer
- Max Planck Institute for Polymer Research, 55128 Mainz, Germany
| | - Tristan Bereau
- Max Planck Institute for Polymer Research, 55128 Mainz, Germany
- Van 't Hoff Institute for Molecular Sciences and Informatics Institute, University of Amsterdam, Amsterdam 1098 XH, The Netherlands
| | - Michael Vogel
- Institute of Condensed Matter Physics, Technische Universität Darmstadt, Hochschulstr. 6, 64289 Darmstadt, Germany
| |
Collapse
|
62
|
Abstract
Cellulose is the most common biopolymer and widely used in our daily life. Due to its unique properties and biodegradability, it has been attracting increased attention in the recent years and various new applications of cellulose and its derivatives are constantly being found. The development of new materials with improved properties, however, is not always an easy task, and theoretical models and computer simulations can often help in this process. In this review, we give an overview of different coarse-grained models of cellulose and their applications to various systems. Various coarse-grained models with different mapping schemes are presented, which can efficiently simulate systems from the single cellulose fibril/crystal to the assembly of many fibrils/crystals. We also discuss relevant applications of these models with a focus on the mechanical properties, self-assembly, chiral nematic phases, conversion between cellulose allomorphs, composite materials and interactions with other molecules.
Collapse
|
63
|
Sun T, Minhas V, Korolev N, Mirzoev A, Lyubartsev AP, Nordenskiöld L. Bottom-Up Coarse-Grained Modeling of DNA. Front Mol Biosci 2021; 8:645527. [PMID: 33816559 PMCID: PMC8010198 DOI: 10.3389/fmolb.2021.645527] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 02/22/2021] [Indexed: 12/22/2022] Open
Abstract
Recent advances in methodology enable effective coarse-grained modeling of deoxyribonucleic acid (DNA) based on underlying atomistic force field simulations. The so-called bottom-up coarse-graining practice separates fast and slow dynamic processes in molecular systems by averaging out fast degrees of freedom represented by the underlying fine-grained model. The resulting effective potential of interaction includes the contribution from fast degrees of freedom effectively in the form of potential of mean force. The pair-wise additive potential is usually adopted to construct the coarse-grained Hamiltonian for its efficiency in a computer simulation. In this review, we present a few well-developed bottom-up coarse-graining methods, discussing their application in modeling DNA properties such as DNA flexibility (persistence length), conformation, "melting," and DNA condensation.
Collapse
Affiliation(s)
- Tiedong Sun
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Vishal Minhas
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Nikolay Korolev
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Alexander Mirzoev
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Alexander P. Lyubartsev
- Department of Materials and Environmental Chemistry, Stockholm University, Stockholm, Sweden
| | - Lars Nordenskiöld
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
64
|
Ding J, Xu N, Nguyen MT, Qiao Q, Shi Y, He Y, Shao Q. Machine learning for molecular thermodynamics. Chin J Chem Eng 2021. [DOI: 10.1016/j.cjche.2020.10.044] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
65
|
Druchok M, Yarish D, Gurbych O, Maksymenko M. Toward efficient generation, correction, and properties control of unique drug‐like structures. J Comput Chem 2021; 42:746-760. [DOI: 10.1002/jcc.26494] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Revised: 12/21/2020] [Accepted: 01/25/2021] [Indexed: 01/01/2023]
Affiliation(s)
- Maksym Druchok
- SoftServe, Inc Lviv Ukraine
- Institute for Condensed Matter Physics Lviv Ukraine
| | | | | | | |
Collapse
|
66
|
Ye H, Xian W, Li Y. Machine Learning of Coarse-Grained Models for Organic Molecules and Polymers: Progress, Opportunities, and Challenges. ACS OMEGA 2021; 6:1758-1772. [PMID: 33521417 PMCID: PMC7841771 DOI: 10.1021/acsomega.0c05321] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 01/04/2021] [Indexed: 05/02/2023]
Abstract
Machine learning (ML) has emerged as one of the most powerful tools transforming all areas of science and engineering. The nature of molecular dynamics (MD) simulations, complex and time-consuming calculations, makes them particularly suitable for ML research. This review article focuses on recent advancements in developing efficient and accurate coarse-grained (CG) models using various ML methods, in terms of regulating the coarse-graining process, constructing adequate descriptors/features, generating representative training data sets, and optimization of the loss function. Two classes of the CG models are introduced: bottom-up and top-down CG methods. To illustrate these methods and demonstrate the open methodological questions, we survey several important principles in constructing CG models and how these are incorporated into ML methods and improved with specific learning techniques. Finally, we discuss some key aspects of developing machine-learned CG models with high accuracy and efficiency. Besides, we describe how these aspects are tackled in state-of-the-art methods and which remain to be addressed in the near future. We expect that these machine-learned CG models can address thermodynamic consistent, transferable, and representative issues in classical CG models.
Collapse
Affiliation(s)
- Huilin Ye
- Department
of Mechanical Engineering, University of
Connecticut, Storrs, Connecticut 06269, United States
| | - Weikang Xian
- Department
of Mechanical Engineering, University of
Connecticut, Storrs, Connecticut 06269, United States
| | - Ying Li
- Department
of Mechanical Engineering, University of
Connecticut, Storrs, Connecticut 06269, United States
- Polymer
Program, Institute of Materials Science, University of Connecticut, Storrs, Connecticut 06269, United States
- E-mail: . Phone: +1 860 4867110. Fax: +1 860 4865088
| |
Collapse
|
67
|
Abstract
Four decades of molecular theory and computation have helped form the modern understanding of the physical chemistry of organic semiconductors. Whereas these efforts have historically centered around characterizations of electronic structure at the single-molecule or dimer scale, emerging trends in noncrystalline molecular and polymeric semiconductors are motivating the need for modeling techniques capable of morphological and electronic structure predictions at the mesoscale. Provided the challenges associated with these prediction tasks, the community has begun to evolve a computational toolkit for organic semiconductors incorporating techniques from the fields of soft matter, coarse-graining, and machine learning. Here, we highlight recent advances in coarse-grained methodologies aimed at the multiscale characterization of noncrystalline organic semiconductors. As organic semiconductor performance is dependent on the interplay of mesoscale morphology and molecular electronic structure, specific emphasis is placed on coarse-grained modeling approaches capable of both structural and electronic predictions without recourse to all-atom representations.
Collapse
Affiliation(s)
- Nicholas E Jackson
- Department of Chemistry, University of Illinois, Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
68
|
Ramanathan A, Ma H, Parvatikar A, Chennubhotla SC. Artificial intelligence techniques for integrative structural biology of intrinsically disordered proteins. Curr Opin Struct Biol 2021; 66:216-224. [PMID: 33421906 DOI: 10.1016/j.sbi.2020.12.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 12/01/2020] [Accepted: 12/03/2020] [Indexed: 12/16/2022]
Abstract
We outline recent developments in artificial intelligence (AI) and machine learning (ML) techniques for integrative structural biology of intrinsically disordered proteins (IDP) ensembles. IDPs challenge the traditional protein structure-function paradigm by adapting their conformations in response to specific binding partners leading them to mediate diverse, and often complex cellular functions such as biological signaling, self-organization and compartmentalization. Obtaining mechanistic insights into their function can therefore be challenging for traditional structural determination techniques. Often, scientists have to rely on piecemeal evidence drawn from diverse experimental techniques to characterize their functional mechanisms. Multiscale simulations can help bridge critical knowledge gaps about IDP structure-function relationships-however, these techniques also face challenges in resolving emergent phenomena within IDP conformational ensembles. We posit that scalable statistical inference techniques can effectively integrate information gleaned from multiple experimental techniques as well as from simulations, thus providing access to atomistic details of these emergent phenomena.
Collapse
Affiliation(s)
- Arvind Ramanathan
- Data Science & Learning Division, Argonne National Laboratory, Lemont, IL 60439, United States; Consortium for Advanced Science and Engineering (CASE), University of Chicago, Hyde Park, IL, United States.
| | - Heng Ma
- Data Science & Learning Division, Argonne National Laboratory, Lemont, IL 60439, United States
| | - Akash Parvatikar
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, United States
| | - S Chakra Chennubhotla
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, United States
| |
Collapse
|
69
|
Gao W, Mahajan SP, Sulam J, Gray JJ. Deep Learning in Protein Structural Modeling and Design. PATTERNS (NEW YORK, N.Y.) 2020; 1:100142. [PMID: 33336200 PMCID: PMC7733882 DOI: 10.1016/j.patter.2020.100142] [Citation(s) in RCA: 82] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Deep learning is catalyzing a scientific revolution fueled by big data, accessible toolkits, and powerful computational resources, impacting many fields, including protein structural modeling. Protein structural modeling, such as predicting structure from amino acid sequence and evolutionary information, designing proteins toward desirable functionality, or predicting properties or behavior of a protein, is critical to understand and engineer biological systems at the molecular level. In this review, we summarize the recent advances in applying deep learning techniques to tackle problems in protein structural modeling and design. We dissect the emerging approaches using deep learning techniques for protein structural modeling and discuss advances and challenges that must be addressed. We argue for the central importance of structure, following the "sequence → structure → function" paradigm. This review is directed to help both computational biologists to gain familiarity with the deep learning methods applied in protein modeling, and computer scientists to gain perspective on the biologically meaningful problems that may benefit from deep learning techniques.
Collapse
Affiliation(s)
- Wenhao Gao
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Sai Pooja Mahajan
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeremias Sulam
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
70
|
Vassaux M, Gopalakrishnan K, Sinclair RC, Richardson RA, Coveney PV. Accelerating Heterogeneous Multiscale Simulations of Advanced Materials Properties with Graph‐Based Clustering. ADVANCED THEORY AND SIMULATIONS 2020. [DOI: 10.1002/adts.202000234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Maxime Vassaux
- Centre for Computational Sciences University College London 20 Gordon Street London WC1H 0AJ UK
| | | | - Robert C. Sinclair
- Centre for Computational Sciences University College London 20 Gordon Street London WC1H 0AJ UK
| | - Robin. A. Richardson
- Centre for Computational Sciences University College London 20 Gordon Street London WC1H 0AJ UK
- Netherlands eScience Center Science Park 140, 1098 XG Amsterdam The Netherlands
| | - Peter V. Coveney
- Centre for Computational Sciences University College London 20 Gordon Street London WC1H 0AJ UK
| |
Collapse
|
71
|
Rudzinski JF, Bereau T. Coarse-grained conformational surface hopping: Methodology and transferability. J Chem Phys 2020; 153:214110. [DOI: 10.1063/5.0031249] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Affiliation(s)
| | - Tristan Bereau
- Max Planck Institute for Polymer Research, 55128 Mainz, Germany
- Van ’t Hoff Institute for Molecular Sciences and Informatics Institute, University of Amsterdam, Amsterdam 1098 XH, The Netherlands
| |
Collapse
|
72
|
Husic BE, Charron NE, Lemm D, Wang J, Pérez A, Majewski M, Krämer A, Chen Y, Olsson S, de Fabritiis G, Noé F, Clementi C. Coarse graining molecular dynamics with graph neural networks. J Chem Phys 2020; 153:194101. [PMID: 33218238 PMCID: PMC7671749 DOI: 10.1063/5.0026133] [Citation(s) in RCA: 71] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Accepted: 10/27/2020] [Indexed: 11/14/2022] Open
Abstract
Coarse graining enables the investigation of molecular dynamics for larger systems and at longer timescales than is possible at an atomic resolution. However, a coarse graining model must be formulated such that the conclusions we draw from it are consistent with the conclusions we would draw from a model at a finer level of detail. It has been proved that a force matching scheme defines a thermodynamically consistent coarse-grained model for an atomistic system in the variational limit. Wang et al. [ACS Cent. Sci. 5, 755 (2019)] demonstrated that the existence of such a variational limit enables the use of a supervised machine learning framework to generate a coarse-grained force field, which can then be used for simulation in the coarse-grained space. Their framework, however, requires the manual input of molecular features to machine learn the force field. In the present contribution, we build upon the advance of Wang et al. and introduce a hybrid architecture for the machine learning of coarse-grained force fields that learn their own features via a subnetwork that leverages continuous filter convolutions on a graph neural network architecture. We demonstrate that this framework succeeds at reproducing the thermodynamics for small biomolecular systems. Since the learned molecular representations are inherently transferable, the architecture presented here sets the stage for the development of machine-learned, coarse-grained force fields that are transferable across molecular systems.
Collapse
Affiliation(s)
| | | | - Dominik Lemm
- Computational Science Laboratory, Universitat Pompeu Fabra, PRBB, C/Dr. Aiguader 88, Barcelona, Spain
| | | | - Adrià Pérez
- Computational Science Laboratory, Universitat Pompeu Fabra, PRBB, C/Dr. Aiguader 88, Barcelona, Spain
| | - Maciej Majewski
- Computational Science Laboratory, Universitat Pompeu Fabra, PRBB, C/Dr. Aiguader 88, Barcelona, Spain
| | - Andreas Krämer
- Department of Mathematics and Computer Science, Freie Universität, Berlin, Germany
| | | | - Simon Olsson
- Department of Mathematics and Computer Science, Freie Universität, Berlin, Germany
| | | | | | | |
Collapse
|
73
|
Zhang Y, Young LS. DNN-assisted statistical analysis of a model of local cortical circuits. Sci Rep 2020; 10:20139. [PMID: 33208805 PMCID: PMC7674455 DOI: 10.1038/s41598-020-76770-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 10/20/2020] [Indexed: 01/27/2023] Open
Abstract
In neuroscience, computational modeling is an effective way to gain insight into cortical mechanisms, yet the construction and analysis of large-scale network models—not to mention the extraction of underlying principles—are themselves challenging tasks, due to the absence of suitable analytical tools and the prohibitive costs of systematic numerical exploration of high-dimensional parameter spaces. In this paper, we propose a data-driven approach assisted by deep neural networks (DNN). The idea is to first discover certain input-output relations, and then to leverage this information and the superior computation speeds of the well-trained DNN to guide parameter searches and to deduce theoretical understanding. To illustrate this novel approach, we used as a test case a medium-size network of integrate-and-fire neurons intended to model local cortical circuits. With the help of an accurate yet extremely efficient DNN surrogate, we revealed the statistics of model responses, providing a detailed picture of model behavior. The information obtained is both general and of a fundamental nature, with direct application to neuroscience. Our results suggest that the methodology proposed can be scaled up to larger and more complex biological networks when used in conjunction with other techniques of biological modeling.
Collapse
Affiliation(s)
- Yaoyu Zhang
- School of Mathematical Sciences, Institute of Natural Sciences, MOE-LSC and Qing Yuan Research Institute, Shanghai Jiao Tong University, Shanghai, 200240, China.
| | - Lai-Sang Young
- School of Mathematics and School of Natural Sciences, Institute for Advanced Study, Princeton, NJ, 08540, USA. .,Courant Institute of Mathematical Sciences, New York University, New York, NY, 10012, USA.
| |
Collapse
|
74
|
Kojima T, Washio T, Hara S, Koishi M. Synthesis of computer simulation and machine learning for achieving the best material properties of filled rubber. Sci Rep 2020; 10:18127. [PMID: 33093549 PMCID: PMC7581745 DOI: 10.1038/s41598-020-75038-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 10/09/2020] [Indexed: 12/04/2022] Open
Abstract
Molecular dynamics (MD) simulation is used to analyze the mechanical properties of polymerized and nanoscale filled rubber. Unfortunately, the computation time for a simulation can require several months’ computing power, because the interactions of thousands of filler particles must be calculated. To alleviate this problem, we introduce a surrogate convolutional neural network model to achieve faster and more accurate predictions. The major difficulty when employing machine-learning-based surrogate models is the shortage of training data, contributing to the huge simulation costs. To derive a highly accurate surrogate model using only a small amount of training data, we increase the number of training instances by dividing the large-scale simulation results into 3D images of middle-scale filler morphologies and corresponding regional stresses. The images include fringe regions to reflect the influence of the filler constituents outside the core regions. The resultant surrogate model provides higher prediction accuracy than that trained only by images of the entire region. Afterwards, we extract the fillers that dominate the mechanical properties using the surrogate model and we confirm their validity using MD.
Collapse
Affiliation(s)
- Takashi Kojima
- Research and Advanced Development Division, The Yokohama Rubber Co., Ltd., 2-1 Oiwake, Hiratsuka,, Kanagawa,, 254-8601, Japan. .,Department of Reasoning for Intelligence, The Institute of Scientific and Industrial Research, Osaka University, 8-1, Mihogaoka, Ibarakishi, Osaka, 567-0047, Japan.
| | - Takashi Washio
- Department of Reasoning for Intelligence, The Institute of Scientific and Industrial Research, Osaka University, 8-1, Mihogaoka, Ibarakishi, Osaka, 567-0047, Japan
| | - Satoshi Hara
- Department of Reasoning for Intelligence, The Institute of Scientific and Industrial Research, Osaka University, 8-1, Mihogaoka, Ibarakishi, Osaka, 567-0047, Japan
| | - Masataka Koishi
- Research and Advanced Development Division, The Yokohama Rubber Co., Ltd., 2-1 Oiwake, Hiratsuka,, Kanagawa,, 254-8601, Japan
| |
Collapse
|
75
|
Nicholas TC, Goodwin AL, Deringer VL. Understanding the geometric diversity of inorganic and hybrid frameworks through structural coarse-graining. Chem Sci 2020; 11:12580-12587. [PMID: 34123235 PMCID: PMC8162807 DOI: 10.1039/d0sc03287e] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 10/16/2020] [Indexed: 12/13/2022] Open
Abstract
Much of our understanding of complex structures is based on simplification: for example, metal-organic frameworks are often discussed in the context of "nodes" and "linkers", allowing for a qualitative comparison with simpler inorganic structures. Here we show how such an understanding can be obtained in a systematic and quantitative framework, combining atom-density based similarity (kernel) functions and unsupervised machine learning with the long-standing idea of "coarse-graining" atomic structure. We demonstrate how the latter enables a comparison of vastly different chemical systems, and we use it to create a unified, two-dimensional structure map of experimentally known tetrahedral AB2 networks - including clathrate hydrates, zeolitic imidazolate frameworks (ZIFs), and diverse inorganic phases. The structural relationships that emerge can then be linked to microscopic properties of interest, which we exemplify for structural heterogeneity and tetrahedral density.
Collapse
Affiliation(s)
- Thomas C Nicholas
- Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford Oxford OX1 3QR UK
| | - Andrew L Goodwin
- Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford Oxford OX1 3QR UK
| | - Volker L Deringer
- Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford Oxford OX1 3QR UK
| |
Collapse
|
76
|
Joshi SY, Deshmukh SA. A review of advancements in coarse-grained molecular dynamics simulations. MOLECULAR SIMULATION 2020. [DOI: 10.1080/08927022.2020.1828583] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Soumil Y. Joshi
- Department of Chemical Engineering, Virginia Tech, Blacksburg, VA, USA
| | | |
Collapse
|
77
|
Webb MA, Jackson NE, Gil PS, de Pablo JJ. Targeted sequence design within the coarse-grained polymer genome. SCIENCE ADVANCES 2020; 6:eabc6216. [PMID: 33087352 PMCID: PMC7577717 DOI: 10.1126/sciadv.abc6216] [Citation(s) in RCA: 66] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 09/02/2020] [Indexed: 05/05/2023]
Abstract
The chemical design of polymers with target structural and/or functional properties represents a grand challenge in materials science. While data-driven design approaches are promising, success with polymers has been limited, largely due to limitations in data availability. Here, we demonstrate the targeted sequence design of single-chain structure in polymers by combining coarse-grained modeling, machine learning, and model optimization. Nearly 2000 unique coarse-grained polymers are simulated to construct and analyze machine learning models. We find that deep neural networks inexpensively and reliably predict structural properties with limited sequence information as input. By coupling trained ML models with sequential model-based optimization, polymer sequences are proposed to exhibit globular, swollen, or rod-like behaviors, which are verified by explicit simulations. This work highlights the promising integration of coarse-grained modeling with data-driven design and represents a necessary and crucial step toward more complex polymer design efforts.
Collapse
Affiliation(s)
- Michael A Webb
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL 60615, USA
| | - Nicholas E Jackson
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL 60615, USA
- Center for Molecular Engineering and Materials Science Division, Argonne National Laboratory, Lemont, IL 06349, USA
| | - Phwey S Gil
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL 60615, USA
| | - Juan J de Pablo
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL 60615, USA.
- Center for Molecular Engineering and Materials Science Division, Argonne National Laboratory, Lemont, IL 06349, USA
| |
Collapse
|
78
|
Wang S, Ma Z, Pan W. Data-driven coarse-grained modeling of polymers in solution with structural and dynamic properties conserved. SOFT MATTER 2020; 16:8330-8344. [PMID: 32785383 DOI: 10.1039/d0sm01019g] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
We present data-driven coarse-grained (CG) modeling for polymers in solution, which conserves the dynamic as well as structural properties of the underlying atomistic system. The CG modeling is built upon the framework of the generalized Langevin equation (GLE). The key is to determine each term in the GLE by directly linking it to atomistic data. In particular, we propose a two-stage Gaussian process-based Bayesian optimization method to infer the non-Markovian memory kernel from the data of the velocity autocorrelation function (VACF). Considering that the long-time behaviors of the VACF and memory kernel for polymer solutions can exhibit hydrodynamic scaling (algebraic decay with time), we further develop an active learning method to determine the emergence of hydrodynamic scaling, which can accelerate the inference process of the memory kernel. The proposed methods do not rely on how the mean force or CG potential in the GLE is constructed. Thus, we also compare two methods for constructing the CG potential: a deep learning method and the iterative Boltzmann inversion method. With the memory kernel and CG potential determined, the GLE is mapped onto an extended Markovian process to circumvent the expensive cost of directly solving the GLE. The accuracy and computational efficiency of the proposed CG modeling are assessed in a model star-polymer solution system at three representative concentrations. By comparing with the reference atomistic simulation results, we demonstrate that the proposed CG modeling can robustly and accurately reproduce the dynamic and structural properties of polymers in solution.
Collapse
Affiliation(s)
- Shu Wang
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | - Zhan Ma
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | - Wenxiao Pan
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
79
|
Mahmoud SSM, Esposito G, Serra G, Fogolari F. Generalized Born radii computation using linear models and neural networks. Bioinformatics 2020; 36:1757-1764. [PMID: 31693089 DOI: 10.1093/bioinformatics/btz818] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 10/08/2019] [Accepted: 10/30/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Implicit solvent models play an important role in describing the thermodynamics and the dynamics of biomolecular systems. Key to an efficient use of these models is the computation of generalized Born (GB) radii, which is accomplished by algorithms based on the electrostatics of inhomogeneous dielectric media. The speed and accuracy of such computations are still an issue especially for their intensive use in classical molecular dynamics. Here, we propose an alternative approach that encodes the physics of the phenomena and the chemical structure of the molecules in model parameters which are learned from examples. RESULTS GB radii have been computed using (i) a linear model and (ii) a neural network. The input is the element, the histogram of counts of neighbouring atoms, divided by atom element, within 16 Å. Linear models are ca. 8 times faster than the most widely used reference method and the accuracy is higher with correlation coefficient with the inverse of 'perfect' GB radii of 0.94 versus 0.80 of the reference method. Neural networks further improve the accuracy of the predictions with correlation coefficient with 'perfect' GB radii of 0.97 and ca. 20% smaller root mean square error. AVAILABILITY AND IMPLEMENTATION We provide a C program implementing the computation using the linear model, including the coefficients appropriate for the set of Bondi radii, as Supplementary Material. We also provide a Python implementation of the neural network model with parameter and example files in the Supplementary Material as well. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Saida Saad Mohamed Mahmoud
- Department of Mathematics, Informatics and Physics, University of Udine, Udine 33100, Italy.,Faculty of Science, Cairo University, Giza 12613, Egypt
| | - Gennaro Esposito
- Science and Math Division, New York University at Abu Dhabi, PO Box 129188, Abu Dhabi, United Arab Emirates
| | - Giuseppe Serra
- Department of Mathematics, Informatics and Physics, University of Udine, Udine 33100, Italy
| | - Federico Fogolari
- Department of Mathematics, Informatics and Physics, University of Udine, Udine 33100, Italy
| |
Collapse
|
80
|
Gkeka P, Stoltz G, Barati Farimani A, Belkacemi Z, Ceriotti M, Chodera JD, Dinner AR, Ferguson AL, Maillet JB, Minoux H, Peter C, Pietrucci F, Silveira A, Tkatchenko A, Trstanova Z, Wiewiora R, Lelièvre T. Machine Learning Force Fields and Coarse-Grained Variables in Molecular Dynamics: Application to Materials and Biological Systems. J Chem Theory Comput 2020; 16:4757-4775. [PMID: 32559068 PMCID: PMC8312194 DOI: 10.1021/acs.jctc.0c00355] [Citation(s) in RCA: 87] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Machine learning encompasses tools and algorithms that are now becoming popular in almost all scientific and technological fields. This is true for molecular dynamics as well, where machine learning offers promises of extracting valuable information from the enormous amounts of data generated by simulation of complex systems. We provide here a review of our current understanding of goals, benefits, and limitations of machine learning techniques for computational studies on atomistic systems, focusing on the construction of empirical force fields from ab initio databases and the determination of reaction coordinates for free energy computation and enhanced sampling.
Collapse
Affiliation(s)
- Paraskevi Gkeka
- Integrated Drug Discovery, Sanofi R&D, 91385 Chilly-Mazarin, France
| | - Gabriel Stoltz
- CERMICS, Ecole des Ponts, Marne-la-Vallée, France
- Matherials Project-Team, Inria Paris, 75012 Paris, France
| | | | - Zineb Belkacemi
- Integrated Drug Discovery, Sanofi R&D, 91385 Chilly-Mazarin, France
- CERMICS, Ecole des Ponts, Marne-la-Vallée, France
| | - Michele Ceriotti
- Laboratory of Computational Science and Modelling, Institute of Materials, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - John D Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
| | - Aaron R Dinner
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - Andrew L Ferguson
- Pritzker School of Molecular Engineering, University of Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, United States
| | | | - Hervé Minoux
- Integrated Drug Discovery, Sanofi R&D, 94403 Vitry-sur-Seine, France
| | | | - Fabio Pietrucci
- UMR CNRS 7590, MNHN, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, Sorbonne Université, 75005 Paris, France
| | - Ana Silveira
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Zofia Trstanova
- School of Mathematics, The University of Edinburgh, Edinburgh EH9 3FD, U.K
| | - Rafal Wiewiora
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
| | - Tony Lelièvre
- CERMICS, Ecole des Ponts, Marne-la-Vallée, France
- Matherials Project-Team, Inria Paris, 75012 Paris, France
| |
Collapse
|
81
|
Li W, Burkhart C, Polińska P, Harmandaris V, Doxastakis M. Backmapping coarse-grained macromolecules: An efficient and versatile machine learning approach. J Chem Phys 2020; 153:041101. [DOI: 10.1063/5.0012320] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Wei Li
- Department of Chemical and Biomolecular Engineering, University of Tennessee, Knoxville, Tennessee 37996, USA
| | - Craig Burkhart
- The Goodyear Tire and Rubber Company, Akron, Ohio 44305, USA
| | - Patrycja Polińska
- Goodyear Innovation Center Luxembourg, Avenue Gordon Smith, L-7750 Colmar-Berg, Luxembourg
| | - Vagelis Harmandaris
- Department of Applied Mathematics, University of Crete, and IACM FORTH, GR-71110 Heraklion, Greece
- Computation-Based Science and Technology Research Center, The Cyprus Institute, Nicosia 2121, Cyprus
| | - Manolis Doxastakis
- Department of Chemical and Biomolecular Engineering, University of Tennessee, Knoxville, Tennessee 37996, USA
| |
Collapse
|
82
|
Wang J, Chmiela S, Müller KR, Noé F, Clementi C. Ensemble learning of coarse-grained molecular dynamics force fields with a kernel approach. J Chem Phys 2020; 152:194106. [DOI: 10.1063/5.0007276] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Affiliation(s)
- Jiang Wang
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA
- Department of Chemistry, Rice University, Houston, Texas 77005, USA
| | - Stefan Chmiela
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Klaus-Robert Müller
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
- Department of Brain and Cognitive Engineering, Korea University, Seoul 02841, South Korea
- Max Planck Institute for Informatics, Saarbrücken 66123, Germany
| | - Frank Noé
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA
- Department of Chemistry, Rice University, Houston, Texas 77005, USA
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
- Department of Physics, Freie Universität Berlin, Arnimallee 14, 14195 Berlin, Germany
| | - Cecilia Clementi
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA
- Department of Chemistry, Rice University, Houston, Texas 77005, USA
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
- Department of Physics, Freie Universität Berlin, Arnimallee 14, 14195 Berlin, Germany
- Department of Physics, Rice University, Houston, Texas 77005, USA
| |
Collapse
|
83
|
Scherer C, Scheid R, Andrienko D, Bereau T. Kernel-Based Machine Learning for Efficient Simulations of Molecular Liquids. J Chem Theory Comput 2020; 16:3194-3204. [PMID: 32282206 PMCID: PMC7304872 DOI: 10.1021/acs.jctc.9b01256] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Indexed: 11/29/2022]
Abstract
Current machine learning (ML) models aimed at learning force fields are plagued by their high computational cost at every integration time step. We describe a number of practical and computationally efficient strategies to parametrize traditional force fields for molecular liquids from ML: the particle decomposition ansatz to two- and three-body force fields, the use of kernel-based ML models that incorporate physical symmetries, the incorporation of switching functions close to the cutoff, and the use of covariant meshing to boost the training set size. Results are presented for model molecular liquids: pairwise Lennard-Jones, three-body Stillinger-Weber, and bottom-up coarse-graining of water. Here, covariant meshing proves to be an efficient strategy to learn canonically averaged instantaneous forces. We show that molecular dynamics simulations with tabulated two- and three-body ML potentials are computationally efficient and recover two- and three-body distribution functions. Many-body representations, decomposition, and kernel regression schemes are all implemented in the open-source software package VOTCA.
Collapse
Affiliation(s)
- Christoph Scherer
- Max Planck Institute for
Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
| | - René Scheid
- Max Planck Institute for
Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
| | - Denis Andrienko
- Max Planck Institute for
Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
| | - Tristan Bereau
- Max Planck Institute for
Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
| |
Collapse
|
84
|
Sherman ZM, Howard MP, Lindquist BA, Jadrich RB, Truskett TM. Inverse methods for design of soft materials. J Chem Phys 2020; 152:140902. [DOI: 10.1063/1.5145177] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Affiliation(s)
- Zachary M. Sherman
- McKetta Department of Chemical Engineering, University of Texas at Austin, Austin, Texas 78712, USA
| | - Michael P. Howard
- McKetta Department of Chemical Engineering, University of Texas at Austin, Austin, Texas 78712, USA
| | - Beth A. Lindquist
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Ryan B. Jadrich
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Thomas M. Truskett
- McKetta Department of Chemical Engineering, University of Texas at Austin, Austin, Texas 78712, USA
- Department of Physics, University of Texas at Austin, Austin, Texas 78712, USA
| |
Collapse
|
85
|
Abstract
Machine learning (ML) is transforming all areas of science. The complex and time-consuming calculations in molecular simulations are particularly suitable for an ML revolution and have already been profoundly affected by the application of existing ML methods. Here we review recent ML methods for molecular simulation, with particular focus on (deep) neural networks for the prediction of quantum-mechanical energies and forces, on coarse-grained molecular dynamics, on the extraction of free energy surfaces and kinetics, and on generative network approaches to sample molecular equilibrium structures and compute thermodynamics. To explain these methods and illustrate open methodological problems, we review some important principles of molecular physics and describe how they can be incorporated into ML structures. Finally, we identify and describe a list of open challenges for the interface between ML and molecular simulation.
Collapse
Affiliation(s)
- Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany; .,Department of Physics, Freie Universität Berlin, 14195 Berlin, Germany.,Department of Chemistry and Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA;
| | - Alexandre Tkatchenko
- Physics and Materials Science Research Unit, University of Luxembourg, 1511 Luxembourg, Luxembourg;
| | - Klaus-Robert Müller
- Department of Computer Science, Technical University Berlin, 10587 Berlin, Germany; .,Max-Planck-Institut für Informatik, 66123 Saarbrücken, Germany.,Department of Brain and Cognitive Engineering, Korea University, Seoul 136-713, South Korea
| | - Cecilia Clementi
- Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany; .,Department of Chemistry and Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA; .,Department of Physics, Rice University, Houston, Texas 77005, USA
| |
Collapse
|
86
|
Zhai C, Li T, Shi H, Yeo J. Discovery and design of soft polymeric bio-inspired materials with multiscale simulations and artificial intelligence. J Mater Chem B 2020; 8:6562-6587. [DOI: 10.1039/d0tb00896f] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Establishing the “Materials 4.0” paradigm requires intimate knowledge of the virtual space in materials design.
Collapse
Affiliation(s)
- Chenxi Zhai
- J2 Lab for Engineering Living Materials
- Sibley School of Mechanical and Aerospace Engineering
- Cornell University
- Ithaca
- USA
| | - Tianjiao Li
- J2 Lab for Engineering Living Materials
- Sibley School of Mechanical and Aerospace Engineering
- Cornell University
- Ithaca
- USA
| | - Haoyuan Shi
- J2 Lab for Engineering Living Materials
- Sibley School of Mechanical and Aerospace Engineering
- Cornell University
- Ithaca
- USA
| | - Jingjie Yeo
- J2 Lab for Engineering Living Materials
- Sibley School of Mechanical and Aerospace Engineering
- Cornell University
- Ithaca
- USA
| |
Collapse
|
87
|
Noé F. Machine Learning for Molecular Dynamics on Long Timescales. MACHINE LEARNING MEETS QUANTUM PHYSICS 2020. [DOI: 10.1007/978-3-030-40245-7_16] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
88
|
Jackson NE, Bowen AS, de Pablo JJ. Efficient Multiscale Optoelectronic Prediction for Conjugated Polymers. Macromolecules 2019. [DOI: 10.1021/acs.macromol.9b02020] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Affiliation(s)
- Nicholas E. Jackson
- Center for Molecular Engineering and Materials Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Alec S. Bowen
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Juan J. de Pablo
- Center for Molecular Engineering and Materials Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
89
|
Ko HY, Zhang L, Santra B, Wang H, E W, DiStasio Jr RA, Car R. Isotope effects in liquid water via deep potential molecular dynamics. Mol Phys 2019. [DOI: 10.1080/00268976.2019.1652366] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- Hsin-Yu Ko
- Department of Chemistry, Princeton University, Princeton, NJ, USA
| | - Linfeng Zhang
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ, USA
| | - Biswajit Santra
- Department of Physics, Temple University, Philadelphia, PA, USA
| | - Han Wang
- Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Beijing, People's Republic of China
| | - Weinan E
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ, USA
- Department of Mathematics, Princeton University, Princeton, NJ, USA
| | | | - Roberto Car
- Department of Chemistry, Princeton University, Princeton, NJ, USA
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ, USA
- Department of Physics and Princeton Institute for the Science and Technology of Materials, Princeton University, Princeton, NJ, USA
| |
Collapse
|
90
|
Durumeric AEP, Voth GA. Adversarial-residual-coarse-graining: Applying machine learning theory to systematic molecular coarse-graining. J Chem Phys 2019; 151:124110. [PMID: 31575201 DOI: 10.1063/1.5097559] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
We utilize connections between molecular coarse-graining (CG) approaches and implicit generative models in machine learning to describe a new framework for systematic molecular CG. Focus is placed on the formalism encompassing generative adversarial networks. The resulting method enables a variety of model parameterization strategies, some of which show similarity to previous CG methods. We demonstrate that the resulting framework can rigorously parameterize CG models containing CG sites with no prescribed connection to the reference atomistic system (termed virtual sites); however, this advantage is offset by the lack of a closed-form expression for the CG Hamiltonian at the resolution obtained after integration over the virtual CG sites. Computational examples are provided for cases in which these methods ideally return identical parameters as relative entropy minimization CG but where traditional relative entropy minimization CG optimization equations are not applicable.
Collapse
Affiliation(s)
- Aleksander E P Durumeric
- Department of Chemistry, James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Gregory A Voth
- Department of Chemistry, James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
91
|
Vassaux M, Sinclair RC, Richardson RA, Suter JL, Coveney PV. Toward High Fidelity Materials Property Prediction from Multiscale Modeling and Simulation. ADVANCED THEORY AND SIMULATIONS 2019. [DOI: 10.1002/adts.201900122] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Affiliation(s)
- Maxime Vassaux
- Centre for Computational SciencesUniversity College London20 Gordon Street London WC1H 0AJ UK
| | - Robert C. Sinclair
- Centre for Computational SciencesUniversity College London20 Gordon Street London WC1H 0AJ UK
| | - Robin A. Richardson
- Centre for Computational SciencesUniversity College London20 Gordon Street London WC1H 0AJ UK
| | - James L. Suter
- Centre for Computational SciencesUniversity College London20 Gordon Street London WC1H 0AJ UK
| | - Peter V. Coveney
- Centre for Computational SciencesUniversity College London20 Gordon Street London WC1H 0AJ UK
- Computational Science LaboratoryInstitute for InformaticsFaculty of ScienceUniversity of Amsterdam Amsterdam 1098XH The Netherlands
| |
Collapse
|
92
|
Pervaje AK, Walker CC, Santiso EE. Molecular simulation of polymers with a SAFT-γ Mie approach. MOLECULAR SIMULATION 2019. [DOI: 10.1080/08927022.2019.1645331] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Affiliation(s)
- Amulya K. Pervaje
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA
| | - Christopher C. Walker
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA
| | - Erik E. Santiso
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA
| |
Collapse
|
93
|
Vergadou N, Theodorou DN. Molecular Modeling Investigations of Sorption and Diffusion of Small Molecules in Glassy Polymers. MEMBRANES 2019; 9:E98. [PMID: 31398889 PMCID: PMC6723301 DOI: 10.3390/membranes9080098] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 07/22/2019] [Accepted: 07/23/2019] [Indexed: 11/16/2022]
Abstract
With a wide range of applications, from energy and environmental engineering, such as in gas separations and water purification, to biomedical engineering and packaging, glassy polymeric materials remain in the core of novel membrane and state-of the art barrier technologies. This review focuses on molecular simulation methodologies implemented for the study of sorption and diffusion of small molecules in dense glassy polymeric systems. Basic concepts are introduced and systematic methods for the generation of realistic polymer configurations are briefly presented. Challenges related to the long length and time scale phenomena that govern the permeation process in the glassy polymer matrix are described and molecular simulation approaches developed to address the multiscale problem at hand are discussed.
Collapse
Affiliation(s)
- Niki Vergadou
- Molecular Thermodynamics and Modelling of Materials Laboratory, Institute of Nanoscience and Nanotechnology, National Center for Scientific Research Demokritos, Aghia Paraskevi Attikis, GR-15310 Athens, Greece.
| | - Doros N Theodorou
- School of Chemical Engineering, National Technical University of Athens, GR 15780 Athens, Greece
| |
Collapse
|
94
|
Wang J, Olsson S, Wehmeyer C, Pérez A, Charron NE, de Fabritiis G, Noé F, Clementi C. Machine Learning of Coarse-Grained Molecular Dynamics Force Fields. ACS CENTRAL SCIENCE 2019; 5:755-767. [PMID: 31139712 PMCID: PMC6535777 DOI: 10.1021/acscentsci.8b00913] [Citation(s) in RCA: 199] [Impact Index Per Article: 39.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2018] [Indexed: 05/17/2023]
Abstract
Atomistic or ab initio molecular dynamics simulations are widely used to predict thermodynamics and kinetics and relate them to molecular structure. A common approach to go beyond the time- and length-scales accessible with such computationally expensive simulations is the definition of coarse-grained molecular models. Existing coarse-graining approaches define an effective interaction potential to match defined properties of high-resolution models or experimental data. In this paper, we reformulate coarse-graining as a supervised machine learning problem. We use statistical learning theory to decompose the coarse-graining error and cross-validation to select and compare the performance of different models. We introduce CGnets, a deep learning approach, that learns coarse-grained free energy functions and can be trained by a force-matching scheme. CGnets maintain all physically relevant invariances and allow one to incorporate prior physics knowledge to avoid sampling of unphysical structures. We show that CGnets can capture all-atom explicit-solvent free energy surfaces with models using only a few coarse-grained beads and no solvent, while classical coarse-graining methods fail to capture crucial features of the free energy surface. Thus, CGnets are able to capture multibody terms that emerge from the dimensionality reduction.
Collapse
Affiliation(s)
- Jiang Wang
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas 77005, United States
- Department
of Chemistry, Rice University, Houston, Texas 77005, United States
| | - Simon Olsson
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Christoph Wehmeyer
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Adrià Pérez
- Computational
Science Laboratory, Universitat Pompeu Fabra, PRBB, C/Dr Aiguader 88, 08003 Barcelona, Spain
| | - Nicholas E. Charron
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas 77005, United States
- Department
of Physics, Rice University, Houston, Texas 77005, United States
| | - Gianni de Fabritiis
- Computational
Science Laboratory, Universitat Pompeu Fabra, PRBB, C/Dr Aiguader 88, 08003 Barcelona, Spain
- Institucio
Catalana de Recerca i Estudis Avanats (ICREA), Passeig Lluis Companys 23, 08010 Barcelona, Spain
| | - Frank Noé
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas 77005, United States
- Department
of Chemistry, Rice University, Houston, Texas 77005, United States
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Cecilia Clementi
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas 77005, United States
- Department
of Chemistry, Rice University, Houston, Texas 77005, United States
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
- Department
of Physics, Rice University, Houston, Texas 77005, United States
| |
Collapse
|
95
|
Marrink SJ, Corradi V, Souza PC, Ingólfsson HI, Tieleman DP, Sansom MS. Computational Modeling of Realistic Cell Membranes. Chem Rev 2019; 119:6184-6226. [PMID: 30623647 PMCID: PMC6509646 DOI: 10.1021/acs.chemrev.8b00460] [Citation(s) in RCA: 435] [Impact Index Per Article: 87.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Indexed: 12/15/2022]
Abstract
Cell membranes contain a large variety of lipid types and are crowded with proteins, endowing them with the plasticity needed to fulfill their key roles in cell functioning. The compositional complexity of cellular membranes gives rise to a heterogeneous lateral organization, which is still poorly understood. Computational models, in particular molecular dynamics simulations and related techniques, have provided important insight into the organizational principles of cell membranes over the past decades. Now, we are witnessing a transition from simulations of simpler membrane models to multicomponent systems, culminating in realistic models of an increasing variety of cell types and organelles. Here, we review the state of the art in the field of realistic membrane simulations and discuss the current limitations and challenges ahead.
Collapse
Affiliation(s)
- Siewert J. Marrink
- Groningen
Biomolecular Sciences and Biotechnology Institute & Zernike Institute
for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands
| | - Valentina Corradi
- Centre
for Molecular Simulation and Department of Biological Sciences, University of Calgary, 2500 University Drive NW, Calgary, Alberta T2N 1N4, Canada
| | - Paulo C.T. Souza
- Groningen
Biomolecular Sciences and Biotechnology Institute & Zernike Institute
for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands
| | - Helgi I. Ingólfsson
- Biosciences
and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, California 94550, United States
| | - D. Peter Tieleman
- Centre
for Molecular Simulation and Department of Biological Sciences, University of Calgary, 2500 University Drive NW, Calgary, Alberta T2N 1N4, Canada
| | - Mark S.P. Sansom
- Department
of Biochemistry, University of Oxford, South Parks Road, Oxford OX1 3QU, U.K.
| |
Collapse
|
96
|
Giulini M, Potestio R. A deep learning approach to the structural analysis of proteins. Interface Focus 2019; 9:20190003. [PMID: 31065348 PMCID: PMC6501347 DOI: 10.1098/rsfs.2019.0003] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/13/2019] [Indexed: 02/07/2023] Open
Abstract
Deep learning (DL) algorithms hold great promise for applications in the field of computational biophysics. In fact, the vast amount of available molecular structures, as well as their notable complexity, constitutes an ideal context in which DL-based approaches can be profitably employed. To express the full potential of these techniques, though, it is a prerequisite to express the information contained in a molecule’s atomic positions and distances in a set of input quantities that the network can process. Many of the molecular descriptors devised so far are effective and manageable for relatively small structures, but become complex and cumbersome for larger ones. Furthermore, most of them are defined locally, a feature that could represent a limit for those applications where global properties are of interest. Here, we build a DL architecture capable of predicting non-trivial and intrinsically global quantities, that is, the eigenvalues of a protein’s lowest-energy fluctuation modes. This application represents a first, relatively simple test bed for the development of a neural network approach to the quantitative analysis of protein structures, and demonstrates unexpected use in the identification of mechanically relevant regions of the molecule.
Collapse
Affiliation(s)
- Marco Giulini
- Physics Department, University of Trento, via Sommarive 14, 38123, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, 38123 Trento, Italy
| | - Raffaello Potestio
- Physics Department, University of Trento, via Sommarive 14, 38123, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, 38123 Trento, Italy
| |
Collapse
|
97
|
Moradzadeh A, Aluru NR. Transfer-Learning-Based Coarse-Graining Method for Simple Fluids: Toward Deep Inverse Liquid-State Theory. J Phys Chem Lett 2019; 10:1242-1250. [PMID: 30818949 DOI: 10.1021/acs.jpclett.8b03872] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Machine learning is an attractive paradigm to circumvent difficulties associated with the development and optimization of force-field parameters. In this study, a deep neural network (DNN) is used to study the inverse problem of the liquid-state theory, in particular, to obtain the relation between the radial distribution function (RDF) and the Lennard-Jones (LJ) potential parameters at various thermodynamic states. Using molecular dynamics (MD), many observables, including RDF, are determined once the interatomic potential is specified. However, the inverse problem (parametrization of the potential for a specific RDF) is not straightforward. Here we present a framework integrating DNN with big data from 1.5 TB of MD trajectories with a cumulative simulation time of 52 μs for 26 000 distinct systems to predict LJ potential parameters. Our results show that DNN is successful not only in the parametrization of the atomic LJ liquids but also in parametrizing the LJ potential for coarse-grained models of simple multiatom molecules.
Collapse
Affiliation(s)
- Alireza Moradzadeh
- Department of Mechanical Science and Engineering, Beckman Institute for Advanced Science and Technology , University of Illinois at Urbana-Champaign , Urbana , Illinois 61801 , United States
| | - Narayana R Aluru
- Department of Mechanical Science and Engineering, Beckman Institute for Advanced Science and Technology , University of Illinois at Urbana-Champaign , Urbana , Illinois 61801 , United States
| |
Collapse
|
98
|
Jackson NE, Webb MA, de Pablo JJ. Recent advances in machine learning towards multiscale soft materials design. Curr Opin Chem Eng 2019. [DOI: 10.1016/j.coche.2019.03.005] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
|
99
|
Chen WK, Liu XY, Fang WH, Dral PO, Cui G. Deep Learning for Nonadiabatic Excited-State Dynamics. J Phys Chem Lett 2018; 9:6702-6708. [PMID: 30403870 DOI: 10.1021/acs.jpclett.8b03026] [Citation(s) in RCA: 100] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
In this work we show that deep learning (DL) can be used for exploring complex and highly nonlinear multistate potential energy surfaces of polyatomic molecules and related nonadiabatic dynamics. Our DL is based on deep neural networks (DNNs), which are used as accurate representations of the CASSCF ground- and excited-state potential energy surfaces (PESs) of CH2NH. After geometries near conical intersection are included in the training set, the DNN models accurately reproduce excited-state topological structures; photoisomerization paths; and, importantly, conical intersections. We have also demonstrated that the results from nonadiabatic dynamics run with the DNN models are very close to those from the dynamics run with the pure ab initio method. The present work should encourage further studies of using machine learning methods to explore excited-state potential energy surfaces and nonadiabatic dynamics of polyatomic molecules.
Collapse
Affiliation(s)
- Wen-Kai Chen
- Key Laboratory of Theoretical and Computational Photochemistry, Ministry of Education, College of Chemistry , Beijing Normal University , Beijing 100875 , China
| | - Xiang-Yang Liu
- Key Laboratory of Theoretical and Computational Photochemistry, Ministry of Education, College of Chemistry , Beijing Normal University , Beijing 100875 , China
| | - Wei-Hai Fang
- Key Laboratory of Theoretical and Computational Photochemistry, Ministry of Education, College of Chemistry , Beijing Normal University , Beijing 100875 , China
| | - Pavlo O Dral
- Max-Planck-Institut für Kohlenforschung , Kaiser-Wilhelm-Platz 1 , 45470 Mülheim an der Ruhr , Germany
| | - Ganglong Cui
- Key Laboratory of Theoretical and Computational Photochemistry, Ministry of Education, College of Chemistry , Beijing Normal University , Beijing 100875 , China
| |
Collapse
|
100
|
Guenza MG, Dinpajooh M, McCarty J, Lyubimov IY. Accuracy, Transferability, and Efficiency of Coarse-Grained Models of Molecular Liquids. J Phys Chem B 2018; 122:10257-10278. [DOI: 10.1021/acs.jpcb.8b06687] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- M. G. Guenza
- Department of Chemistry and Biochemistry and Institute of Theoretical Science, University of Oregon, Eugene, Oregon 97403, United States
| | - M. Dinpajooh
- Department of Chemistry and Biochemistry and Institute of Theoretical Science, University of Oregon, Eugene, Oregon 97403, United States
| | - J. McCarty
- Department of Chemistry and Biochemistry and Institute of Theoretical Science, University of Oregon, Eugene, Oregon 97403, United States
| | - I. Y. Lyubimov
- Department of Chemistry and Biochemistry and Institute of Theoretical Science, University of Oregon, Eugene, Oregon 97403, United States
| |
Collapse
|