1
|
Wang J, Wang Y, Zhang H, Yang Z, Liang Z, Shi J, Wang HT, Xing D, Sun J. E(n)-Equivariant cartesian tensor message passing interatomic potential. Nat Commun 2024; 15:7607. [PMID: 39218987 PMCID: PMC11366765 DOI: 10.1038/s41467-024-51886-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Accepted: 08/16/2024] [Indexed: 09/04/2024] Open
Abstract
Machine learning potential (MLP) has been a popular topic in recent years for its capability to replace expensive first-principles calculations in some large systems. Meanwhile, message passing networks have gained significant attention due to their remarkable accuracy, and a wave of message passing networks based on Cartesian coordinates has emerged. However, the information of the node in these models is usually limited to scalars, and vectors. In this work, we propose High-order Tensor message Passing interatomic Potential (HotPP), an E(n) equivariant message passing neural network that extends the node embedding and message to an arbitrary order tensor. By performing some basic equivariant operations, high order tensors can be coupled very simply and thus the model can make direct predictions of high-order tensors such as dipole moments and polarizabilities without any modifications. The tests in several datasets show that HotPP not only achieves high accuracy in predicting target properties, but also successfully performs tasks such as calculating phonon spectra, infrared spectra, and Raman spectra, demonstrating its potential as a tool for future research.
Collapse
Affiliation(s)
- Junjie Wang
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210093, China
| | - Yong Wang
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210093, China
- Department of Chemistry, Princeton University, Princeton, NJ, 08544, USA
| | - Haoting Zhang
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210093, China
| | - Ziyang Yang
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210093, China
| | - Zhixin Liang
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210093, China
| | - Jiuyang Shi
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210093, China
| | - Hui-Tian Wang
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210093, China
| | - Dingyu Xing
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210093, China
| | - Jian Sun
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210093, China.
| |
Collapse
|
2
|
Schmiedmayer B, Kresse G. Derivative learning of tensorial quantities-Predicting finite temperature infrared spectra from first principles. J Chem Phys 2024; 161:084703. [PMID: 39171710 DOI: 10.1063/5.0217243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 08/05/2024] [Indexed: 08/23/2024] Open
Abstract
We develop a strategy that integrates machine learning and first-principles calculations to achieve technically accurate predictions of infrared spectra. In particular, the methodology allows one to predict infrared spectra for complex systems at finite temperatures. The method's effectiveness is demonstrated in challenging scenarios, such as the analysis of water and the organic-inorganic halide perovskite MAPbI3, where our results consistently align with experimental data. A distinctive feature of the methodology is the incorporation of derivative learning, which proves indispensable for obtaining accurate polarization data in bulk materials and facilitates the training of a machine learning surrogate model of the polarization adapted to rotational and translational symmetries. We achieve polarization prediction accuracies of about 1% for the water dimer by training only on the predicted Born effective charges.
Collapse
Affiliation(s)
- Bernhard Schmiedmayer
- Faculty of Physics and Center for Computational Materials Science, University of Vienna, Kolingasse 14-16, A-1090 Vienna, Austria
| | - Georg Kresse
- Faculty of Physics and Center for Computational Materials Science, University of Vienna, Kolingasse 14-16, A-1090 Vienna, Austria
- VASP Software GmbH, Sensengasse 8, A-1090 Vienna, Austria
| |
Collapse
|
3
|
Paul A, Rubenstein M, Ruffino A, Masiuk S, Spanier JE, Grinberg I. Accuracy and limitations of the bond polarizability model in modeling of Raman scattering from molecular dynamics simulations. J Chem Phys 2024; 161:064305. [PMID: 39132793 DOI: 10.1063/5.0217227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 07/22/2024] [Indexed: 08/13/2024] Open
Abstract
Calculation of Raman scattering from molecular dynamics (MD) simulations requires accurate modeling of the evolution of the electronic polarizability of the system along its MD trajectory. For large systems, this necessitates the use of atomistic models to represent the dependence of electronic polarizability on atomic coordinates. The bond polarizability model (BPM) is the simplest such model and has been used for modeling the Raman spectra of molecular systems but has not been applied to solid-state systems. Here, we systematically investigate the accuracy and limitations of the BPM parameterized from the density functional theory results for a series of simple molecules, such as CO2, SO2, H2S, H2O, NH3, and CH4; the more complex CH2O, CH3OH, CH3CH2OH, and thiophene molecules; and the BaTiO3 and CsPbBr3 perovskite solids. We find that BPM can reliably reproduce the overall features of the Raman spectra, such as shifts of peak positions. However, with the exception of highly symmetric systems, the assumption of non-interacting bonds limits the quantitative accuracy of the BPM; this assumption also leads to qualitatively inaccurate polarizability evolution and Raman spectra for systems where large deviations from the ground state structure are present.
Collapse
Affiliation(s)
- Atanu Paul
- Department of Chemistry, Bar-Ilan University, Ramat Gan 5290002, Israel
| | - Maya Rubenstein
- Department of Chemistry, Bar-Ilan University, Ramat Gan 5290002, Israel
| | - Anthony Ruffino
- Department of Physics, Drexel University, Philadelphia, Pennsylvania 19104, USA
| | - Stefan Masiuk
- Department of Mechanical Engineering and Mechanics, Drexel University, Philadelphia, Pennsylvania 19104, USA
| | - Jonathan E Spanier
- Department of Physics, Drexel University, Philadelphia, Pennsylvania 19104, USA
- Department of Mechanical Engineering and Mechanics, Drexel University, Philadelphia, Pennsylvania 19104, USA
| | - Ilya Grinberg
- Department of Chemistry, Bar-Ilan University, Ramat Gan 5290002, Israel
| |
Collapse
|
4
|
Malosso C, Manko N, Izzo MG, Baroni S, Hassanali A. Evidence of ferroelectric features in low-density supercooled water from ab initio deep neural-network simulations. Proc Natl Acad Sci U S A 2024; 121:e2407295121. [PMID: 39083416 PMCID: PMC11317578 DOI: 10.1073/pnas.2407295121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Accepted: 06/30/2024] [Indexed: 08/02/2024] Open
Abstract
Over the last decade, an increasing body of evidence has emerged, supporting the existence of a metastable liquid-liquid critical point in supercooled water whereby two distinct liquid phases of different densities coexist. Analyzing long molecular dynamics simulations performed using deep neural-network force fields trained to accurate quantum mechanical data, we demonstrate that the low-density liquid phase displays a strong propensity toward spontaneous polarization, as witnessed by large and long-lived collective dipole fluctuations. Our findings suggest that the dynamical stability of the low-density phase, and hence the transition from high-density to low-density liquid, is triggered by a collective process involving an accumulation of rotational angular jumps, which could ignite large dipole fluctuations. This dynamical transition involves subtle changes in the electronic polarizability of water molecules which affects their rotational mobility within the two phases. These findings hold the potential for catalyzing activity in the search for dielectric-based probes of the putative second critical point.
Collapse
Affiliation(s)
- Cesare Malosso
- Scuola Internazionale Superiore di Studi Avanzati, Trieste34136, Italy
| | - Natalia Manko
- Condensed Matter and Statistical Physics (CMSP), The Abdus Salam Centre for Theoretical Physics, Trieste34151, Italy
| | - Maria Grazia Izzo
- Scuola Internazionale Superiore di Studi Avanzati, Trieste34136, Italy
| | - Stefano Baroni
- Scuola Internazionale Superiore di Studi Avanzati, Trieste34136, Italy
- Consiglio Nazionale delle Ricerche-Istituto Officina dei Materiali, Scuola Internazionale Superiore di Studi Avanzati Unit, Trieste34136, Italy
| | - Ali Hassanali
- Condensed Matter and Statistical Physics (CMSP), The Abdus Salam Centre for Theoretical Physics, Trieste34151, Italy
| |
Collapse
|
5
|
Chen G, Jaffrelot Inizan T, Plé T, Lagardère L, Piquemal JP, Maday Y. Advancing Force Fields Parameterization: A Directed Graph Attention Networks Approach. J Chem Theory Comput 2024; 20:5558-5569. [PMID: 38875012 DOI: 10.1021/acs.jctc.3c01421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2024]
Abstract
Force fields (FFs) are an established tool for simulating large and complex molecular systems. However, parametrizing FFs is a challenging and time-consuming task that relies on empirical heuristics, experimental data, and computational data. Recent efforts aim to automate the assignment of FF parameters using pre-existing databases and on-the-fly ab initio data. In this study, we propose a graph-based force field (GB-FFs) model to directly derive parameters for the Generalized Amber Force Field (GAFF) from chemical environments and research into the influence of functional forms. Our end-to-end parametrization approach predicts parameters by aggregating the basic information in directed molecular graphs, eliminating the need for expert-defined procedures and enhances the accuracy and transferability of GAFF across a broader range of molecular complexes. Simulation results are compared to the original GAFF parametrization. In practice, our results demonstrate an improved transferability of the model, showcasing its improved accuracy in modeling intermolecular and torsional interactions, as well as improved solvation free energies. The optimization approach developed in this work is fully applicable to other nonpolarizable FFs as well as to polarizable ones.
Collapse
Affiliation(s)
- Gong Chen
- Sorbonne Université, CNRS, Université Paris Cité, Laboratoire Jacques-Louis Lions (LJLL), UMR 7598 CNRS, 75005 Paris, France
| | - Théo Jaffrelot Inizan
- Sorbonne Université, Laboratoire de Chimie Théorique (LCT), UMR 7616 CNRS, 75005 Paris, France
| | - Thomas Plé
- Sorbonne Université, Laboratoire de Chimie Théorique (LCT), UMR 7616 CNRS, 75005 Paris, France
| | - Louis Lagardère
- Sorbonne Université, Laboratoire de Chimie Théorique (LCT), UMR 7616 CNRS, 75005 Paris, France
| | - Jean-Philip Piquemal
- Sorbonne Université, Laboratoire de Chimie Théorique (LCT), UMR 7616 CNRS, 75005 Paris, France
| | - Yvon Maday
- Sorbonne Université, CNRS, Université Paris Cité, Laboratoire Jacques-Louis Lions (LJLL), UMR 7598 CNRS, 75005 Paris, France
| |
Collapse
|
6
|
Berrens M, Kundu A, Calegari Andrade MF, Pham TA, Galli G, Donadio D. Nuclear Quantum Effects on the Electronic Structure of Water and Ice. J Phys Chem Lett 2024; 15:6818-6825. [PMID: 38916450 PMCID: PMC11229061 DOI: 10.1021/acs.jpclett.4c01315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2024] [Revised: 06/17/2024] [Accepted: 06/21/2024] [Indexed: 06/26/2024]
Abstract
The electronic properties and optical response of ice and water are intricately shaped by their molecular structure, including the quantum mechanical nature of the hydrogen atoms. Despite numerous previous studies, a comprehensive understanding of the nuclear quantum effects (NQEs) on the electronic structure of water and ice at finite temperatures remains elusive. Here, we utilize molecular simulations that harness efficient machine-learning potentials and many-body perturbation theory to assess how NQEs impact the electronic bands of water and hexagonal ice. By comparing path-integral and classical simulations, we find that NQEs lead to a larger renormalization of the fundamental gap of ice, compared to that of water, ultimately yielding similar bandgaps in the two systems, consistent with experimental estimates. Our calculations suggest that the increased quantum mechanical delocalization of protons in ice, relative to water, is a key factor leading to the enhancement of NQEs on the electronic structure of ice.
Collapse
Affiliation(s)
- Margaret
L. Berrens
- Department
of Chemistry, University of California Davis, One Shields Ave.. Davis, California 95616, United States
| | - Arpan Kundu
- Pritzker
School of Molecular Engineering, University
of Chicago, Chicago, Illinois 60637, United States
| | - Marcos F. Calegari Andrade
- Quantum
Simulations Group, Materials Science Division, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
| | - Tuan Anh Pham
- Quantum
Simulations Group, Materials Science Division, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
| | - Giulia Galli
- Pritzker
School of Molecular Engineering, University
of Chicago, Chicago, Illinois 60637, United States
- Department
of Chemistry, University of Chicago, Chicago, Illinois 60637, United States
- Materials
Science Division and Center for Molecular Engineering, Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Davide Donadio
- Department
of Chemistry, University of California Davis, One Shields Ave.. Davis, California 95616, United States
| |
Collapse
|
7
|
Berger E, Niemelä J, Lampela O, Juffer AH, Komsa HP. Raman Spectra of Amino Acids and Peptides from Machine Learning Polarizabilities. J Chem Inf Model 2024; 64:4601-4612. [PMID: 38829726 DOI: 10.1021/acs.jcim.4c00077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]
Abstract
Raman spectroscopy is an important tool in the study of vibrational properties and composition of molecules, peptides, and even proteins. Raman spectra can be simulated based on the change of the electronic polarizability with vibrations, which can nowadays be efficiently obtained via machine learning models trained on first-principles data. However, the transferability of the models trained on small molecules to larger structures is unclear, and direct training on large structures is prohibitively expensive. In this work, we first train two machine learning models to predict the polarizabilities of all 20 amino acids. Both models are carefully benchmarked and compared to density functional theory (DFT) calculations, with the neural network method being found to offer better transferability. By combination of machine learning models with classical force field molecular dynamics, Raman spectra of all amino acids are also obtained and investigated, showing good agreement with experiments. The models are further extended to small peptides. We find that adding structures containing peptide bonds to the training set greatly improves predictions, even for peptides not included in training sets.
Collapse
Affiliation(s)
- Ethan Berger
- Microelectronics Research Unit, Faculty of Information Technology and Electrical Engineering, University of Oulu, P.O. Box 4500, Oulu FIN-90014, Finland
| | - Juha Niemelä
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, Oulu FIN-90014, Finland
| | - Outi Lampela
- Biocenter Oulu and Faculty of Biochemistry and Molecular Medicine, University of Oulu, Oulu FIN-90014, Finland
| | - André H Juffer
- Biocenter Oulu and Faculty of Biochemistry and Molecular Medicine, University of Oulu, Oulu FIN-90014, Finland
| | - Hannu-Pekka Komsa
- Microelectronics Research Unit, Faculty of Information Technology and Electrical Engineering, University of Oulu, P.O. Box 4500, Oulu FIN-90014, Finland
| |
Collapse
|
8
|
de la Puente M, Laage D. Impact of interfacial curvature on molecular properties of aqueous interfaces. J Chem Phys 2024; 160:234504. [PMID: 38888129 DOI: 10.1063/5.0210884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 05/28/2024] [Indexed: 06/20/2024] Open
Abstract
The curvature of soft interfaces plays a crucial role in determining their mechanical and thermodynamic properties, both at macroscopic and microscopic scales. In the case of air/water interfaces, particular attention has recently focused on water microdroplets, due to their distinctive chemical reactivity. However, the specific impact of curvature on the molecular properties of interfacial water and interfacial reactivity has so far remained elusive. Here, we use molecular dynamics simulations to determine the effect of curvature on a broad range of structural, dynamical, and thermodynamical properties of the interface. For a droplet, a flat interface, and a cavity, we successively examine the structure of the hydrogen-bond network and its relation to vibrational spectroscopy, the dynamics of water translation, rotation, and hydrogen-bond exchanges, and the thermodynamics of ion solvation and ion-pair dissociation. Our simulations show that curvature predominantly impacts the hydrogen-bond structure through the fraction of dangling OH groups and the dynamics of interfacial water molecules. In contrast, curvature has a limited effect on solvation and ion-pair dissociation thermodynamics. For water microdroplets, this suggests that the curvature alone cannot fully account for the distinctive reactivity measured in these systems, which are of great importance for catalysis and atmospheric chemistry.
Collapse
Affiliation(s)
- M de la Puente
- PASTEUR, Department of Chemistry, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| | - D Laage
- PASTEUR, Department of Chemistry, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| |
Collapse
|
9
|
Kwon H, Calegari Andrade MF, Ardo S, Esposito DV, Pham TA, Ogitsu T. Confinement Effects on Proton Transfer in TiO 2 Nanopores from Machine Learning Potential Molecular Dynamics Simulations. ACS APPLIED MATERIALS & INTERFACES 2024; 16:31687-31695. [PMID: 38840582 DOI: 10.1021/acsami.4c02339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]
Abstract
Improved understanding of proton transfer in nanopores is critical for a wide range of emerging applications, yet experimentally probing mechanisms and energetics of this process remains a significant challenge. To help reveal details of this process, we developed and applied a machine learning potential derived from first-principles calculations to examine water reactivity and proton transfer in TiO2 slit-pores. We find that confinement of water within pores smaller than 0.5 nm imposes strong and complex effects on water reactivity and proton transfer. Although the proton transfer mechanism is similar to that at a TiO2 interface with bulk water, confinement reduces the activation energy of this process, leading to more frequent proton transfer events. This enhanced proton transfer stems from the contraction of oxygen-oxygen distances dictated by the interplay between confinement and hydrophilic interactions. Our simulations also highlight the importance of the surface topology, where faster proton transport is found in the direction where a unique arrangement of surface oxygens enables the formation of an ordered water chain. In a broader context, our study demonstrates that proton transfer in hydrophilic nanopores can be enhanced by controlling pore size, surface chemistry, and topology.
Collapse
Affiliation(s)
- Hyuna Kwon
- Quantum Simulations Group, Materials Science Division, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
| | - Marcos F Calegari Andrade
- Quantum Simulations Group, Materials Science Division, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
| | - Shane Ardo
- Department of Chemistry, Department of Chemical and Biomolecular Engineering, Department of Materials Science and Engineering, University of California, Irvine, California 92697, United States
| | - Daniel V Esposito
- Chemical Engineering Department, Columbia Electrochemical Energy Center, Columbia University, New York, New York 10027, United States
| | - Tuan Anh Pham
- Quantum Simulations Group, Materials Science Division, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
- Laboratory for Energy Applications for the Future, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
| | - Tadashi Ogitsu
- Quantum Simulations Group, Materials Science Division, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
| |
Collapse
|
10
|
Chen Y, Pios SV, Gelin MF, Chen L. Accelerating Molecular Vibrational Spectra Simulations with a Physically Informed Deep Learning Model. J Chem Theory Comput 2024; 20:4703-4710. [PMID: 38825857 DOI: 10.1021/acs.jctc.4c00173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
In recent years, machine learning (ML) surrogate models have emerged as an indispensable tool to accelerate simulations of physical and chemical processes. However, there is still a lack of ML models that can accurately predict molecular vibrational spectra. Here, we present a highly efficient multitask ML surrogate model termed Vibrational Spectra Neural Network (VSpecNN), to accurately calculate infrared (IR) and Raman spectra based on dipole moments and polarizabilities obtained on-the-fly via ML-enhanced molecular dynamics simulations. The methodology is applied to pyrazine, a prototypical polyatomic chromophore. The VSpecNN-predicted energies are well within the chemical accuracy (1 kcal/mol), and the errors for VSpecNN-predicted forces are only half of those obtained from a popular high-performance ML model. Compared to the ab initio reference, the VSpecNN-predicted frequencies of IR and Raman spectra differ only by less than 5.87 cm-1, and the intensities of IR spectra and the depolarization ratios of Raman spectra are well reproduced. The VSpecNN model developed in this work highlights the importance of constructing highly accurate neural network potentials for predicting molecular vibrational spectra.
Collapse
Affiliation(s)
| | | | - Maxim F Gelin
- School of Science, Hangzhou Dianzi University, Hangzhou 310018, China
| | | |
Collapse
|
11
|
Xu N, Rosander P, Schäfer C, Lindgren E, Österbacka N, Fang M, Chen W, He Y, Fan Z, Erhart P. Tensorial Properties via the Neuroevolution Potential Framework: Fast Simulation of Infrared and Raman Spectra. J Chem Theory Comput 2024; 20:3273-3284. [PMID: 38572734 PMCID: PMC11044275 DOI: 10.1021/acs.jctc.3c01343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 03/19/2024] [Accepted: 04/01/2024] [Indexed: 04/05/2024]
Abstract
Infrared and Raman spectroscopy are widely used for the characterization of gases, liquids, and solids, as the spectra contain a wealth of information concerning, in particular, the dynamics of these systems. Atomic scale simulations can be used to predict such spectra but are often severely limited due to high computational cost or the need for strong approximations that limit the application range and reliability. Here, we introduce a machine learning (ML) accelerated approach that addresses these shortcomings and provides a significant performance boost in terms of data and computational efficiency compared with earlier ML schemes. To this end, we generalize the neuroevolution potential approach to enable the prediction of rank one and two tensors to obtain the tensorial neuroevolution potential (TNEP) scheme. We apply the resulting framework to construct models for the dipole moment, polarizability, and susceptibility of molecules, liquids, and solids and show that our approach compares favorably with several ML models from the literature with respect to accuracy and computational efficiency. Finally, we demonstrate the application of the TNEP approach to the prediction of infrared and Raman spectra of liquid water, a molecule (PTAF-), and a prototypical perovskite with strong anharmonicity (BaZrO3). The TNEP approach is implemented in the free and open source software package gpumd, which makes this methodology readily available to the scientific community.
Collapse
Affiliation(s)
- Nan Xu
- Institute
of Zhejiang University-Quzhou, Quzhou 324000, P. R. China
- College
of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310058, P. R. China
| | - Petter Rosander
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| | - Christian Schäfer
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| | - Eric Lindgren
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| | - Nicklas Österbacka
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| | - Mandi Fang
- Institute
of Zhejiang University-Quzhou, Quzhou 324000, P. R. China
- College
of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310058, P. R. China
| | - Wei Chen
- State
Key Laboratory of Multiphase Complex Systems, Institute of Process
Engineering, Chinese Academy of Sciences, Beijing 100190, P. R. China
| | - Yi He
- Institute
of Zhejiang University-Quzhou, Quzhou 324000, P. R. China
- College
of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310058, P. R. China
- Department
of Chemical Engineering, University of Washington, Seattle, Washington 98195, United States
| | - Zheyong Fan
- College
of Physical Science and Technology, Bohai
University, Jinzhou 121013, P. R. China
| | - Paul Erhart
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| |
Collapse
|
12
|
Zhang S, Mei Y, Liu J, Liu Z, Tian Y. Alkyne-tagged SERS nanoprobe for understanding Cu + and Cu 2+ conversion in cuproptosis processes. Nat Commun 2024; 15:3246. [PMID: 38622137 PMCID: PMC11018805 DOI: 10.1038/s41467-024-47549-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 03/29/2024] [Indexed: 04/17/2024] Open
Abstract
Simultaneously quantifying mitochondrial Cu+ and Cu2+ levels is crucial for evaluating the molecular mechanisms of copper accumulation-involved pathological processes. Here, a series of molecules containing various diacetylene derivatives as Raman reporters are designed and synthesized, and the alkyne-tagged SERS probe is created for determination Cu+ and Cu2+ with high selectivity and sensitivity. The developed SERS probe generates well-separated distinguishable Raman fingerprint peaks with built-in corrections in the cellular silent region, resulting in accurate quantification of Cu+ and Cu2+. The present probe demonstrates high tempo-spatial resolution for real-time imaging and simultaneously quantifying mitochondrial Cu+ and Cu2+ with long-term stability benefiting from the probe assembly with designed Au-C≡C groups. Using this powerful tool, it is found that mitochondrial Cu+ and Cu2+ increase during ischemia are associated with breakdown of proteins containing copper as well as conversion of Cu+ and Cu2+. Meanwhile, we observe that parts of Cu+ and Cu2+ are transported out of neurons by ATPase. More importantly, cuproptosis in neurons is found including the oxidative stress process caused by the conversion of Cu+ to Cu2+, which dominates at the early stage (<9 h), and subsequent proteotoxic stress. Both oxidative and proteotoxic stresses contribute to neuronal death.
Collapse
Affiliation(s)
- Sihan Zhang
- State Key Laboratory of Molecular & Process Engineering, School of Chemistry and Molecular Engineering, East China Normal University, Dongchuan Road 500, Shanghai, China
| | - Yuxiao Mei
- State Key Laboratory of Molecular & Process Engineering, School of Chemistry and Molecular Engineering, East China Normal University, Dongchuan Road 500, Shanghai, China
| | - Jiaqi Liu
- State Key Laboratory of Molecular & Process Engineering, School of Chemistry and Molecular Engineering, East China Normal University, Dongchuan Road 500, Shanghai, China
| | - Zhichao Liu
- State Key Laboratory of Molecular & Process Engineering, School of Chemistry and Molecular Engineering, East China Normal University, Dongchuan Road 500, Shanghai, China.
| | - Yang Tian
- State Key Laboratory of Molecular & Process Engineering, School of Chemistry and Molecular Engineering, East China Normal University, Dongchuan Road 500, Shanghai, China.
| |
Collapse
|
13
|
de la Puente M, Gomez A, Laage D. Neural Network-Based Sum-Frequency Generation Spectra of Pure and Acidified Water Interfaces with Air. J Phys Chem Lett 2024; 15:3096-3102. [PMID: 38470065 DOI: 10.1021/acs.jpclett.4c00113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
The affinity of hydronium ions (H3O+) for the air-water interface is a crucial question in environmental chemistry. While sum-frequency generation (SFG) spectroscopy has been instrumental in indicating the preference of H3O+ for the interface, key questions persist regarding the molecular origin of the SFG spectral changes in acidified water. Here we combine nanosecond long neural network (NN) reactive simulations of pure and acidified water slabs with NN predictions of molecular dipoles and polarizabilities to calculate SFG spectra of long reactive trajectories including proton transfer events. Our simulations show that H3O+ ions cause two distinct changes in phase-resolved SFG spectra: first, a low-frequency tail due to the vibrations of H3O+ and its first hydration shell, analogous to the bulk proton continuum, and second, an enhanced hydrogen-bonded band due to the ion-induced static field polarizing molecules in deeper layers. Our calculations confirm that changes in the SFG spectra of acidic solutions are caused by hydronium ions preferentially residing at the interface.
Collapse
Affiliation(s)
- Miguel de la Puente
- PASTEUR, Department of Chemistry, École Normale Supérieur, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| | - Axel Gomez
- PASTEUR, Department of Chemistry, École Normale Supérieur, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| | - Damien Laage
- PASTEUR, Department of Chemistry, École Normale Supérieur, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| |
Collapse
|
14
|
Kapil V, Kovács DP, Csányi G, Michaelides A. First-principles spectroscopy of aqueous interfaces using machine-learned electronic and quantum nuclear effects. Faraday Discuss 2024; 249:50-68. [PMID: 37799072 PMCID: PMC10845015 DOI: 10.1039/d3fd00113j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 07/18/2023] [Indexed: 10/07/2023]
Abstract
Vibrational spectroscopy is a powerful approach to visualising interfacial phenomena. However, extracting structural and dynamical information from vibrational spectra is a challenge that requires first-principles simulations, including non-Condon and quantum nuclear effects. We address this challenge by developing a machine-learning enhanced first-principles framework to speed up predictive modelling of infrared, Raman, and sum-frequency generation spectra. Our approach uses machine learning potentials that encode quantum nuclear effects to generate quantum trajectories using simple molecular dynamics efficiently. In addition, we reformulate bulk and interfacial selection rules to express them unambiguously in terms of the derivatives of polarisation and polarisabilities of the whole system and predict these derivatives efficiently using fully-differentiable machine learning models of dielectric response tensors. We demonstrate our framework's performance by predicting the IR, Raman, and sum-frequency generation spectra of liquid water, ice and the water-air interface by achieving near quantitative agreement with experiments at nearly the same computational efficiency as pure classical methods. Finally, to aid the experimental discovery of new phases of nanoconfined water, we predict the temperature-dependent vibrational spectra of monolayer water across the solid-hexatic-liquid phases transition.
Collapse
Affiliation(s)
- Venkat Kapil
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK.
| | | | - Gábor Csányi
- Engineering Laboratory, University of Cambridge, Cambridge, CB2 1PZ, UK
| | - Angelos Michaelides
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK.
| |
Collapse
|
15
|
Ye Z, Gygi F, Galli G. Raman Spectra of Electrified Si-Water Interfaces: First-Principles Simulations. J Phys Chem Lett 2024; 15:51-58. [PMID: 38128587 DOI: 10.1021/acs.jpclett.3c03122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
We investigate the Raman spectra of liquid water in contact with a semiconductor surface using first-principles molecular dynamics simulations. We focus on a hydrogenated silicon-water interface and compute the Raman spectra from time correlation functions of the polarizability. We establish a relationship between Raman spectral signatures and structural properties of the liquid at the interface, and we identify the vibrational impacts of an applied electric field. We show that negative bias leads to a reduction of the number of hydrogen bonds (HBs) formed between the surface and the topmost water layer and an enhancement of the HB interactions between water molecules. Instead, positive bias leads to an enhancement of both the HB interactions between water and the surface and between water molecules, creating a semi-ordered interfacial layer. Our work provides molecular-level insights into electrified semiconductor/water interfaces and the identification of specific structural features through Raman spectroscopy.
Collapse
Affiliation(s)
- Zifan Ye
- Pritzker School of Molecular Engineering, The University of Chicago, Chicago, Illinois 60637, United States
| | - Francois Gygi
- Department of Computer Science, University of California, Davis, Davis, California 95616, United States
| | - Giulia Galli
- Pritzker School of Molecular Engineering, The University of Chicago, Chicago, Illinois 60637, United States
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
- Materials Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| |
Collapse
|
16
|
LaCour RA, Heindel JP, Head-Gordon T. Predicting the Raman Spectra of Liquid Water with a Monomer-Field Model. J Phys Chem Lett 2023; 14:11742-11749. [PMID: 38116782 DOI: 10.1021/acs.jpclett.3c02873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
The Raman spectrum of liquid water is quite complex, reflecting its strong sensitivity to the local environment of the individual waters. The OH-stretch region of the spectrum, which captures the influence of hydrogen bonding, has only just begun to be unraveled. Here we develop a model for predicting the Raman spectra of the OH-stretch region by considering how local electric fields distort the energy surface of each water monomer. We find that our model is capable of reproducing the bimodal nature of the main peak, with the shoulder at 3250 cm-1 resulting almost entirely from Fermi resonance. Furthermore, we capture the temperature and polarization dependence of the shoulder, which has proven to be difficult to obtain with previous methods, and analyze the origin of this dependence. We expect our model to be generally useful for understanding and predicting how Raman spectra change under different conditions and with different probe reporters beyond water.
Collapse
Affiliation(s)
- R Allen LaCour
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Joseph P Heindel
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Teresa Head-Gordon
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
- Departments of Bioengineering and Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, United States
| |
Collapse
|
17
|
Shen H, Shen X, Wu Z. Simulating the isotropic Raman spectra of O-H stretching mode in liquid H 2O based on a machine learning potential: the influence of vibrational couplings. Phys Chem Chem Phys 2023; 25:28180-28188. [PMID: 37819214 DOI: 10.1039/d3cp03035k] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
In this study, we trained a deep potential (DP) for H2O, an accurate machine learning (ML) potential. We performed molecular dynamics (MD) simulations of liquid water using the DP model (or DeePMD simulations). Our results showed that the DP model exhibits DFT-level accuracy, and the DeePMD simulation is a promising approach for modeling the structural properties of liquid water. Based on the DeePMD simulation trajectories, we calculated the isotropic Raman spectra of the O-H stretching mode using the surface-specific velocity-velocity correlation function (ssVVCF), showing that the DeePMD/ssVVCF approach can correctly capture the bimodal characteristics of the experimental Raman spectra, with one peak located near 3400 cm-1 and the other near 3250 cm-1. The success of the DeePMD/ssVVCF approach should be credited to (1) the DFT-level accuracy of the DP model for H2O, (2) the ssVVCF formulation considering the coupling between vibrational modes, and (3) non-Condon effects. Furthermore, the DeePMD simulations revealed that the anharmonic interactions between the coupled water molecules in the first and second hydration shells should play an essential role in the strong mixing of the H-O-H bending mode and the O-H stretching mode, leading to the delocalization of the O-H stretching band. In particular, increasing the strength of hydrogen bonds would enhance the bend-stretch coupling, leading to the red-shifting of the O-H vibrational spectra and the increase in the intensity of the shoulder around 3250 cm-1.
Collapse
Affiliation(s)
- Hujun Shen
- Guizhou Provincial Key Laboratory of Computational Nano-Material Science, Guizhou Education University, Guiyang 550018, China.
| | - Xu Shen
- National Center of Technology Innovation for Intelligent Design and Numerical Control, Huazhong University of Science and Technology, Wuhan, 430074, China
| | - Zhenhua Wu
- Department of Big Data and Artificial Intelligence, Guizhou Vocational Technology College of Electronics & Information, Kaili, 556000, China
| |
Collapse
|
18
|
Zhang Y, Jiang B. Universal machine learning for the response of atomistic systems to external fields. Nat Commun 2023; 14:6424. [PMID: 37827998 PMCID: PMC10570356 DOI: 10.1038/s41467-023-42148-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Accepted: 10/01/2023] [Indexed: 10/14/2023] Open
Abstract
Machine learned interatomic interaction potentials have enabled efficient and accurate molecular simulations of closed systems. However, external fields, which can greatly change the chemical structure and/or reactivity, have been seldom included in current machine learning models. This work proposes a universal field-induced recursively embedded atom neural network (FIREANN) model, which integrates a pseudo field vector-dependent feature into atomic descriptors to represent system-field interactions with rigorous rotational equivariance. This "all-in-one" approach correlates various response properties like dipole moment and polarizability with the field-dependent potential energy in a single model, very suitable for spectroscopic and dynamics simulations in molecular and periodic systems in the presence of electric fields. Especially for periodic systems, we find that FIREANN can overcome the intrinsic multiple-value issue of the polarization by training atomic forces only. These results validate the universality and capability of the FIREANN method for efficient first-principles modeling of complicated systems in strong external fields.
Collapse
Affiliation(s)
- Yaolong Zhang
- Key Laboratory of Precision and Intelligent Chemistry, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui, 230026, China
- École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland
| | - Bin Jiang
- Key Laboratory of Precision and Intelligent Chemistry, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui, 230026, China.
- Hefei National Laboratory, University of Science and Technology of China, Hefei, 230088, China.
| |
Collapse
|
19
|
Zhao D, Zhao Y, He X, Li Y, Ayers PW, Liu S. Accurate and Efficient Prediction of Post-Hartree-Fock Polarizabilities of Condensed-Phase Systems. J Chem Theory Comput 2023; 19:6461-6470. [PMID: 37676647 DOI: 10.1021/acs.jctc.3c00646] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
To accurately and efficiently predict the molecular response properties (such as polarizability) at post-Hartree-Fock levels for condensed-phase systems under periodic boundary conditions (PBC) is still an unaccomplished and ongoing task. We demonstrate that static isotropic polarizabilities can be cost-effectively predicted at post-Hartree-Fock levels by combining the linear-scaling generalized energy-based fragmentation (GEBF) and information-theoretic approach (ITA) quantities. In PBC-GEBF, the total molecular polarizability of an extended system is obtained as a linear combination of the corresponding quantities of a series of small embedded subsystems of several monomers. Here, we show that in the PBC-GEBF-ITA framework, one can obtain the molecular polarizabilities and establish linear relations to ITA quantities. Once these relations are established for smaller subsystems, one can predict the polarizabilities of larger subsystems directly from the molecular wavefunction (or electron density) via ITA quantities. Alternatively, one can determine the total molecular polarizability via a linear combination equation in PBC-GEBF. We have corroborated that this newly proposed PBC-GEBF-ITA protocol is much more efficient than the original PBC-GEBF approach but is not much less accurate and that this conclusion holds for both many-body perturbation theory and the coupled cluster calculations. Good efficiency and transferability of the PBC-GEBF-ITA protocol are demonstrated for periodic systems with several hundred atoms in a unit cell.
Collapse
Affiliation(s)
- Dongbo Zhao
- Institute of Biomedical Research, Yunnan University, Kunming 650500, P. R. China
| | - Yilin Zhao
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON L8S 4M1, Canada
| | - Xin He
- Qingdao Institute for Theoretical and Computational Sciences, Shandong University, Qingdao 266237, P. R. China
| | - Yunzhi Li
- School of Chemistry and Chemical Engineering, Linyi University, Linyi 276000, P. R. China
| | - Paul W Ayers
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON L8S 4M1, Canada
| | - Shubin Liu
- Research Computing Center, University of North Carolina, Chapel Hill, North Carolina 27599-3420, United States
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, United States
| |
Collapse
|
20
|
Litman Y, Lan J, Nagata Y, Wilkins DM. Fully First-Principles Surface Spectroscopy with Machine Learning. J Phys Chem Lett 2023; 14:8175-8182. [PMID: 37671886 PMCID: PMC10510433 DOI: 10.1021/acs.jpclett.3c01989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 08/29/2023] [Indexed: 09/07/2023]
Abstract
Our current understanding of the structure and dynamics of aqueous interfaces at the molecular level has grown substantially due to the continuous development of surface-specific spectroscopies, such as vibrational sum-frequency generation (VSFG). As in other vibrational spectroscopies, we must turn to atomistic simulations to extract all of the information encoded in the VSFG spectra. The high computational cost associated with existing methods means that they have limitations in representing systems with complex electronic structure or in achieving statistical convergence. In this work, we combine high-dimensional neural network interatomic potentials and symmetry-adapted Gaussian process regression to overcome these constraints. We show that it is possible to model VSFG signals with fully ab initio accuracy using machine learning and illustrate the versatility of our approach on the water/air interface. Our strategy allows us to identify the main sources of theoretical inaccuracy and establish a clear pathway toward the modeling of surface-sensitive spectroscopy of complex interfaces.
Collapse
Affiliation(s)
- Yair Litman
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
- Max
Planck Institute for Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
| | - Jinggang Lan
- Department
of Chemistry, New York University, New York, New York 10003, United States
- Simons
Center for Computational Physical Chemistry at New York University, New York, New York 10003, United States
| | - Yuki Nagata
- Max
Planck Institute for Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
| | - David M. Wilkins
- Centre
for Quantum Materials and Technologies School of Mathematics and Physics, Queen’s University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| |
Collapse
|
21
|
Zeng J, Zhang D, Lu D, Mo P, Li Z, Chen Y, Rynik M, Huang L, Li Z, Shi S, Wang Y, Ye H, Tuo P, Yang J, Ding Y, Li Y, Tisi D, Zeng Q, Bao H, Xia Y, Huang J, Muraoka K, Wang Y, Chang J, Yuan F, Bore SL, Cai C, Lin Y, Wang B, Xu J, Zhu JX, Luo C, Zhang Y, Goodall REA, Liang W, Singh AK, Yao S, Zhang J, Wentzcovitch R, Han J, Liu J, Jia W, York DM, E W, Car R, Zhang L, Wang H. DeePMD-kit v2: A software package for deep potential models. J Chem Phys 2023; 159:054801. [PMID: 37526163 PMCID: PMC10445636 DOI: 10.1063/5.0155600] [Citation(s) in RCA: 41] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 07/03/2023] [Indexed: 08/02/2023] Open
Abstract
DeePMD-kit is a powerful open-source software package that facilitates molecular dynamics simulations using machine learning potentials known as Deep Potential (DP) models. This package, which was released in 2017, has been widely used in the fields of physics, chemistry, biology, and material science for studying atomistic systems. The current version of DeePMD-kit offers numerous advanced features, such as DeepPot-SE, attention-based and hybrid descriptors, the ability to fit tensile properties, type embedding, model deviation, DP-range correction, DP long range, graphics processing unit support for customized operators, model compression, non-von Neumann molecular dynamics, and improved usability, including documentation, compiled binary packages, graphical user interfaces, and application programming interfaces. This article presents an overview of the current major version of the DeePMD-kit package, highlighting its features and technical details. Additionally, this article presents a comprehensive procedure for conducting molecular dynamics as a representative application, benchmarks the accuracy and efficiency of different models, and discusses ongoing developments.
Collapse
Affiliation(s)
- Jinzhe Zeng
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | | | - Denghui Lu
- HEDPS, CAPT, College of Engineering, Peking University, Beijing 100871, People’s Republic of China
| | - Pinghui Mo
- College of Electrical and Information Engineering, Hunan University, Changsha, People’s Republic of China
| | - Zeyu Li
- Yuanpei College, Peking University, Beijing 100871, People’s Republic of China
| | - Yixiao Chen
- Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey 08540, USA
| | - Marián Rynik
- Department of Experimental Physics, Comenius University, Mlynská Dolina F2, 842 48 Bratislava, Slovakia
| | - Li’ang Huang
- Center for Quantum Information, Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, People’s Republic of China
| | | | - Shaochen Shi
- ByteDance Research, Zhonghang Plaza, No. 43, North 3rd Ring West Road, Haidian District, Beijing, People’s Republic of China
| | | | - Haotian Ye
- Yuanpei College, Peking University, Beijing 100871, People’s Republic of China
| | - Ping Tuo
- AI for Science Institute, Beijing 100080, People’s Republic of China
| | - Jiabin Yang
- Baidu, Inc., Beijing, People’s Republic of China
| | | | - Yifan Li
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, USA
| | | | - Qiyu Zeng
- Department of Physics, National University of Defense Technology, Changsha, Hunan 410073, People’s Republic of China
| | | | - Yu Xia
- ByteDance Research, Zhonghang Plaza, No. 43, North 3rd Ring West Road, Haidian District, Beijing, People’s Republic of China
| | | | - Koki Muraoka
- Department of Chemical System Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
| | - Yibo Wang
- DP Technology, Beijing 100080, People’s Republic of China
| | | | - Fengbo Yuan
- DP Technology, Beijing 100080, People’s Republic of China
| | - Sigbjørn Løland Bore
- Hylleraas Centre for Quantum Molecular Sciences and Department of Chemistry, University of Oslo, P.O. Box 1033 Blindern, 0315 Oslo, Norway
| | | | - Yinnian Lin
- Wangxuan Institute of Computer Technology, Peking University, Beijing 100871, People’s Republic of China
| | - Bo Wang
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, Shanghai Key Laboratory of Green Chemistry and Chemical Process, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, People’s Republic of China
| | - Jiayan Xu
- School of Chemistry and Chemical Engineering, Queen’s University Belfast, Belfast BT9 5AG, United Kingdom
| | - Jia-Xin Zhu
- State Key Laboratory of Physical Chemistry of Solid Surfaces, iChEM, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, People’s Republic of China
| | - Chenxing Luo
- Department of Applied Physics and Applied Mathematics, Columbia University, New York, New York 10027, USA
| | - Yuzhi Zhang
- DP Technology, Beijing 100080, People’s Republic of China
| | | | - Wenshuo Liang
- DP Technology, Beijing 100080, People’s Republic of China
| | - Anurag Kumar Singh
- Department of Data Science, Indian Institute of Technology, Palakkad, Kerala, India
| | - Sikai Yao
- DP Technology, Beijing 100080, People’s Republic of China
| | - Jingchao Zhang
- NVIDIA AI Technology Center (NVAITC), Santa Clara, California 95051, USA
| | | | - Jiequn Han
- Center for Computational Mathematics, Flatiron Institute, New York, New York 10010, USA
| | - Jie Liu
- College of Electrical and Information Engineering, Hunan University, Changsha, People’s Republic of China
| | | | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | | | - Roberto Car
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, USA
| | | | - Han Wang
- Author to whom correspondence should be addressed:
| |
Collapse
|
22
|
Kabylda A, Vassilev-Galindo V, Chmiela S, Poltavsky I, Tkatchenko A. Efficient interatomic descriptors for accurate machine learning force fields of extended molecules. Nat Commun 2023; 14:3562. [PMID: 37322039 DOI: 10.1038/s41467-023-39214-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 05/17/2023] [Indexed: 06/17/2023] Open
Abstract
Machine learning force fields (MLFFs) are gradually evolving towards enabling molecular dynamics simulations of molecules and materials with ab initio accuracy but at a small fraction of the computational cost. However, several challenges remain to be addressed to enable predictive MLFF simulations of realistic molecules, including: (1) developing efficient descriptors for non-local interatomic interactions, which are essential to capture long-range molecular fluctuations, and (2) reducing the dimensionality of the descriptors to enhance the applicability and interpretability of MLFFs. Here we propose an automatized approach to substantially reduce the number of interatomic descriptor features while preserving the accuracy and increasing the efficiency of MLFFs. To simultaneously address the two stated challenges, we illustrate our approach on the example of the global GDML MLFF. We found that non-local features (atoms separated by as far as 15 Å in studied systems) are crucial to retain the overall accuracy of the MLFF for peptides, DNA base pairs, fatty acids, and supramolecular complexes. Interestingly, the number of required non-local features in the reduced descriptors becomes comparable to the number of local interatomic features (those below 5 Å). These results pave the way to constructing global molecular MLFFs whose cost increases linearly, instead of quadratically, with system size.
Collapse
Affiliation(s)
- Adil Kabylda
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Valentin Vassilev-Galindo
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Stefan Chmiela
- Machine Learning Group, Technische Universität Berlin, 10587, Berlin, Germany
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, 10587, Berlin, Germany
| | - Igor Poltavsky
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg.
| |
Collapse
|
23
|
Calegari Andrade MF, Pham TA. Probing Confinement Effects on the Infrared Spectra of Water with Deep Potential Molecular Dynamics Simulations. J Phys Chem Lett 2023:5560-5566. [PMID: 37294927 DOI: 10.1021/acs.jpclett.3c01054] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The hydrogen-bond network of confined water is expected to deviate from that of the bulk liquid, yet probing these deviations remains a significant challenge. In this work, we combine large-scale molecular dynamics simulations with machine learning potential derived from first-principles calculations to examine the hydrogen bonding of water confined in carbon nanotubes (CNTs). We computed and compared the infrared spectrum (IR) of confined water to existing experiments to elucidate confinement effects. For CNTs with diameters >1.2 nm, we find that confinement imposes a monotonic effect on the hydrogen-bond network and on the IR spectrum of water. In contrast, confinement below 1.2 nm CNT diameter affects the water structure in a complex fashion, leading to a strong directional dependence of hydrogen bonding that varies nonlinearly with the CNT diameter. When integrated with existing IR measurements, our simulations provide a new interpretation for the IR spectrum of water confined in CNTs, pointing to previously unreported aspects of hydrogen bonding in this system. This work also offers a general platform for simulating water in CNTs with quantum accuracy on time and length scales beyond the reach of conventional first-principles approaches.
Collapse
Affiliation(s)
- Marcos F Calegari Andrade
- Quantum Simulations Group, Materials Science Division, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
- Laboratory for Energy Applications for the Future, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
| | - Tuan Anh Pham
- Quantum Simulations Group, Materials Science Division, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
- Laboratory for Energy Applications for the Future, Lawrence Livermore National Laboratory, Livermore, California 94550-5507, United States
| |
Collapse
|
24
|
Inoue K, Litman Y, Wilkins DM, Nagata Y, Okuno M. Is Unified Understanding of Vibrational Coupling of Water Possible? Hyper-Raman Measurement and Machine Learning Spectra. J Phys Chem Lett 2023; 14:3063-3068. [PMID: 36947156 DOI: 10.1021/acs.jpclett.3c00398] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The impact of the vibrational coupling of the OH stretch mode on the spectra differs significantly between IR and Raman spectra of water. Unified understanding of the vibrational couplings is not yet achieved. By using a different class of vibrational spectroscopy, hyper-Raman (HR) spectroscopy, together with machine-learning-assisted HR spectra calculation, we examine the impact of the vibrational couplings of water through the comparison of isotopically diluted H2O and pure H2O. We found that the isotopic dilution reduces the HR bandwidths, but the impact of the vibrational coupling is smaller than in the IR and parallel-polarized Raman. Machine learning HR spectra indicate that the intermolecular coupling plays a major role in broadening the bandwidth, while the intramolecular coupling is negligibly small, which is consistent with the IR and Raman spectra. Our result clearly demonstrates a limited impact of the intramolecular vibration, independent of the selection rules of vibrational spectroscopies.
Collapse
Affiliation(s)
- Kazuki Inoue
- Department of Basic Science, Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo 153-8902, Japan
| | - Yair Litman
- Max Planck Institute for Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - David M Wilkins
- Atomistic Simulation Centre, School of Mathematics and Physics, Queen's University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| | - Yuki Nagata
- Max Planck Institute for Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
| | - Masanari Okuno
- Department of Basic Science, Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo 153-8902, Japan
| |
Collapse
|
25
|
Kondratyuk N, Ryltsev R, Ankudinov V, Chtchelkatchev N. First-principles calculations of the viscosity in multicomponent metallic melts: Al-Cu-Ni as a test case. J Mol Liq 2023. [DOI: 10.1016/j.molliq.2023.121751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]
|
26
|
Feng C, Xi J, Zhang Y, Jiang B, Zhou Y. Accurate and Interpretable Dipole Interaction Model-Based Machine Learning for Molecular Polarizability. J Chem Theory Comput 2023; 19:1207-1217. [PMID: 36753749 DOI: 10.1021/acs.jctc.2c01094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
Polarizabilities play significant roles in describing dispersive and inductive interactions of the atom and molecular systems. However, an accurate prediction of molecular polarizabilities from first principles is computationally prohibitive. Although physical models or statistical machine learning models have been proposed, either a lack of accurate description of local chemical environments or demanding a large number of samples for training has limited their practical applications. In this study, we combine a physically inspired dipole interaction model and an accurate neural network method for predicting the polarizability tensors of molecules. With the local chemical environment precisely described and the requirement of rotational covariance naturally fulfilled, this hybrid model is proven to give an accurate molecular polarizability prediction, essentially reducing the number of training samples. The atomic polarizabilities are physically interpretable and transferable to larger molecules unseen in the training set. This promising method may find its wide range of applications, such as spectroscopic simulations and the construction of polarizable force fields.
Collapse
Affiliation(s)
- Chaoqiang Feng
- Anhui Key Laboratory of Optoelectric Materials Science and Technology, Department of Physics, Anhui Normal University, Wuhu, Anhui 241000, China.,Hefei National Research Center for Physical Sciences at the Microscale, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Jin Xi
- Anhui Key Laboratory of Optoelectric Materials Science and Technology, Department of Physics, Anhui Normal University, Wuhu, Anhui 241000, China
| | - Yaolong Zhang
- Hefei National Research Center for Physical Sciences at the Microscale, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Bin Jiang
- Hefei National Research Center for Physical Sciences at the Microscale, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Yong Zhou
- Anhui Key Laboratory of Optoelectric Materials Science and Technology, Department of Physics, Anhui Normal University, Wuhu, Anhui 241000, China
| |
Collapse
|
27
|
Zhai Y, Caruso A, Bore SL, Luo Z, Paesani F. A "short blanket" dilemma for a state-of-the-art neural network potential for water: Reproducing experimental properties or the physics of the underlying many-body interactions? J Chem Phys 2023; 158:084111. [PMID: 36859071 DOI: 10.1063/5.0142843] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Deep neural network (DNN) potentials have recently gained popularity in computer simulations of a wide range of molecular systems, from liquids to materials. In this study, we explore the possibility of combining the computational efficiency of the DeePMD framework and the demonstrated accuracy of the MB-pol data-driven, many-body potential to train a DNN potential for large-scale simulations of water across its phase diagram. We find that the DNN potential is able to reliably reproduce the MB-pol results for liquid water, but provides a less accurate description of the vapor-liquid equilibrium properties. This shortcoming is traced back to the inability of the DNN potential to correctly represent many-body interactions. An attempt to explicitly include information about many-body effects results in a new DNN potential that exhibits the opposite performance, being able to correctly reproduce the MB-pol vapor-liquid equilibrium properties, but losing accuracy in the description of the liquid properties. These results suggest that DeePMD-based DNN potentials are not able to correctly "learn" and, consequently, represent many-body interactions, which implies that DNN potentials may have limited ability to predict the properties for state points that are not explicitly included in the training process. The computational efficiency of the DeePMD framework can still be exploited to train DNN potentials on data-driven many-body potentials, which can thus enable large-scale, "chemically accurate" simulations of various molecular systems, with the caveat that the target state points must have been adequately sampled by the reference data-driven many-body potential in order to guarantee a faithful representation of the associated properties.
Collapse
Affiliation(s)
- Yaoguang Zhai
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, California 92093, USA
| | - Alessandro Caruso
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Sigbjørn Løland Bore
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Zhishang Luo
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Francesco Paesani
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| |
Collapse
|
28
|
Shanavas Rasheeda D, Martín Santa Daría A, Schröder B, Mátyus E, Behler J. High-dimensional neural network potentials for accurate vibrational frequencies: the formic acid dimer benchmark. Phys Chem Chem Phys 2022; 24:29381-29392. [PMID: 36459127 DOI: 10.1039/d2cp03893e] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
In recent years, machine learning potentials (MLP) for atomistic simulations have attracted a lot of attention in chemistry and materials science. Many new approaches have been developed with the primary aim to transfer the accuracy of electronic structure calculations to large condensed systems containing thousands of atoms. In spite of these advances, the reliability of modern MLPs in reproducing the subtle details of the multi-dimensional potential-energy surface is still difficult to assess for such systems. On the other hand, moderately sized systems enabling the application of tools for thorough and systematic quality-control are nowadays rarely investigated. In this work we use benchmark-quality harmonic and anharmonic vibrational frequencies as a sensitive probe for the validation of high-dimensional neural network potentials. For the case of the formic acid dimer, a frequently studied model system for which stringent spectroscopic data became recently available, we show that high-quality frequencies can be obtained from state-of-the-art calculations in excellent agreement with coupled cluster theory and experimental data.
Collapse
Affiliation(s)
- Dilshana Shanavas Rasheeda
- Universität Göttingen, Institut für Physikalische Chemie, Theoretische Chemie, Tammannstraβe 6, 37077 Göttingen, Germany.
| | - Alberto Martín Santa Daría
- ELTE, Eötvös Loránd University, Institute of Chemistry, Pázmány Péter sétány 1/A, 1117 Budapest, Hungary
| | - Benjamin Schröder
- Universität Göttingen, Institut für Physikalische Chemie, Tammannstraβe 6, 37077 Göttingen, Germany
| | - Edit Mátyus
- ELTE, Eötvös Loránd University, Institute of Chemistry, Pázmány Péter sétány 1/A, 1117 Budapest, Hungary
| | - Jörg Behler
- Universität Göttingen, Institut für Physikalische Chemie, Theoretische Chemie, Tammannstraβe 6, 37077 Göttingen, Germany.
| |
Collapse
|
29
|
Zhang Y, Lin Q, Jiang B. Atomistic neural network representations for chemical dynamics simulations of molecular, condensed phase, and interfacial systems: Efficiency, representability, and generalization. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Affiliation(s)
- Yaolong Zhang
- Department of Chemical Physics, School of Chemistry and Materials Science, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes University of Science and Technology of China Hefei Anhui China
| | - Qidong Lin
- Department of Chemical Physics, School of Chemistry and Materials Science, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes University of Science and Technology of China Hefei Anhui China
| | - Bin Jiang
- Department of Chemical Physics, School of Chemistry and Materials Science, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes University of Science and Technology of China Hefei Anhui China
| |
Collapse
|
30
|
Zhang Z, Li Y, Li Y. Prediction approach of larch wood density from visible-near-infrared spectroscopy based on parameter calibrating and transfer learning. FRONTIERS IN PLANT SCIENCE 2022; 13:1006292. [PMID: 36267936 PMCID: PMC9577256 DOI: 10.3389/fpls.2022.1006292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 09/20/2022] [Indexed: 06/16/2023]
Abstract
Wood density, as a key indicator to measure wood properties, is of weighty significance in enhancing wood utilization and modifying wood properties in sustainable forest management. Visible-near-infrared (Vis-NIR) spectroscopy provides a feasible and efficient solution for obtaining wood density by the advantages of its efficiency and non-destructiveness. However, the spectral responses are different in wood products with different moisture content conditions, and changes in external factors may cause the regression model to fail. Although some calibration transfer methods and convolutional neural network (CNN)-based deep transfer learning methods have been proposed, the generalization ability and prediction accuracy of the models still need to be improved. For the prediction problem of Vis-NIR wood density in different moisture contents, a deep transfer learning hybrid method with automatic calibration capability (Resnet1D-SVR-TrAdaBoost.R2) was proposed in this study. The disadvantage of overfitting was avoided when CNN processes small sample data, which considered the complex exterior factors in actual production to enhance feature extraction and migration between samples. Density prediction of the method was performed on a larch dataset with different moisture content conditions, and the hybrid method was found to achieve the best prediction results under the calibration samples with different target domain calibration samples and moisture contents, and the performance of models was better than that of the traditional calibration transfer and migration learning methods. In particular, the hybrid model has achieved an improvement of about 0.1 in both R 2 and root mean square error (RMSE) values compared to the support vector regression model transferred by piecewise direct standardization method (SVR+PDS), which has the best performance among traditional calibration methods. To further ascertain the generalizability of the hybrid model, the model was validated with samples collected from mixed moisture contents as the target domain. Various experiments demonstrated that the Resnet1D-SVR-TrAdaBoost.R2 model could predict larch wood density with a high generalization ability and accuracy effectively but was computation consuming. It showed the potential to be extended to predict other metrics of wood.
Collapse
Affiliation(s)
- Zheyu Zhang
- College of Engineering and Technology, Northeast Forestry University, Harbin, China
| | - Yaoxiang Li
- College of Engineering and Technology, Northeast Forestry University, Harbin, China
| | - Ying Li
- College of Energy and Transportation Engineering, Inner Mongolia Agricultural University, Hohhot, China
| |
Collapse
|
31
|
Abstract
Molecular simulations have provided valuable insight into the microscopic mechanisms underlying homogeneous ice nucleation. While empirical models have been used extensively to study this phenomenon, simulations based on first-principles calculations have so far proven prohibitively expensive. Here, we circumvent this difficulty by using an efficient machine-learning model trained on density-functional theory energies and forces. We compute nucleation rates at atmospheric pressure, over a broad range of supercoolings, using the seeding technique and systems of up to hundreds of thousands of atoms simulated with ab initio accuracy. The key quantity provided by the seeding technique is the size of the critical cluster (i.e., a size such that the cluster has equal probabilities of growing or melting at the given supersaturation), which is used together with the equations of classical nucleation theory to compute nucleation rates. We find that nucleation rates for our model at moderate supercoolings are in good agreement with experimental measurements within the error of our calculation. We also study the impact of properties such as the thermodynamic driving force, interfacial free energy, and stacking disorder on the calculated rates.
Collapse
|
32
|
Lu D, Jiang W, Chen Y, Zhang L, Jia W, Wang H, Chen M. DP Compress: A Model Compression Scheme for Generating Efficient Deep Potential Models. J Chem Theory Comput 2022; 18:5559-5567. [PMID: 35926122 DOI: 10.1021/acs.jctc.2c00102] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Machine-learning-based interatomic potential energy surface (PES) models are revolutionizing the field of molecular modeling. However, although much faster than electronic structure schemes, these models suffer from costly computations via deep neural networks to predict the energy and atomic forces, resulting in lower running efficiency as compared to the typical empirical force fields. Herein, we report a model compression scheme for boosting the performance of the Deep Potential (DP) model, a deep learning-based PES model. This scheme, we call DP Compress, is an efficient postprocessing step after the training of DP models (DP Train). DP Compress combines several DP-specific compression techniques, which typically speed up DP-based molecular dynamics simulations by an order of magnitude faster and consume an order of magnitude less memory. We demonstrate that DP Compress is sufficiently accurate by testing a variety of physical properties of Cu, H2O, and Al-Cu-Mg systems. DP Compress applies to both CPU and GPU machines and is publicly available online.
Collapse
Affiliation(s)
- Denghui Lu
- HEDPS, CAPT, College of Engineering, Peking University, Beijing 100871, P. R. China
| | - Wanrun Jiang
- Songshan Lake Materials Laboratory, Dongguan, Guangdong 523808, P. R. China.,Institute of Physics, Chinese Academy of Sciences, Beijing 100190, P. R. China
| | - Yixiao Chen
- Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey 08544, United States
| | - Linfeng Zhang
- Beijing Institute of Big Data Research, Beijing 100871, P. R. China
| | - Weile Jia
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, P. R. China.,University of Chinese Academy of Sciences, Beijing 100049, P. R. China
| | - Han Wang
- HEDPS, CAPT, College of Engineering, Peking University, Beijing 100871, P. R. China.,Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Fenghao East Road 2, Beijing 100094, P. R. China
| | - Mohan Chen
- HEDPS, CAPT, College of Engineering, Peking University, Beijing 100871, P. R. China
| |
Collapse
|
33
|
Omodemi O, Kaledin M, Kaledin AL. Permutationally invariant polynomial representation of polarizability tensor surfaces for linear regression analysis. J Comput Chem 2022; 43:1495-1503. [PMID: 35737590 DOI: 10.1002/jcc.26952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Accepted: 06/09/2022] [Indexed: 11/07/2022]
Abstract
A linearly parameterized functional form for a Cartesian representation of molecular dipole polarizability tensor surfaces (PTS) is described. The proposed expression for the PTS is a linearization of the recently reported power series ansatz of the original Applequist model, which by construction is non-linear in parameter space. This new approach possesses (i) a unique solution to the least-squares fitting problem; (ii) a low level of the computational complexity of the resulting linear regression procedure, comparable to those of the potential energy and dipole moment surfaces; and (iii) a competitive level of accuracy compared to the non-linear PTS model. Calculations of CH4 PTS, with polarizabilities fitted to 9000 training set points with the energies up to 14,000 cm-1 show an impressive level of accuracy of the linear PTS model obtained with ~1600 parameters: ~1% versus 0.3% RMSE for the non-linear vs. linear model on a test set of 1000 configurations.
Collapse
Affiliation(s)
- Oluwaseun Omodemi
- Department of Chemistry & Biochemistry, Kennesaw State University, Kennesaw, Georgia, USA
| | - Martina Kaledin
- Department of Chemistry & Biochemistry, Kennesaw State University, Kennesaw, Georgia, USA
| | - Alexey L Kaledin
- Cherry L. Emerson Center for Scientific Computation and Department of Chemistry, Emory University, Atlanta, Georgia, USA
| |
Collapse
|
34
|
Liang W, Lu G, Yu J. Machine Learning Accelerates Molten Salt Simulations: Thermal Conductivity of MgCl
2
‐NaCl Eutectic. ADVANCED THEORY AND SIMULATIONS 2022. [DOI: 10.1002/adts.202200206] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Affiliation(s)
- Wenshuo Liang
- School of Resources and Environmental Engineering East China University of Science and Technology Shanghai 200237 China
- National Engineering Research Center for Integrated Utilization of Salt Lake Resource East China University of Science and Technology Shanghai 200237 China
| | - Guimin Lu
- School of Resources and Environmental Engineering East China University of Science and Technology Shanghai 200237 China
- National Engineering Research Center for Integrated Utilization of Salt Lake Resource East China University of Science and Technology Shanghai 200237 China
| | - Jianguo Yu
- School of Resources and Environmental Engineering East China University of Science and Technology Shanghai 200237 China
- National Engineering Research Center for Integrated Utilization of Salt Lake Resource East China University of Science and Technology Shanghai 200237 China
| |
Collapse
|
35
|
Investigation on the local structure and properties of molten Li2CO3-K2CO3 binary salts by machine learning potentials. J Mol Liq 2022. [DOI: 10.1016/j.molliq.2022.118979] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
36
|
Wöhl J, Kopp WA, Yevlakhovych I, Bahr L, Koß HJ, Leonhard K. Completely Computational Model Setup for Spectroscopic Techniques: The Ab Initio Molecular Dynamics Indirect Hard Modeling Approach. J Phys Chem A 2022; 126:2845-2853. [PMID: 35476427 DOI: 10.1021/acs.jpca.2c01061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The spectroscopic quantification of mixture compositions usually requires pure compounds and mixtures of known compositions for calibration. Since they are not always available, methods to fill such gaps have evolved, which are, however, not generally applicable. Therefore, calibration can be extremely challenging, especially when multiple unstable species, for example, intermediates, exist in a system. This study presents a new calibration approach that uses ab initio molecular dynamics (AIMD)-simulated spectra to set up and calibrate models for the physics-based spectral analysis method indirect hard modeling (IHM). To demonstrate our approach called AIMD-IHM, we analyze Raman spectra of ternary hydrogen-bonding mixtures of acetone, methanol, and ethanol. The derived AIMD-IHM pure-component models and calibration coefficients are in good agreement with conventionally generated experimental results. The method yields compositions with prediction errors of less than 5% without any experimental calibration input. Our approach can be extended, in principle, to infrared and NMR spectroscopy and allows for the analysis of systems that were hitherto inaccessible to quantitative spectroscopic analysis.
Collapse
Affiliation(s)
- Justus Wöhl
- Institute of Technical Thermodynamics, RWTH Aachen University, 52062 Aachen, Germany
| | - Wassja A Kopp
- Institute of Technical Thermodynamics, RWTH Aachen University, 52062 Aachen, Germany
| | - Iryna Yevlakhovych
- Institute of Technical Thermodynamics, RWTH Aachen University, 52062 Aachen, Germany
| | - Leo Bahr
- Institute of Technical Thermodynamics, RWTH Aachen University, 52062 Aachen, Germany
| | - Hans-Jürgen Koß
- Institute of Technical Thermodynamics, RWTH Aachen University, 52062 Aachen, Germany
| | - Kai Leonhard
- Institute of Technical Thermodynamics, RWTH Aachen University, 52062 Aachen, Germany
| |
Collapse
|
37
|
Ren H, Zhang Q, Wang Z, Zhang G, Liu H, Guo W, Mukamel S, Jiang J. Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors. Proc Natl Acad Sci U S A 2022; 119:e2202713119. [PMID: 35476517 PMCID: PMC9171355 DOI: 10.1073/pnas.2202713119] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 03/28/2022] [Indexed: 11/29/2022] Open
Abstract
Protein secondary structure discrimination is crucial for understanding their biological function. It is not generally possible to invert spectroscopic data to yield the structure. We present a machine learning protocol which uses two-dimensional UV (2DUV) spectra as pattern recognition descriptors, aiming at automated protein secondary structure determination from spectroscopic features. Accurate secondary structure recognition is obtained for homologous (97%) and nonhomologous (91%) protein segments, randomly selected from simulated model datasets. The advantage of 2DUV descriptors over one-dimensional linear absorption and circular dichroism spectra lies in the cross-peak information that reflects interactions between local regions of the protein. Thanks to their ultrafast (∼200 fs) nature, 2DUV measurements can be used in the future to probe conformational variations in the course of protein dynamics.
Collapse
Affiliation(s)
- Hao Ren
- School of Materials Science and Engineering, China University of Petroleum (East China), Qingdao 266580, Shandong, China
| | - Qian Zhang
- School of Materials Science and Engineering, China University of Petroleum (East China), Qingdao 266580, Shandong, China
| | - Zhengjie Wang
- School of Materials Science and Engineering, China University of Petroleum (East China), Qingdao 266580, Shandong, China
| | - Guozhen Zhang
- School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, Anhui, China
| | - Hongzhang Liu
- School of Materials Science and Engineering, China University of Petroleum (East China), Qingdao 266580, Shandong, China
| | - Wenyue Guo
- School of Materials Science and Engineering, China University of Petroleum (East China), Qingdao 266580, Shandong, China
| | - Shaul Mukamel
- Department of Chemistry and Physics & Astronomy, University of California, Irvine, CA 92697
| | - Jun Jiang
- School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, Anhui, China
| |
Collapse
|
38
|
Zhang L, Wang H, Muniz MC, Panagiotopoulos AZ, Car R, E W. A deep potential model with long-range electrostatic interactions. J Chem Phys 2022; 156:124107. [DOI: 10.1063/5.0083669] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Machine learning models for the potential energy of multi-atomic systems, such as the deep potential (DP) model, make molecular simulations with the accuracy of quantum mechanical density functional theory possible at a cost only moderately higher than that of empirical force fields. However, the majority of these models lack explicit long-range interactions and fail to describe properties that derive from the Coulombic tail of the forces. To overcome this limitation, we extend the DP model by approximating the long-range electrostatic interaction between ions (nuclei + core electrons) and valence electrons with that of distributions of spherical Gaussian charges located at ionic and electronic sites. The latter are rigorously defined in terms of the centers of the maximally localized Wannier distributions, whose dependence on the local atomic environment is modeled accurately by a deep neural network. In the DP long-range (DPLR) model, the electrostatic energy of the Gaussian charge system is added to short-range interactions that are represented as in the standard DP model. The resulting potential energy surface is smooth and possesses analytical forces and virial. Missing effects in the standard DP scheme are recovered, improving on accuracy and predictive power. By including long-range electrostatics, DPLR correctly extrapolates to large systems the potential energy surface learned from quantum mechanical calculations on smaller systems. We illustrate the approach with three examples: the potential energy profile of the water dimer, the free energy of interaction of a water molecule with a liquid water slab, and the phonon dispersion curves of the NaCl crystal.
Collapse
Affiliation(s)
| | - Han Wang
- Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Fenghao East Road 2, Beijing 100094, People’s Republic of China
- HEDPS, CAPT, College of Engineering, Peking University, Beijing 100871, People’s Republic of China
| | - Maria Carolina Muniz
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, USA
| | | | - Roberto Car
- Department of Chemistry, Department of Physics, Program in Applied and Computational Mathematics, Princeton Institute for the Science and Technology of Materials, Princeton University, Princeton, New Jersey 08544, USA
| | - Weinan E
- School of Mathematical Sciences, Peking University, Beijing 100871, People’s Republic of China
- AI for Science Institute, Beijing, People’s Republic of China
- Department of Mathematics and Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey 08544, USA
| |
Collapse
|
39
|
Zhang Y, Xia J, Jiang B. REANN: A PyTorch-based end-to-end multi-functional deep neural network package for molecular, reactive, and periodic systems. J Chem Phys 2022; 156:114801. [DOI: 10.1063/5.0080766] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
In this work, we present a general purpose deep neural network package for representing energies, forces, dipole moments, and polarizabilities of atomistic systems. This so-called recursively embedded atom neural network model takes advantages of both the physically inspired atomic descriptor based neural networks and the message-passing based neural networks. Implemented in the PyTorch framework, the training process is parallelized on both the central processing unit and the graphics processing unit with high efficiency and low memory in which all hyperparameters can be optimized automatically. We demonstrate the state-of-the-art accuracy, high efficiency, scalability, and universality of this package by learning not only energies (with or without forces) but also dipole moment vectors and polarizability tensors in various molecular, reactive, and periodic systems. An interface between a trained model and LAMMPs is provided for large scale molecular dynamics simulations. We hope that this open-source toolbox will allow for future method development and applications of machine learned potential energy surfaces and quantum-chemical properties of molecules, reactions, and materials.
Collapse
Affiliation(s)
- Yaolong Zhang
- School of Chemistry and Materials Science, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Junfan Xia
- School of Chemistry and Materials Science, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Bin Jiang
- School of Chemistry and Materials Science, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui 230026, China
| |
Collapse
|
40
|
Ryltsev R, Chtchelkatchev N. Deep machine learning potentials for multicomponent metallic melts: Development, predictability and compositional transferability. J Mol Liq 2022. [DOI: 10.1016/j.molliq.2021.118181] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
41
|
Zhao L, Zhang J, Zhang Y, Ye S, Zhang G, Chen X, Jiang B, Jiang J. Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors. JACS AU 2021; 1:2377-2384. [PMID: 34977905 PMCID: PMC8715543 DOI: 10.1021/jacsau.1c00449] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Indexed: 05/08/2023]
Abstract
A data-driven approach to simulate circular dichroism (CD) spectra is appealing for fast protein secondary structure determination, yet the challenge of predicting electric and magnetic transition dipole moments poses a substantial barrier for the goal. To address this problem, we designed a new machine learning (ML) protocol in which ordinary pure geometry-based descriptors are replaced with alternative embedded density descriptors and electric and magnetic transition dipole moments are successfully predicted with an accuracy comparable to first-principle calculation. The ML model is able to not only simulate protein CD spectra nearly 4 orders of magnitude faster than conventional first-principle simulation but also obtain CD spectra in good agreement with experiments. Finally, we predicted a series of CD spectra of the Trp-cage protein associated with continuous changes of protein configuration along its folding path, showing the potential of our ML model for supporting real-time CD spectroscopy study of protein dynamics.
Collapse
Affiliation(s)
- Luyuan Zhao
- Hefei
National Laboratory for Physical Sciences at the Microscale, Collaborative
Innovation Center of Chemistry for Energy Materials, School of Chemistry
and Materials Science, University of Science
and Technology of China, Hefei, Anhui 230026, P. R. China
| | - Jinxiao Zhang
- Guangxi
Key Laboratory of Electrochemical and Magneto-chemical Functional
Materials, College of Chemistry and Bioengineering, Guilin University of Technology, Guilin 541006, P. R. China
| | - Yaolong Zhang
- Hefei
National Laboratory for Physical Sciences at the Microscale, Collaborative
Innovation Center of Chemistry for Energy Materials, School of Chemistry
and Materials Science, University of Science
and Technology of China, Hefei, Anhui 230026, P. R. China
| | - Sheng Ye
- School
of Artificial Intelligence, Anhui University, Hefei, Anhui 230601, P. R. China
| | - Guozhen Zhang
- Hefei
National Laboratory for Physical Sciences at the Microscale, Collaborative
Innovation Center of Chemistry for Energy Materials, School of Chemistry
and Materials Science, University of Science
and Technology of China, Hefei, Anhui 230026, P. R. China
| | - Xin Chen
- Gusu
Laboratory of Materials, Suzhou, Jiangsu 215123, P. R. China
| | - Bin Jiang
- Hefei
National Laboratory for Physical Sciences at the Microscale, Collaborative
Innovation Center of Chemistry for Energy Materials, School of Chemistry
and Materials Science, University of Science
and Technology of China, Hefei, Anhui 230026, P. R. China
| | - Jun Jiang
- Hefei
National Laboratory for Physical Sciences at the Microscale, Collaborative
Innovation Center of Chemistry for Energy Materials, School of Chemistry
and Materials Science, University of Science
and Technology of China, Hefei, Anhui 230026, P. R. China
| |
Collapse
|
42
|
Penkov N. Antibodies Processed Using High Dilution Technology Distantly Change Structural Properties of IFNγ Aqueous Solution. Pharmaceutics 2021; 13:1864. [PMID: 34834279 PMCID: PMC8618336 DOI: 10.3390/pharmaceutics13111864] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Revised: 10/29/2021] [Accepted: 11/01/2021] [Indexed: 11/16/2022] Open
Abstract
Terahertz spectroscopy allows for the analysis of vibrations corresponding to the large-scale structural movements and collective dynamics of hydrogen-bonded water molecules. Previously, differences had been detected in the emission spectra of interferon-gamma (IFNγ) solutions surrounded by extremely diluted solutions of either IFNγ or antibodies to IFNγ without direct contact compared to a control. Here we aimed to analyse the structural properties of water in a sample of an aqueous solution of IFNγ via terahertz time-domain spectroscopy (THz-TDS). Tubes with the IFNγ solution were immersed in fluidised lactose saturated with test samples (dilutions of antibodies to IFNγ or control) and incubated at 37 °C for 1, 1.5-2, 2.5-3, or 3.5-4 h. Fluidised lactose was chosen since it is an excipient in the manufacture of drugs based on diluted antibodies to IFNγ. After incubation, spectra were recorded within a wavenumber range of 10 to 110 cm-1 with a resolution of 4 cm-1. Lactose saturated with dilutions of antibodies to IFNγ (incubated for more than 2.5 h) changed the structural properties of an IFNγ aqueous solution without direct contact compared to the control. Terahertz spectra revealed stronger intermolecular hydrogen bonds and an increase in the relaxation time of free and weakly bound water molecules. The methodology developed on the basis of THz-TDS could potentially be applied to quality control of pharmaceuticals based on extremely diluted antibodies.
Collapse
Affiliation(s)
- Nikita Penkov
- Laboratory of Optical and Spectral Analysis Methods, Institute of Cell Biophysics RAS, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", 142290 Pushchino, Russia
| |
Collapse
|
43
|
Shi Y, Doyle CC, Beck TL. Condensed Phase Water Molecular Multipole Moments from Deep Neural Network Models Trained on Ab Initio Simulation Data. J Phys Chem Lett 2021; 12:10310-10317. [PMID: 34662132 DOI: 10.1021/acs.jpclett.1c02328] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Ionic solvation phenomena in liquids involve intense interactions in the inner solvation shell. For interactions beyond the first shell, the ion-solvent interaction energies result from the sum of many smaller-magnitude contributions that can still include polarization effects. Deep neural network (DNN) methods have recently found wide application in developing efficient molecular models that maintain near-quantum accuracy. Here we extend the DeePMD-kit code to produce accurate molecular multipole moments in the bulk and near interfaces. The new method is validated by comparing the DNN moments with those generated by ab initio simulations. The moments are used to compute the electrostatic potential at the center of a molecular-sized hydrophobic cavity in water. The results show that the fields produced by the DNN models are in quantitative agreement with the AIMD-derived values. These efficient methods will open the door to more accurate solvation models for large solutes such as proteins.
Collapse
Affiliation(s)
- Yu Shi
- Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221-0172, United States
| | - Carrie C Doyle
- Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221-0172, United States
| | - Thomas L Beck
- Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221-0172, United States
| |
Collapse
|
44
|
Gastegger M, Schütt KT, Müller KR. Machine learning of solvent effects on molecular spectra and reactions. Chem Sci 2021; 12:11473-11483. [PMID: 34567501 PMCID: PMC8409491 DOI: 10.1039/d1sc02742e] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 07/22/2021] [Indexed: 01/13/2023] Open
Abstract
Fast and accurate simulation of complex chemical systems in environments such as solutions is a long standing challenge in theoretical chemistry. In recent years, machine learning has extended the boundaries of quantum chemistry by providing highly accurate and efficient surrogate models of electronic structure theory, which previously have been out of reach for conventional approaches. Those models have long been restricted to closed molecular systems without accounting for environmental influences, such as external electric and magnetic fields or solvent effects. Here, we introduce the deep neural network FieldSchNet for modeling the interaction of molecules with arbitrary external fields. FieldSchNet offers access to a wealth of molecular response properties, enabling it to simulate a wide range of molecular spectra, such as infrared, Raman and nuclear magnetic resonance. Beyond that, it is able to describe implicit and explicit molecular environments, operating as a polarizable continuum model for solvation or in a quantum mechanics/molecular mechanics setup. We employ FieldSchNet to study the influence of solvent effects on molecular spectra and a Claisen rearrangement reaction. Based on these results, we use FieldSchNet to design an external environment capable of lowering the activation barrier of the rearrangement reaction significantly, demonstrating promising venues for inverse chemical design.
Collapse
Affiliation(s)
- Michael Gastegger
- Machine Learning Group, Technische Universität Berlin 10587 Berlin Germany
| | - Kristof T Schütt
- Machine Learning Group, Technische Universität Berlin 10587 Berlin Germany
- Berlin Institute for the Foundations of Learning and Data 10587 Berlin Germany
| | - Klaus-Robert Müller
- Machine Learning Group, Technische Universität Berlin 10587 Berlin Germany
- Berlin Institute for the Foundations of Learning and Data 10587 Berlin Germany
- Department of Artificial Intelligence, Korea University Anam-dong, Seongbuk-gu Seoul 02841 Korea
- Max-Planck-Institut für Informatik 66123 Saarbrücken Germany
| |
Collapse
|
45
|
Deringer VL, Bartók AP, Bernstein N, Wilkins DM, Ceriotti M, Csányi G. Gaussian Process Regression for Materials and Molecules. Chem Rev 2021; 121:10073-10141. [PMID: 34398616 PMCID: PMC8391963 DOI: 10.1021/acs.chemrev.1c00022] [Citation(s) in RCA: 232] [Impact Index Per Article: 77.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Indexed: 12/18/2022]
Abstract
We provide an introduction to Gaussian process regression (GPR) machine-learning methods in computational materials science and chemistry. The focus of the present review is on the regression of atomistic properties: in particular, on the construction of interatomic potentials, or force fields, in the Gaussian Approximation Potential (GAP) framework; beyond this, we also discuss the fitting of arbitrary scalar, vectorial, and tensorial quantities. Methodological aspects of reference data generation, representation, and regression, as well as the question of how a data-driven model may be validated, are reviewed and critically discussed. A survey of applications to a variety of research questions in chemistry and materials science illustrates the rapid growth in the field. A vision is outlined for the development of the methodology in the years to come.
Collapse
Affiliation(s)
- Volker L. Deringer
- Department
of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, United Kingdom
| | - Albert P. Bartók
- Department
of Physics and Warwick Centre for Predictive Modelling, School of
Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Noam Bernstein
- Center
for Computational Materials Science, U.S.
Naval Research Laboratory, Washington D.C. 20375, United States
| | - David M. Wilkins
- Atomistic
Simulation Centre, School of Mathematics and Physics, Queen’s University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| | - Michele Ceriotti
- Laboratory
of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
- National
Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale
de Lausanne, Lausanne, Switzerland
| | - Gábor Csányi
- Engineering
Laboratory, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
| |
Collapse
|
46
|
Unke O, Chmiela S, Sauceda HE, Gastegger M, Poltavsky I, Schütt KT, Tkatchenko A, Müller KR. Machine Learning Force Fields. Chem Rev 2021; 121:10142-10186. [PMID: 33705118 PMCID: PMC8391964 DOI: 10.1021/acs.chemrev.0c01111] [Citation(s) in RCA: 404] [Impact Index Per Article: 134.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Indexed: 12/27/2022]
Abstract
In recent years, the use of machine learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of ab initio methods and the efficiency of classical FFs. The key idea is to learn the statistical relation between chemical structure and potential energy without relying on a preconceived notion of fixed chemical bonds or knowledge about the relevant interactions. Such universal ML approximations are in principle only limited by the quality and quantity of the reference data used to train them. This review gives an overview of applications of ML-FFs and the chemical insights that can be obtained from them. The core concepts underlying ML-FFs are described in detail, and a step-by-step guide for constructing and testing them from scratch is given. The text concludes with a discussion of the challenges that remain to be overcome by the next generation of ML-FFs.
Collapse
Affiliation(s)
- Oliver
T. Unke
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
- DFG
Cluster of Excellence “Unifying Systems in Catalysis”
(UniSysCat), Technische Universität Berlin, 10623 Berlin, Germany
| | - Stefan Chmiela
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
| | - Huziel E. Sauceda
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
- BASLEARN,
BASF-TU Joint Lab, Technische Universität
Berlin, 10587 Berlin, Germany
| | - Michael Gastegger
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
- DFG
Cluster of Excellence “Unifying Systems in Catalysis”
(UniSysCat), Technische Universität Berlin, 10623 Berlin, Germany
- BASLEARN,
BASF-TU Joint Lab, Technische Universität
Berlin, 10587 Berlin, Germany
| | - Igor Poltavsky
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Kristof T. Schütt
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
| | - Alexandre Tkatchenko
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Klaus-Robert Müller
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
- BIFOLD−Berlin
Institute for the Foundations of Learning and Data, Berlin, Germany
- Department
of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul 02841, Korea
- Max Planck
Institute for Informatics, Stuhlsatzenhausweg, 66123 Saarbrücken, Germany
- Google
Research, Brain Team, Berlin, Germany
| |
Collapse
|
47
|
Westermayr J, Gastegger M, Schütt KT, Maurer RJ. Perspective on integrating machine learning into computational chemistry and materials science. J Chem Phys 2021; 154:230903. [PMID: 34241249 DOI: 10.1063/5.0047760] [Citation(s) in RCA: 67] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Machine learning (ML) methods are being used in almost every conceivable area of electronic structure theory and molecular simulation. In particular, ML has become firmly established in the construction of high-dimensional interatomic potentials. Not a day goes by without another proof of principle being published on how ML methods can represent and predict quantum mechanical properties-be they observable, such as molecular polarizabilities, or not, such as atomic charges. As ML is becoming pervasive in electronic structure theory and molecular simulation, we provide an overview of how atomistic computational modeling is being transformed by the incorporation of ML approaches. From the perspective of the practitioner in the field, we assess how common workflows to predict structure, dynamics, and spectroscopy are affected by ML. Finally, we discuss how a tighter and lasting integration of ML methods with computational chemistry and materials science can be achieved and what it will mean for research practice, software development, and postgraduate training.
Collapse
Affiliation(s)
- Julia Westermayr
- Department of Chemistry, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, United Kingdom
| | - Michael Gastegger
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Kristof T Schütt
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Reinhard J Maurer
- Department of Chemistry, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, United Kingdom
| |
Collapse
|
48
|
Zhang L, Wang H, Car R, E W. Phase Diagram of a Deep Potential Water Model. PHYSICAL REVIEW LETTERS 2021; 126:236001. [PMID: 34170175 DOI: 10.1103/physrevlett.126.236001] [Citation(s) in RCA: 91] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 04/28/2021] [Indexed: 06/13/2023]
Abstract
Using the Deep Potential methodology, we construct a model that reproduces accurately the potential energy surface of the SCAN approximation of density functional theory for water, from low temperature and pressure to about 2400 K and 50 GPa, excluding the vapor stability region. The computational efficiency of the model makes it possible to predict its phase diagram using molecular dynamics. Satisfactory overall agreement with experimental results is obtained. The fluid phases, molecular and ionic, and all the stable ice polymorphs, ordered and disordered, are predicted correctly, with the exception of ice III and XV that are stable in experiments, but metastable in the model. The evolution of the atomic dynamics upon heating, as ice VII transforms first into ice VII^{''} and then into an ionic fluid, reveals that molecular dissociation and breaking of the ice rules coexist with strong covalent fluctuations, explaining why only partial ionization was inferred in experiments.
Collapse
Affiliation(s)
- Linfeng Zhang
- Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey 08544, USA
| | - Han Wang
- Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Fenghao East Road 2, Beijing 100094, People's Republic of China
| | - Roberto Car
- Department of Chemistry, Department of Physics, Program in Applied and Computational Mathematics, Princeton Institute for the Science and Technology of Materials, Princeton University, Princeton, New Jersey 08544, USA
| | - Weinan E
- Department of Mathematics and Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey 08544, USA and Beijing Institute of Big Data Research, Beijing 100871, People's Republic of China
| |
Collapse
|
49
|
Krishnamoorthy A, Nomura KI, Baradwaj N, Shimamura K, Rajak P, Mishra A, Fukushima S, Shimojo F, Kalia R, Nakano A, Vashishta P. Dielectric Constant of Liquid Water Determined with Neural Network Quantum Molecular Dynamics. PHYSICAL REVIEW LETTERS 2021; 126:216403. [PMID: 34114857 DOI: 10.1103/physrevlett.126.216403] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2020] [Accepted: 03/30/2021] [Indexed: 06/12/2023]
Abstract
The static dielectric constant ϵ_{0} and its temperature dependence for liquid water is investigated using neural network quantum molecular dynamics (NNQMD). We compute the exact dielectric constant in canonical ensemble from NNQMD trajectories using fluctuations in macroscopic polarization computed from maximally localized Wannier functions (MLWF). Two deep neural networks are constructed. The first, NNQMD, is trained on QMD configurations for liquid water under a variety of temperature and density conditions to learn potential energy surface and forces and then perform molecular dynamics simulations. The second network, NNMLWF, is trained to predict locations of MLWF of individual molecules using the atomic configurations from NNQMD. Training data for both the neural networks is produced using a highly accurate quantum-mechanical method, DFT-SCAN that yields an excellent description of liquid water. We produce 280×10^{6} configurations of water at 7 temperatures using NNQMD and predict MLWF centers using NNMLWF to compute the polarization fluctuations. The length of trajectories needed for a converged value of the dielectric constant at 0°C is found to be 20 ns (40×10^{6} configurations with 0.5 fs time step). The computed dielectric constants for 0, 15, 30, 45, 60, 75, and 90°C are in good agreement with experiments. Our scalable scheme to compute dielectric constants with quantum accuracy is also applicable to other polar molecular liquids.
Collapse
Affiliation(s)
- Aravind Krishnamoorthy
- Collaboratory for Advanced Computing and Simulations, Department of Chemical Engineering and Materials Science, Department of Physics & Astronomy, and Department of Computer Science, University of Southern California, Los Angeles, California 90089, USA
| | - Ken-Ichi Nomura
- Collaboratory for Advanced Computing and Simulations, Department of Chemical Engineering and Materials Science, Department of Physics & Astronomy, and Department of Computer Science, University of Southern California, Los Angeles, California 90089, USA
| | - Nitish Baradwaj
- Collaboratory for Advanced Computing and Simulations, Department of Chemical Engineering and Materials Science, Department of Physics & Astronomy, and Department of Computer Science, University of Southern California, Los Angeles, California 90089, USA
| | - Kohei Shimamura
- Department of Physics, Kumamoto University, Kumamoto 860-8555, Japan
| | - Pankaj Rajak
- Argonne National Laboratory, Lemont, Illinois 60439, USA
| | - Ankit Mishra
- Collaboratory for Advanced Computing and Simulations, Department of Chemical Engineering and Materials Science, Department of Physics & Astronomy, and Department of Computer Science, University of Southern California, Los Angeles, California 90089, USA
| | - Shogo Fukushima
- Department of Physics, Kumamoto University, Kumamoto 860-8555, Japan
| | - Fuyuki Shimojo
- Department of Physics, Kumamoto University, Kumamoto 860-8555, Japan
| | - Rajiv Kalia
- Collaboratory for Advanced Computing and Simulations, Department of Chemical Engineering and Materials Science, Department of Physics & Astronomy, and Department of Computer Science, University of Southern California, Los Angeles, California 90089, USA
| | - Aiichiro Nakano
- Collaboratory for Advanced Computing and Simulations, Department of Chemical Engineering and Materials Science, Department of Physics & Astronomy, and Department of Computer Science, University of Southern California, Los Angeles, California 90089, USA
| | - Priya Vashishta
- Collaboratory for Advanced Computing and Simulations, Department of Chemical Engineering and Materials Science, Department of Physics & Astronomy, and Department of Computer Science, University of Southern California, Los Angeles, California 90089, USA
| |
Collapse
|
50
|
Liu X, Zhang L, Liu J. Machine learning phase space quantum dynamics approaches. J Chem Phys 2021; 154:184104. [PMID: 34241027 DOI: 10.1063/5.0046689] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Derived from phase space expressions of the quantum Liouville theorem, equilibrium continuity dynamics is a category of trajectory-based phase space dynamics methods, which satisfies the two critical fundamental criteria: conservation of the quantum Boltzmann distribution for the thermal equilibrium system and being exact for any thermal correlation functions (even of nonlinear operators) in the classical and harmonic limits. The effective force and effective mass matrix are important elements in the equations of motion of equilibrium continuity dynamics, where only the zeroth term of an exact series expansion of the phase space propagator is involved. We introduce a machine learning approach for fitting these elements in quantum phase space, leading to a much more efficient integration of the equations of motion. Proof-of-concept applications to realistic molecules demonstrate that machine learning phase space dynamics approaches are possible as well as competent in producing reasonably accurate results with a modest computation effort.
Collapse
Affiliation(s)
- Xinzijian Liu
- Beijing National Laboratory for Molecular Sciences, Institute of Theoretical and Computational Chemistry, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Linfeng Zhang
- Beijing Institute of Big Data Research, Beijing 100871, China
| | - Jian Liu
- Beijing National Laboratory for Molecular Sciences, Institute of Theoretical and Computational Chemistry, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| |
Collapse
|