1
|
Berger E, Niemelä J, Lampela O, Juffer AH, Komsa HP. Raman Spectra of Amino Acids and Peptides from Machine Learning Polarizabilities. J Chem Inf Model 2024; 64:4601-4612. [PMID: 38829726 DOI: 10.1021/acs.jcim.4c00077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]
Abstract
Raman spectroscopy is an important tool in the study of vibrational properties and composition of molecules, peptides, and even proteins. Raman spectra can be simulated based on the change of the electronic polarizability with vibrations, which can nowadays be efficiently obtained via machine learning models trained on first-principles data. However, the transferability of the models trained on small molecules to larger structures is unclear, and direct training on large structures is prohibitively expensive. In this work, we first train two machine learning models to predict the polarizabilities of all 20 amino acids. Both models are carefully benchmarked and compared to density functional theory (DFT) calculations, with the neural network method being found to offer better transferability. By combination of machine learning models with classical force field molecular dynamics, Raman spectra of all amino acids are also obtained and investigated, showing good agreement with experiments. The models are further extended to small peptides. We find that adding structures containing peptide bonds to the training set greatly improves predictions, even for peptides not included in training sets.
Collapse
Affiliation(s)
- Ethan Berger
- Microelectronics Research Unit, Faculty of Information Technology and Electrical Engineering, University of Oulu, P.O. Box 4500, Oulu FIN-90014, Finland
| | - Juha Niemelä
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, Oulu FIN-90014, Finland
| | - Outi Lampela
- Biocenter Oulu and Faculty of Biochemistry and Molecular Medicine, University of Oulu, Oulu FIN-90014, Finland
| | - André H Juffer
- Biocenter Oulu and Faculty of Biochemistry and Molecular Medicine, University of Oulu, Oulu FIN-90014, Finland
| | - Hannu-Pekka Komsa
- Microelectronics Research Unit, Faculty of Information Technology and Electrical Engineering, University of Oulu, P.O. Box 4500, Oulu FIN-90014, Finland
| |
Collapse
|
2
|
Jana A, Shepherd S, Litman Y, Wilkins DM. Learning Electronic Polarizations in Aqueous Systems. J Chem Inf Model 2024; 64:4426-4435. [PMID: 38804973 PMCID: PMC11167596 DOI: 10.1021/acs.jcim.4c00421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 05/10/2024] [Accepted: 05/14/2024] [Indexed: 05/29/2024]
Abstract
The polarization of periodically repeating systems is a discontinuous function of the atomic positions, a fact which seems at first to stymie attempts at their statistical learning. Two approaches to build models for bulk polarizations are compared: one in which a simple point charge model is used to preprocess the raw polarization to give a learning target that is a smooth function of atomic positions and the total polarization is learned as a sum of atom-centered dipoles and one in which instead the average position of Wannier centers around atoms is predicted. For a range of bulk aqueous systems, both of these methods perform perform comparatively well, with the former being slightly better but often requiring an extra effort to find a suitable point charge model. As a challenging test, we also analyze the performance of the models at the air-water interface. In this case, while the Wannier center approach delivers accurate predictions without further modifications, the preprocessing method requires augmentation with information from isolated water molecules to reach similar accuracy. Finally, we present a simple protocol to preprocess the polarizations in a data-driven way using a small number of derivatives calculated at a much lower level of theory, thus overcoming the need to find point charge models without appreciably increasing the computation cost. We believe that the training strategies presented here help the construction of accurate polarization models required for the study of the dielectric properties of realistic complex bulk systems and interfaces with ab initio accuracy.
Collapse
Affiliation(s)
- Arnab Jana
- Centre
for Quantum Materials and Technologies, School of Mathematics and
Physics, Queen’s University Belfast, Belfast BT7 1NN, U.K.
| | - Sam Shepherd
- Centre
for Quantum Materials and Technologies, School of Mathematics and
Physics, Queen’s University Belfast, Belfast BT7 1NN, U.K.
| | - Yair Litman
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| | - David M. Wilkins
- Centre
for Quantum Materials and Technologies, School of Mathematics and
Physics, Queen’s University Belfast, Belfast BT7 1NN, U.K.
| |
Collapse
|
3
|
Althorpe SC. Path Integral Simulations of Condensed-Phase Vibrational Spectroscopy. Annu Rev Phys Chem 2024; 75:397-420. [PMID: 38941531 DOI: 10.1146/annurev-physchem-090722-124705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024]
Abstract
Recent theoretical and algorithmic developments have improved the accuracy with which path integral dynamics methods can include nuclear quantum effects in simulations of condensed-phase vibrational spectra. Such methods are now understood to be approximations to the delocalized classical Matsubara dynamics of smooth Feynman paths, which dominate the dynamics of systems such as liquid water at room temperature. Focusing mainly on simulations of liquid water and hexagonal ice, we explain how the recently developed quasicentroid molecular dynamics (QCMD), fast-QCMD, and temperature-elevated path integral coarse-graining simulations (Te PIGS) methods generate classical dynamics on potentials of mean force obtained by averaging over quantum thermal fluctuations. These new methods give very close agreement with one another, and the Te PIGS method has recently yielded excellent agreement with experimentally measured vibrational spectra for liquid water, ice, and the liquid-air interface. We also discuss the limitations of such methods.
Collapse
Affiliation(s)
- Stuart C Althorpe
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, United Kingdom;
| |
Collapse
|
4
|
Xu N, Rosander P, Schäfer C, Lindgren E, Österbacka N, Fang M, Chen W, He Y, Fan Z, Erhart P. Tensorial Properties via the Neuroevolution Potential Framework: Fast Simulation of Infrared and Raman Spectra. J Chem Theory Comput 2024; 20:3273-3284. [PMID: 38572734 PMCID: PMC11044275 DOI: 10.1021/acs.jctc.3c01343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 03/19/2024] [Accepted: 04/01/2024] [Indexed: 04/05/2024]
Abstract
Infrared and Raman spectroscopy are widely used for the characterization of gases, liquids, and solids, as the spectra contain a wealth of information concerning, in particular, the dynamics of these systems. Atomic scale simulations can be used to predict such spectra but are often severely limited due to high computational cost or the need for strong approximations that limit the application range and reliability. Here, we introduce a machine learning (ML) accelerated approach that addresses these shortcomings and provides a significant performance boost in terms of data and computational efficiency compared with earlier ML schemes. To this end, we generalize the neuroevolution potential approach to enable the prediction of rank one and two tensors to obtain the tensorial neuroevolution potential (TNEP) scheme. We apply the resulting framework to construct models for the dipole moment, polarizability, and susceptibility of molecules, liquids, and solids and show that our approach compares favorably with several ML models from the literature with respect to accuracy and computational efficiency. Finally, we demonstrate the application of the TNEP approach to the prediction of infrared and Raman spectra of liquid water, a molecule (PTAF-), and a prototypical perovskite with strong anharmonicity (BaZrO3). The TNEP approach is implemented in the free and open source software package gpumd, which makes this methodology readily available to the scientific community.
Collapse
Affiliation(s)
- Nan Xu
- Institute
of Zhejiang University-Quzhou, Quzhou 324000, P. R. China
- College
of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310058, P. R. China
| | - Petter Rosander
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| | - Christian Schäfer
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| | - Eric Lindgren
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| | - Nicklas Österbacka
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| | - Mandi Fang
- Institute
of Zhejiang University-Quzhou, Quzhou 324000, P. R. China
- College
of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310058, P. R. China
| | - Wei Chen
- State
Key Laboratory of Multiphase Complex Systems, Institute of Process
Engineering, Chinese Academy of Sciences, Beijing 100190, P. R. China
| | - Yi He
- Institute
of Zhejiang University-Quzhou, Quzhou 324000, P. R. China
- College
of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310058, P. R. China
- Department
of Chemical Engineering, University of Washington, Seattle, Washington 98195, United States
| | - Zheyong Fan
- College
of Physical Science and Technology, Bohai
University, Jinzhou 121013, P. R. China
| | - Paul Erhart
- Department
of Physics, Chalmers University of Technology, SE-41296 Gothenburg, Sweden
| |
Collapse
|
5
|
Montero de Hijes P, Dellago C, Jinnouchi R, Schmiedmayer B, Kresse G. Comparing machine learning potentials for water: Kernel-based regression and Behler-Parrinello neural networks. J Chem Phys 2024; 160:114107. [PMID: 38506284 DOI: 10.1063/5.0197105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 03/03/2024] [Indexed: 03/21/2024] Open
Abstract
In this paper, we investigate the performance of different machine learning potentials (MLPs) in predicting key thermodynamic properties of water using RPBE + D3. Specifically, we scrutinize kernel-based regression and high-dimensional neural networks trained on a highly accurate dataset consisting of about 1500 structures, as well as a smaller dataset, about half the size, obtained using only on-the-fly learning. This study reveals that despite minor differences between the MLPs, their agreement on observables such as the diffusion constant and pair-correlation functions is excellent, especially for the large training dataset. Variations in the predicted density isobars, albeit somewhat larger, are also acceptable, particularly given the errors inherent to approximate density functional theory. Overall, this study emphasizes the relevance of the database over the fitting method. Finally, this study underscores the limitations of root mean square errors and the need for comprehensive testing, advocating the use of multiple MLPs for enhanced certainty, particularly when simulating complex thermodynamic properties that may not be fully captured by simpler tests.
Collapse
Affiliation(s)
- Pablo Montero de Hijes
- University of Vienna, Faculty of Physics, Kolingasse 14, A-1090 Vienna, Austria
- University of Vienna, Faculty of Earth Sciences, Geography and Astronomy, Josef-Holaubuek-Platz 2, 1090 Vienna, Austria
| | - Christoph Dellago
- University of Vienna, Faculty of Physics, Kolingasse 14, A-1090 Vienna, Austria
| | - Ryosuke Jinnouchi
- Toyota Central R&D Labs., Inc., 41-1 Yokomichi, Nagakute, Aichi 480-1192, Japan
| | | | - Georg Kresse
- University of Vienna, Faculty of Physics, Kolingasse 14, A-1090 Vienna, Austria
- VASP Software GmbH, Berggasse 21, A-1090 Vienna, Austria
| |
Collapse
|
6
|
Kapil V, Kovács DP, Csányi G, Michaelides A. First-principles spectroscopy of aqueous interfaces using machine-learned electronic and quantum nuclear effects. Faraday Discuss 2024; 249:50-68. [PMID: 37799072 PMCID: PMC10845015 DOI: 10.1039/d3fd00113j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 07/18/2023] [Indexed: 10/07/2023]
Abstract
Vibrational spectroscopy is a powerful approach to visualising interfacial phenomena. However, extracting structural and dynamical information from vibrational spectra is a challenge that requires first-principles simulations, including non-Condon and quantum nuclear effects. We address this challenge by developing a machine-learning enhanced first-principles framework to speed up predictive modelling of infrared, Raman, and sum-frequency generation spectra. Our approach uses machine learning potentials that encode quantum nuclear effects to generate quantum trajectories using simple molecular dynamics efficiently. In addition, we reformulate bulk and interfacial selection rules to express them unambiguously in terms of the derivatives of polarisation and polarisabilities of the whole system and predict these derivatives efficiently using fully-differentiable machine learning models of dielectric response tensors. We demonstrate our framework's performance by predicting the IR, Raman, and sum-frequency generation spectra of liquid water, ice and the water-air interface by achieving near quantitative agreement with experiments at nearly the same computational efficiency as pure classical methods. Finally, to aid the experimental discovery of new phases of nanoconfined water, we predict the temperature-dependent vibrational spectra of monolayer water across the solid-hexatic-liquid phases transition.
Collapse
Affiliation(s)
- Venkat Kapil
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK.
| | | | - Gábor Csányi
- Engineering Laboratory, University of Cambridge, Cambridge, CB2 1PZ, UK
| | - Angelos Michaelides
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK.
| |
Collapse
|
7
|
LaCour RA, Heindel JP, Head-Gordon T. Predicting the Raman Spectra of Liquid Water with a Monomer-Field Model. J Phys Chem Lett 2023; 14:11742-11749. [PMID: 38116782 DOI: 10.1021/acs.jpclett.3c02873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
The Raman spectrum of liquid water is quite complex, reflecting its strong sensitivity to the local environment of the individual waters. The OH-stretch region of the spectrum, which captures the influence of hydrogen bonding, has only just begun to be unraveled. Here we develop a model for predicting the Raman spectra of the OH-stretch region by considering how local electric fields distort the energy surface of each water monomer. We find that our model is capable of reproducing the bimodal nature of the main peak, with the shoulder at 3250 cm-1 resulting almost entirely from Fermi resonance. Furthermore, we capture the temperature and polarization dependence of the shoulder, which has proven to be difficult to obtain with previous methods, and analyze the origin of this dependence. We expect our model to be generally useful for understanding and predicting how Raman spectra change under different conditions and with different probe reporters beyond water.
Collapse
Affiliation(s)
- R Allen LaCour
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Joseph P Heindel
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Teresa Head-Gordon
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
- Departments of Bioengineering and Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, United States
| |
Collapse
|
8
|
Inoue K, Litman Y, Wilkins DM, Nagata Y, Okuno M. Is Unified Understanding of Vibrational Coupling of Water Possible? Hyper-Raman Measurement and Machine Learning Spectra. J Phys Chem Lett 2023; 14:3063-3068. [PMID: 36947156 DOI: 10.1021/acs.jpclett.3c00398] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The impact of the vibrational coupling of the OH stretch mode on the spectra differs significantly between IR and Raman spectra of water. Unified understanding of the vibrational couplings is not yet achieved. By using a different class of vibrational spectroscopy, hyper-Raman (HR) spectroscopy, together with machine-learning-assisted HR spectra calculation, we examine the impact of the vibrational couplings of water through the comparison of isotopically diluted H2O and pure H2O. We found that the isotopic dilution reduces the HR bandwidths, but the impact of the vibrational coupling is smaller than in the IR and parallel-polarized Raman. Machine learning HR spectra indicate that the intermolecular coupling plays a major role in broadening the bandwidth, while the intramolecular coupling is negligibly small, which is consistent with the IR and Raman spectra. Our result clearly demonstrates a limited impact of the intramolecular vibration, independent of the selection rules of vibrational spectroscopies.
Collapse
Affiliation(s)
- Kazuki Inoue
- Department of Basic Science, Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo 153-8902, Japan
| | - Yair Litman
- Max Planck Institute for Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - David M Wilkins
- Atomistic Simulation Centre, School of Mathematics and Physics, Queen's University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| | - Yuki Nagata
- Max Planck Institute for Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
| | - Masanari Okuno
- Department of Basic Science, Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo 153-8902, Japan
| |
Collapse
|
9
|
Schienbein P. Spectroscopy from Machine Learning by Accurately Representing the Atomic Polar Tensor. J Chem Theory Comput 2023; 19:705-712. [PMID: 36695707 PMCID: PMC9933433 DOI: 10.1021/acs.jctc.2c00788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Vibrational spectroscopy is a key technique to elucidate microscopic structure and dynamics. Without the aid of theoretical approaches, it is, however, often difficult to understand such spectra at a microscopic level. Ab initio molecular dynamics has repeatedly proved to be suitable for this purpose; however, the computational cost can be daunting. Here, the E(3)-equivariant neural network e3nn is used to fit the atomic polar tensor of liquid water a posteriori on top of existing molecular dynamics simulations. Notably, the introduced methodology is general and thus transferable to any other system as well. The target property is most fundamental and gives access to the IR spectrum, and more importantly, it is a highly powerful tool to directly assign IR spectral features to nuclear motion─a connection which has been pursued in the past but only using severe approximations due to the prohibitive computational cost. The herein introduced methodology overcomes this bottleneck. To benchmark the machine learning model, the IR spectrum of liquid water is calculated, indeed showing excellent agreement with the explicit reference calculation. In conclusion, the presented methodology gives a new route to calculate accurate IR spectra from molecular dynamics simulations and will facilitate the understanding of such spectra on a microscopic level.
Collapse
|
10
|
Shiga M. Path integral Brownian chain molecular dynamics: A simple approximation of quantum vibrational dynamics. J Comput Chem 2022; 43:1864-1879. [PMID: 36094104 DOI: 10.1002/jcc.26989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Revised: 08/06/2022] [Accepted: 08/09/2022] [Indexed: 11/05/2022]
Abstract
An approximate approach to quantum vibrational dynamics, "Brownian chain molecular dynamics (BCMD)," is proposed to alleviate the chain resonance and curvature problems in the imaginary time-based path integral (PI) simulation. Here the non-centroid velocity is randomized at each step when solving the equation of motion of path integral molecular dynamics. This leads to a combination of the Newton equation and the overdamped Langevin equation for the centroid and non-centroid variables, respectively. BCMD shares the basic properties of other PI approaches such as centroid and ring polymer molecular dynamics: It gives the correct Kubo-transformed correlation function at short times, conserves the time symmetry, has the correct high-temperature/classical limits, gives exactly the position and velocity autocorrelations of harmonic oscillator systems, and does not have the zero-point leakage problem. Numerical tests were done on simple molecular models and liquid water. On-the-fly ab initio BCMD simulations were performed for the protonated water cluster, H 5 O 2 + $$ {\mathrm{H}}_5{\mathrm{O}}_2^{+} $$ , and its isotopologue, D 5 O 2 + $$ {\mathrm{D}}_5{\mathrm{O}}_2^{+} $$ .
Collapse
Affiliation(s)
- Motoyuki Shiga
- Center for Computational Science and e-Systems, Japan Atomic Energy Agency, Chiba, Japan
| |
Collapse
|
11
|
Simulation of Nuclear Quantum Effects in Condensed Matter Systems via Quantum Baths. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12094756] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
This paper reviews methods that aim at simulating nuclear quantum effects (NQEs) using generalized thermal baths. Generalized (or quantum) baths simulate statistical quantum features, and in particular zero-point energy effects, through non-Markovian stochastic dynamics. They make use of generalized Langevin Equations (GLEs), in which the quantum Bose–Einstein energy distribution is enforced by tuning the random and friction forces, while the system degrees of freedom remain classical. Although these baths have been formally justified only for harmonic oscillators, they perform well for several systems, while keeping the cost of the simulations comparable to the classical ones. We review the formal properties and main characteristics of classical and quantum GLEs, in relation with the fluctuation–dissipation theorems. Then, we describe the quantum thermostat and quantum thermal bath, the two generalized baths currently most used, providing several examples of applications for condensed matter systems, including the calculation of vibrational spectra. The most important drawback of these methods, zero-point energy leakage, is discussed in detail with the help of model systems, and a recently proposed scheme to monitor and mitigate or eliminate it—the adaptive quantum thermal bath—is summarised. This approach considerably extends the domain of application of generalized baths, leading, for instance, to the successful simulation of liquid water, where a subtle interplay of NQEs is at play. The paper concludes by overviewing further development opportunities and open challenges of generalized baths.
Collapse
|
12
|
Kurapothula PKJ, Shepherd S, Wilkins DM. Hydrogen-bonding and nuclear quantum effects in clays. J Chem Phys 2022; 156:084702. [DOI: 10.1063/5.0083075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Hydrogen bonds are of paramount importance in the chemistry of clays, mediating the interaction between the clay surface and water, and for some materials between separate layers. It is well-established that the accuracy of a computational model for clays depends on the level of theory at which the electronic structure is treated. However, for hydrogen-bonded systems, the motion of light H nuclei on the electronic potential energy surface is often affected by quantum delocalization. Using path integral molecular dynamics, we show that nuclear quantum effects lead to a relatively small change in the structure of clays, but one that is comparable to the variation incurred by treating the clay at different levels of electronic structure theory. Accounting for quantum effects weakens the hydrogen bonds in clays, with H-bonds between different layers of the clay affected more than those within the same layer; this is ascribed to the fact that the confinement of an H atom inside a layer is independent of its participation in hydrogen-bonding. More importantly, the weakening of hydrogen bonds by nuclear quantum effects causes changes in the vibrational spectra of these systems, significantly shifting the O–H stretching peaks and meaning that in order to fully understand these spectra by computational modeling, both electronic and nuclear quantum effects must be included. We show that after reparameterization of the popular clay forcefield CLAYFF, the O–H stretching region of their vibrational spectra better matches the experimental one, with no detriment to the model’s agreement with other experimental properties.
Collapse
Affiliation(s)
- Pawan K. J. Kurapothula
- Atomistic Simulation Centre, School of Mathematics and Physics, Queen’s University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| | - Sam Shepherd
- Atomistic Simulation Centre, School of Mathematics and Physics, Queen’s University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| | - David M. Wilkins
- Atomistic Simulation Centre, School of Mathematics and Physics, Queen’s University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| |
Collapse
|
13
|
Zhang X, Zhou S, Leonik FM, Wang L, Kuroda DG. Quantum mechanical effects in acid–base chemistry. Chem Sci 2022; 13:6998-7006. [PMID: 35774178 PMCID: PMC9200130 DOI: 10.1039/d2sc01784a] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 05/17/2022] [Indexed: 01/04/2023] Open
Abstract
Acid–base chemistry has immense importance for explaining and predicting the chemical products formed by an acid and a base when mixed together. However, the traditional chemistry theories used to describe acid–base reactions do not take into account the effect arising from the quantum mechanical nature of the acidic hydrogen shuttling potential and its dependence on the acid base distance. Here, infrared and NMR spectroscopies, in combination with first principles simulations, are performed to demonstrate that quantum mechanical effects, including electronic and nuclear quantum effects, play an essential role in defining the acid–base chemistry when 1-methylimidazole and acetic acid are mixed together. In particular, it is observed that the acid and the base interact to form a complex containing a strong hydrogen bond, in which the acidic hydrogen atom is neither close to the acid nor to the base, but delocalized between them. In addition, the delocalization of the acidic hydrogen atom in the complex leads to characteristic IR and NMR signatures. The presence of a hydrogen delocalized state in this simple system challenges the conventional knowledge of acid–base chemistry and opens up new avenues for designing materials in which specific properties produced by the hydrogen delocalized state can be harvested. Acid-based theories do not consider the quantum mechanical nature of the acidic hydrogen shuttling potential. Here, it is demonstrated that this particularity is needed to explain the formation acid-base complex with a delocalized acidic hydrogen.![]()
Collapse
Affiliation(s)
- Xiaoliu Zhang
- Department of Chemistry, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Shengmin Zhou
- Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Fedra M. Leonik
- Department of Chemistry, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Lu Wang
- Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Daniel G. Kuroda
- Department of Chemistry, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| |
Collapse
|
14
|
Shepherd S, Lan J, Wilkins DM, Kapil V. Efficient Quantum Vibrational Spectroscopy of Water with High-Order Path Integrals: From Bulk to Interfaces. J Phys Chem Lett 2021; 12:9108-9114. [PMID: 34523941 DOI: 10.1021/acs.jpclett.1c02574] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Vibrational spectroscopy is key in probing the interplay between the structure and dynamics of aqueous systems. To map different regions of experimental spectra to the microscopic structure of a system, it is important to combine them with first-principles atomistic simulations that incorporate the quantum nature of nuclei. Here we show that the large cost of calculating the quantum vibrational spectra of aqueous systems can be dramatically reduced compared with standard path integral methods by using approximate quantum dynamics based on high-order path integrals. Together with state-of-the-art machine-learned electronic properties, our approach gives an excellent description not only of the infrared and Raman spectra of bulk water but also of the 2D correlation and the more challenging sum-frequency generation spectra of the water-air interface. This paves the way for understanding complex interfaces such as water encapsulated between or in contact with hydrophobic and hydrophilic materials through robust and inexpensive surface-sensitive and multidimensional spectra with first-principles accuracy.
Collapse
Affiliation(s)
- Sam Shepherd
- Atomistic Simulation Centre, School of Mathematics and Physics, Queen's University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| | - Jinggang Lan
- Department of Chemistry, University of Zürich, Zürich 8057, Switzerland
| | - David M Wilkins
- Atomistic Simulation Centre, School of Mathematics and Physics, Queen's University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| | - Venkat Kapil
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW,United Kingdom
| |
Collapse
|
15
|
Benson RL, Althorpe SC. On the "Matsubara heating" of overtone intensities and Fermi splittings. J Chem Phys 2021; 155:104107. [PMID: 34525826 DOI: 10.1063/5.0056829] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Classical molecular dynamics (MD) and imaginary-time path-integral dynamics methods underestimate the infrared absorption intensities of overtone and combination bands by typically an order of magnitude. Plé et al. [J. Chem. Phys. 155, 2863 (2021)] have shown that this is because such methods fail to describe the coupling of the centroid to the Matsubara dynamics of the fluctuation modes; classical first-order perturbation theory (PT) applied to the Matsubara dynamics is sufficient to recover most of the lost intensity in simple models and gives identical results to quantum (Rayleigh-Schrödinger) PT. Here, we show numerically that the results of this analysis can be used as post-processing correction factors, which can be applied to realistic (classical MD or path-integral dynamics) simulations of infrared spectra. We find that the correction factors recover most of the lost intensity in the overtone and combination bands of gas-phase water and ammonia and much of it for liquid water. We then re-derive and confirm the earlier PT analysis by applying canonical PT to Matsubara dynamics, which has the advantage of avoiding secular terms and gives a simple picture of the perturbed Matsubara dynamics in terms of action-angle variables. Collectively, these variables "Matsubara heat" the amplitudes of the overtone and combination vibrations of the centroid to what they would be in a classical system with the oscillators (of frequency Ωi) held at their quantum effective temperatures [of ℏΩi coth(βℏΩi/2)/2kB]. Numerical calculations show that a similar neglect of "Matsubara heating" causes path-integral methods to underestimate Fermi resonance splittings.
Collapse
Affiliation(s)
- Raz L Benson
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Stuart C Althorpe
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
16
|
Deringer VL, Bartók AP, Bernstein N, Wilkins DM, Ceriotti M, Csányi G. Gaussian Process Regression for Materials and Molecules. Chem Rev 2021; 121:10073-10141. [PMID: 34398616 PMCID: PMC8391963 DOI: 10.1021/acs.chemrev.1c00022] [Citation(s) in RCA: 215] [Impact Index Per Article: 71.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Indexed: 12/18/2022]
Abstract
We provide an introduction to Gaussian process regression (GPR) machine-learning methods in computational materials science and chemistry. The focus of the present review is on the regression of atomistic properties: in particular, on the construction of interatomic potentials, or force fields, in the Gaussian Approximation Potential (GAP) framework; beyond this, we also discuss the fitting of arbitrary scalar, vectorial, and tensorial quantities. Methodological aspects of reference data generation, representation, and regression, as well as the question of how a data-driven model may be validated, are reviewed and critically discussed. A survey of applications to a variety of research questions in chemistry and materials science illustrates the rapid growth in the field. A vision is outlined for the development of the methodology in the years to come.
Collapse
Affiliation(s)
- Volker L. Deringer
- Department
of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, United Kingdom
| | - Albert P. Bartók
- Department
of Physics and Warwick Centre for Predictive Modelling, School of
Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Noam Bernstein
- Center
for Computational Materials Science, U.S.
Naval Research Laboratory, Washington D.C. 20375, United States
| | - David M. Wilkins
- Atomistic
Simulation Centre, School of Mathematics and Physics, Queen’s University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| | - Michele Ceriotti
- Laboratory
of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
- National
Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale
de Lausanne, Lausanne, Switzerland
| | - Gábor Csányi
- Engineering
Laboratory, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
| |
Collapse
|
17
|
Musil F, Grisafi A, Bartók AP, Ortner C, Csányi G, Ceriotti M. Physics-Inspired Structural Representations for Molecules and Materials. Chem Rev 2021; 121:9759-9815. [PMID: 34310133 DOI: 10.1021/acs.chemrev.1c00021] [Citation(s) in RCA: 135] [Impact Index Per Article: 45.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The first step in the construction of a regression model or a data-driven analysis, aiming to predict or elucidate the relationship between the atomic-scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of atomic-scale representations has played, and continues to play, a central role in the success of machine-learning methods for chemistry and materials science. This review summarizes the current understanding of the nature and characteristics of the most commonly used structural and chemical descriptions of atomistic structures, highlighting the deep underlying connections between different frameworks and the ideas that lead to computationally efficient and universally applicable models. It emphasizes the link between properties, structures, their physical chemistry, and their mathematical description, provides examples of recent applications to a diverse set of chemical and materials science problems, and outlines the open questions and the most promising research directions in the field.
Collapse
Affiliation(s)
- Felix Musil
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Andrea Grisafi
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Albert P Bartók
- Department of Physics and Warwick Centre for Predictive Modelling, School of Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Christoph Ortner
- University of British Columbia, Vancouver, British Columbia V6T 1Z2, Canada
| | - Gábor Csányi
- Engineering Laboratory, University of Cambridge, Trumpington Street, Cambridge CB2 1PZ, United Kingdom
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
18
|
Morawietz T, Artrith N. Machine learning-accelerated quantum mechanics-based atomistic simulations for industrial applications. J Comput Aided Mol Des 2021; 35:557-586. [PMID: 33034008 PMCID: PMC8018928 DOI: 10.1007/s10822-020-00346-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 09/26/2020] [Indexed: 01/13/2023]
Abstract
Atomistic simulations have become an invaluable tool for industrial applications ranging from the optimization of protein-ligand interactions for drug discovery to the design of new materials for energy applications. Here we review recent advances in the use of machine learning (ML) methods for accelerated simulations based on a quantum mechanical (QM) description of the system. We show how recent progress in ML methods has dramatically extended the applicability range of conventional QM-based simulations, allowing to calculate industrially relevant properties with enhanced accuracy, at reduced computational cost, and for length and time scales that would have otherwise not been accessible. We illustrate the benefits of ML-accelerated atomistic simulations for industrial R&D processes by showcasing relevant applications from two very different areas, drug discovery (pharmaceuticals) and energy materials. Writing from the perspective of both a molecular and a materials modeling scientist, this review aims to provide a unified picture of the impact of ML-accelerated atomistic simulations on the pharmaceutical, chemical, and materials industries and gives an outlook on the exciting opportunities that could emerge in the future.
Collapse
Affiliation(s)
- Tobias Morawietz
- Bayer AG, Pharmaceuticals, R&D, Digital Technologies, Computational Molecular Design, 42096 Wuppertal, Germany
| | - Nongnuch Artrith
- Department of Chemical Engineering, Columbia University, New York, NY 10027 USA
| |
Collapse
|
19
|
Imbalzano G, Zhuang Y, Kapil V, Rossi K, Engel EA, Grasselli F, Ceriotti M. Uncertainty estimation for molecular dynamics and sampling. J Chem Phys 2021; 154:074102. [DOI: 10.1063/5.0036522] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Giulio Imbalzano
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Yongbin Zhuang
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Collaborative Innovation Center of Chemistry for Energy Materials, Xiamen University, Xiamen 361005, China
| | - Venkat Kapil
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Kevin Rossi
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
- Laboratory of Nanochemistry for Energy, ISIC, École Polytechnique Fédérale de Lausanne, 1950 Sion, Switzerland
| | - Edgar A. Engel
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Federico Grasselli
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
20
|
Gkeka P, Stoltz G, Barati Farimani A, Belkacemi Z, Ceriotti M, Chodera JD, Dinner AR, Ferguson AL, Maillet JB, Minoux H, Peter C, Pietrucci F, Silveira A, Tkatchenko A, Trstanova Z, Wiewiora R, Lelièvre T. Machine Learning Force Fields and Coarse-Grained Variables in Molecular Dynamics: Application to Materials and Biological Systems. J Chem Theory Comput 2020; 16:4757-4775. [PMID: 32559068 PMCID: PMC8312194 DOI: 10.1021/acs.jctc.0c00355] [Citation(s) in RCA: 82] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Machine learning encompasses tools and algorithms that are now becoming popular in almost all scientific and technological fields. This is true for molecular dynamics as well, where machine learning offers promises of extracting valuable information from the enormous amounts of data generated by simulation of complex systems. We provide here a review of our current understanding of goals, benefits, and limitations of machine learning techniques for computational studies on atomistic systems, focusing on the construction of empirical force fields from ab initio databases and the determination of reaction coordinates for free energy computation and enhanced sampling.
Collapse
Affiliation(s)
- Paraskevi Gkeka
- Integrated Drug Discovery, Sanofi R&D, 91385 Chilly-Mazarin, France
| | - Gabriel Stoltz
- CERMICS, Ecole des Ponts, Marne-la-Vallée, France
- Matherials Project-Team, Inria Paris, 75012 Paris, France
| | | | - Zineb Belkacemi
- Integrated Drug Discovery, Sanofi R&D, 91385 Chilly-Mazarin, France
- CERMICS, Ecole des Ponts, Marne-la-Vallée, France
| | - Michele Ceriotti
- Laboratory of Computational Science and Modelling, Institute of Materials, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - John D Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
| | - Aaron R Dinner
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - Andrew L Ferguson
- Pritzker School of Molecular Engineering, University of Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, United States
| | | | - Hervé Minoux
- Integrated Drug Discovery, Sanofi R&D, 94403 Vitry-sur-Seine, France
| | | | - Fabio Pietrucci
- UMR CNRS 7590, MNHN, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, Sorbonne Université, 75005 Paris, France
| | - Ana Silveira
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Zofia Trstanova
- School of Mathematics, The University of Edinburgh, Edinburgh EH9 3FD, U.K
| | - Rafal Wiewiora
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
| | - Tony Lelièvre
- CERMICS, Ecole des Ponts, Marne-la-Vallée, France
- Matherials Project-Team, Inria Paris, 75012 Paris, France
| |
Collapse
|
21
|
Veit M, Wilkins DM, Yang Y, DiStasio RA, Ceriotti M. Predicting molecular dipole moments by combining atomic partial charges and atomic dipoles. J Chem Phys 2020; 153:024113. [DOI: 10.1063/5.0009106] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Affiliation(s)
- Max Veit
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - David M. Wilkins
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Yang Yang
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, USA
| | - Robert A. DiStasio
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, USA
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|