1
|
Zhang X, Li C, Ye HZ, Berkelbach TC, Chan GKL. Performant automatic differentiation of local coupled cluster theories: Response properties and ab initio molecular dynamics. J Chem Phys 2024; 161:014109. [PMID: 38949583 DOI: 10.1063/5.0212274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 06/12/2024] [Indexed: 07/02/2024] Open
Abstract
In this work, we introduce a differentiable implementation of the local natural orbital coupled cluster (LNO-CC) method within the automatic differentiation framework of the PySCFAD package. The implementation is comprehensively tuned for enhanced performance, which enables the calculation of first-order static response properties on medium-sized molecular systems using coupled cluster theory with single, double, and perturbative triple excitations [CCSD(T)]. We evaluate the accuracy of our method by benchmarking it against the canonical CCSD(T) reference for nuclear gradients, dipole moments, and geometry optimizations. In addition, we demonstrate the possibility of property calculations for chemically interesting systems through the computation of bond orders and Mössbauer spectroscopy parameters for a [NiFe]-hydrogenase active site model, along with the simulation of infrared spectra via ab initio LNO-CC molecular dynamics for a protonated water hexamer.
Collapse
Affiliation(s)
- Xing Zhang
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Chenghan Li
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Hong-Zhou Ye
- Department of Chemistry, Columbia University, New York, New York 10027, USA
| | | | - Garnet Kin-Lic Chan
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| |
Collapse
|
2
|
Tao Y, Giese TJ, Ekesan Ş, Zeng J, Aradi B, Hourahine B, Aktulga HM, Götz AW, Merz KM, York DM. Amber free energy tools: Interoperable software for free energy simulations using generalized quantum mechanical/molecular mechanical and machine learning potentials. J Chem Phys 2024; 160:224104. [PMID: 38856060 DOI: 10.1063/5.0211276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 05/15/2024] [Indexed: 06/11/2024] Open
Abstract
We report the development and testing of new integrated cyberinfrastructure for performing free energy simulations with generalized hybrid quantum mechanical/molecular mechanical (QM/MM) and machine learning potentials (MLPs) in Amber. The Sander molecular dynamics program has been extended to leverage fast, density-functional tight-binding models implemented in the DFTB+ and xTB packages, and an interface to the DeePMD-kit software enables the use of MLPs. The software is integrated through application program interfaces that circumvent the need to perform "system calls" and enable the incorporation of long-range Ewald electrostatics into the external software's self-consistent field procedure. The infrastructure provides access to QM/MM models that may serve as the foundation for QM/MM-ΔMLP potentials, which supplement the semiempirical QM/MM model with a MLP correction trained to reproduce ab initio QM/MM energies and forces. Efficient optimization of minimum free energy pathways is enabled through a new surface-accelerated finite-temperature string method implemented in the FE-ToolKit package. Furthermore, we interfaced Sander with the i-PI software by implementing the socket communication protocol used in the i-PI client-server model. The new interface with i-PI allows for the treatment of nuclear quantum effects with semiempirical QM/MM-ΔMLP models. The modular interoperable software is demonstrated on proton transfer reactions in guanine-thymine mispairs in a B-form deoxyribonucleic acid helix. The current work represents a considerable advance in the development of modular software for performing free energy simulations of chemical reactions that are important in a wide range of applications.
Collapse
Affiliation(s)
- Yujun Tao
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Timothy J Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Şölen Ekesan
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Jinzhe Zeng
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Bálint Aradi
- Bremen Center for Computational Materials Science, University of Bremen, D-28334 Bremen, Germany
| | - Ben Hourahine
- SUPA, Department of Physics, University of Strathclyde, Glasgow G4 0NG, United Kingdom
| | - Hasan Metin Aktulga
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA
| | - Andreas W Götz
- San Diego Supercomputer Center, University of California San Diego, La Jolla, California 92093, USA
| | - Kenneth M Merz
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA
| | - Darrin M York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| |
Collapse
|
3
|
Zheng J, Frisch MJ. Multiple-time scale integration method based on an interpolated potential energy surface for ab initio path integral molecular dynamics. J Chem Phys 2024; 160:144111. [PMID: 38597307 DOI: 10.1063/5.0196634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 03/25/2024] [Indexed: 04/11/2024] Open
Abstract
A new multiple-time scale integration method is presented that propagates ab initio path integral molecular dynamics (PIMD). This method uses a large time step to generate an approximate geometrical configuration whose energy and gradient are evaluated at the level of an ab initio method, and then, a more precise integration scheme, e.g., the Bulirsch-Stoer method or velocity Verlet integration with a smaller time step, is used to integrate from the previous step using the computationally efficient interpolated potential energy surface constructed from two consecutive points. This method makes the integration of PIMD more efficient and accurate compared with the velocity Verlet integration. A Nosé-Hoover chain thermostat combined with this new multiple-time scale method has good energy conservation even with a large time step, which is usually challenging in velocity Verlet integration for PIMD due to the very small chain mass when a large number of beads are used. The new method is used to calculate infrared spectra and free energy profiles to demonstrate its accuracy and capabilities.
Collapse
Affiliation(s)
- Jingjing Zheng
- Gaussian, Inc., 340 Quinnipiac St. Bldg. 40, Wallingford, Connecticut 06492, USA
| | - Michael J Frisch
- Gaussian, Inc., 340 Quinnipiac St. Bldg. 40, Wallingford, Connecticut 06492, USA
| |
Collapse
|
4
|
Loose T, Sahrmann PG, Qu TS, Voth GA. Coarse-Graining with Equivariant Neural Networks: A Path Toward Accurate and Data-Efficient Models. J Phys Chem B 2023; 127:10564-10572. [PMID: 38033234 PMCID: PMC10726966 DOI: 10.1021/acs.jpcb.3c05928] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 10/30/2023] [Accepted: 11/09/2023] [Indexed: 12/02/2023]
Abstract
Machine learning has recently entered into the mainstream of coarse-grained (CG) molecular modeling and simulation. While a variety of methods for incorporating deep learning into these models exist, many of them involve training neural networks to act directly as the CG force field. This has several benefits of which the most significant is accuracy. Neural networks can inherently incorporate multibody effects during the calculation of CG forces, and a well-trained neural network force field outperforms pairwise basis sets generated from essentially any methodology. However, this comes at a significant cost. First, these models are typically slower than pairwise force fields, even when accounting for specialized hardware, which accelerates the training and integration of such networks. The second and the focus of this paper is the need for a considerable amount of data to train such force fields. It is common to use 10s of microseconds of molecular dynamics data to train a single CG model, which approaches the point of eliminating the CG model's usefulness in the first place. As we investigate in this work, this "data-hunger" trap from neural networks for predicting molecular energies and forces can be remediated in part by incorporating equivariant convolutional operations. We demonstrate that, for CG water, networks that incorporate equivariant convolutional operations can produce functional models using data sets as small as a single frame of reference data, while networks without these operations cannot.
Collapse
Affiliation(s)
| | | | - Thomas S. Qu
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, James Franck Institute,
and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Gregory A. Voth
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, James Franck Institute,
and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
5
|
Popov A. Electronic structure of small metastable GAS-Phase boron clusters formed in a helium buffer GAS. Chem Phys 2023. [DOI: 10.1016/j.chemphys.2023.111896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]
|
6
|
Bocus M, Goeminne R, Lamaire A, Cools-Ceuppens M, Verstraelen T, Van Speybroeck V. Nuclear quantum effects on zeolite proton hopping kinetics explored with machine learning potentials and path integral molecular dynamics. Nat Commun 2023; 14:1008. [PMID: 36823162 PMCID: PMC9950054 DOI: 10.1038/s41467-023-36666-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 02/10/2023] [Indexed: 02/25/2023] Open
Abstract
Proton hopping is a key reactive process within zeolite catalysis. However, the accurate determination of its kinetics poses major challenges both for theoreticians and experimentalists. Nuclear quantum effects (NQEs) are known to influence the structure and dynamics of protons, but their rigorous inclusion through the path integral molecular dynamics (PIMD) formalism was so far beyond reach for zeolite catalyzed processes due to the excessive computational cost of evaluating all forces and energies at the Density Functional Theory (DFT) level. Herein, we overcome this limitation by training first a reactive machine learning potential (MLP) that can reproduce with high fidelity the DFT potential energy surface of proton hopping around the first Al coordination sphere in the H-CHA zeolite. The MLP offers an immense computational speedup, enabling us to derive accurate reaction kinetics beyond standard transition state theory for the proton hopping reaction. Overall, more than 0.6 μs of simulation time was needed, which is far beyond reach of any standard DFT approach. NQEs are found to significantly impact the proton hopping kinetics up to ~473 K. Moreover, PIMD simulations with deuterium can be performed without any additional training to compute kinetic isotope effects over a broad range of temperatures.
Collapse
Affiliation(s)
- Massimo Bocus
- Center for Molecular Modeling, Ghent University, Technologiepark 46, 9052, Zwijnaarde, Belgium
| | - Ruben Goeminne
- Center for Molecular Modeling, Ghent University, Technologiepark 46, 9052, Zwijnaarde, Belgium
| | - Aran Lamaire
- Center for Molecular Modeling, Ghent University, Technologiepark 46, 9052, Zwijnaarde, Belgium
| | - Maarten Cools-Ceuppens
- Center for Molecular Modeling, Ghent University, Technologiepark 46, 9052, Zwijnaarde, Belgium
| | - Toon Verstraelen
- Center for Molecular Modeling, Ghent University, Technologiepark 46, 9052, Zwijnaarde, Belgium
| | | |
Collapse
|
7
|
Lamaire A, Cools-Ceuppens M, Bocus M, Verstraelen T, Van Speybroeck V. Quantum Free Energy Profiles for Molecular Proton Transfers. J Chem Theory Comput 2022; 19:18-24. [PMID: 36563337 PMCID: PMC9835831 DOI: 10.1021/acs.jctc.2c00874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Although many molecular dynamics simulations treat the atomic nuclei as classical particles, an adequate description of nuclear quantum effects (NQEs) is indispensable when studying proton transfer reactions. Herein, quantum free energy profiles are constructed for three typical proton transfers, which properly take NQEs into account using the path integral formalism. The computational cost of the simulations is kept tractable by deriving machine learning potentials. It is shown that the classical and quasi-classical centroid free energy profiles of the proton transfers deviate substantially from the exact quantum free energy profile.
Collapse
|
8
|
Snyder R, Kim B, Pan X, Shao Y, Pu J. Facilitating ab initio QM/MM free energy simulations by Gaussian process regression with derivative observations. Phys Chem Chem Phys 2022; 24:25134-25143. [PMID: 36222412 PMCID: PMC11095978 DOI: 10.1039/d2cp02820d] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
In combined quantum mechanical and molecular mechanical (QM/MM) free energy simulations, how to synthesize the accuracy of ab initio (AI) methods with the speed of semiempirical (SE) methods for a cost-effective QM treatment remains a long-standing challenge. In this work, we present a machine-learning-facilitated method for obtaining AI/MM-quality free energy profiles through efficient SE/MM simulations. In particular, we use Gaussian process regression (GPR) to learn the energy and force corrections needed for SE/MM to match with AI/MM results during molecular dynamics simulations. Force matching is enabled in our model by including energy derivatives into the observational targets through the extended-kernel formalism. We demonstrate the effectiveness of this method on the solution-phase SN2 Menshutkin reaction using AM1/MM and B3LYP/6-31+G(d,p)/MM as the base and target levels, respectively. Trained on only 80 configurations sampled along the minimum free energy path (MFEP), the resulting GPR model reduces the average energy error in AM1/MM from 18.2 to 5.8 kcal mol-1 for the 4000-sample testing set with the average force error on the QM atoms decreased from 14.6 to 3.7 kcal mol-1 Å-1. Free energy sampling with the GPR corrections applied (AM1-GPR/MM) produces a free energy barrier of 14.4 kcal mol-1 and a reaction free energy of -34.1 kcal mol-1, in closer agreement with the AI/MM benchmarks and experimental results.
Collapse
Affiliation(s)
- Ryan Snyder
- Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis, 402 N. Blackford St., Indianapolis, IN 46202, USA.
| | - Bryant Kim
- Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis, 402 N. Blackford St., Indianapolis, IN 46202, USA.
| | - Xiaoliang Pan
- Department of Chemistry and Biochemistry, University of Oklahoma, 101 Stephenson Pkwy, Norman, OK 73019, USA.
| | - Yihan Shao
- Department of Chemistry and Biochemistry, University of Oklahoma, 101 Stephenson Pkwy, Norman, OK 73019, USA.
| | - Jingzhi Pu
- Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis, 402 N. Blackford St., Indianapolis, IN 46202, USA.
| |
Collapse
|
9
|
Loose T, Sahrmann PG, Voth GA. Centroid Molecular Dynamics Can Be Greatly Accelerated through Neural Network Learned Centroid Forces Derived from Path Integral Molecular Dynamics. J Chem Theory Comput 2022; 18:5856-5863. [PMID: 36103576 PMCID: PMC9558744 DOI: 10.1021/acs.jctc.2c00706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Indexed: 11/29/2022]
Abstract
For nearly the past 30 years, centroid molecular dynamics (CMD) has proven to be a viable classical-like phase space formulation for the calculation of quantum dynamical properties. However, calculation of the centroid effective force remains a significant computational cost and limits the ability of CMD to be an efficient approach to study condensed phase quantum dynamics. In this paper, we introduce a neural network-based methodology for first learning the centroid effective force from path integral molecular dynamics data, which is subsequently used as an effective force field to evolve the centroids directly with the CMD algorithm. This method, called machine-learned centroid molecular dynamics (ML-CMD), is faster and far less costly than both standard "on the fly" CMD and ring polymer molecular dynamics (RPMD). The training aspect of ML-CMD is also straightforwardly implemented utilizing the DeepMD software kit. ML-CMD is then applied to two model systems to illustrate the approach: liquid para-hydrogen and water. The results show comparable accuracy to both CMD and RPMD in the estimation of quantum dynamical properties, including the self-diffusion constant and velocity time correlation function, but with significantly reduced overall computational cost.
Collapse
Affiliation(s)
- Timothy
D. Loose
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, James Franck Institute,
and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Patrick G. Sahrmann
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, James Franck Institute,
and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Gregory A. Voth
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, James Franck Institute,
and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
10
|
Coupled Multiphysics Modelling of Sensors for Chemical, Biomedical, and Environmental Applications with Focus on Smart Materials and Low-Dimensional Nanostructures. CHEMOSENSORS 2022; 10:157. [PMID: 35909810 PMCID: PMC9171916 DOI: 10.3390/chemosensors10050157] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 04/22/2022] [Indexed: 12/20/2022]
Abstract
Low-dimensional nanostructures have many advantages when used in sensors compared to the traditional bulk materials, in particular in their sensitivity and specificity. In such nanostructures, the motion of carriers can be confined from one, two, or all three spatial dimensions, leading to their unique properties. New advancements in nanosensors, based on low-dimensional nanostructures, permit their functioning at scales comparable with biological processes and natural systems, allowing their efficient functionalization with chemical and biological molecules. In this article, we provide details of such sensors, focusing on their several important classes, as well as the issues of their designs based on mathematical and computational models covering a range of scales. Such multiscale models require state-of-the-art techniques for their solutions, and we provide an overview of the associated numerical methodologies and approaches in this context. We emphasize the importance of accounting for coupling between different physical fields such as thermal, electromechanical, and magnetic, as well as of additional nonlinear and nonlocal effects which can be salient features of new applications and sensor designs. Our special attention is given to nanowires and nanotubes which are well suited for nanosensor designs and applications, being able to carry a double functionality, as transducers and the media to transmit the signal. One of the key properties of these nanostructures is an enhancement in sensitivity resulting from their high surface-to-volume ratio, which leads to their geometry-dependant properties. This dependency requires careful consideration at the modelling stage, and we provide further details on this issue. Another important class of sensors analyzed here is pertinent to sensor and actuator technologies based on smart materials. The modelling of such materials in their dynamics-enabled applications represents a significant challenge as we have to deal with strongly nonlinear coupled problems, accounting for dynamic interactions between different physical fields and microstructure evolution. Among other classes, important in novel sensor applications, we have given our special attention to heterostructures and nucleic acid based nanostructures. In terms of the application areas, we have focused on chemical and biomedical fields, as well as on green energy and environmentally-friendly technologies where the efficient designs and opportune deployments of sensors are both urgent and compelling.
Collapse
|