1
|
Li J, Vindel-Zandbergen P, Li J, Felker PM, Bačić Z. HF Trimer: A New Full-Dimensional Potential Energy Surface and Rigorous 12D Quantum Calculations of Vibrational States. J Phys Chem A 2024; 128:9707-9720. [PMID: 39484697 DOI: 10.1021/acs.jpca.4c03771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
HF trimer, as the smallest and the lightest cyclic hydrogen-bonded (HB) cluster, has long been a favorite prototype system for spectroscopic and theoretical investigations of the structure, energetics, spectroscopy, and dynamics of hydrogen-bond networks. Recently, rigorous quantum 12D calculations of the coupled intra- and intermolecular vibrations of this fundamental HB trimer (J. Chem. Phys. 2023, 158, 234109) were performed, employing an older ab initio-based many-body potential energy surface (PES). While the theoretical results were found to be in reasonably good agreement with the available spectroscopic data, it was also evident that it is highly desirable to develop a more accurate 12D PES of HF trimer. Motivated by this, here we report a new, and the first fully ab initio 12D PES of this paradigmatic system. Approximately 42,540 geometries were sampled and calculated at the level of CCSD(T)-F12a/AVTZ. The permutationally invariant polynomial-neural network based Δ-machine learning approach (J. Phys. Chem. Lett. 2022, 13, 4729) was employed to perform cost-efficient calculations of the basis-set-superposition error (BSSE) correction. By strategically selecting data points, this approach facilitated the construction of a high-precision PES with BSSE correction, while requiring only a minimal number of BSSE value computations. The fitting error of the final PES is only 0.035 kcal/mol. To assess its performance, the 12D fully coupled quantum calculations of excited intra- and intermolecular vibrational states of HF trimer are carried out using the rigorous methodology developed by us earlier. The results are found to be in a significantly better agreement with the available spectroscopic data than those obtained with the previously existing semiempirical 12D PES.
Collapse
Affiliation(s)
- Jia Li
- School of Chemistry and Chemical Engineering & Chongqing Key Laboratory of Chemical Theory and Mechanism, Chongqing University, Chongqing 401331, China
| | - Patricia Vindel-Zandbergen
- Department of Chemistry, New York University, New York, New York 10003, United States
- Simons Center for Computational Physical Chemistry, New York University, New York, New York 10003, United States
| | - Jun Li
- School of Chemistry and Chemical Engineering & Chongqing Key Laboratory of Chemical Theory and Mechanism, Chongqing University, Chongqing 401331, China
| | - Peter M Felker
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California 90095-1569, United States
| | - Zlatko Bačić
- Department of Chemistry, New York University, New York, New York 10003, United States
- Simons Center for Computational Physical Chemistry, New York University, New York, New York 10003, United States
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, 3663 Zhongshan Road North, Shanghai 200062, China
| |
Collapse
|
2
|
Jäger S, Khatri J, Meyer P, Henkel S, Schwaab G, Nandi A, Pandey P, Barlow KR, Perkins MA, Tschumper GS, Bowman JM, van der Avoird A, Havenith M. On the nature of hydrogen bonding in the H 2S dimer. Nat Commun 2024; 15:9540. [PMID: 39500885 PMCID: PMC11538508 DOI: 10.1038/s41467-024-53444-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 10/09/2024] [Indexed: 11/08/2024] Open
Abstract
Hydrogen bonding is a central concept in chemistry and biochemistry, and so it continues to attract intense study. Here, we examine hydrogen bonding in the H2S dimer, in comparison with the well-studied water dimer, in unprecedented detail. We record a mass-selected IR spectrum of the H2S dimer in superfluid helium nanodroplets. We are able to resolve a rotational substructure in each of the three distinct bands and, based on it, assign these to vibration-rotation-tunneling transitions of a single intramolecular vibration. With the use of high-level potential and dipole-moment surfaces we compute the vibration-rotation-tunneling dynamics and far-infrared spectrum with rigorous quantum methods. Intramolecular mode Vibrational Self-Consistent-Field and Configuration-Interaction calculations provide the frequencies and intensities of the four SH-stretch modes, with a focus on the most intense, the donor bound SH mode which yields the experimentally observed bands. We show that the intermolecular modes in the H2S dimer are substantially more delocalized and more strongly mixed than in the water dimer. The less directional nature of the hydrogen bonding can be quantified in terms of weaker electrostatic and more important dispersion interactions. The present study reconciles all previous spectroscopic data, and serves as a sensitive test for the potential and dipole-moment surfaces.
Collapse
Affiliation(s)
- Svenja Jäger
- Department of Physical Chemistry II, Ruhr University Bochum, 44801, Bochum, Germany
| | - Jai Khatri
- Department of Physical Chemistry II, Ruhr University Bochum, 44801, Bochum, Germany
| | - Philipp Meyer
- Department of Physical Chemistry II, Ruhr University Bochum, 44801, Bochum, Germany
| | - Stefan Henkel
- Department of Physical Chemistry II, Ruhr University Bochum, 44801, Bochum, Germany
| | - Gerhard Schwaab
- Department of Physical Chemistry II, Ruhr University Bochum, 44801, Bochum, Germany
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, GA, 30322, USA
| | - Priyanka Pandey
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, GA, 30322, USA
| | - Kayleigh R Barlow
- Department of Chemistry and Biochemistry, University of Mississippi, University, MS, 38677-1848, USA
| | - Morgan A Perkins
- Department of Chemistry and Biochemistry, University of Mississippi, University, MS, 38677-1848, USA
| | - Gregory S Tschumper
- Department of Chemistry and Biochemistry, University of Mississippi, University, MS, 38677-1848, USA
| | - Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, GA, 30322, USA.
| | - Ad van der Avoird
- Theoretical Chemistry, Institute for Molecules and Materials, Radboud University, Heyendaalseweg 135, 6525 AJ, Nijmegen, Netherlands.
| | - Martina Havenith
- Department of Physical Chemistry II, Ruhr University Bochum, 44801, Bochum, Germany.
| |
Collapse
|
3
|
Nandi A, Pandey P, Houston PL, Qu C, Yu Q, Conte R, Tkatchenko A, Bowman JM. Δ-Machine Learning to Elevate DFT-Based Potentials and a Force Field to the CCSD( T) Level Illustrated for Ethanol. J Chem Theory Comput 2024; 20:8807-8819. [PMID: 39361051 PMCID: PMC11500277 DOI: 10.1021/acs.jctc.4c00977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2024] [Revised: 09/17/2024] [Accepted: 09/18/2024] [Indexed: 10/23/2024]
Abstract
Progress in machine learning has facilitated the development of potentials that offer both the accuracy of first-principles techniques and vast increases in the speed of evaluation. Recently, Δ-machine learning has been used to elevate the quality of a potential energy surface (PES) based on low-level, e.g., density functional theory (DFT) energies and gradients to close to the gold-standard coupled cluster level of accuracy. We have demonstrated the success of this approach for molecules, ranging in size from H3O+ to 15-atom acetyl-acetone and tropolone. These were all done using the B3LYP functional. Here, we investigate the generality of this approach for the PBE, M06, M06-2X, and PBE0 + MBD functionals, using ethanol as the example molecule. Linear regression with permutationally invariant polynomials is used to fit both low-level and correction PESs. These PESs are employed for standard RMSE analysis for training and test data sets, and then general fidelity tests such as energetics of stationary points, normal-mode frequencies, and torsional potentials are examined. We achieve similar improvements in all cases. Interestingly, we obtained significant improvement over DFT gradients where coupled cluster gradients were not used to correct the low-level PES. Finally, we present some results for correcting a recent molecular mechanics force field for ethanol and comment on the possible generality of this approach.
Collapse
Affiliation(s)
- Apurba Nandi
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Priyanka Pandey
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Paul L. Houston
- Department
of Chemistry and Chemical Biology, Cornell
University, Ithaca, New York 14853, United States
- Department
of Chemistry and Biochemistry, Georgia Institute
of Technology, Atlanta, Georgia 30332, United States
| | - Chen Qu
- Independent
Researcher, Toronto, Ontario M9B0E3, Canada
| | - Qi Yu
- Department
of Chemistry, Fudan University, Shanghai 200438, P. R. China
| | - Riccardo Conte
- Dipartimento
di Chimica, Università degli Studi
di Milano, via Golgi 19, 20133 Milano, Italy
| | - Alexandre Tkatchenko
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Joel M. Bowman
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| |
Collapse
|
4
|
Gutierrez-Cardenas J, Gibbas BD, Whitaker K, Kaledin M, Kaledin AL. A Low-Order Permutationally Invariant Polynomial Approach to Learning Potential Energy Surfaces Using the Bond-Order Charge-Density Matrix: Application to C n Clusters for n = 3-10, 20. J Phys Chem A 2024; 128:7703-7713. [PMID: 39205486 PMCID: PMC11407436 DOI: 10.1021/acs.jpca.4c04281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
A representation for learning potential energy surfaces (PESs) in terms of permutationally invariant polynomials (PIPs) using the Hartree-Fock expression for electronic energy is proposed. Our approach is based on the one-electron core Hamiltonian weighted by the configuration-dependent elements of the bond-order charge density matrix (CDM). While the previously reported model used an s-function Gaussian basis for the CDM, the present formulation is expanded with p-functions, which are crucial for describing chemical bonding. Detailed results are demonstrated on linear and cyclic Cn clusters (n = 3-10) trained on extensive B3LYP/aug-cc-pVTZ data. The described method facilitates PES learning by reducing the root mean squared error (RMSE) by a factor of 5 relative to the s-function formulation and by a factor of 20 relative to the conventional PIP approach. This is equivalent to using CDM and an sp basis with a PIP of order M to achieve the same RMSE as with the conventional method with a PIP of order M + 2. Implications for large-scale problems are discussed using the case of the PES of the C20 fullerene in full permutational symmetry.
Collapse
Affiliation(s)
- Jose Gutierrez-Cardenas
- Department of Chemistry & Biochemistry, Kennesaw State University, 370 Paulding Ave NW ,Box#1203,Kennesaw 30144, Georgia
| | - Benjamin D Gibbas
- Department of Chemistry & Biochemistry, Kennesaw State University, 370 Paulding Ave NW ,Box#1203,Kennesaw 30144, Georgia
| | - Kyle Whitaker
- Department of Chemistry & Biochemistry, Kennesaw State University, 370 Paulding Ave NW ,Box#1203,Kennesaw 30144, Georgia
| | - Martina Kaledin
- Department of Chemistry & Biochemistry, Kennesaw State University, 370 Paulding Ave NW ,Box#1203,Kennesaw 30144, Georgia
| | - Alexey L Kaledin
- Cherry L. Emerson Center for Scientific Computation and Department of Chemistry, Emory University, 1515 Dickey Drive ,Atlanta 30322, Georgia
| |
Collapse
|
5
|
Schatz GC, Wodtke AM, Yang X. Spiers Memorial Lecture: New directions in molecular scattering. Faraday Discuss 2024; 251:9-62. [PMID: 38764350 DOI: 10.1039/d4fd00015c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/21/2024]
Abstract
The field of molecular scattering is reviewed as it pertains to gas-gas as well as gas-surface chemical reaction dynamics. We emphasize the importance of collaboration of experiment and theory, from which new directions of research are being pursued on increasingly complex problems. We review both experimental and theoretical advances that provide the modern toolbox available to molecular-scattering studies. We distinguish between two classes of work. The first involves simple systems and uses experiment to validate theory so that from the validated theory, one may learn far more than could ever be measured in the laboratory. The second class involves problems of great complexity that would be difficult or impossible to understand without a partnership of experiment and theory. Key topics covered in this review include crossed-beams reactive scattering and scattering at extremely low energies, where quantum effects dominate. They also include scattering from surfaces, reactive scattering and kinetics at surfaces, and scattering work done at liquid surfaces. The review closes with thoughts on future promising directions of research.
Collapse
Affiliation(s)
- George C Schatz
- Dept of Chemistry, Northwestern University, Evanston, Illinois 60208, USA
| | - Alec M Wodtke
- Institute for Physical Chemistry, Georg August University, Goettingen, Germany
- Max Planck Institute for Multidisciplinary Natural Sciences, Goettingen, Germany.
- International Center for the Advanced Studies of Energy Conversion, Georg August University, Goettingen, Germany
| | - Xueming Yang
- Dalian Institute for Chemical Physics, Chinese Academy of Sciences, Dalian, China
- Department of Chemistry, College of Science, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
6
|
Aldossary A, Campos-Gonzalez-Angulo JA, Pablo-García S, Leong SX, Rajaonson EM, Thiede L, Tom G, Wang A, Avagliano D, Aspuru-Guzik A. In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
Affiliation(s)
- Abdulrahman Aldossary
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | | | - Sergio Pablo-García
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
| | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Ella Miray Rajaonson
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Luca Thiede
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Andrew Wang
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Davide Avagliano
- Chimie ParisTech, PSL University, CNRS, Institute of Chemistry for Life and Health Sciences (iCLeHS UMR 8060), Paris, F-75005, France
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
- Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON, M5S 3E4, Canada
- Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON, M5S 3E5, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 66118 University Ave., Toronto, M5G 1M1, Canada
- Acceleration Consortium, 80 St George St, Toronto, M5S 3H6, Canada
| |
Collapse
|
7
|
Murakami T, Takahashi S, Kikuma Y, Takayanagi T. Theoretical Study of the Thermal Rate Coefficients of the H 3+ + C 2H 4 Reaction: Dynamics Study on a Full-Dimensional Potential Energy Surface. Molecules 2024; 29:2789. [PMID: 38930853 PMCID: PMC11206701 DOI: 10.3390/molecules29122789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2024] [Revised: 06/08/2024] [Accepted: 06/09/2024] [Indexed: 06/28/2024] Open
Abstract
Ion-molecular reactions play a significant role in molecular evolution within the interstellar medium. In this study, the entrance channel reaction, H3+ + C2H4 → H2 + C2H5+, was investigated using classical molecular dynamic (classical MD) and ring polymer molecular dynamic (RPMD) simulation techniques. We developed an analytical potential energy surface function with a permutationally invariant polynomial basis, specifically employing the monomial symmetrized approach. Our dynamic simulations reproduced the rate coefficient of 300 K for H3+ + C2H4 → H2 + C2H5+, aligning reasonably well with the values in the kinetic database commonly utilized in astrochemistry. The thermal rate coefficients obtained using both the classical MD and RPMD techniques exhibited an increase from 100 K to 300 K as the temperature rose. Additionally, we analyzed the excess energy distribution of the C2H5+ fragment with respect to temperature to investigate the indirect reaction pathway of C2H5+ → H2 + C2H3+. This result suggests that the indirect reaction pathway of C2H5+ → H2 + C2H3+ holds minor significance, although the distribution highly depends on the collisional temperature.
Collapse
Affiliation(s)
- Tatsuhiro Murakami
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City 338-8570, Japan; (S.T.); (Y.K.)
- Department of Materials & Life Sciences, Faculty of Science & Technology, Sophia University, 7-1 Kioicho, Chiyoda-ku, Tokyo 102-8554, Japan
| | - Soma Takahashi
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City 338-8570, Japan; (S.T.); (Y.K.)
| | - Yuya Kikuma
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City 338-8570, Japan; (S.T.); (Y.K.)
| | - Toshiyuki Takayanagi
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City 338-8570, Japan; (S.T.); (Y.K.)
| |
Collapse
|
8
|
Selloni A. Aqueous Titania Interfaces. Annu Rev Phys Chem 2024; 75:47-65. [PMID: 38271659 DOI: 10.1146/annurev-physchem-090722-015957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024]
Abstract
Water-metal oxide interfaces are central to many phenomena and applications, ranging from material corrosion and dissolution to photoelectrochemistry and bioengineering. In particular, the discovery of photocatalytic water splitting on TiO2 has motivated intensive studies of water-TiO2 interfaces for decades. So far, a broad understanding of the interaction of water vapor with several TiO2 surfaces has been obtained. However, much less is known about liquid water-TiO2 interfaces, which are more relevant to many practical applications. Probing these complex systems at the molecular level is experimentally challenging and is sometimes possible only through computational studies. This review summarizes recent advances in the atomistic understanding, mostly through computational simulations, of the structure and dynamics of interfacial water on TiO2 surfaces. The main focus is on the nature, molecular or dissociated, of water in direct contact with low-index defect-free crystalline surfaces. The hydroxyls resulting from water dissociation are essential in the photooxidation of water and critically affect the surface chemistry of TiO2.
Collapse
Affiliation(s)
- Annabella Selloni
- Department of Chemistry, Princeton University, Princeton, New Jersey, USA;
| |
Collapse
|
9
|
Houston PL, Qu C, Yu Q, Pandey P, Conte R, Nandi A, Bowman JM, Kukolich SG. Formic Acid-Ammonia Heterodimer: A New Δ-Machine Learning CCSD(T)-Level Potential Energy Surface Allows Investigation of the Double Proton Transfer. J Chem Theory Comput 2024; 20:1821-1828. [PMID: 38382541 DOI: 10.1021/acs.jctc.3c01273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
The formic acid-ammonia dimer is an important example of a hydrogen-bonded complex in which a double proton transfer can occur. Its microwave spectrum has recently been reported and rotational constants and quadrupole coupling constants were determined. Calculated estimates of the double-well barrier and the internal barriers to rotation were also reported. Here, we report a full-dimensional potential energy surface (PES) for this complex, using two closely related Δ-machine learning methods to bring it to the CCSD(T) level of accuracy. The PES dissociates smoothly and accurately. Using a 2d quantum model the ground vibrational-state tunneling splitting is estimated to be less than 10-4 cm-1. The dipole moment along the intrinsic reaction coordinate is calculated along with a Mullikan charge analysis and supports the mildly ionic character of the minimum and strongly ionic character at the double-well barrier.
Collapse
Affiliation(s)
- Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, U.S.A. and Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Chen Qu
- Independent Researcher, Toronto, Ontario M9B0E3, Canada
| | - Qi Yu
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Priyanka Pandey
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, Via Golgi 19, Milano 20133, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
- Department of Physics and Materials Science, University of Luxembourg, Luxembourg City L-1511, Luxembourg
| | - Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Stephen G Kukolich
- Department of Chemistry and Biochemistry, University of Arizona, 1306 E. University Avenue, Tucson, Arizona 85721, United States
| |
Collapse
|
10
|
Iyengar SS, Ricard TC, Zhu X. Reformulation of All ONIOM-Type Molecular Fragmentation Approaches and Many-Body Theories Using Graph-Theory-Based Projection Operators: Applications to Dynamics, Molecular Potential Surfaces, Machine Learning, and Quantum Computing. J Phys Chem A 2024; 128:466-478. [PMID: 38180503 DOI: 10.1021/acs.jpca.3c05630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2024]
Abstract
We present a graph-theory-based reformulation of all ONIOM-based molecular fragmentation methods. We discuss applications to (a) accurate post-Hartree-Fock AIMD that can be conducted at DFT cost for medium-sized systems, (b) hybrid DFT condensed-phase studies at the cost of pure density functionals, (c) reduced cost on-the-fly large basis gas-phase AIMD and condensed-phase studies, (d) post-Hartree-Fock-level potential surfaces at DFT cost to obtain quantum nuclear effects, and (e) novel transfer machine learning protocols derived from these measures. Additionally, in previous work, the unifying strategy discussed here has been used to construct new quantum computing algorithms. Thus, we conclude that this reformulation is robust and accurate.
Collapse
Affiliation(s)
- Srinivasan S Iyengar
- Department of Chemistry, Department of Physics, and the Indiana University Quantum Science and Engineering Center (IU-QSEC), Indiana University, 800 E. Kirkwood Avenue, Bloomington, Indiana 47405, United States
| | - Timothy C Ricard
- Department of Chemistry, Department of Physics, and the Indiana University Quantum Science and Engineering Center (IU-QSEC), Indiana University, 800 E. Kirkwood Avenue, Bloomington, Indiana 47405, United States
| | - Xiao Zhu
- Department of Chemistry, Department of Physics, and the Indiana University Quantum Science and Engineering Center (IU-QSEC), Indiana University, 800 E. Kirkwood Avenue, Bloomington, Indiana 47405, United States
| |
Collapse
|
11
|
Lin HH, Wang CI, Yang CH, Secario MK, Hsu CP. Two-Step Machine Learning Approach for Charge-Transfer Coupling with Structurally Diverse Data. J Phys Chem A 2024; 128:271-280. [PMID: 38157315 DOI: 10.1021/acs.jpca.3c04524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]
Abstract
Electronic coupling is important in determining charge-transfer rates and dynamics. Coupling strength is sensitive to both intermolecular, e.g., orientation or distance, and intramolecular degrees of freedom. Hence, it is challenging to build an accurate machine learning model to predict electronic coupling of molecular pairs, especially for those derived from the amorphous phase, for which intermolecular configurations are much more diverse than those derived from crystals. In this work, we devise a new prediction algorithm that employs two consecutive KRR models. The first model predicts molecular orbitals (MOs) from structural variation for each fragment, and coupling is further predicted by using the overlap integral included in a second model. With our two-step procedure, we achieved mean absolute errors of 0.27 meV for an ethylene dimer and 1.99 meV for a naphthalene pair, much improved accuracy amounting to 14-fold and 3-fold error reductions, respectively. In addition, MOs from the first model can also be the starting point to obtain other quantum chemical properties from atomistic structures. This approach is also compatible with a MO predictor with sufficient accuracy.
Collapse
Affiliation(s)
- Hung-Hsuan Lin
- Institute of Chemistry, Academia Sinica, 128 Section 2 Academia Road, Nankang, Taipei 115, Taiwan
- Molecular Science and Digital Innovation Center, Genetics Generation Advancement Corp, No. 28, Ln. 36, Xinhu First Rd., Neihu, Taipei 114, Taiwan
| | - Chun-I Wang
- Institute of Chemistry, Academia Sinica, 128 Section 2 Academia Road, Nankang, Taipei 115, Taiwan
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Chou-Hsun Yang
- Institute of Chemistry, Academia Sinica, 128 Section 2 Academia Road, Nankang, Taipei 115, Taiwan
| | - Muhammad Khari Secario
- Institute of Chemistry, Academia Sinica, 128 Section 2 Academia Road, Nankang, Taipei 115, Taiwan
- Taiwan International Graduate Program on Sustainable Chemical Science & Technology, Academia Sinica Institute of Chemistry, 128 Academia Road Sec.2, Nankang, Taipei 115, Taiwan
- Department of Applied Chemistry, National Yang Ming Chiao Tung University, 1001 University Road, Hsinchu 300, Taiwan
| | - Chao-Ping Hsu
- Institute of Chemistry, Academia Sinica, 128 Section 2 Academia Road, Nankang, Taipei 115, Taiwan
- Division of Physics, National Center for Theoretical Sciences, 1, Section 4, Roosevelt Road, Taipei 106, Taiwan
| |
Collapse
|
12
|
Kumar S, Jing X, Pask JE, Medford AJ, Suryanarayana P. Kohn-Sham accuracy from orbital-free density functional theory via Δ-machine learning. J Chem Phys 2023; 159:244106. [PMID: 38147461 DOI: 10.1063/5.0180541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 11/30/2023] [Indexed: 12/28/2023] Open
Abstract
We present a Δ-machine learning model for obtaining Kohn-Sham accuracy from orbital-free density functional theory (DFT) calculations. In particular, we employ a machine-learned force field (MLFF) scheme based on the kernel method to capture the difference between Kohn-Sham and orbital-free DFT energies/forces. We implement this model in the context of on-the-fly molecular dynamics simulations and study its accuracy, performance, and sensitivity to parameters for representative systems. We find that the formalism not only improves the accuracy of Thomas-Fermi-von Weizsäcker orbital-free energies and forces by more than two orders of magnitude but is also more accurate than MLFFs based solely on Kohn-Sham DFT while being more efficient and less sensitive to model parameters. We apply the framework to study the structure of molten Al0.88Si0.12, the results suggesting no aggregation of Si atoms, in agreement with a previous Kohn-Sham study performed at an order of magnitude smaller length and time scales.
Collapse
Affiliation(s)
- Shashikant Kumar
- College of Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Xin Jing
- College of Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
- College of Computing, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - John E Pask
- Physics Division, Lawrence Livermore National Laboratory, Livermore, California 94550, USA
| | - Andrew J Medford
- College of Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Phanish Suryanarayana
- College of Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
- College of Computing, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| |
Collapse
|
13
|
Ricard TC, Zhu X, Iyengar SS. Capturing Weak Interactions in Surface Adsorbate Systems at Coupled Cluster Accuracy: A Graph-Theoretic Molecular Fragmentation Approach Improved through Machine Learning. J Chem Theory Comput 2023. [PMID: 38019639 DOI: 10.1021/acs.jctc.3c00955] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2023]
Abstract
The accurate and efficient study of the interactions of organic matter with the surface of water is critical to a wide range of applications. For example, environmental studies have found that acidic polyfluorinated alkyl substances, especially perfluorooctanoic acid (PFOA), have spread throughout the environment and bioaccumulate into human populations residing near contaminated watersheds, leading to many systemic maladies. Thus, the study of the interactions of PFOA with water surfaces became important for the mitigation of their activity as pollutants and threats to public health. However, theoretical study of the interactions of such organic adsorbates on the surface of water, and their bulk concerted properties, often necessitates the use of ab initio methods to properly incorporate the long-range electronic properties that govern these extended systems. Notable theoretical treatments of "on-water" reactions thus far have employed hybrid DFT and semilocal DFT, but the interactions involved are weak interactions that may be best described using post-Hartree-Fock theory. Here, we aim to demonstrate the utility of a graph-theoretic approach to molecular fragmentation that accurately captures the critical "weak" interactions while maintaining an efficient ab initio treatment of the long-range periodic interactions that underpin the physics of extended systems. We apply this graph-theoretical treatment to study PFOA on the surface of water as a model system for the study of weak interactions seen in the wide range of surface interactions and reactions. The approach divides a system into a set of vertices, that are then connected through edges, faces, and higher order graph theoretic objects known as simplexes, to represent a collection of locally interacting subsystems. These subsystems are then used to construct ab initio molecular dynamics simulations and for computing multidimensional potential energy surfaces. To further improve the computational efficiency of our graph theoretic fragmentation method, we use a recently developed transfer learning protocol to construct the full system potential energy from a family of neural networks each designed to accurately model the behavior of individual simplexes. We use a unique multidimensional clustering algorithm, based on the k-means clustering methodology, to define our training space for each separate simplex. These models are used to extrapolate the energies for molecular dynamics trajectories at PFOA water interfaces, at less than one-tenth the cost as compared to a regular molecular fragmentation-based dynamics calculation with excellent agreement with couple cluster level of full system potential energies.
Collapse
Affiliation(s)
- Timothy C Ricard
- Department of Chemistry and Department of Physics, Indiana University, 800 E. Kirkwood Avenue, Bloomington, Indiana 47405, United States
| | - Xiao Zhu
- Department of Chemistry and Department of Physics, Indiana University, 800 E. Kirkwood Avenue, Bloomington, Indiana 47405, United States
| | - Srinivasan S Iyengar
- Department of Chemistry and Department of Physics, Indiana University, 800 E. Kirkwood Avenue, Bloomington, Indiana 47405, United States
| |
Collapse
|
14
|
Vinod V, Maity S, Zaspel P, Kleinekathöfer U. Multifidelity Machine Learning for Molecular Excitation Energies. J Chem Theory Comput 2023; 19:7658-7670. [PMID: 37862054 DOI: 10.1021/acs.jctc.3c00882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2023]
Abstract
The accurate but fast calculation of molecular excited states is still a very challenging topic. For many applications, detailed knowledge of the energy funnel in larger molecular aggregates is of key importance, requiring highly accurate excitation energies. To this end, machine learning techniques can be a very useful tool, though the cost of generating highly accurate training data sets still remains a severe challenge. To overcome this hurdle, this work proposes the use of multifidelity machine learning where very little training data from high accuracies is combined with cheaper and less accurate data to achieve the accuracy of the costlier level. In the present study, the approach is employed to predict vertical excitation energies to the first excited state for three molecules of increasing size, namely, benzene, naphthalene, and anthracene. The energies are trained and tested for conformations stemming from classical molecular dynamics and density functional based tight-binding simulations. It can be shown that the multifidelity machine learning model can achieve the same accuracy as a machine learning model built only on high-cost training data while expending a much lower computational effort to generate the data. The numerical gain observed in these benchmark test calculations was over a factor of 30 but certainly can be much higher for high-accuracy data.
Collapse
Affiliation(s)
- Vivin Vinod
- School of Mathematics and Natural Science, University of Wuppertal, Wuppertal 42119, Germany
- School of Computer Science and Engineering, Constructor University, Campus Ring 1, Bremen 28759, Germany
| | - Sayan Maity
- School of Science, Constructor University, Campus Ring 1, Bremen 28759, Germany
| | - Peter Zaspel
- School of Mathematics and Natural Science, University of Wuppertal, Wuppertal 42119, Germany
- School of Computer Science and Engineering, Constructor University, Campus Ring 1, Bremen 28759, Germany
| | | |
Collapse
|
15
|
Broderick DR, Herbert JM. Scalable generalized screening for high-order terms in the many-body expansion: Algorithm, open-source implementation, and demonstration. J Chem Phys 2023; 159:174801. [PMID: 37921253 DOI: 10.1063/5.0174293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 10/16/2023] [Indexed: 11/04/2023] Open
Abstract
The many-body expansion lies at the heart of numerous fragment-based methods that are intended to sidestep the nonlinear scaling of ab initio quantum chemistry, making electronic structure calculations feasible in large systems. In principle, inclusion of higher-order n-body terms ought to improve the accuracy in a controllable way, but unfavorable combinatorics often defeats this in practice and applications with n ≥ 4 are rare. Here, we outline an algorithm to overcome this combinatorial bottleneck, based on a bottom-up approach to energy-based screening. This is implemented within a new open-source software application ("Fragme∩t"), which is integrated with a lightweight semi-empirical method that is used to cull subsystems, attenuating the combinatorial growth of higher-order terms in the graph that is used to manage the calculations. This facilitates applications of unprecedented size, and we report four-body calculations in (H2O)64 clusters that afford relative energies within 0.1 kcal/mol/monomer of the supersystem result using less than 10% of the unique subsystems. We also report n-body calculations in (H2O)20 clusters up to n = 8, at which point the expansion terminates naturally due to screening. These are the largest n-body calculations reported to date using ab initio electronic structure theory, and they confirm that high-order n-body terms are mostly artifacts of basis-set superposition error.
Collapse
Affiliation(s)
- Dustin R Broderick
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, USA
| | - John M Herbert
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, USA
| |
Collapse
|
16
|
Liu Y, Guo H. A Gaussian Process Based Δ-Machine Learning Approach to Reactive Potential Energy Surfaces. J Phys Chem A 2023; 127:8765-8772. [PMID: 37815868 DOI: 10.1021/acs.jpca.3c05318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/12/2023]
Abstract
The Gaussian process (GP) is an efficient nonparametric machine learning (ML) method. A distinct advantage of the GP is its ability to provide an estimate of statistical uncertainties. This is particularly useful in constructing high-dimensional potential energy surfaces (PESs) from ab initio data as it offers an optimal way to add new geometries to reduce the overall error. In this work, GP is employed in the context of Δ-machine learning (Δ-ML), in which a correction PES to an inaccurate low-level PES is constructed using a small number of high-level ab initio calculations. This new method is tested in three prototypical reactive systems, namely, the H + H2 → H + H2, OH + H2 → H2O + H, and H + CH4 → H2 + CH3 reactions. The results show that the GP-based Δ-ML approach is more efficient than its direct application in constructing high-level PESs. We also compare the new method to a previously proposed neural-network-based Δ-ML approach [Liu and Li J. Phys. Chem. Lett. 2022, 13, 4729-4738]. The results indicate that the two Δ-ML methods have comparable efficiencies in constructing accurate PESs.
Collapse
Affiliation(s)
- Yang Liu
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Hua Guo
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| |
Collapse
|
17
|
Yu Q, Qu C, Houston PL, Nandi A, Pandey P, Conte R, Bowman JM. A Status Report on "Gold Standard" Machine-Learned Potentials for Water. J Phys Chem Lett 2023; 14:8077-8087. [PMID: 37656898 PMCID: PMC10510435 DOI: 10.1021/acs.jpclett.3c01791] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 08/28/2023] [Indexed: 09/03/2023]
Abstract
Owing to the central importance of water to life as well as its unusual properties, potentials for water have been the subject of extensive research over the past 50 years. Recently, five potentials based on different machine learning approaches have been reported that are at or near the "gold standard" CCSD(T) level of theory. The development of such high-level potentials enables efficient and accurate simulations of water systems using classical and quantum dynamical approaches. This Perspective serves as a status report of these potentials, focusing on their methodology and applications to water systems across different phases. Their performances on the energies of gas phase water clusters, as well as condensed phase structural and dynamical properties, are discussed.
Collapse
Affiliation(s)
- Qi Yu
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Chen Qu
- Independent
Researcher, Toronto, Ontario M9B 0E3, Canada
| | - Paul L. Houston
- Department
of Chemistry and Chemical Biology, Cornell
University, Ithaca, New York 14853, United States
- Department of Chemistry
and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Apurba Nandi
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Priyanka Pandey
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Riccardo Conte
- Dipartimento
di Chimica, Università degli Studi
di Milano, via Golgi 19, 20133 Milano, Italy
| | - Joel M. Bowman
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| |
Collapse
|
18
|
Hashem Y, Foust K, Kaledin M, Kaledin AL. Fitting Potential Energy Surfaces by Learning the Charge Density Matrix with Permutationally Invariant Polynomials. J Chem Theory Comput 2023; 19:5690-5700. [PMID: 37561135 PMCID: PMC10501011 DOI: 10.1021/acs.jctc.3c00586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Indexed: 08/11/2023]
Abstract
The electronic energy in the Hartree-Fock (HF) theory is the trace of the product of the charge density matrix (CDM) with the one-electron and two-electron matrices represented in an atomic orbital basis, where the two-electron matrix is also a function of the same CDM. In this work, we examine a formalism of analytic representation of a generic molecular potential energy surface (PES) as a sum of a linearly parameterized HF and a correction term, the latter formally representing the electron correlation energy, also linearly parameterized, by expressing the elements of CDM using permutationally invariant polynomials (PIPs). We show on a variety of numerical examples, ranging from exemplary two-electron systems HeH+ and H3+ to the more challenging cases of methanium (CH5+) fragmentation and high-energy tautomerization of formamide to formimidic acid that such a formulation requires significantly fewer, 10-20% of PIPs, to accomplish the same accuracy of the fit as the conventional representation at practically the same computational cost.
Collapse
Affiliation(s)
- Younos Hashem
- Department
of Chemistry & Biochemistry, Kennesaw
State University, 370 Paulding Ave NW, Box # 1203, Kennesaw 30144, Georgia
| | - Katheryn Foust
- Department
of Chemistry & Biochemistry, Kennesaw
State University, 370 Paulding Ave NW, Box # 1203, Kennesaw 30144, Georgia
| | - Martina Kaledin
- Department
of Chemistry & Biochemistry, Kennesaw
State University, 370 Paulding Ave NW, Box # 1203, Kennesaw 30144, Georgia
| | - Alexey L. Kaledin
- Cherry
L. Emerson Center for Scientific Computation and Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta 30322, Georgia
| |
Collapse
|
19
|
Riera M, Knight C, Bull-Vulpe EF, Zhu X, Agnew H, Smith DGA, Simmonett AC, Paesani F. MBX: A many-body energy and force calculator for data-driven many-body simulations. J Chem Phys 2023; 159:054802. [PMID: 37526156 PMCID: PMC10550339 DOI: 10.1063/5.0156036] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 07/11/2023] [Indexed: 08/02/2023] Open
Abstract
Many-Body eXpansion (MBX) is a C++ library that implements many-body potential energy functions (PEFs) within the "many-body energy" (MB-nrg) formalism. MB-nrg PEFs integrate an underlying polarizable model with explicit machine-learned representations of many-body interactions to achieve chemical accuracy from the gas to the condensed phases. MBX can be employed either as a stand-alone package or as an energy/force engine that can be integrated with generic software for molecular dynamics and Monte Carlo simulations. MBX is parallelized internally using Open Multi-Processing and can utilize Message Passing Interface when available in interfaced molecular simulation software. MBX enables classical and quantum molecular simulations with MB-nrg PEFs, as well as hybrid simulations that combine conventional force fields and MB-nrg PEFs, for diverse systems ranging from small gas-phase clusters to aqueous solutions and molecular fluids to biomolecular systems and metal-organic frameworks.
Collapse
Affiliation(s)
- Marc Riera
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Christopher Knight
- Argonne National Laboratory, Computational Science Division, Lemont, Illinois 60439, USA
| | - Ethan F. Bull-Vulpe
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Xuanyu Zhu
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Henry Agnew
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | | | - Andrew C. Simmonett
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | |
Collapse
|
20
|
Liu B, Liu L, Qin X, Liu Y, Yang R, Mo X, Qin C, Liang C, Yao S. Effect of Substituents on Molecular Reactivity during Lignin Oxidation by Chlorine Dioxide: A Density Functional Theory Study. Int J Mol Sci 2023; 24:11809. [PMID: 37511570 PMCID: PMC10380563 DOI: 10.3390/ijms241411809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 07/14/2023] [Accepted: 07/18/2023] [Indexed: 07/30/2023] Open
Abstract
Lignin is a polymer with a complex structure. It is widely present in lignocellulosic biomass, and it has a variety of functional group substituents and linkage forms. Especially during the oxidation reaction, the positioning effect of the different substituents of the benzene ring leads to differences in lignin reactivity. The position of the benzene ring branched chain with respect to methoxy is important. The study of the effect of benzene substituents on the oxidation reaction's activity is still an unfinished task. In this study, density functional theory (DFT) and the m062x/6-311+g (d) basis set were used. Differences in the processes of phenolic oxygen intermediates formed by phenolic lignin structures (with different substituents) with chlorine dioxide during the chlorine dioxide reaction were investigated. Six phenolic lignin model species with different structures were selected. Bond energies, electrostatic potentials, atomic charges, Fukui functions and double descriptors of lignin model substances and reaction energy barriers are compared. The effects of benzene ring branched chains and methoxy on the mechanism of chlorine dioxide oxidation of lignin were revealed systematically. The results showed that the substituents with shorter branched chains and strong electron-absorbing ability were more stable. Lignin is not easily susceptible to the effects of chlorine dioxide. The substituents with longer branched chains have a significant effect on the flow of electron clouds. The results demonstrate that chlorine dioxide can affect the electron arrangement around the molecule, which directly affects the electrophilic activity of the molecule. The electron-absorbing effect of methoxy leads to a low dissociation energy of the phenolic hydroxyl group. Electrophilic reagents are more likely to attack this reaction site. In addition, the stabilizing effect of methoxy on the molecular structure of lignin was also found.
Collapse
Affiliation(s)
- Baojie Liu
- Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industrial and Food Engineering, Guangxi University, Nanning 530004, China
| | - Lu Liu
- Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industrial and Food Engineering, Guangxi University, Nanning 530004, China
| | - Xin Qin
- Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industrial and Food Engineering, Guangxi University, Nanning 530004, China
| | - Yi Liu
- Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industrial and Food Engineering, Guangxi University, Nanning 530004, China
| | - Rui Yang
- Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industrial and Food Engineering, Guangxi University, Nanning 530004, China
| | - Xiaorong Mo
- Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industrial and Food Engineering, Guangxi University, Nanning 530004, China
| | - Chengrong Qin
- Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industrial and Food Engineering, Guangxi University, Nanning 530004, China
| | - Chen Liang
- Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industrial and Food Engineering, Guangxi University, Nanning 530004, China
| | - Shuangquan Yao
- Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industrial and Food Engineering, Guangxi University, Nanning 530004, China
| |
Collapse
|
21
|
Dandu NK, Ward L, Assary RS, Redfern PC, Curtiss LA. Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning. J Phys Chem A 2023. [PMID: 37406209 DOI: 10.1021/acs.jpca.3c00823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/07/2023]
Abstract
In previous work (Dandu et al., J. Phys. Chem. A, 2022, 126, 4528-4536), we were successful in predicting accurate atomization energies of organic molecules using machine learning (ML) models, obtaining an accuracy as low as 0.1 kcal/mol compared to the G4MP2 method. In this work, we extend the use of these ML models to adiabatic ionization potentials on data sets of energies generated using quantum chemical calculations. Atomic specific corrections that were found to improve atomization energies from quantum chemical calculations have also been used in this study to improve ionization potentials. The quantum chemical calculations were performed on 3405 molecules containing eight or fewer non-hydrogen atoms derived from the QM9 data set, using the B3LYP functional with the 6-31G(2df,p) basis set for optimization. Low-fidelity IPs for these structures were obtained using two density functional methods: B3LYP/6-31+G(2df,p) and ωB97XD/6-311+G(3df,2p). Highly accurate G4MP2 calculations were performed on these optimized structures to obtain high-fidelity IPs to use in ML models based on the low-fidelity IPs. Our best performing ML methods gave IPs of organic molecules within a mean absolute deviation of 0.035 eV from the G4MP2 IPs for the whole data set. This work demonstrates that ML predictions assisted by quantum chemical calculations can be used to successfully predict IPs of organic molecules for use in high throughput screening.
Collapse
Affiliation(s)
- Naveen K Dandu
- Materials Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
- Joint Center for Energy Storage Research (JCESR), Argonne National Laboratory, Lemont, Illinois 60439, United States
- Chemical Engineering Department, University of Illinois-Chicago, Chicago, Illinois 60608, United States
| | - Logan Ward
- Joint Center for Energy Storage Research (JCESR), Argonne National Laboratory, Lemont, Illinois 60439, United States
- Data Science and Learning Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Rajeev S Assary
- Materials Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
- Joint Center for Energy Storage Research (JCESR), Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Paul C Redfern
- Materials Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Larry A Curtiss
- Materials Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
- Joint Center for Energy Storage Research (JCESR), Argonne National Laboratory, Lemont, Illinois 60439, United States
| |
Collapse
|
22
|
Staub R, Gantzer P, Harabuchi Y, Maeda S, Varnek A. Challenges for Kinetics Predictions via Neural Network Potentials: A Wilkinson's Catalyst Case. Molecules 2023; 28:molecules28114477. [PMID: 37298952 DOI: 10.3390/molecules28114477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 05/23/2023] [Accepted: 05/26/2023] [Indexed: 06/12/2023] Open
Abstract
Ab initio kinetic studies are important to understand and design novel chemical reactions. While the Artificial Force Induced Reaction (AFIR) method provides a convenient and efficient framework for kinetic studies, accurate explorations of reaction path networks incur high computational costs. In this article, we are investigating the applicability of Neural Network Potentials (NNP) to accelerate such studies. For this purpose, we are reporting a novel theoretical study of ethylene hydrogenation with a transition metal complex inspired by Wilkinson's catalyst, using the AFIR method. The resulting reaction path network was analyzed by the Generative Topographic Mapping method. The network's geometries were then used to train a state-of-the-art NNP model, to replace expensive ab initio calculations with fast NNP predictions during the search. This procedure was applied to run the first NNP-powered reaction path network exploration using the AFIR method. We discovered that such explorations are particularly challenging for general purpose NNP models, and we identified the underlying limitations. In addition, we are proposing to overcome these challenges by complementing NNP models with fast semiempirical predictions. The proposed solution offers a generally applicable framework, laying the foundations to further accelerate ab initio kinetic studies with Machine Learning Force Fields, and ultimately explore larger systems that are currently inaccessible.
Collapse
Affiliation(s)
- Ruben Staub
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
| | - Philippe Gantzer
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
| | - Yu Harabuchi
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
- Japan Science and Technology Agency (JST), ERATO Maeda Artificial Intelligence in Chemical Reaction Design and Discovery Project, Kita 10, Nishi 8, Kita-ku, Sapporo 060-0810, Japan
- Department of Chemistry, Faculty of Science, Hokkaido University, Kita 10, Nishi 8, Kita-ku, Sapporo 060-0810, Japan
| | - Satoshi Maeda
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
- Japan Science and Technology Agency (JST), ERATO Maeda Artificial Intelligence in Chemical Reaction Design and Discovery Project, Kita 10, Nishi 8, Kita-ku, Sapporo 060-0810, Japan
- Department of Chemistry, Faculty of Science, Hokkaido University, Kita 10, Nishi 8, Kita-ku, Sapporo 060-0810, Japan
- Research and Services Division of Materials Data and Integrated System (MaDIS), National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba 305-0044, Japan
| | - Alexandre Varnek
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
- Laboratory of Chemoinformatics, UMR 7140, CNRS, University of Strasbourg, 67081 Strasbourg, France
| |
Collapse
|
23
|
Murakami T, Ibuki S, Hashimoto Y, Kikuma Y, Takayanagi T. Dynamics study of the post-transition-state-bifurcation process of the (HCOOH)H + → CO + H 3O +/HCO + + H 2O dissociation: application of machine-learning techniques. Phys Chem Chem Phys 2023; 25:14016-14027. [PMID: 37161528 DOI: 10.1039/d3cp00252g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
The process of protonated formic acid dissociating from the transition state was studied using ring-polymer molecular dynamics (RPMD), classical MD, and quasi-classical trajectory (QCT) simulations. Temperature had a strong influence on the branching fractions for the HCO+ + H2O and CO + H3O+ dissociation channels. The RPMD and classical MD simulations showed similar behavior, but the QCT dynamics were significantly different owing to the excess energies in the quasi-classical trajectories. Machine-learning analysis identified several key features in the phase information of the vibrational motions at the transition state. We found that the initial configuration and momentum of a hydrogen atom connected to a carbon atom and the shrinking coordinate of the CO bond at the transition state play a role in the dynamics of HCO+ + H2O production.
Collapse
Affiliation(s)
- Tatsuhiro Murakami
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama, 338-8570, Japan.
- Department of Materials & Life Sciences, Faculty of Science & Technology, Sophia University, 7-1 Kioicho, Chiyoda-ku, Tokyo, 102-8554, Japan
| | - Shunichi Ibuki
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama, 338-8570, Japan.
| | - Yu Hashimoto
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama, 338-8570, Japan.
| | - Yuya Kikuma
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama, 338-8570, Japan.
| | - Toshiyuki Takayanagi
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama, 338-8570, Japan.
| |
Collapse
|
24
|
Heindel JP, Herman KM, Xantheas SS. Many-Body Effects in Aqueous Systems: Synergies Between Interaction Analysis Techniques and Force Field Development. Annu Rev Phys Chem 2023; 74:337-360. [PMID: 37093659 DOI: 10.1146/annurev-physchem-062422-023532] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Interaction analysis techniques, including the many-body expansion (MBE), symmetry-adapted perturbation theory, and energy decomposition analysis, allow for an intuitive understanding of complex molecular interactions. We review these methods by first providing a historical context for the study of many-body interactions and discussing how nonadditivities emerge from Hamiltonians containing strictly pairwise-additive interactions. We then elaborate on the synergy between these interaction analysis techniques and the development of advanced force fields aimed at accurately reproducing the Born-Oppenheimer potential energy surface. In particular, we focus on ab initio-based force fields that aim to explicitly reproduce many-body terms and are fitted to high-level electronic structure results. These force fields generally incorporate many-body effects through (a) parameterization of distributed multipoles, (b) explicit fitting of the MBE, (c) inclusion of many-atom features in a neural network, and (d) coarse-graining of many-body terms into an effective two-body term. We also discuss the emerging use of the MBE to improve the accuracy and speed of ab initio molecular dynamics.
Collapse
Affiliation(s)
- Joseph P Heindel
- Department of Chemistry, University of Washington, Seattle, Washington, USA
| | - Kristina M Herman
- Department of Chemistry, University of Washington, Seattle, Washington, USA
| | - Sotiris S Xantheas
- Department of Chemistry, University of Washington, Seattle, Washington, USA
- Advanced Computing, Mathematics and Data Division, Pacific Northwest National Laboratory, Richland, Washington, USA; ,
| |
Collapse
|
25
|
Schneider M, Born D, Kästner J, Rauhut G. Positioning of grid points for spanning potential energy surfaces-How much effort is really needed? J Chem Phys 2023; 158:144118. [PMID: 37061506 DOI: 10.1063/5.0146020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2023] Open
Abstract
The positions of grid points for representing a multidimensional potential energy surface (PES) have a non-negligible impact on its accuracy and the associated computational effort for its generation. Six different positioning schemes were studied for PESs represented by n-mode expansions as needed for the accurate calculation of anharmonic vibrational frequencies by means of vibrational configuration interaction theory. A static approach, which has successfully been used in many applications, and five adaptive schemes based on Gaussian process regression have been investigated with respect to the number of necessary grid points and the accuracy of the fundamental modes for a small set of test molecules. A comparison with a related, more sophisticated, and consistent approach by Christiansen et al. is provided. The impact of the positions of the ab initio grid points is discussed for multilevel PESs, for which the computational effort of the individual electronic structure calculations decreases for increasing orders of the n-mode expansion. As a result of that, the ultimate goal is not the maximal reduction of grid points but rather the computational cost, which is not directly related.
Collapse
Affiliation(s)
- Moritz Schneider
- Institute for Theoretical Chemistry, University of Stuttgart, Pfaffenwaldring 55, 70569 Stuttgart, Germany
| | - Daniel Born
- Institute for Theoretical Chemistry, University of Stuttgart, Pfaffenwaldring 55, 70569 Stuttgart, Germany
| | - Johannes Kästner
- Institute for Theoretical Chemistry, University of Stuttgart, Pfaffenwaldring 55, 70569 Stuttgart, Germany
| | - Guntram Rauhut
- Institute for Theoretical Chemistry, University of Stuttgart, Pfaffenwaldring 55, 70569 Stuttgart, Germany
| |
Collapse
|
26
|
Low K, Coote ML, Izgorodina EI. Accurate Prediction of Three-Body Intermolecular Interactions via Electron Deformation Density-Based Machine Learning. J Chem Theory Comput 2023; 19:1466-1475. [PMID: 36787280 DOI: 10.1021/acs.jctc.2c00984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
Abstract
This work extends the electron deformation density-based descriptor, originally developed in the electron deformation density-based interaction energy machine learning (EDDIE-ML) algorithm to predict dimer interaction energies, to the prediction of three-body interactions in trimers. Using a sequential learning process to select the training data, the resulting Gaussian process regression (GPR) model predicts the three-body interaction energy within 0.2 kcal mol-1 of the SRS-MP2/cc-pVTZ reference values for the 3B69 and S22-3 trimer data sets. A hybrid kernel function is introduced, which combines contributions from the average and individual atomic environments, allowing the total trimer interaction energy to be predicted in addition to the three-body contribution using the same descriptor. To extend the range and diversity of trimer interaction energies available in the literature, a new data set based on a protein-ligand crystal structure is introduced, consisting of 509 structures of a central ligand with two protein fragments. Benchmark calculations are provided for the new data set, which contains significantly larger molecular interactions than current databases in the literature in addition to charged fragments. Compared to density funtional theory (DFT)- and wavefunction-based methods for calculating the three-body interaction energy, our model makes predictions in a significantly shorter time frame by reducing the number of required SCF calculations from 7 to 4 performed at the PBE0 level of theory, showcasing the utility and efficiency of our Δ-ML method particularly when applied to larger systems.
Collapse
Affiliation(s)
- Kaycee Low
- Monash Computational Chemistry Group, School of Chemistry, Monash University, Clayton, Victoria 3800, Australia
| | - Michelle L Coote
- Institute for Nanoscale Science and Technology, College of Science and Engineering, Flinders University, Bedford Park, South Australia 5042, Australia
| | - Ekaterina I Izgorodina
- Monash Computational Chemistry Group, School of Chemistry, Monash University, Clayton, Victoria 3800, Australia
| |
Collapse
|
27
|
Huang B, von Lilienfeld OA, Krogel JT, Benali A. Toward DMC Accuracy Across Chemical Space with Scalable Δ-QML. J Chem Theory Comput 2023; 19:1711-1721. [PMID: 36857531 DOI: 10.1021/acs.jctc.2c01058] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/03/2023]
Abstract
In the past decade, quantum diffusion Monte Carlo (DMC) has been demonstrated to successfully predict the energetics and properties of a wide range of molecules and solids by numerically solving the electronic many-body Schrödinger equation. With O(N3) scaling with the number of electrons N, DMC has the potential to be a reference method for larger systems that are not accessible to more traditional methods such as CCSD(T). Assessing the accuracy of DMC for smaller molecules becomes the stepping stone in making the method a reference for larger systems. We show that when coupled with quantum machine learning (QML)-based surrogate methods, the computational burden can be alleviated such that quantum Monte Carlo (QMC) shows clear potential to undergird the formation of high-quality descriptions across chemical space. We discuss three crucial approximations necessary to accomplish this: the fixed-node approximation, universal and accurate references for chemical bond dissociation energies, and scalable minimal amons-set-based QML (AQML) models. Numerical evidence presented includes converged DMC results for over 1000 small organic molecules with up to five heavy atoms used as amons and 50 medium-sized organic molecules with nine heavy atoms to validate the AQML predictions. Numerical evidence collected for Δ-AQML models suggests that already modestly sized QMC training data sets of amons suffice to predict total energies with near chemical accuracy throughout chemical space.
Collapse
Affiliation(s)
- Bing Huang
- Faculty of Physics, University of Vienna, Kolingasse 14-16, 1090 Vienna, Austria
| | - O Anatole von Lilienfeld
- Departments of Chemistry, Materials Science and Engineering, and Physics, University of Toronto, St. George Campus, Toronto, ON M5S 3H6, Canada.,Machine Learning Group, Technische Universität Berlin and Institute for the Foundations of Learning and Data, 10587 Berlin, Germany.,Vector Institute for Artificial Intelligence, Toronto, ON M5S 1M1, Canada
| | - Jaron T Krogel
- Materials Science and Technology Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
| | - Anouar Benali
- Computational Science Division, Argonne National Laboratory, Argonne, Illinois 60439, United States
| |
Collapse
|
28
|
Li H, Jiao Y, Davey K, Qiao SZ. Data-Driven Machine Learning for Understanding Surface Structures of Heterogeneous Catalysts. Angew Chem Int Ed Engl 2023; 62:e202216383. [PMID: 36509704 DOI: 10.1002/anie.202216383] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 12/11/2022] [Accepted: 12/12/2022] [Indexed: 12/15/2022]
Abstract
The design of heterogeneous catalysts is necessarily surface-focused, generally achieved via optimization of adsorption energy and microkinetic modelling. A prerequisite is to ensure the adsorption energy is physically meaningful is the stable existence of the conceived active-site structure on the surface. The development of improved understanding of the catalyst surface, however, is challenging practically because of the complex nature of dynamic surface formation and evolution under in-situ reactions. We propose therefore data-driven machine-learning (ML) approaches as a solution. In this Minireview we summarize recent progress in using machine-learning to search and predict (meta)stable structures, assist operando simulation under reaction conditions and micro-environments, and critically analyze experimental characterization data. We conclude that ML will become the new norm to lower costs associated with discovery and design of optimal heterogeneous catalysts.
Collapse
Affiliation(s)
- Haobo Li
- School of Chemical Engineering and Advanced Materials, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Yan Jiao
- School of Chemical Engineering and Advanced Materials, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Kenneth Davey
- School of Chemical Engineering and Advanced Materials, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Shi-Zhang Qiao
- School of Chemical Engineering and Advanced Materials, The University of Adelaide, Adelaide, SA 5005, Australia
| |
Collapse
|
29
|
Käser S, Vazquez-Salazar LI, Meuwly M, Töpfer K. Neural network potentials for chemistry: concepts, applications and prospects. DIGITAL DISCOVERY 2023; 2:28-58. [PMID: 36798879 PMCID: PMC9923808 DOI: 10.1039/d2dd00102k] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 12/20/2022] [Indexed: 12/24/2022]
Abstract
Artificial Neural Networks (NN) are already heavily involved in methods and applications for frequent tasks in the field of computational chemistry such as representation of potential energy surfaces (PES) and spectroscopic predictions. This perspective provides an overview of the foundations of neural network-based full-dimensional potential energy surfaces, their architectures, underlying concepts, their representation and applications to chemical systems. Methods for data generation and training procedures for PES construction are discussed and means for error assessment and refinement through transfer learning are presented. A selection of recent results illustrates the latest improvements regarding accuracy of PES representations and system size limitations in dynamics simulations, but also NN application enabling direct prediction of physical results without dynamics simulations. The aim is to provide an overview for the current state-of-the-art NN approaches in computational chemistry and also to point out the current challenges in enhancing reliability and applicability of NN methods on a larger scale.
Collapse
Affiliation(s)
- Silvan Käser
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | | | - Markus Meuwly
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | - Kai Töpfer
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| |
Collapse
|
30
|
Houston PL, Qu C, Yu Q, Conte R, Nandi A, Li JK, Bowman JM. PESPIP: Software to fit complex molecular and many-body potential energy surfaces with permutationally invariant polynomials. J Chem Phys 2023; 158:044109. [PMID: 36725524 DOI: 10.1063/5.0134442] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
We wish to describe a potential energy surface by using a basis of permutationally invariant polynomials whose coefficients will be determined by numerical regression so as to smoothly fit a dataset of electronic energies as well as, perhaps, gradients. The polynomials will be powers of transformed internuclear distances, usually either Morse variables, exp(-ri,j/λ), where λ is a constant range hyperparameter, or reciprocals of the distances, 1/ri,j. The question we address is how to create the most efficient basis, including (a) which polynomials to keep or discard, (b) how many polynomials will be needed, (c) how to make sure the polynomials correctly reproduce the zero interaction at a large distance, (d) how to ensure special symmetries, and (e) how to calculate gradients efficiently. This article discusses how these questions can be answered by using a set of programs to choose and manipulate the polynomials as well as to write efficient Fortran programs for the calculation of energies and gradients. A user-friendly interface for access to monomial symmetrization approach results is also described. The software for these programs is now publicly available.
Collapse
Affiliation(s)
- Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, USA and Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Chen Qu
- Independent Researcher, Toronto, Ontario M9B0E3, Canada
| | - Qi Yu
- Department of Chemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, Via Golgi 19, 20133 Milano, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| | - Jeffrey K Li
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| | - Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| |
Collapse
|
31
|
Bowman JM, Qu C, Conte R, Nandi A, Houston PL, Yu Q. Δ-Machine Learned Potential Energy Surfaces and Force Fields. J Chem Theory Comput 2023; 19:1-17. [PMID: 36527383 DOI: 10.1021/acs.jctc.2c01034] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
There has been great progress in developing machine-learned potential energy surfaces (PESs) for molecules and clusters with more than 10 atoms. Unfortunately, this number of atoms generally limits the level of electronic structure theory to less than the "gold standard" CCSD(T) level. Indeed, for the well-known MD17 dataset for molecules with 9-20 atoms, all of the energies and forces were obtained with DFT calculations (PBE). This Perspective is focused on a Δ-machine learning method that we recently proposed and applied to bring DFT-based PESs to close to CCSD(T) accuracy. This is demonstrated for hydronium, N-methylacetamide, acetyl acetone, and ethanol. For 15-atom tropolone, it appears that special approaches (e.g., molecular tailoring, local CCSD(T)) are needed to obtain the CCSD(T) energies. A new aspect of this approach is the extension of Δ-machine learning to force fields. The approach is based on many-body corrections to polarizable force field potentials. This is examined in detail using the TTM2.1 water potential. The corrections make use of our recent CCSD(T) datasets for 2-b, 3-b, and 4-b interactions for water. These datasets were used to develop a new fully ab initio potential for water, termed q-AQUA.
Collapse
Affiliation(s)
- Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Chen Qu
- Independent Researcher, Toronto, Canada 66777
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, United States.,Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Qi Yu
- Department of Chemistry, Yale University, New Haven, Connecticut 06520, United States
| |
Collapse
|
32
|
Murakami T, Iida R, Hashimoto Y, Takahashi Y, Takahashi S, Takayanagi T. Ring-Polymer Molecular Dynamics and Kinetics for the H – + C 2H 2 → H 2 + C 2H – Reaction Using the Full-Dimensional Potential Energy Surface. J Phys Chem A 2022; 126:9244-9258. [DOI: 10.1021/acs.jpca.2c05851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Tatsuhiro Murakami
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama338-8570, Japan
- Department of Materials & Life Sciences, Faculty of Science & Technology, Sophia University, 7-1 Kioicho, Chiyoda-ku, Tokyo102-8554, Japan
| | - Ryusei Iida
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama338-8570, Japan
| | - Yu Hashimoto
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama338-8570, Japan
| | - Yukinobu Takahashi
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama338-8570, Japan
| | - Soma Takahashi
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama338-8570, Japan
| | - Toshiyuki Takayanagi
- Department of Chemistry, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama City, Saitama338-8570, Japan
| |
Collapse
|
33
|
Naturally-meaningful and efficient descriptors: machine learning of material properties based on robust one-shot ab initio descriptors. J Cheminform 2022; 14:78. [PMID: 36348412 PMCID: PMC9644534 DOI: 10.1186/s13321-022-00658-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Accepted: 10/01/2022] [Indexed: 11/09/2022] Open
Abstract
Establishing a data-driven pipeline for the discovery of novel materials requires the engineering of material features that can be feasibly calculated and can be applied to predict a material’s target properties. Here we propose a new class of descriptors for describing crystal structures, which we term Robust One-Shot Ab initio (ROSA) descriptors. ROSA is computationally cheap and is shown to accurately predict a range of material properties. These simple and intuitive class of descriptors are generated from the energetics of a material at a low level of theory using an incomplete ab initio calculation. We demonstrate how the incorporation of ROSA descriptors in ML-based property prediction leads to accurate predictions over a wide range of crystals, amorphized crystals, metal–organic frameworks and molecules. We believe that the low computational cost and ease of use of these descriptors will significantly improve ML-based predictions.
Collapse
|
34
|
Käser S, Richardson JO, Meuwly M. Transfer Learning for Affordable and High-Quality Tunneling Splittings from Instanton Calculations. J Chem Theory Comput 2022; 18:6840-6850. [DOI: 10.1021/acs.jctc.2c00790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Silvan Käser
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | | | - Markus Meuwly
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| |
Collapse
|
35
|
Conte R, Nandi A, Qu C, Yu Q, Houston PL, Bowman JM. Semiclassical and VSCF/VCI Calculations of the Vibrational Energies of trans- and gauche-Ethanol Using a CCSD(T) Potential Energy Surface. J Phys Chem A 2022; 126:7709-7718. [PMID: 36240438 DOI: 10.1021/acs.jpca.2c06322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A recent full-dimensional Δ-Machine learning potential energy surface (PES) for ethanol is employed in semiclassical and vibrational self-consistent field (VSCF) and virtual-state configuration interaction (VCI) calculations, using MULTIMODE, to determine the anharmonic vibrational frequencies of vibration for both the trans and gauche conformers of ethanol. Both semiclassical and VSCF/VCI energies agree well with the experimental data. We find significant mixing between the VSCF basis states due to Fermi resonances between bending and stretching modes. The same effects are also accurately described by the full-dimensional semiclassical calculations. These are the first high-level anharmonic calculations using a PES, in particular a "gold-standard" CCSD(T) one.
Collapse
Affiliation(s)
- Riccardo Conte
- Dipartimento di Chimica, Università degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Chen Qu
- Independent Researcher, Toronto, Ontario M9B0E3, Canada
| | - Qi Yu
- Department of Chemistry Yale University, New Haven, Connecticut 06520, United States
| | - Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, United States.,Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| |
Collapse
|
36
|
Lunghi A, Sanvito S. Computational design of magnetic molecules and their environment using quantum chemistry, machine learning and multiscale simulations. Nat Rev Chem 2022; 6:761-781. [PMID: 37118096 DOI: 10.1038/s41570-022-00424-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/15/2022] [Indexed: 11/09/2022]
Abstract
Having served as a playground for fundamental studies on the physics of d and f electrons for almost a century, magnetic molecules are now becoming increasingly important for technological applications, such as magnetic resonance, data storage, spintronics and quantum information. All of these applications require the preservation and control of spins in time, an ability hampered by the interaction with the environment, namely with other spins, conduction electrons, molecular vibrations and electromagnetic fields. Thus, the design of a novel magnetic molecule with tailored properties is a formidable task, which does not only concern its electronic structures but also calls for a deep understanding of the interaction among all the degrees of freedom at play. This Review describes how state-of-the-art ab initio computational methods, combined with data-driven approaches to materials modelling, can be integrated into a fully multiscale strategy capable of defining design rules for magnetic molecules.
Collapse
|
37
|
Kuntz D, Wilson AK. Machine learning, artificial intelligence, and chemistry: how smart algorithms are reshaping simulation and the laboratory. PURE APPL CHEM 2022. [DOI: 10.1515/pac-2022-0202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
Machine learning and artificial intelligence are increasingly gaining in prominence through image analysis, language processing, and automation, to name a few applications. Machine learning is also making profound changes in chemistry. From revisiting decades-old analytical techniques for the purpose of creating better calibration curves, to assisting and accelerating traditional in silico simulations, to automating entire scientific workflows, to being used as an approach to deduce underlying physics of unexplained chemical phenomena, machine learning and artificial intelligence are reshaping chemistry, accelerating scientific discovery, and yielding new insights. This review provides an overview of machine learning and artificial intelligence from a chemist’s perspective and focuses on a number of examples of the use of these approaches in computational chemistry and in the laboratory.
Collapse
Affiliation(s)
- David Kuntz
- Department of Chemistry , University of North Texas , Denton , TX 76201 , USA
| | - Angela K. Wilson
- Department of Chemistry , Michigan State University , East Lansing , MI 48824 , USA
| |
Collapse
|
38
|
Zhu X, Iyengar SS. Graph Theoretic Molecular Fragmentation for Multidimensional Potential Energy Surfaces Yield an Adaptive and General Transfer Machine Learning Protocol. J Chem Theory Comput 2022; 18:5125-5144. [PMID: 35994592 DOI: 10.1021/acs.jctc.1c01241] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Over a series of publications we have introduced a graph-theoretic description for molecular fragmentation. Here, a system is divided into a set of nodes, or vertices, that are then connected through edges, faces, and higher-order simplexes to represent a collection of spatially overlapping and locally interacting subsystems. Each such subsystem is treated at two levels of electronic structure theory, and the result is used to construct many-body expansions that are then embedded within an ONIOM-scheme. These expansions converge rapidly with many-body order (or graphical rank) of subsystems and have been previously used for ab initio molecular dynamics (AIMD) calculations and for computing multidimensional potential energy surfaces. Specifically, in all these cases we have shown that CCSD and MP2 level AIMD trajectories and potential surfaces may be obtained at density functional theory cost. The approach has been demonstrated for gas-phase studies, for condensed phase electronic structure, and also for basis set extrapolation-based AIMD. Recently, this approach has also been used to derive new quantum-computing algorithms that enormously reduce the quantum circuit depth in a circuit-based computation of correlated electronic structure. In this publication, we introduce (a) a family of neural networks that act in parallel to represent, efficiently, the post-Hartree-Fock electronic structure energy contributions for all simplexes (fragments), and (b) a new k-means-based tessellation strategy to glean training data for high-dimensional molecular spaces and minimize the extent of training needed to construct this family of neural networks. The approach is particularly useful when coupled cluster accuracy is desired and when fragment sizes grow in order to capture nonlocal interactions accurately. The unique multidimensional k-means tessellation/clustering algorithm used to determine our training data for all fragments is shown to be extremely efficient and reduces the needed training to only 10% of data for all fragments to obtain accurate neural networks for each fragment. These fully connected dense neural networks are then used to extrapolate the potential energy surface for all molecular fragments, and these are then combined as per our graph-theoretic procedure to transfer the learning process to a full system energy for the entire AIMD trajectory at less than one-tenth the cost as compared to a regular fragmentation-based AIMD calculation.
Collapse
Affiliation(s)
- Xiao Zhu
- Department of Chemistry and Department of Physics, Indiana University, 800 E. Kirkwood Avenue, Bloomington 47405, Indiana, United States
| | - Srinivasan S Iyengar
- Department of Chemistry and Department of Physics, Indiana University, 800 E. Kirkwood Avenue, Bloomington 47405, Indiana, United States
| |
Collapse
|
39
|
Nandi A, Conte R, Qu C, Houston PL, Yu Q, Bowman JM. Quantum Calculations on a New CCSD(T) Machine-Learned Potential Energy Surface Reveal the Leaky Nature of Gas-Phase Trans and Gauche Ethanol Conformers. J Chem Theory Comput 2022; 18:5527-5538. [PMID: 35951990 PMCID: PMC9476654 DOI: 10.1021/acs.jctc.2c00760] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
![]()
Ethanol is a molecule of fundamental interest in combustion,
astrochemistry,
and condensed phase as a solvent. It is characterized by two methyl
rotors and trans (anti) and gauche conformers, which are known to be very close in energy.
Here we show that based on rigorous quantum calculations of the vibrational
zero-point state, using a new ab initio potential
energy surface (PES), the ground state resembles the trans conformer, but substantial delocalization to the gauche conformer is present. This explains experimental issues about identification
and isolation of the two conformers. This “leak” effect
is partially quenched when deuterating the OH group, which further
demonstrates the need for a quantum mechanical approach. Diffusion
Monte Carlo and full-dimensional semiclassical dynamics calculations
are employed. The new PES is obtained by means of a Δ-machine
learning approach starting from a pre-existing low level density functional
theory surface. This surface is brought to the CCSD(T) level of theory
using a relatively small number of ab initio CCSD(T)
energies. Agreement between the corrected PES and direct ab
initio results for standard tests is excellent. One- and
two-dimensional discrete variable representation calculations focusing
on the trans–gauche torsional
motion are also reported, in reasonable agreement with experiment.
Collapse
Affiliation(s)
- Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| | - Chen Qu
- Independent Researcher, Toronto 66777, Canada
| | - Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, United States.,Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Qi Yu
- Department of Chemistry, Yale University, New Haven, Connecticut 06520, United States
| | - Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| |
Collapse
|
40
|
Ruth M, Gerbig D, Schreiner PR. Machine Learning of Coupled Cluster (T)-Energy Corrections via Delta (Δ)-Learning. J Chem Theory Comput 2022; 18:4846-4855. [PMID: 35816588 DOI: 10.1021/acs.jctc.2c00501] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Accurate thermochemistry is essential in many chemical disciplines, such as astro-, atmospheric, or combustion chemistry. These areas often involve fleetingly existent intermediates whose thermochemistry is difficult to assess. Whenever direct calorimetric experiments are infeasible, accurate computational estimates of relative molecular energies are required. However, high-level computations, often using coupled cluster theory, are generally resource-intensive. To expedite the process using machine learning techniques, we generated a database of energies for small organic molecules at the CCSD(T)/cc-pVDZ, CCSD(T)/aug-cc-pVDZ, and CCSD(T)/cc-pVTZ levels of theory. Leveraging the power of deep learning by employing graph neural networks, we are able to predict the effect of perturbatively included triples (T), that is, the difference between CCSD and CCSD(T) energies, with a mean absolute error of 0.25, 0.25, and 0.28 kcal mol-1 (R2 of 0.998, 0.997, and 0.998) with the cc-pVDZ, aug-cc-pVDZ, and cc-pVTZ basis sets, respectively. Our models were further validated by application to three validation sets taken from the S22 Database as well as to a selection of known theoretically challenging cases.
Collapse
Affiliation(s)
- Marcel Ruth
- Institute of Organic Chemistry, Justus Liebig University, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
| | - Dennis Gerbig
- Institute of Organic Chemistry, Justus Liebig University, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
| | - Peter R Schreiner
- Institute of Organic Chemistry, Justus Liebig University, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
| |
Collapse
|
41
|
Nguyen TH, Nguyen LH, Truong TN. Application of Machine Learning in Developing Quantitative Structure-Property Relationship for Electronic Properties of Polyaromatic Compounds. ACS OMEGA 2022; 7:22879-22888. [PMID: 35811887 PMCID: PMC9261278 DOI: 10.1021/acsomega.2c02650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 05/25/2022] [Indexed: 06/15/2023]
Abstract
The degree of π orbital overlap (DPO) model has been demonstrated to be an excellent quantitative structure-property relationship (QSPR) that can map two-dimensional structural information of polycyclic aromatic hydrocarbons (PAHs) and thienoacenes to their electronic properties, namely, band gaps, electron affinities, and ionization potentials. However, the model suffers from significant limitations that narrow its applications due to inefficient manual procedures in parameter optimization and descriptor formulation. In this work, we developed a machine learning (ML)-based method for efficiently optimizing DPO parameters and proposed a truncated DPO descriptor, which is simple enough that can be automatically extracted from simplified molecular-input line-entry system strings of PAHs and thienoacenes. Compared with the result from our previous studies, the ML-based methodology can optimize DPO parameters with four times fewer data, while it can achieve the same level of accuracy in predictions of the mentioned electronic properties to within 0.1 eV. The truncated DPO model also has similar accuracy to the full DPO model. Consequently, the ML-based DPO approach coupled with the truncated DPO model enables new possibilities for developing automatic pipelines for high-throughput screening and investigating new QSPR for new chemical classes.
Collapse
Affiliation(s)
- Tuan H Nguyen
- Institute for Computational Science and Technology, Ho Chi Minh City 700000, Vietnam
| | - Lam H Nguyen
- Institute for Computational Science and Technology, Ho Chi Minh City 700000, Vietnam
| | - Thanh N Truong
- Department of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| |
Collapse
|
42
|
Lu F, Cheng L, DiRisio RJ, Finney JM, Boyer MA, Moonkaen P, Sun J, Lee SJR, Deustua JE, Miller TF, McCoy AB. Fast Near Ab Initio Potential Energy Surfaces Using Machine Learning. J Phys Chem A 2022; 126:4013-4024. [PMID: 35715227 DOI: 10.1021/acs.jpca.2c02243] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A machine-learning based approach for evaluating potential energies for quantum mechanical studies of properties of the ground and excited vibrational states of small molecules is developed. This approach uses the molecular-orbital-based machine learning (MOB-ML) method to generate electronic energies with the accuracy of CCSD(T) calculations at the same cost as a Hartree-Fock calculation. To further reduce the computational cost of the potential energy evaluations without sacrificing the CCSD(T) level accuracy, GPU-accelerated Neural Network Potential Energy Surfaces (NN-PES) are trained to geometries and energies that are collected from small-scale Diffusion Monte Carlo (DMC) simulations, which are run using energies evaluated using the MOB-ML model. The combined NN+(MOB-ML) approach is used in variational calculations of the ground and low-lying vibrational excited states of water and in DMC calculations of the ground states of water, CH5+, and its deuterated analogues. For both of these molecules, comparisons are made to the results obtained using potentials that were fit to much larger sets of electronic energies than were required to train the MOB-ML models. The NN+(MOB-ML) approach is also used to obtain a potential surface for C2H5+, which is a carbocation with a nonclassical equilibrium structure for which there is currently no available potential surface. This potential is used to explore the CH stretching vibrations, focusing on those of the bridging hydrogen atom. For both CH5+ and C2H5+ the MOB-ML model is trained using geometries that were sampled from an AIMD trajectory, which was run at 350 K. By comparison, the structures sampled in the ground state calculations can have energies that are as much as ten times larger than those used to train the MOB-ML model. For water a higher temperature AIMD trajectory is needed to obtain accurate results due to the smaller thermal energy. A second MOB-ML model for C2H5+ was developed with additional higher energy structures in the training set. The two models are found to provide nearly identical descriptions of the ground state of C2H5+.
Collapse
Affiliation(s)
- Fenris Lu
- Department of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Lixue Cheng
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Ryan J DiRisio
- Department of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Jacob M Finney
- Department of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Mark A Boyer
- Department of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Pattarapon Moonkaen
- Department of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Jiace Sun
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Sebastian J R Lee
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - J Emiliano Deustua
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Thomas F Miller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Anne B McCoy
- Department of Chemistry, University of Washington, Seattle, Washington 98195, United States
| |
Collapse
|
43
|
Liu Y, Li J. Permutation-Invariant-Polynomial Neural-Network-Based Δ-Machine Learning Approach: A Case for the HO 2 Self-Reaction and Its Dynamics Study. J Phys Chem Lett 2022; 13:4729-4738. [PMID: 35609295 DOI: 10.1021/acs.jpclett.2c01064] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Δ-machine learning, or the hierarchical construction scheme, is a highly cost-effective method, as only a small number of high-level ab initio energies are required to improve a potential energy surface (PES) fit to a large number of low-level points. However, there is no efficient and systematic way to select as few points as possible from the low-level data set. We here propose a permutation-invariant-polynomial neural-network (PIP-NN)-based Δ-machine learning approach to construct full-dimensional accurate PESs of complicated reactions efficiently. Particularly, the high flexibility of the NN is exploited to efficiently sample points from the low-level data set. This approach is applied to the challenging case of a HO2 self-reaction with a large configuration space. Only 14% of the DFT data set is used to successfully bring a newly fitted DFT PES to the UCCSD(T)-F12a/AVTZ quality. Then, the quasiclassical trajectory (QCT) calculations are performed to study its dynamics, particularly the mode specificity.
Collapse
Affiliation(s)
- Yang Liu
- School of Chemistry and Chemical Engineering & Chongqing Key Laboratory of Theoretical and Computational Chemistry, Chongqing University, Chongqing 401331, China
| | - Jun Li
- School of Chemistry and Chemical Engineering & Chongqing Key Laboratory of Theoretical and Computational Chemistry, Chongqing University, Chongqing 401331, China
| |
Collapse
|
44
|
Czernek J, Brus J, Czerneková V. A computational inspection of the dissociation energy of mid-sized organic dimers. J Chem Phys 2022; 156:204303. [DOI: 10.1063/5.0093557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The gas-phase value of the dissociation energy ( D0) is a key parameter employed in both experimental and theoretical descriptions of noncovalent complexes. The D0 data were obtained for a set of mid-sized organic dimers in their global minima which was located using geometry optimizations that applied ample basis sets together with either the conventional second-order Møller–Plesset (MP2) method or several dispersion-corrected density-functional theory (DFT-D) schemes. The harmonic vibrational zero-point (VZP) and deformation energies from the MP2 calculations were combined with electronic energies from the coupled cluster theory with singles, doubles, and iterative triples [CCSD(T)] extrapolated to the complete basis set (CBS) limit to estimate D0 with the aim of inspecting values that were most recently measured, and an analogous comparison was performed using the DFT-D data. In at least one case (namely, for the aniline⋯methane cluster), the D0 estimate that employed the CCSD(T)/CBS energies differed from experiment in the way that could not be explained by a possible deficiency in the VZP contribution. Curiously, one of the DFT-D schemes (namely, the B3LYP-D3/def2-QZVPPD) was able to reproduce all measured D0 values to within 1.0 kJ/mol from experimental error bars. These findings show the need for further measurements and computations of some of the complexes. In order to facilitate such studies, the physical nature of intermolecular interactions in the investigated dimers was analyzed by means of the DFT-based symmetry-adapted perturbation theory.
Collapse
Affiliation(s)
- Jiří Czernek
- Institute of Macromolecular Chemistry, Czech Academy of Sciences, Heyrovsky Square 2, 162 06 Praha 6, The Czech Republic
| | - Jiří Brus
- Institute of Macromolecular Chemistry, Czech Academy of Sciences, Heyrovsky Square 2, 162 06 Praha 6, The Czech Republic
| | - Vladimíra Czerneková
- Institute of Physics, Czech Academy of Sciences, Na Slovance 2, 182 21 Praha 8, The Czech Republic
| |
Collapse
|
45
|
Atz K, Isert C, Böcker MNA, Jiménez-Luna J, Schneider G. Δ-Quantum machine-learning for medicinal chemistry. Phys Chem Chem Phys 2022; 24:10775-10783. [PMID: 35470831 PMCID: PMC9093086 DOI: 10.1039/d2cp00834c] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Accepted: 04/05/2022] [Indexed: 11/21/2022]
Abstract
Many molecular design tasks benefit from fast and accurate calculations of quantum-mechanical (QM) properties. However, the computational cost of QM methods applied to drug-like molecules currently renders large-scale applications of quantum chemistry challenging. Aiming to mitigate this problem, we developed DelFTa, an open-source toolbox for the prediction of electronic properties of drug-like molecules at the density functional (DFT) level of theory, using Δ-machine-learning. Δ-Learning corrects the prediction error (Δ) of a fast but inaccurate property calculation. DelFTa employs state-of-the-art three-dimensional message-passing neural networks trained on a large dataset of QM properties. It provides access to a wide array of quantum observables on the molecular, atomic and bond levels by predicting approximations to DFT values from a low-cost semiempirical baseline. Δ-Learning outperformed its direct-learning counterpart for most of the considered QM endpoints. The results suggest that predictions for non-covalent intra- and intermolecular interactions can be extrapolated to larger biomolecular systems. The software is fully open-sourced and features documented command-line and Python APIs.
Collapse
Affiliation(s)
- Kenneth Atz
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093 Zurich, Switzerland.
| | - Clemens Isert
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093 Zurich, Switzerland.
| | - Markus N A Böcker
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093 Zurich, Switzerland.
| | - José Jiménez-Luna
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093 Zurich, Switzerland.
- Department of Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co. KG, Birkendorfer Straße 65, 88397 Biberach an der Riss, Germany
| | - Gisbert Schneider
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093 Zurich, Switzerland.
- ETH Singapore SEC Ltd., 1 CREATE Way, #06-01 CREATE Tower, Singapore 138602, Singapore
| |
Collapse
|
46
|
Verma S, Rivera M, Scanlon DO, Walsh A. Machine learned calibrations to high-throughput molecular excited state calculations. J Chem Phys 2022; 156:134116. [PMID: 35395896 DOI: 10.1063/5.0084535] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Understanding the excited state properties of molecules provides insight into how they interact with light. These interactions can be exploited to design compounds for photochemical applications, including enhanced spectral conversion of light to increase the efficiency of photovoltaic cells. While chemical discovery is time- and resource-intensive experimentally, computational chemistry can be used to screen large-scale databases for molecules of interest in a procedure known as high-throughput virtual screening. The first step usually involves a high-speed but low-accuracy method to screen large numbers of molecules (potentially millions), so only the best candidates are evaluated with expensive methods. However, use of a coarse first-pass screening method can potentially result in high false positive or false negative rates. Therefore, this study uses machine learning to calibrate a high-throughput technique [eXtended Tight Binding based simplified Tamm-Dancoff approximation (xTB-sTDA)] against a higher accuracy one (time-dependent density functional theory). Testing the calibration model shows an approximately sixfold decrease in the error in-domain and an approximately threefold decrease in the out-of-domain. The resulting mean absolute error of ∼0.14 eV is in line with previous work in machine learning calibrations and out-performs previous work in linear calibration of xTB-sTDA. We then apply the calibration model to screen a 250k molecule database and map inaccuracies of xTB-sTDA in chemical space. We also show generalizability of the workflow by calibrating against a higher-level technique (CC2), yielding a similarly low error. Overall, this work demonstrates that machine learning can be used to develop a cost-effective and accurate method for large-scale excited state screening, enabling accelerated molecular discovery across a variety of disciplines.
Collapse
Affiliation(s)
- Shomik Verma
- Department of Materials, Imperial College London, Exhibition Road, London SW7 2AZ, United Kingdom
| | - Miguel Rivera
- Department of Chemistry, University College London, 20 Gordon Street, London WC1H 0AJ, United Kingdom
| | - David O Scanlon
- Department of Chemistry and Thomas Young Centre, University College London, 20 Gordon Street, London WC1H 0AJ, United Kingdom
| | - Aron Walsh
- Department of Materials, Imperial College London, Exhibition Road, London SW7 2AZ, United Kingdom
| |
Collapse
|
47
|
Pernot P. The long road to calibrated prediction uncertainty in computational chemistry. J Chem Phys 2022; 156:114109. [DOI: 10.1063/5.0084302] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Uncertainty quantification (UQ) in computational chemistry (CC) is still in its infancy. Very few CC methods are designed to provide a confidence level on their predictions, and most users still rely improperly on the mean absolute error as an accuracy metric. The development of reliable UQ methods is essential, notably for CC to be used confidently in industrial processes. A review of the CC-UQ literature shows that there is no common standard procedure to report or validate prediction uncertainty. I consider here analysis tools using concepts (calibration and sharpness) developed in meteorology and machine learning for the validation of probabilistic forecasters. These tools are adapted to CC-UQ and applied to datasets of prediction uncertainties provided by composite methods, Bayesian ensembles methods, and machine learning and a posteriori statistical methods.
Collapse
Affiliation(s)
- Pascal Pernot
- Institut de Chimie Physique, UMR8000 CNRS, Université Paris-Saclay, 91405 Orsay, France
| |
Collapse
|
48
|
Meuwly M. Atomistic Simulations for Reactions and Vibrational Spectroscopy in the Era of Machine Learning─ Quo Vadis?. J Phys Chem B 2022; 126:2155-2167. [PMID: 35286087 DOI: 10.1021/acs.jpcb.2c00212] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Atomistic simulations using accurate energy functions can provide molecular-level insight into functional motions of molecules in the gas and in the condensed phase. This Perspective delineates the present status of the field from the efforts of others and some of our own work and discusses open questions and future prospects. The combination of physics-based long-range representations using multipolar charge distributions and kernel representations for the bonded interactions is shown to provide realistic models for the exploration of the infrared spectroscopy of molecules in solution. For reactions, empirical models connecting dedicated energy functions for the reactant and product states allow statistically meaningful sampling of conformational space whereas machine-learned energy functions are superior in accuracy. The future combination of physics-based models with machine-learning techniques and integration into all-purpose molecular simulation software provides a unique opportunity to bring such dynamics simulations closer to reality.
Collapse
Affiliation(s)
- Markus Meuwly
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, 4056 Basel, Switzerland
| |
Collapse
|
49
|
Khire SS, Gurav ND, Nandi A, Gadre SR. Enabling Rapid and Accurate Construction of CCSD(T)-Level Potential Energy Surface of Large Molecules Using Molecular Tailoring Approach. J Phys Chem A 2022; 126:1458-1464. [PMID: 35170973 DOI: 10.1021/acs.jpca.2c00025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The construction of a potential energy surface (PES) of even a medium-sized molecule employing correlated theory, such as CCSD(T), is arduous due to the high computational cost involved. The present study reports the possibility of efficiently constructing such a PES of molecules containing up to 15 atoms and 550 basis functions by employing the fragment-based molecular tailoring approach (MTA) on off-the-shelf hardware. The MTA energies at the CCSD(T)/aug-cc-pVTZ level for several geometries of three test molecules, viz., acetylacetone, N-methylacetamide, and tropolone, are reported. These energies are in excellent agreement with their full calculation counterparts with a time advantage factor of 3-5. The energy barrier from the ground to transition state is also accurately captured. Further, we demonstrate the accuracy and efficiency of MTA for estimating the energy gradients at the CCSD(T) level. As a further application of our MTA methodology, the energies of acetylacetone at ∼430 geometries are computed at the CCSD(T)/aug-cc-pVTZ level and used for generating a Δ-machine learning (Δ-ML) PES. This leads to the H-transfer barrier of 3.02 kcal/mol, well in agreement with the benchmarked barrier of 3.19 kcal/mol. The fidelity of this Δ-ML PES is examined by geometry optimization and normal mode frequency calculations of global minima and saddle point geometries. We trust that the present work is a major development for the rapid and accurate construction of PES at the CCSD(T) level for molecules containing up to 20 atoms and 600 basis functions using off-the-shelf hardware.
Collapse
Affiliation(s)
- Subodh S Khire
- Department of Scientific Computing, Modelling and Simulation, Savitribai Phule Pune University, Pune 411 007, India
| | - Nalini D Gurav
- Department of Scientific Computing, Modelling and Simulation, Savitribai Phule Pune University, Pune 411 007, India
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Shridhar R Gadre
- Department of Scientific Computing, Modelling and Simulation, Savitribai Phule Pune University, Pune 411 007, India
| |
Collapse
|
50
|
Gupta AK, Raghavachari K. Three-Dimensional Convolutional Neural Networks Utilizing Molecular Topological Features for Accurate Atomization Energy Predictions. J Chem Theory Comput 2022; 18:2132-2143. [PMID: 35226496 DOI: 10.1021/acs.jctc.1c00504] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Deep learning methods provide a novel way to establish a correlation between two quantities. In this context, computer vision techniques such as three-dimensional (3D)-convolutional neural networks become a natural choice to associate a molecular property with its structure due to the inherent 3D nature of a molecule. However, traditional 3D input data structures are intrinsically sparse in nature, which tend to induce instabilities during the learning process, which in turn may lead to underfitted results. To address this deficiency, in this project, we propose to use quantum-chemically derived molecular topological features, namely, localized orbital locator and electron localization function, as molecular descriptors, which provide a relatively denser input representation in a 3D space. Such topological features provide a detailed picture of the atomic and electronic configuration and interatomic interactions in the molecule and hence are ideal for predicting properties that are highly dependent on the physical or electronic structure of the molecule. Herein, we demonstrate the efficacy of our proposed model by applying it to the task of predicting atomization energies for the QM9-G4MP2 data set, which contains ∼134k molecules. Furthermore, we incorporated the Δ-machine learning approach into our model, which enabled us to reach beyond benchmark accuracy levels (∼1.0 kJ mol-1). As a result, we consistently obtain impressive mean absolute errors of the order 0.1 kcal mol-1 (∼0.42 kJ mol-1) versus the G4(MP2) theory using relatively modest models, which could potentially be improved further in a systematic manner using additional compute resources.
Collapse
Affiliation(s)
- Ankur Kumar Gupta
- Department of Chemistry, Indiana University, Bloomington, Indiana 47405, United States
| | - Krishnan Raghavachari
- Department of Chemistry, Indiana University, Bloomington, Indiana 47405, United States
| |
Collapse
|