1
|
Cao Y, Balduf T, Beachy MD, Bennett MC, Bochevarov AD, Chien A, Dub PA, Dyall KG, Furness JW, Halls MD, Hughes TF, Jacobson LD, Kwak HS, Levine DS, Mainz DT, Moore KB, Svensson M, Videla PE, Watson MA, Friesner RA. Quantum chemical package Jaguar: A survey of recent developments and unique features. J Chem Phys 2024; 161:052502. [PMID: 39092934 DOI: 10.1063/5.0213317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 07/12/2024] [Indexed: 08/04/2024] Open
Abstract
This paper is dedicated to the quantum chemical package Jaguar, which is commercial software developed and distributed by Schrödinger, Inc. We discuss Jaguar's scientific features that are relevant to chemical research as well as describe those aspects of the program that are pertinent to the user interface, the organization of the computer code, and its maintenance and testing. Among the scientific topics that feature prominently in this paper are the quantum chemical methods grounded in the pseudospectral approach. A number of multistep workflows dependent on Jaguar are covered: prediction of protonation equilibria in aqueous solutions (particularly calculations of tautomeric stability and pKa), reactivity predictions based on automated transition state search, assembly of Boltzmann-averaged spectra such as vibrational and electronic circular dichroism, as well as nuclear magnetic resonance. Discussed also are quantum chemical calculations that are oriented toward materials science applications, in particular, prediction of properties of optoelectronic materials and organic semiconductors, and molecular catalyst design. The topic of treatment of conformations inevitably comes up in real world research projects and is considered as part of all the workflows mentioned above. In addition, we examine the role of machine learning methods in quantum chemical calculations performed by Jaguar, from auxiliary functions that return the approximate calculation runtime in a user interface, to prediction of actual molecular properties. The current work is second in a series of reviews of Jaguar, the first having been published more than ten years ago. Thus, this paper serves as a rare milestone on the path that is being traversed by Jaguar's development in more than thirty years of its existence.
Collapse
Affiliation(s)
- Yixiang Cao
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Ty Balduf
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Michael D Beachy
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - M Chandler Bennett
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Art D Bochevarov
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Alan Chien
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Pavel A Dub
- Schrödinger, Inc., 9868 Scranton Road, Suite 3200, San Diego, California 92121, USA
| | - Kenneth G Dyall
- Schrödinger, Inc., 101 SW Main St., Suite 1300, Portland, Oregon 97204, USA
| | - James W Furness
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Mathew D Halls
- Schrödinger, Inc., 9868 Scranton Road, Suite 3200, San Diego, California 92121, USA
| | - Thomas F Hughes
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Leif D Jacobson
- Schrödinger, Inc., 101 SW Main St., Suite 1300, Portland, Oregon 97204, USA
| | - H Shaun Kwak
- Schrödinger, Inc., 101 SW Main St., Suite 1300, Portland, Oregon 97204, USA
| | - Daniel S Levine
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Daniel T Mainz
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Kevin B Moore
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Mats Svensson
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Pablo E Videla
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Mark A Watson
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Richard A Friesner
- Department of Chemistry, Columbia University, 3000 Broadway, New York, New York 10027, USA
| |
Collapse
|
2
|
Duan Y, Niu T, Wang J, Cieplak P, Luo R. PCMRESP: A Method for Polarizable Force Field Parameter Development and Transferability of the Polarizable Gaussian Multipole Models Across Multiple Solvents. J Chem Theory Comput 2024; 20:2820-2829. [PMID: 38502776 PMCID: PMC11008095 DOI: 10.1021/acs.jctc.4c00064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 02/27/2024] [Accepted: 03/01/2024] [Indexed: 03/21/2024]
Abstract
The transferability of force field parameters is a crucial aspect of high-quality force fields. Previous investigations have affirmed the transferability of electrostatic parameters derived from polarizable Gaussian multipole models (pGMs) when applied to water oligomer clusters, polypeptides across various conformations, and different sequences. In this study, we introduce PCMRESP, a novel method for electrostatic parametrization in solution, intended for the development of polarizable force fields. We utilized this method to assess the transferability of three models: a fixed charge model and two variants of pGM models. Our analysis involved testing these models on 377 small molecules and 100 tetra-peptides in five representative dielectric environments: gas, diethyl ether, dichloroethane, acetone, and water. Our findings reveal that the inclusion of atomic polarization significantly enhances transferability and the incorporation of permanent atomic dipoles, in the form of covalent bond dipoles, leads to further improvements. Moreover, our tests on dual-solvent strategies demonstrate consistent transferability for all three models, underscoring the robustness of the dual-solvent approach. In contrast, an evaluation of the traditional HF/6-31G* method indicates poor transferability for the pGM-ind and pGM-perm models, suggesting the limitations of this conventional approach.
Collapse
Affiliation(s)
- Yong Duan
- UC
Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, One Shields Avenue, Davis, California 95616, United States
| | - Taoyu Niu
- Department
of Pharmaceutical Sciences and Computational Chemical Genomics Screening
Center, School of Pharmacy, University of
Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Junmei Wang
- Department
of Pharmaceutical Sciences and Computational Chemical Genomics Screening
Center, School of Pharmacy, University of
Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Piotr Cieplak
- SBP
Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Ray Luo
- Departments
of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering,
Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine. Irvine, California 92697, United States
| |
Collapse
|
3
|
Tian Z, Zhang S, Chern GW. Machine learning for structure-property mapping of Ising models: Scalability and limitations. Phys Rev E 2023; 108:065304. [PMID: 38243546 DOI: 10.1103/physreve.108.065304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 11/27/2023] [Indexed: 01/21/2024]
Abstract
We present a scalable machine learning (ML) framework for predicting intensive properties and particularly classifying phases of Ising models. Scalability and transferability are central to the unprecedented computational efficiency of ML methods. In general, linear-scaling computation can be achieved through the divide-and-conquer approach, and the locality of physical properties is key to partitioning the system into subdomains that can be solved separately. Based on the locality assumption, ML model is developed for the prediction of intensive properties of a finite-size block. Predictions of large-scale systems can then be obtained by averaging results of the ML model from randomly sampled blocks of the system. We show that the applicability of this approach depends on whether the block-size of the ML model is greater than the characteristic length scale of the system. In particular, in the case of phase identification across a critical point, the accuracy of the ML prediction is limited by the diverging correlation length. We obtain an intriguing scaling relation between the prediction accuracy and the ratio of ML block size over the spin-spin correlation length. Implications for practical applications are also discussed. While the two-dimensional Ising model is used to demonstrate the proposed approach, the ML framework can be generalized to other many-body or condensed-matter systems.
Collapse
Affiliation(s)
- Zhongzheng Tian
- Department of Physics, University of Virginia, Charlottesville, Virginia 22904, USA
| | - Sheng Zhang
- Department of Physics, University of Virginia, Charlottesville, Virginia 22904, USA
| | - Gia-Wei Chern
- Department of Physics, University of Virginia, Charlottesville, Virginia 22904, USA
| |
Collapse
|
4
|
Thürlemann M, Riniker S. Hybrid classical/machine-learning force fields for the accurate description of molecular condensed-phase systems. Chem Sci 2023; 14:12661-12675. [PMID: 38020395 PMCID: PMC10646964 DOI: 10.1039/d3sc04317g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 10/24/2023] [Indexed: 12/01/2023] Open
Abstract
Electronic structure methods offer in principle accurate predictions of molecular properties, however, their applicability is limited by computational costs. Empirical methods are cheaper, but come with inherent approximations and are dependent on the quality and quantity of training data. The rise of machine learning (ML) force fields (FFs) exacerbates limitations related to training data even further, especially for condensed-phase systems for which the generation of large and high-quality training datasets is difficult. Here, we propose a hybrid ML/classical FF model that is parametrized exclusively on high-quality ab initio data of dimers and monomers in vacuum but is transferable to condensed-phase systems. The proposed hybrid model combines our previous ML-parametrized classical model with ML corrections for situations where classical approximations break down, thus combining the robustness and efficiency of classical FFs with the flexibility of ML. Extensive validation on benchmarking datasets and experimental condensed-phase data, including organic liquids and small-molecule crystal structures, showcases how the proposed approach may promote FF development and unlock the full potential of classical FFs.
Collapse
Affiliation(s)
- Moritz Thürlemann
- Department of Chemistry and Applied Biosciences, ETH Zürich Vladimir-Prelog-Weg 2 Zürich 8093 Switzerland
| | - Sereina Riniker
- Department of Chemistry and Applied Biosciences, ETH Zürich Vladimir-Prelog-Weg 2 Zürich 8093 Switzerland
| |
Collapse
|
5
|
Spronk SA, Glick ZL, Metcalf DP, Sherrill CD, Cheney DL. A quantum chemical interaction energy dataset for accurately modeling protein-ligand interactions. Sci Data 2023; 10:619. [PMID: 37699937 PMCID: PMC10497680 DOI: 10.1038/s41597-023-02443-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 08/03/2023] [Indexed: 09/14/2023] Open
Abstract
Fast and accurate calculation of intermolecular interaction energies is desirable for understanding many chemical and biological processes, including the binding of small molecules to proteins. The Splinter ["Symmetry-adapted perturbation theory (SAPT0) protein-ligand interaction"] dataset has been created to facilitate the development and improvement of methods for performing such calculations. Molecular fragments representing commonly found substructures in proteins and small-molecule ligands were paired into >9000 unique dimers, assembled into numerous configurations using an approach designed to adequately cover the breadth of the dimers' potential energy surfaces while enhancing sampling in favorable regions. ~1.5 million configurations of these dimers were randomly generated, and a structurally diverse subset of these were minimized to obtain an additional ~80 thousand local and global minima. For all >1.6 million configurations, SAPT0 calculations were performed with two basis sets to complete the dataset. It is expected that Splinter will be a useful benchmark dataset for training and testing various methods for the calculation of intermolecular interaction energies.
Collapse
Affiliation(s)
- Steven A Spronk
- Molecular Structure and Design, Bristol Myers Squibb Company, P. O. Box 5400, Princeton, NJ, 08543, USA.
| | - Zachary L Glick
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA
| | - Derek P Metcalf
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA
| | - C David Sherrill
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA.
| | - Daniel L Cheney
- Molecular Structure and Design, Bristol Myers Squibb Company, P. O. Box 5400, Princeton, NJ, 08543, USA
| |
Collapse
|
6
|
Low K, Coote ML, Izgorodina EI. Accurate Prediction of Three-Body Intermolecular Interactions via Electron Deformation Density-Based Machine Learning. J Chem Theory Comput 2023; 19:1466-1475. [PMID: 36787280 DOI: 10.1021/acs.jctc.2c00984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
Abstract
This work extends the electron deformation density-based descriptor, originally developed in the electron deformation density-based interaction energy machine learning (EDDIE-ML) algorithm to predict dimer interaction energies, to the prediction of three-body interactions in trimers. Using a sequential learning process to select the training data, the resulting Gaussian process regression (GPR) model predicts the three-body interaction energy within 0.2 kcal mol-1 of the SRS-MP2/cc-pVTZ reference values for the 3B69 and S22-3 trimer data sets. A hybrid kernel function is introduced, which combines contributions from the average and individual atomic environments, allowing the total trimer interaction energy to be predicted in addition to the three-body contribution using the same descriptor. To extend the range and diversity of trimer interaction energies available in the literature, a new data set based on a protein-ligand crystal structure is introduced, consisting of 509 structures of a central ligand with two protein fragments. Benchmark calculations are provided for the new data set, which contains significantly larger molecular interactions than current databases in the literature in addition to charged fragments. Compared to density funtional theory (DFT)- and wavefunction-based methods for calculating the three-body interaction energy, our model makes predictions in a significantly shorter time frame by reducing the number of required SCF calculations from 7 to 4 performed at the PBE0 level of theory, showcasing the utility and efficiency of our Δ-ML method particularly when applied to larger systems.
Collapse
Affiliation(s)
- Kaycee Low
- Monash Computational Chemistry Group, School of Chemistry, Monash University, Clayton, Victoria 3800, Australia
| | - Michelle L Coote
- Institute for Nanoscale Science and Technology, College of Science and Engineering, Flinders University, Bedford Park, South Australia 5042, Australia
| | - Ekaterina I Izgorodina
- Monash Computational Chemistry Group, School of Chemistry, Monash University, Clayton, Victoria 3800, Australia
| |
Collapse
|
7
|
Wang D, Li W, Dong X, Li H, Hu L. TFRegNCI: Interpretable Noncovalent Interaction Correction Multimodal Based on Transformer Encoder Fusion. J Chem Inf Model 2023; 63:782-793. [PMID: 36652718 DOI: 10.1021/acs.jcim.2c01283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The interpretability is an important issue for end-to-end learning models. Motivated by computer vision algorithms, an interpretable noncovalent interaction (NCI) correction multimodal (TFRegNCI) is proposed for NCI prediction. TFRegNCI is based on RegNet feature extraction and a transformer encoder fusion strategy. RegNet is a network design paradigm that mainly focuses on local features. Meanwhile, the Vision Transformer is also leveraged for feature extraction, because it can capture global features better than RegNet while lowering the computational cost. Using a transformer encoder as the fusion strategy rather than multilayer perceptron can enhance model performance, due to its emphasis on important features with less parameters. Therefore, the proposed TFRegNCI achieved high accurate prediction (mean absolute error of ∼0.1 kcal/mol) comparing with the coupled cluster single double (triple) (CCSD(T)) benchmark. To further improve the model efficiency, TFRegNCI applies two-dimensional (2D) inputs transformed from three-dimensional (3D) electron density cubes, which saves time (30%), while the model accuracy remains. To improve model interpretability, a visualization module, Gradient-weighted Regression Activation Mapping (Grad-RAM) has been embedded. Grad-RAM is promoted from the classification algorithm, Gradient-weighted Class Activation Mapping, to perform feature visualization for the regression task. With Grad-RAM, the visual location map for features in deep learning models can be displayed. The feature map visualizations suggest that the 2D model has the similar performance as the 3D model, because of equally effective feature extractions from electron density. Moreover, the valid feature region on the location map by the 3D model is consistent with the NCIPLOT NCI isosurface. It is confirmed that the model does extract significant features related to the NCI interaction. The interpretable analyses are carried out through molecular orbital contribution on effective features. Thereby, the proposed model is likely to be a promising tool to reveal some essential information on NCIs, with regard to the level of electronic theory.
Collapse
Affiliation(s)
- Donghan Wang
- School of Information Science and Technology, Northeast Normal University, Changchun130117, China
| | - Wenze Li
- College of Computer and Information Engineering, Henan Normal University, Henan, Xinxiang453007, China
| | - Xu Dong
- School of Information Science and Technology, Northeast Normal University, Changchun130117, China
| | - Hongzhi Li
- School of Information Science and Technology, Northeast Normal University, Changchun130117, China
| | - LiHong Hu
- School of Information Science and Technology, Northeast Normal University, Changchun130117, China
| |
Collapse
|
8
|
Thürlemann M, Böselt L, Riniker S. Regularized by Physics: Graph Neural Network Parametrized Potentials for the Description of Intermolecular Interactions. J Chem Theory Comput 2023; 19:562-579. [PMID: 36633918 PMCID: PMC9878731 DOI: 10.1021/acs.jctc.2c00661] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Indexed: 01/13/2023]
Abstract
Simulations of molecular systems using electronic structure methods are still not feasible for many systems of biological importance. As a result, empirical methods such as force fields (FF) have become an established tool for the simulation of large and complex molecular systems. The parametrization of FF is, however, time-consuming and has traditionally been based on experimental data. Recent years have therefore seen increasing efforts to automatize FF parametrization or to replace FF with machine-learning (ML) based potentials. Here, we propose an alternative strategy to parametrize FF, which makes use of ML and gradient-descent based optimization while retaining a functional form founded in physics. Using a predefined functional form is shown to enable interpretability, robustness, and efficient simulations of large systems over long time scales. To demonstrate the strength of the proposed method, a fixed-charge and a polarizable model are trained on ab initio potential-energy surfaces. Given only information about the constituting elements, the molecular topology, and reference potential energies, the models successfully learn to assign atom types and corresponding FF parameters from scratch. The resulting models and parameters are validated on a wide range of experimentally and computationally derived properties of systems including dimers, pure liquids, and molecular crystals.
Collapse
Affiliation(s)
- Moritz Thürlemann
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Lennard Böselt
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
9
|
Fedik N, Zubatyuk R, Kulichenko M, Lubbers N, Smith JS, Nebgen B, Messerly R, Li YW, Boldyrev AI, Barros K, Isayev O, Tretiak S. Extending machine learning beyond interatomic potentials for predicting molecular properties. Nat Rev Chem 2022; 6:653-672. [PMID: 37117713 DOI: 10.1038/s41570-022-00416-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/15/2022] [Indexed: 11/09/2022]
Abstract
Machine learning (ML) is becoming a method of choice for modelling complex chemical processes and materials. ML provides a surrogate model trained on a reference dataset that can be used to establish a relationship between a molecular structure and its chemical properties. This Review highlights developments in the use of ML to evaluate chemical properties such as partial atomic charges, dipole moments, spin and electron densities, and chemical bonding, as well as to obtain a reduced quantum-mechanical description. We overview several modern neural network architectures, their predictive capabilities, generality and transferability, and illustrate their applicability to various chemical properties. We emphasize that learned molecular representations resemble quantum-mechanical analogues, demonstrating the ability of the models to capture the underlying physics. We also discuss how ML models can describe non-local quantum effects. Finally, we conclude by compiling a list of available ML toolboxes, summarizing the unresolved challenges and presenting an outlook for future development. The observed trends demonstrate that this field is evolving towards physics-based models augmented by ML, which is accompanied by the development of new methods and the rapid growth of user-friendly ML frameworks for chemistry.
Collapse
|
10
|
Nagy PR, Gyevi-Nagy L, Lőrincz BD, Kállay M. Pursuing the basis set limit of CCSD(T) non-covalent interaction energies for medium-sized complexes: case study on the S66 compilation. Mol Phys 2022. [DOI: 10.1080/00268976.2022.2109526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
Affiliation(s)
- Péter R. Nagy
- Faculty of Chemical Technology and Biotechnology, Department of Physical Chemistry and Materials Science, Budapest University of Technology and Economics, Budapest, Hungary
- ELKH-BME Quantum Chemistry Research Group, Budapest, Hungary
| | - László Gyevi-Nagy
- Faculty of Chemical Technology and Biotechnology, Department of Physical Chemistry and Materials Science, Budapest University of Technology and Economics, Budapest, Hungary
- ELKH-BME Quantum Chemistry Research Group, Budapest, Hungary
| | - Balázs D. Lőrincz
- Faculty of Chemical Technology and Biotechnology, Department of Physical Chemistry and Materials Science, Budapest University of Technology and Economics, Budapest, Hungary
- ELKH-BME Quantum Chemistry Research Group, Budapest, Hungary
| | - Mihály Kállay
- Faculty of Chemical Technology and Biotechnology, Department of Physical Chemistry and Materials Science, Budapest University of Technology and Economics, Budapest, Hungary
- ELKH-BME Quantum Chemistry Research Group, Budapest, Hungary
| |
Collapse
|
11
|
Cheng L, Sun J, Miller TF. Accurate Molecular-Orbital-Based Machine Learning Energies via Unsupervised Clustering of Chemical Space. J Chem Theory Comput 2022; 18:4826-4835. [PMID: 35858242 DOI: 10.1021/acs.jctc.2c00396] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We introduce an unsupervised clustering algorithm to improve training efficiency and accuracy in predicting energies using molecular-orbital-based machine learning (MOB-ML). This work determines clusters via the Gaussian mixture model (GMM) in an entirely automatic manner and simplifies an earlier supervised clustering approach [ J. Chem. Theory Comput. 2019, 15, 6668] by eliminating both the necessity for user-specified parameters and the training of an additional classifier. Unsupervised clustering results from GMM have the advantages of accurately reproducing chemically intuitive groupings of frontier molecular orbitals and exhibiting improved performance with an increasing number of training examples. The resulting clusters from supervised or unsupervised clustering are further combined with scalable Gaussian process regression (GPR) or linear regression (LR) to learn molecular energies accurately by generating a local regression model in each cluster. Among all four combinations of regressors and clustering methods, GMM combined with scalable exact GPR (GMM/GPR) is the most efficient training protocol for MOB-ML. The numerical tests of molecular energy learning on thermalized data sets of drug-like molecules demonstrate the improved accuracy, transferability, and learning efficiency of GMM/GPR over other training protocols for MOB-ML, i.e., supervised regression clustering combined with GPR (RC/GPR) and GPR without clustering. GMM/GPR also provides the best molecular energy predictions compared with ones from the literature on the same benchmark data sets. With a lower scaling, GMM/GPR has a 10.4-fold speedup in wall-clock training time compared with scalable exact GPR with a training size of 6500 QM7b-T molecules.
Collapse
Affiliation(s)
- Lixue Cheng
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Jiace Sun
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Thomas F Miller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| |
Collapse
|
12
|
Villot C, Ballesteros F, Wang D, Lao KU. Coupled Cluster Benchmarking of Large Noncovalent Complexes in L7 and S12L as Well as the C 60 Dimer, DNA-Ellipticine, and HIV-Indinavir. J Phys Chem A 2022; 126:4326-4341. [PMID: 35766331 DOI: 10.1021/acs.jpca.2c01421] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In this work, we report the benchmark binding energies of the seven complexes within the L7 data set, six host-guest complexes from the S12L data set, a C60 dimer, the DNA-ellipticine intercalation complex, and the largest system of the study, the HIV-indinavir system, which contained 343 atoms or 139 heavy atoms. The high-quality values reported were obtained via a focal point method that relies on the canonical form of second-order Møller-Plesset theory and the domain-based local pair natural orbital scheme for the coupled cluster with single double and perturbative triple excitations [DLPNO-CCSD(T)] extrapolated to the complete basis set (CBS) limit. The results in this work not only corroborate but also improve upon some previous benchmark values for large noncovalent complexes albeit at a relatively steep cost. Although local CCSD(T) and the largely successful fixed-node diffusion Monte Carlo (FN-DMC) have been shown to generally agree for small- to medium-size systems, a discrepancy in their reported binding energy values arises for large complexes, where the magnitude of the disagreement is a definite cause for concern. For example, the largest deviation in the L7 data set was 2.8 kcal/mol (∼10%) on the low end in C3GC. Such a deviation only grows worse in the S12L set, which showed a difference of up to 10.4 kcal/mol (∼25%) by a conservative estimation in buckycatcher-C60. The DNA-ellipticine complex also generated a disagreement of 4.4 kcal/mol (∼10%) between both state-of-the-art methods. The disagreement between local CCSD(T) and FN-DMC in large noncovalent complexes shows that it is urgently needed to have the canonical CCSD(T), the Monte Carlo CCSD(T), or the full configuration interaction quantum Monte Carlo approaches available to large systems on the hundred-atom scale to solve this dilemma. In addition, the performances of cheaper popular computational methods were assessed for the studied complexes with respect to DLPNO-CCSD(T)/CBS. r2SCAN-3c, B97M-V, and PBE0+D4 work well in large noncovalent complexes in this work, and GFN2-xTB performs well in π-π stacking complexes. B97M-V is the most reliable computationally efficient approach to predicting noncovalent interactions for large complexes, being the only one to have binding errors within the so-called 1 kcal/mol "chemical accuracy". The benchmark interaction energies of these host-guest complexes, molecular materials, and biological systems with electronic and medicinal implications provide crucial reference data for the improvement of current and future lower-cost methods.
Collapse
Affiliation(s)
- Corentin Villot
- Department of Chemistry, Virginia Commonwealth University, Richmond, Virginia 23284 United States
| | - Francisco Ballesteros
- Department of Chemistry, Virginia Commonwealth University, Richmond, Virginia 23284 United States
| | - Danyang Wang
- Department of Chemistry, Virginia Commonwealth University, Richmond, Virginia 23284 United States
| | - Ka Un Lao
- Department of Chemistry, Virginia Commonwealth University, Richmond, Virginia 23284 United States
| |
Collapse
|
13
|
Lee S, Ermanis K, Goodman JM. MolE8: finding DFT potential energy surface minima values from force-field optimised organic molecules with new machine learning representations. Chem Sci 2022; 13:7204-7214. [PMID: 35799803 PMCID: PMC9214916 DOI: 10.1039/d1sc06324c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Accepted: 05/23/2022] [Indexed: 11/21/2022] Open
Abstract
The use of machine learning techniques in computational chemistry has gained significant momentum since large molecular databases are now readily available. Predictions of molecular properties using machine learning have advantages over the traditional quantum mechanics calculations because they can be cheaper computationally without losing the accuracy. We present a new extrapolatable and explainable molecular representation based on bonds, angles and dihedrals that can be used to train machine learning models. The trained models can accurately predict the electronic energy and the free energy of small organic molecules with atom types C, H N and O, with a mean absolute error of 1.2 kcal mol-1. The models can be extrapolated to larger organic molecules with an average error of less than 3.7 kcal mol-1 for 10 or fewer heavy atoms, which represent a chemical space two orders of magnitude larger. The rapid energy predictions of multiple molecules, up to 7 times faster than previous ML models of similar accuracy, has been achieved by sampling geometries around the potential energy surface minima. Therefore, the input geometries do not have to be located precisely on the minima and we show that accurate density functional theory energy predictions can be made from force-field optimised geometries with a mean absolute error 2.5 kcal mol-1.
Collapse
Affiliation(s)
- Sanha Lee
- Yusuf Hamied Department of Chemistry, University of Cambridge Lensfield Road Cambridge CB2 1EW UK
| | | | - Jonathan M Goodman
- Yusuf Hamied Department of Chemistry, University of Cambridge Lensfield Road Cambridge CB2 1EW UK
| |
Collapse
|
14
|
Huang HH, Wang YS, Chao SD. A Minimum Quantum Chemistry CCSD(T)/CBS Data Set of Dimeric Interaction Energies for Small Organic Functional Groups: Heterodimers. ACS OMEGA 2022; 7:20059-20080. [PMID: 35722020 PMCID: PMC9201891 DOI: 10.1021/acsomega.2c01888] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 05/04/2022] [Indexed: 06/15/2023]
Abstract
We extend our previous quantum chemistry calculations of interaction energies for 31 homodimers of small organic functional groups (the SOFG-31 data set) by including 239 heterodimers with monomers selected within the SOFG-31 data set, thus resulting in the SOFG-31+239 data set. The minimum-level theoretical scheme contains (1) the basis set superposition error corrected supermolecule (BSSE-SM) approach for intermolecular interactions; (2) the second-order Møller-Plesset perturbation theory (MP2) with the Dunning's aug-cc-pVXZ (X = D, T, Q) basis sets for the geometry optimization and correlation energy calculations; and (3) the single-point energy calculations with the coupled cluster with single, double, and perturbative triple excitations method at the complete basis set limit [CCSD(T)/CBS] using the well-tested extrapolation methods for the MP2 energy calibrations. In addition, we have performed a parallel series of energy decomposition calculations based on the symmetry adapted perturbation theory (SAPT) in order to gain chemical insights. That the above procedure cannot be further reduced has been proven to be very crucial for constructing reliable data sets of interaction energies. The calculated CCSD(T)/CBS interaction energy data can serve as a benchmark for testing or training less accurate but more efficient calculation methods, such as the electronic density functional theory. As an application, we employ a segmental SAPT model previously developed for the SOFG-31 data set to predict binding energies of large heterodimer complexes. These model energy "quanta" can be used in coarse-grained molecular dynamics simulations by avoiding large-scale calculations.
Collapse
Affiliation(s)
- Hsing-Hsiang Huang
- Institute
of Applied Mechanics, National Taiwan University, Taipei 10617, Taiwan R.O.C.
| | - Yi-Siang Wang
- School
of Chemistry & Biochemistry, Georgia
Institute of Technology, Atlanta, Georgia 30332, United States
| | - Sheng D. Chao
- Institute
of Applied Mechanics, National Taiwan University, Taipei 10617, Taiwan R.O.C.
| |
Collapse
|
15
|
Beran GJO, Greenwell C, Rezac J. Spin-component-scaled and dispersion-corrected second-order Møller-Plesset perturbation theory: A path toward chemical accuracy. Phys Chem Chem Phys 2022; 24:3695-3712. [DOI: 10.1039/d1cp04922d] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Second-order Moller-Plesset perturbation theory (MP2) provides a valuable alternative to density functional theory for modeing problems in organic and biological chemistry. However, MP2 suffers from known limitations in the description...
Collapse
|
16
|
Christensen AS, Sirumalla SK, Qiao Z, O'Connor MB, Smith DGA, Ding F, Bygrave PJ, Anandkumar A, Welborn M, Manby FR, Miller TF. OrbNet Denali: A machine learning potential for biological and organic chemistry with semi-empirical cost and DFT accuracy. J Chem Phys 2021; 155:204103. [PMID: 34852495 DOI: 10.1063/5.0061990] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
We present OrbNet Denali, a machine learning model for an electronic structure that is designed as a drop-in replacement for ground-state density functional theory (DFT) energy calculations. The model is a message-passing graph neural network that uses symmetry-adapted atomic orbital features from a low-cost quantum calculation to predict the energy of a molecule. OrbNet Denali is trained on a vast dataset of 2.3 × 106 DFT calculations on molecules and geometries. This dataset covers the most common elements in biochemistry and organic chemistry (H, Li, B, C, N, O, F, Na, Mg, Si, P, S, Cl, K, Ca, Br, and I) and charged molecules. OrbNet Denali is demonstrated on several well-established benchmark datasets, and we find that it provides accuracy that is on par with modern DFT methods while offering a speedup of up to three orders of magnitude. For the GMTKN55 benchmark set, OrbNet Denali achieves WTMAD-1 and WTMAD-2 scores of 7.19 and 9.84, on par with modern DFT functionals. For several GMTKN55 subsets, which contain chemical problems that are not present in the training set, OrbNet Denali produces a mean absolute error comparable to those of DFT methods. For the Hutchison conformer benchmark set, OrbNet Denali has a median correlation coefficient of R2 = 0.90 compared to the reference DLPNO-CCSD(T) calculation and R2 = 0.97 compared to the method used to generate the training data (ωB97X-D3/def2-TZVP), exceeding the performance of any other method with a similar cost. Similarly, the model reaches chemical accuracy for non-covalent interactions in the S66x10 dataset. For torsional profiles, OrbNet Denali reproduces the torsion profiles of ωB97X-D3/def2-TZVP with an average mean absolute error of 0.12 kcal/mol for the potential energy surfaces of the diverse fragments in the TorsionNet500 dataset.
Collapse
Affiliation(s)
| | | | - Zhuoran Qiao
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | | | | | - Feizhi Ding
- Entos, Inc., Los Angeles, California 90027, USA
| | | | - Animashree Anandkumar
- Division of Engineering and Applied Sciences, California Institute of Technology, Pasadena, California 91125, USA
| | | | | | | |
Collapse
|
17
|
Zhang P, Chern GW. Arrested Phase Separation in Double-Exchange Models: Large-Scale Simulation Enabled by Machine Learning. PHYSICAL REVIEW LETTERS 2021; 127:146401. [PMID: 34652181 DOI: 10.1103/physrevlett.127.146401] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 09/08/2021] [Indexed: 06/13/2023]
Abstract
We present large-scale dynamical simulations of electronic phase separation in the single-band double-exchange model based on deep-learning neural-network potentials trained from small-size exact diagonalization solutions. We uncover an intriguing correlation-induced freezing behavior as doped holes are segregated from half filled insulating background during equilibration. While the aggregation of holes is stabilized by the formation of ferromagnetic clusters through Hund's coupling between charge carriers and local magnetic moments, this stabilization also creates confining potentials for holes when antiferromagnetic spin-spin correlation is well developed in the background. The dramatically reduced mobility of the self-trapped holes prematurely disrupts further growth of the ferromagnetic clusters, leading to an arrested phase separation. Implications of our findings for phase separation dynamics in materials that exhibit colossal magnetoresistance effect are discussed.
Collapse
Affiliation(s)
- Puhan Zhang
- Department of Physics, University of Virginia, Charlottesville, Virginia 22904, USA
| | - Gia-Wei Chern
- Department of Physics, University of Virginia, Charlottesville, Virginia 22904, USA
| |
Collapse
|
18
|
Keith JA, Vassilev-Galindo V, Cheng B, Chmiela S, Gastegger M, Müller KR, Tkatchenko A. Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems. Chem Rev 2021; 121:9816-9872. [PMID: 34232033 PMCID: PMC8391798 DOI: 10.1021/acs.chemrev.1c00107] [Citation(s) in RCA: 190] [Impact Index Per Article: 63.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Indexed: 12/23/2022]
Abstract
Machine learning models are poised to make a transformative impact on chemical sciences by dramatically accelerating computational algorithms and amplifying insights available from computational chemistry methods. However, achieving this requires a confluence and coaction of expertise in computer science and physical sciences. This Review is written for new and experienced researchers working at the intersection of both fields. We first provide concise tutorials of computational chemistry and machine learning methods, showing how insights involving both can be achieved. We follow with a critical review of noteworthy applications that demonstrate how computational chemistry and machine learning can be used together to provide insightful (and useful) predictions in molecular and materials modeling, retrosyntheses, catalysis, and drug design.
Collapse
Affiliation(s)
- John A. Keith
- Department
of Chemical and Petroleum Engineering Swanson School of Engineering, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Valentin Vassilev-Galindo
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Bingqing Cheng
- Accelerate
Programme for Scientific Discovery, Department
of Computer Science and Technology, 15 J. J. Thomson Avenue, Cambridge CB3 0FD, United Kingdom
| | - Stefan Chmiela
- Department
of Software Engineering and Theoretical Computer Science, Technische Universität Berlin, 10587, Berlin, Germany
| | - Michael Gastegger
- Department
of Software Engineering and Theoretical Computer Science, Technische Universität Berlin, 10587, Berlin, Germany
| | - Klaus-Robert Müller
- Machine
Learning Group, Technische Universität
Berlin, 10587, Berlin, Germany
- Department
of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul, 02841, Korea
- Max-Planck-Institut für Informatik, 66123 Saarbrücken, Germany
- Google Research, Brain Team, 10117 Berlin, Germany
| | - Alexandre Tkatchenko
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| |
Collapse
|
19
|
Abstract
Abstract
Theoretical and computational chemistry aims to develop chemical theory and to apply numerical computation and simulation to reveal the mechanism behind complex chemical phenomena via quantum theory and statistical mechanics. Computation is the third pillar of scientific research together with theory and experiment. Computation enables scientists to test, discover, and build models/theories of the corresponding chemical phenomena. Theoretical and computational chemistry has been advanced to a new era due to the development of high-performance computational facilities and artificial intelligence approaches. The tendency to merge electronic structural theory with quantum chemical dynamics and statistical mechanics is of increasing interest because of the rapid development of on-the-fly dynamic simulations for complex systems plus low-scaling electronic structural theory. Another challenging issue lies in the transition from order to disorder, from thermodynamics to dynamics, and from equilibrium to non-equilibrium. Despite an increasingly rapid emergence of advances in computational power, detailed criteria for databases, effective data sharing strategies, and deep learning workflows have yet to be developed. Here, we outline some challenges and limitations of the current artificial intelligence approaches with an outlook on the potential future directions for chemistry in the big data era.
Collapse
|
20
|
Daas T, Fabiano E, Della Sala F, Gori-Giorgi P, Vuckovic S. Noncovalent Interactions from Models for the Møller-Plesset Adiabatic Connection. J Phys Chem Lett 2021; 12:4867-4875. [PMID: 34003655 PMCID: PMC8280728 DOI: 10.1021/acs.jpclett.1c01157] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Accepted: 05/13/2021] [Indexed: 05/08/2023]
Abstract
Given the omnipresence of noncovalent interactions (NCIs), their accurate simulations are of crucial importance across various scientific disciplines. Here we construct accurate models for the description of NCIs by an interpolation along the Møller-Plesset adiabatic connection (MP AC). Our interpolation approximates the correlation energy, by recovering MP2 at small coupling strengths and the correct large-coupling strength expansion of the MP AC, recently shown to be a functional of the Hartree-Fock density. Our models are size consistent for fragments with nondegenerate ground states, have the same cost as double hybrids, and require no dispersion corrections to capture NCIs accurately. These interpolations greatly reduce large MP2 errors for typical π-stacking complexes (e.g., benzene-pyridine dimers) and for the L7 data set. They are also competitive with state-of-the-art dispersion enhanced functionals and can even significantly outperform them for a variety of data sets, such as CT7 and L7.
Collapse
Affiliation(s)
- Timothy
J. Daas
- Department
of Chemistry & Pharmaceutical Sciences and Amsterdam Institute
of Molecular and Life Sciences (AIMMS), Faculty of Science, Vrije Universiteit, De Boelelaan 1083, 1081HV Amsterdam, The Netherlands
| | - Eduardo Fabiano
- Institute
for Microelectronics and Microsystems (CNR-IMM), Via Monteroni, Campus Unisalento, 73100 Lecce, Italy
- Center
for Biomolecular Nanotechnologies, Istituto
Italiano di Tecnologia, Via Barsanti 14, 73010 Arnesano (LE), Italy
| | - Fabio Della Sala
- Institute
for Microelectronics and Microsystems (CNR-IMM), Via Monteroni, Campus Unisalento, 73100 Lecce, Italy
- Center
for Biomolecular Nanotechnologies, Istituto
Italiano di Tecnologia, Via Barsanti 14, 73010 Arnesano (LE), Italy
| | - Paola Gori-Giorgi
- Department
of Chemistry & Pharmaceutical Sciences and Amsterdam Institute
of Molecular and Life Sciences (AIMMS), Faculty of Science, Vrije Universiteit, De Boelelaan 1083, 1081HV Amsterdam, The Netherlands
| | - Stefan Vuckovic
- Department
of Chemistry & Pharmaceutical Sciences and Amsterdam Institute
of Molecular and Life Sciences (AIMMS), Faculty of Science, Vrije Universiteit, De Boelelaan 1083, 1081HV Amsterdam, The Netherlands
- Physical
and Theoretical Chemistry, University of
Saarland, 66123 Saarbrücken, Germany
- Department
of Chemistry, University of California, Irvine, California 92697, United States
| |
Collapse
|
21
|
Xu P, Sattasathuchana T, Guidez E, Webb SP, Montgomery K, Yasini H, Pedreira IFM, Gordon MS. Computation of host-guest binding free energies with a new quantum mechanics based mining minima algorithm. J Chem Phys 2021; 154:104122. [PMID: 33722015 PMCID: PMC7955858 DOI: 10.1063/5.0040759] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 02/11/2021] [Indexed: 11/14/2022] Open
Abstract
A new method called QM-VM2 is presented that efficiently combines statistical mechanics with quantum mechanical (QM) energy potentials in order to calculate noncovalent binding free energies of host-guest systems. QM-VM2 efficiently couples the use of semi-empirical QM (SEQM) energies and geometry optimizations with an underlying molecular mechanics (MM) based conformational search, to find low SEQM energy minima, and allows for processing of these minima at higher levels of ab initio QM theory. A progressive geometry optimization scheme is introduced as a means to increase conformational sampling efficiency. The newly implemented QM-VM2 is used to compute the binding free energies of the host molecule cucurbit[7]uril and a set of 15 guest molecules. The results are presented along with comparisons to experimentally determined binding affinities. For the full set of 15 host-guest complexes, which have a range of formal charges from +1 to +3, SEQM-VM2 based binding free energies show poor correlation with experiment, whereas for the ten +1 complexes only, a significant correlation (R2 = 0.8) is achieved. SEQM-VM2 generation of conformers followed by single-point ab initio QM calculations at the dispersion corrected restricted Hartree-Fock-D3(BJ) and TPSS-D3(BJ) levels of theory, as post-processing corrections, yields a reasonable correlation with experiment for the full set of host-guest complexes (R2 = 0.6 and R2 = 0.7, respectively) and an excellent correlation for the +1 formal charge set (R2 = 1.0 and R2 = 0.9, respectively), as long as a sufficiently large basis set (triple-zeta quality) is employed. The importance of the inclusion of configurational entropy, even at the MM level, for the achievement of good correlation with experiment was demonstrated by comparing the calculated ΔE values with experiment and finding a considerably poorer correlation with experiment than for the calculated free energy ΔE - TΔS. For the complete set of host-guest systems with the range of formal charges, it was observed that the deviation of the predicted binding free energy from experiment correlates somewhat with the net charge of the systems. This observation leads to a simple empirical interpolation scheme to improve the linear regression of the full set.
Collapse
Affiliation(s)
- Peng Xu
- Department of Chemistry, Iowa State University, Ames, Iowa 50014, USA
| | | | - Emilie Guidez
- Department of Chemistry, University of Colorado Denver, Denver, Colorado 80204, USA
| | - Simon P. Webb
- VeraChem LLC, 12850 Middlebrook Rd. Ste 205, Germantown, Maryland 20874-5244, USA
| | | | - Hussna Yasini
- Department of Chemistry, University of Colorado Denver, Denver, Colorado 80204, USA
| | - Iara F. M. Pedreira
- Department of Chemistry, University of Colorado Denver, Denver, Colorado 80204, USA
| | - Mark S. Gordon
- Department of Chemistry, Iowa State University, Ames, Iowa 50014, USA
| |
Collapse
|
22
|
Husch T, Sun J, Cheng L, Lee SJR, Miller TF. Improved accuracy and transferability of molecular-orbital-based machine learning: Organics, transition-metal complexes, non-covalent interactions, and transition states. J Chem Phys 2021; 154:064108. [DOI: 10.1063/5.0032362] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Affiliation(s)
- Tamara Husch
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Jiace Sun
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Lixue Cheng
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Sebastian J. R. Lee
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Thomas F. Miller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| |
Collapse
|
23
|
Donchev AG, Taube AG, Decolvenaere E, Hargus C, McGibbon RT, Law KH, Gregersen BA, Li JL, Palmo K, Siva K, Bergdorf M, Klepeis JL, Shaw DE. Quantum chemical benchmark databases of gold-standard dimer interaction energies. Sci Data 2021; 8:55. [PMID: 33568655 PMCID: PMC7876112 DOI: 10.1038/s41597-021-00833-x] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 12/14/2020] [Indexed: 12/11/2022] Open
Abstract
Advances in computational chemistry create an ongoing need for larger and higher-quality datasets that characterize noncovalent molecular interactions. We present three benchmark collections of quantum mechanical data, covering approximately 3,700 distinct types of interacting molecule pairs. The first collection, which we refer to as DES370K, contains interaction energies for more than 370,000 dimer geometries. These were computed using the coupled-cluster method with single, double, and perturbative triple excitations [CCSD(T)], which is widely regarded as the gold-standard method in electronic structure theory. Our second benchmark collection, a core representative subset of DES370K called DES15K, is intended for more computationally demanding applications of the data. Finally, DES5M, our third collection, comprises interaction energies for nearly 5,000,000 dimer geometries; these were calculated using SNS-MP2, a machine learning approach that provides results with accuracy comparable to that of our coupled-cluster training data. These datasets may prove useful in the development of density functionals, empirically corrected wavefunction-based approaches, semi-empirical methods, force fields, and models trained using machine learning methods.
Collapse
Affiliation(s)
| | | | | | - Cory Hargus
- D. E. Shaw Research, New York, NY, 10036, USA
| | | | - Ka-Hei Law
- D. E. Shaw Research, New York, NY, 10036, USA
| | | | - Je-Luen Li
- D. E. Shaw Research, New York, NY, 10036, USA
| | - Kim Palmo
- D. E. Shaw Research, New York, NY, 10036, USA
| | | | | | | | - David E Shaw
- D. E. Shaw Research, New York, NY, 10036, USA. .,Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA.
| |
Collapse
|
24
|
Gyevi-Nagy L, Kállay M, Nagy PR. Accurate Reduced-Cost CCSD(T) Energies: Parallel Implementation, Benchmarks, and Large-Scale Applications. J Chem Theory Comput 2021; 17:860-878. [PMID: 33400527 PMCID: PMC7884001 DOI: 10.1021/acs.jctc.0c01077] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Indexed: 11/28/2022]
Abstract
The accurate and systematically improvable frozen natural orbital (FNO) and natural auxiliary function (NAF) cost-reducing approaches are combined with our recent coupled-cluster singles, doubles, and perturbative triples [CCSD(T)] implementations. Both of the closed- and open-shell FNO-CCSD(T) codes benefit from OpenMP parallelism, completely or partially integral-direct density-fitting algorithms, checkpointing, and hand-optimized, memory- and operation count effective implementations exploiting all permutational symmetries. The closed-shell CCSD(T) code requires negligible disk I/O and network bandwidth, is MPI/OpenMP parallel, and exhibits outstanding peak performance utilization of 50-70% up to hundreds of cores. Conservative FNO and NAF truncation thresholds benchmarked for challenging reaction, atomization, and ionization energies of both closed- and open-shell species are shown to maintain 1 kJ/mol accuracy against canonical CCSD(T) for systems of 31-43 atoms even with large basis sets. The cost reduction of up to an order of magnitude achieved extends the reach of FNO-CCSD(T) to systems of 50-75 atoms (up to 2124 atomic orbitals) with triple- and quadruple-ζ basis sets, which is unprecedented without local approximations. Consequently, a considerably larger portion of the chemical compound space can now be covered by the practically "gold standard" quality FNO-CCSD(T) method using affordable resources and about a week of wall time. Large-scale applications are presented for organocatalytic and transition-metal reactions as well as noncovalent interactions. Possible applications for benchmarking local CCSD(T) methods, as well as for the accuracy assessment or parametrization of less complete models, for example, density functional approximations or machine learning potentials, are also outlined.
Collapse
Affiliation(s)
- László Gyevi-Nagy
- Department of Physical Chemistry and
Materials Science, Budapest University of
Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| | - Mihály Kállay
- Department of Physical Chemistry and
Materials Science, Budapest University of
Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| | - Péter R. Nagy
- Department of Physical Chemistry and
Materials Science, Budapest University of
Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| |
Collapse
|
25
|
Kenouche S, Belkadi A, Djebaili R, Melkemi N. High regioselectivity in the amination reaction of isoquinolinequinone derivatives using conceptual DFT and NCI analysis. J Mol Graph Model 2021; 104:107828. [PMID: 33444977 DOI: 10.1016/j.jmgm.2020.107828] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 12/17/2020] [Accepted: 12/21/2020] [Indexed: 11/27/2022]
Abstract
DFT-derived reactivity descriptors and Non-Covalent interaction (NCI) analysis were performed to rationalize the regioselectivity in the amination reaction of some isoquinolinequinone derivatives. Statistical analysis was performed to assess robustness of atomic charges to the basis set. Various electronic population schemes including Mulliken population analysis (MPA), electrostatic method (ChelpG), Hirshfeld population analysis (HPA) and Natural population analysis (NPA) have been considered. The results revealed that NPA was the most efficient for this purpose. NCI study using the reduced density gradient (RDG) was performed for revealing weak interactions. Domains defined by isosurfaces of RDG have been integrated to quantitatively study the strength of weak interactions and their stabilities have also been examined. Steric hindrance caused by coordination of ethanol with the neighboring carbonyl prevents the nucleophilic attack on C-6 and therefore leads to preferential C-7 substitution. The quantitative study of NCI clearly demonstrates that the hydrogen bond of carbonyl (2) is more stabilizing. Consequently, the high polarity of hydrogen bond on this carbonyl may explain the high electrophilicity of C-5 compared to C-8. Our work proved that the difference in local reactivity, as well as the steric hindrance are the key elements explaining the high regioselectivity exhibited by the amination reaction.
Collapse
Affiliation(s)
- S Kenouche
- Research Team: Modeling of Chemical Systems Using Quantum Calculations, LCA Laboratory, University Mohamed Khider of Biskra, 07000, Biskra, Algeria.
| | - A Belkadi
- Research Team: Computational and Pharmaceutical Chemistry, LMCE Laboratory, University Mohamed Khider of Biskra, 07000, Biskra, Algeria
| | - R Djebaili
- Research Team: Computational and Pharmaceutical Chemistry, LMCE Laboratory, University Mohamed Khider of Biskra, 07000, Biskra, Algeria
| | - N Melkemi
- Research Team: Computational and Pharmaceutical Chemistry, LMCE Laboratory, University Mohamed Khider of Biskra, 07000, Biskra, Algeria
| |
Collapse
|
26
|
Tripathi MK, Ramanathan V. Conformational stability and structural analysis of methanethiol clusters: a revisit. RSC Adv 2021; 11:29207-29214. [PMID: 35479559 PMCID: PMC9040644 DOI: 10.1039/d1ra04900c] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 08/18/2021] [Indexed: 01/12/2023] Open
Abstract
B3LYP/cc-pV(D/T/Q)Z and CCSD/cc-pVDZ levels of theory predict three minima for both dimers and trimers of methanethiol. Predictions at B3LYP/cc-pVDZ corroborates exceedingly well with the earlier reported experimental value but significantly differ from the previous computational predictions. Interaction energy between the molecules decreases with an increase in the size of the basis set for both the dimer and trimer. The dipole moment of methanethiol dimer gets reduced at the B3LYP/cc-pVDZ level of theory relative to all minima configurations, and the same is seen for trimer also. These new predictions are well supported by atoms in molecules (AIM), frontier molecular orbital (FMO), Mulliken charge (MC), and natural bond orbital (NBO) analysis. B3LYP/cc-pV(D/T/Q)Z and CCSD/cc-pVDZ levels of theory predict three minima for both dimers and trimers of methanethiol.![]()
Collapse
Affiliation(s)
- Manish Kumar Tripathi
- Department of Chemistry, Indian Institute of Technology (Banaras Hindu University), Varanasi, Uttar Pradesh, India
| | - Venkatnarayan Ramanathan
- Department of Chemistry, Indian Institute of Technology (Banaras Hindu University), Varanasi, Uttar Pradesh, India
| |
Collapse
|
27
|
Head-Marsden K, Flick J, Ciccarino CJ, Narang P. Quantum Information and Algorithms for Correlated Quantum Matter. Chem Rev 2020; 121:3061-3120. [PMID: 33326218 DOI: 10.1021/acs.chemrev.0c00620] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Discoveries in quantum materials, which are characterized by the strongly quantum-mechanical nature of electrons and atoms, have revealed exotic properties that arise from correlations. It is the promise of quantum materials for quantum information science superimposed with the potential of new computational quantum algorithms to discover new quantum materials that inspires this Review. We anticipate that quantum materials to be discovered and developed in the next years will transform the areas of quantum information processing including communication, storage, and computing. Simultaneously, efforts toward developing new quantum algorithmic approaches for quantum simulation and advanced calculation methods for many-body quantum systems enable major advances toward functional quantum materials and their deployment. The advent of quantum computing brings new possibilities for eliminating the exponential complexity that has stymied simulation of correlated quantum systems on high-performance classical computers. Here, we review new algorithms and computational approaches to predict and understand the behavior of correlated quantum matter. The strongly interdisciplinary nature of the topics covered necessitates a common language to integrate ideas from these fields. We aim to provide this common language while weaving together fields across electronic structure theory, quantum electrodynamics, algorithm design, and open quantum systems. Our Review is timely in presenting the state-of-the-art in the field toward algorithms with nonexponential complexity for correlated quantum matter with applications in grand-challenge problems. Looking to the future, at the intersection of quantum information science and algorithms for correlated quantum matter, we envision seminal advances in predicting many-body quantum states and describing excitonic quantum matter and large-scale entangled states, a better understanding of high-temperature superconductivity, and quantifying open quantum system dynamics.
Collapse
Affiliation(s)
- Kade Head-Marsden
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Johannes Flick
- Center for Computational Quantum Physics, Flatiron Institute, New York, New York 10010, United States
| | - Christopher J Ciccarino
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States.,Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Prineha Narang
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| |
Collapse
|
28
|
Chang YM, Wang YS, Chao SD. A minimum quantum chemistry CCSD(T)/CBS dataset of dimeric interaction energies for small organic functional groups. J Chem Phys 2020; 153:154301. [PMID: 33092384 DOI: 10.1063/5.0019392] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
We have performed a quantum chemistry study on the bonding patterns and interaction energies for 31 dimers of small organic functional groups (dubbed the SOFG-31 dataset), including the alkane-alkene-alkyne (6 + 4 + 4 = 14, AAA) groups, alcohol-aldehyde-ketone (4 + 4 + 3 = 11, AAK) groups, and carboxylic acid-amide (3 + 3 = 6, CAA) groups. The basis set superposition error corrected super-molecule approach using the second order Møller-Plesset perturbation theory (MP2) with the Dunning's aug-cc-pVXZ (X = D, T, Q) basis sets has been employed in the geometry optimization and energy calculations. To calibrate the MP2 calculated interaction energies for these dimeric complexes, we perform single-point calculations with the coupled cluster with single, double, and perturbative triple excitations method at the complete basis set limit [CCSD(T)/CBS] using the well-tested extrapolation methods. In order to gain more physical insights, we also perform a parallel series of energy decomposition calculations based on the symmetry adapted perturbation theory (SAPT). The collection of these CCSD(T)/CBS interaction energy values can serve as a minimum quantum chemistry dataset for testing or training less accurate but more efficient calculation methods. As an application, we further propose a segmental SAPT model based on chemically recognizable segments in a specific functional group. These model interactions can be used to construct coarse-grained force fields for larger molecular systems.
Collapse
Affiliation(s)
- Yu-Ming Chang
- Institute of Applied Mechanics, National Taiwan University, Taipei 10617, Taiwan
| | - Yi-Siang Wang
- Institute of Applied Mechanics, National Taiwan University, Taipei 10617, Taiwan
| | - Sheng D Chao
- Institute of Applied Mechanics, National Taiwan University, Taipei 10617, Taiwan
| |
Collapse
|
29
|
Qiao Z, Welborn M, Anandkumar A, Manby FR, Miller TF. OrbNet: Deep learning for quantum chemistry using symmetry-adapted atomic-orbital features. J Chem Phys 2020; 153:124111. [DOI: 10.1063/5.0021955] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Affiliation(s)
- Zhuoran Qiao
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Matthew Welborn
- Entos, Inc., 4470 W Sunset Blvd., Suite 107 PMB 94758, Los Angeles, California 90027, USA
| | - Animashree Anandkumar
- Division of Engineering and Applied Sciences, California Institute of Technology, Pasadena, California 91125, USA
| | - Frederick R. Manby
- Entos, Inc., 4470 W Sunset Blvd., Suite 107 PMB 94758, Los Angeles, California 90027, USA
| | - Thomas F. Miller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
- Entos, Inc., 4470 W Sunset Blvd., Suite 107 PMB 94758, Los Angeles, California 90027, USA
| |
Collapse
|
30
|
Glick ZL, Metcalf DP, Koutsoukas A, Spronk SA, Cheney DL, Sherrill CD. AP-Net: An atomic-pairwise neural network for smooth and transferable interaction potentials. J Chem Phys 2020; 153:044112. [DOI: 10.1063/5.0011521] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Affiliation(s)
- Zachary L. Glick
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| | - Derek P. Metcalf
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| | - Alexios Koutsoukas
- Molecular Structure and Design, Bristol Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Steven A. Spronk
- Molecular Structure and Design, Bristol Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Daniel L. Cheney
- Molecular Structure and Design, Bristol Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - C. David Sherrill
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| |
Collapse
|
31
|
Smith DGA, Burns LA, Simmonett AC, Parrish RM, Schieber MC, Galvelis R, Kraus P, Kruse H, Di Remigio R, Alenaizan A, James AM, Lehtola S, Misiewicz JP, Scheurer M, Shaw RA, Schriber JB, Xie Y, Glick ZL, Sirianni DA, O’Brien JS, Waldrop JM, Kumar A, Hohenstein EG, Pritchard BP, Brooks BR, Schaefer HF, Sokolov AY, Patkowski K, DePrince AE, Bozkaya U, King RA, Evangelista FA, Turney JM, Crawford TD, Sherrill CD. Psi4 1.4: Open-source software for high-throughput quantum chemistry. J Chem Phys 2020; 152:184108. [PMID: 32414239 PMCID: PMC7228781 DOI: 10.1063/5.0006002] [Citation(s) in RCA: 337] [Impact Index Per Article: 84.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Accepted: 04/12/2020] [Indexed: 12/13/2022] Open
Abstract
PSI4 is a free and open-source ab initio electronic structure program providing implementations of Hartree-Fock, density functional theory, many-body perturbation theory, configuration interaction, density cumulant theory, symmetry-adapted perturbation theory, and coupled-cluster theory. Most of the methods are quite efficient, thanks to density fitting and multi-core parallelism. The program is a hybrid of C++ and Python, and calculations may be run with very simple text files or using the Python API, facilitating post-processing and complex workflows; method developers also have access to most of PSI4's core functionalities via Python. Job specification may be passed using The Molecular Sciences Software Institute (MolSSI) QCSCHEMA data format, facilitating interoperability. A rewrite of our top-level computation driver, and concomitant adoption of the MolSSI QCARCHIVE INFRASTRUCTURE project, makes the latest version of PSI4 well suited to distributed computation of large numbers of independent tasks. The project has fostered the development of independent software components that may be reused in other quantum chemistry programs.
Collapse
Affiliation(s)
| | - Lori A. Burns
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| | - Andrew C. Simmonett
- National Institutes of Health – National Heart,
Lung and Blood Institute, Laboratory of Computational Biology, Bethesda,
Maryland 20892, USA
| | - Robert M. Parrish
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| | - Matthew C. Schieber
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| | | | - Peter Kraus
- School of Molecular and Life Sciences, Curtin
University, Kent St., Bentley, Perth, Western Australia 6102,
Australia
| | - Holger Kruse
- Institute of Biophysics of the Czech Academy of
Sciences, Královopolská 135, 612 65 Brno, Czech
Republic
| | - Roberto Di Remigio
- Department of Chemistry, Centre for Theoretical
and Computational Chemistry, UiT, The Arctic University of Norway, N-9037
Tromsø, Norway
| | - Asem Alenaizan
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| | - Andrew M. James
- Department of Chemistry, Virginia
Tech, Blacksburg, Virginia 24061, USA
| | - Susi Lehtola
- Department of Chemistry, University of
Helsinki, P.O. Box 55 (A. I. Virtasen aukio 1), FI-00014 Helsinki,
Finland
| | - Jonathon P. Misiewicz
- Center for Computational Quantum Chemistry,
University of Georgia, Athens, Georgia 30602, USA
| | - Maximilian Scheurer
- Interdisciplinary Center for Scientific
Computing, Heidelberg University, D-69120 Heidelberg,
Germany
| | - Robert A. Shaw
- ARC Centre of Excellence in Exciton Science,
School of Science, RMIT University, Melbourne, VIC 3000,
Australia
| | - Jeffrey B. Schriber
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| | - Yi Xie
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| | - Zachary L. Glick
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| | - Dominic A. Sirianni
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| | - Joseph Senan O’Brien
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| | - Jonathan M. Waldrop
- Department of Chemistry and Biochemistry, Auburn
University, Auburn, Alabama 36849, USA
| | - Ashutosh Kumar
- Department of Chemistry, Virginia
Tech, Blacksburg, Virginia 24061, USA
| | - Edward G. Hohenstein
- SLAC National Accelerator Laboratory, Stanford
PULSE Institute, Menlo Park, California 94025,
USA
| | | | - Bernard R. Brooks
- National Institutes of Health – National Heart,
Lung and Blood Institute, Laboratory of Computational Biology, Bethesda,
Maryland 20892, USA
| | - Henry F. Schaefer
- Center for Computational Quantum Chemistry,
University of Georgia, Athens, Georgia 30602, USA
| | - Alexander Yu. Sokolov
- Department of Chemistry and Biochemistry, The
Ohio State University, Columbus, Ohio 43210, USA
| | - Konrad Patkowski
- Department of Chemistry and Biochemistry, Auburn
University, Auburn, Alabama 36849, USA
| | - A. Eugene DePrince
- Department of Chemistry and Biochemistry,
Florida State University, Tallahassee, Florida 32306-4390,
USA
| | - Uğur Bozkaya
- Department of Chemistry, Hacettepe
University, Ankara 06800, Turkey
| | - Rollin A. King
- Department of Chemistry, Bethel
University, St. Paul, Minnesota 55112, USA
| | | | - Justin M. Turney
- Center for Computational Quantum Chemistry,
University of Georgia, Athens, Georgia 30602, USA
| | | | - C. David Sherrill
- Center for Computational Molecular Science and
Technology, School of Chemistry and Biochemistry, School of Computational Science and
Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400,
USA
| |
Collapse
|
32
|
Manzhos S. Machine learning for the solution of the Schrödinger equation. MACHINE LEARNING-SCIENCE AND TECHNOLOGY 2020. [DOI: 10.1088/2632-2153/ab7d30] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
33
|
Fabrizio A, Petraglia R, Corminboeuf C. Balancing Density Functional Theory Interaction Energies in Charged Dimers Precursors to Organic Semiconductors. J Chem Theory Comput 2020; 16:3530-3542. [DOI: 10.1021/acs.jctc.9b01193] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Alberto Fabrizio
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Riccardo Petraglia
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
34
|
Fabrizio A, Meyer B, Corminboeuf C. Machine learning models of the energy curvature vs particle number for optimal tuning of long-range corrected functionals. J Chem Phys 2020; 152:154103. [DOI: 10.1063/5.0005039] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Affiliation(s)
- Alberto Fabrizio
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Benjamin Meyer
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
35
|
Abstract
As the quantum chemistry (QC) community embraces machine learning (ML), the number of new methods and applications based on the combination of QC and ML is surging. In this Perspective, a view of the current state of affairs in this new and exciting research field is offered, challenges of using machine learning in quantum chemistry applications are described, and potential future developments are outlined. Specifically, examples of how machine learning is used to improve the accuracy and accelerate quantum chemical research are shown. Generalization and classification of existing techniques are provided to ease the navigation in the sea of literature and to guide researchers entering the field. The emphasis of this Perspective is on supervised machine learning.
Collapse
Affiliation(s)
- Pavlo O Dral
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| |
Collapse
|
36
|
Mezei PD, von Lilienfeld OA. Noncovalent Quantum Machine Learning Corrections to Density Functionals. J Chem Theory Comput 2020; 16:2647-2653. [DOI: 10.1021/acs.jctc.0c00181] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Affiliation(s)
- Pál D. Mezei
- Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials, Department of Chemistry, University of Basel, 4001 Basel, Switzerland
| | - O. Anatole von Lilienfeld
- Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials, Department of Chemistry, University of Basel, 4001 Basel, Switzerland
| |
Collapse
|
37
|
Jeong W, Stoneburner SJ, King D, Li R, Walker A, Lindh R, Gagliardi L. Automation of Active Space Selection for Multireference Methods via Machine Learning on Chemical Bond Dissociation. J Chem Theory Comput 2020; 16:2389-2399. [DOI: 10.1021/acs.jctc.9b01297] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- WooSeok Jeong
- Department of Chemistry, Nanoporous Materials Genome Center, Minnesota Supercomputing Institute, and Chemical Theory Center, University of Minnesota, 207 Pleasant Street Southeast, Minneapolis, Minnesota 55455, United States
| | - Samuel J. Stoneburner
- Department of Chemistry, Nanoporous Materials Genome Center, Minnesota Supercomputing Institute, and Chemical Theory Center, University of Minnesota, 207 Pleasant Street Southeast, Minneapolis, Minnesota 55455, United States
| | - Daniel King
- Department of Chemistry, Nanoporous Materials Genome Center, Minnesota Supercomputing Institute, and Chemical Theory Center, University of Minnesota, 207 Pleasant Street Southeast, Minneapolis, Minnesota 55455, United States
| | - Ruye Li
- Department of Chemistry, Nanoporous Materials Genome Center, Minnesota Supercomputing Institute, and Chemical Theory Center, University of Minnesota, 207 Pleasant Street Southeast, Minneapolis, Minnesota 55455, United States
| | - Andrew Walker
- Department of Computer Science and Engineering, University of Minnesota, 200 Union Street Southeast, Minneapolis, Minnesota 55455, United States
| | - Roland Lindh
- Department of Chemistry—BMC, and Uppsala Center for Computational Chemistry—UC3, Uppsala University, 751 23 Uppsala, Sweden
| | - Laura Gagliardi
- Department of Chemistry, Nanoporous Materials Genome Center, Minnesota Supercomputing Institute, and Chemical Theory Center, University of Minnesota, 207 Pleasant Street Southeast, Minneapolis, Minnesota 55455, United States
| |
Collapse
|
38
|
Metcalf DP, Koutsoukas A, Spronk SA, Claus BL, Loughney DA, Johnson SR, Cheney DL, Sherrill CD. Approaches for machine learning intermolecular interaction energies and application to energy components from symmetry adapted perturbation theory. J Chem Phys 2020; 152:074103. [DOI: 10.1063/1.5142636] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Affiliation(s)
- Derek P. Metcalf
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| | - Alexios Koutsoukas
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Steven A. Spronk
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Brian L. Claus
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Deborah A. Loughney
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Stephen R. Johnson
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Daniel L. Cheney
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - C. David Sherrill
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| |
Collapse
|
39
|
|
40
|
Patkowski K. Recent developments in symmetry‐adapted perturbation theory. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1452] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Affiliation(s)
- Konrad Patkowski
- Department of Chemistry and Biochemistry Auburn University Auburn Alabama
| |
Collapse
|
41
|
Gyevi-Nagy L, Kállay M, Nagy PR. Integral-Direct and Parallel Implementation of the CCSD(T) Method: Algorithmic Developments and Large-Scale Applications. J Chem Theory Comput 2019; 16:366-384. [DOI: 10.1021/acs.jctc.9b00957] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- László Gyevi-Nagy
- Department of Physical Chemistry and Materials Science, Budapest University of Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| | | | | |
Collapse
|
42
|
Fabrizio A, Grisafi A, Meyer B, Ceriotti M, Corminboeuf C. Electron density learning of non-covalent systems. Chem Sci 2019; 10:9424-9432. [PMID: 32055318 PMCID: PMC6991182 DOI: 10.1039/c9sc02696g] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Accepted: 09/08/2019] [Indexed: 11/21/2022] Open
Abstract
Chemists continuously harvest the power of non-covalent interactions to control phenomena in both the micro- and macroscopic worlds. From the quantum chemical perspective, the strategies essentially rely upon an in-depth understanding of the physical origin of these interactions, the quantification of their magnitude and their visualization in real-space. The total electron density ρ( r ) represents the simplest yet most comprehensive piece of information available for fully characterizing bonding patterns and non-covalent interactions. The charge density of a molecule can be computed by solving the Schrödinger equation, but this approach becomes rapidly demanding if the electron density has to be evaluated for thousands of different molecules or very large chemical systems, such as peptides and proteins. Here we present a transferable and scalable machine-learning model capable of predicting the total electron density directly from the atomic coordinates. The regression model is used to access qualitative and quantitative insights beyond the underlying ρ( r ) in a diverse ensemble of sidechain-sidechain dimers extracted from the BioFragment database (BFDb). The transferability of the model to more complex chemical systems is demonstrated by predicting and analyzing the electron density of a collection of 8 polypeptides.
Collapse
Affiliation(s)
- Alberto Fabrizio
- Laboratory for Computational Molecular Design , Institute of Chemical Sciences and Engineering , École Polytechnique Fédérale de Lausanne , CH-1015 Lausanne , Switzerland .
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL) , École Polytechnique Fédérale de Lausanne , 1015 Lausanne , Switzerland
| | - Andrea Grisafi
- Laboratory of Computational Science and Modeling , IMX , École Polytechnique Fédérale de Lausanne , 1015 Lausanne , Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL) , École Polytechnique Fédérale de Lausanne , 1015 Lausanne , Switzerland
| | - Benjamin Meyer
- Laboratory for Computational Molecular Design , Institute of Chemical Sciences and Engineering , École Polytechnique Fédérale de Lausanne , CH-1015 Lausanne , Switzerland .
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL) , École Polytechnique Fédérale de Lausanne , 1015 Lausanne , Switzerland
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling , IMX , École Polytechnique Fédérale de Lausanne , 1015 Lausanne , Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL) , École Polytechnique Fédérale de Lausanne , 1015 Lausanne , Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design , Institute of Chemical Sciences and Engineering , École Polytechnique Fédérale de Lausanne , CH-1015 Lausanne , Switzerland .
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL) , École Polytechnique Fédérale de Lausanne , 1015 Lausanne , Switzerland
| |
Collapse
|
43
|
Cheng L, Kovachki NB, Welborn M, Miller TF. Regression Clustering for Improved Accuracy and Training Costs with Molecular-Orbital-Based Machine Learning. J Chem Theory Comput 2019; 15:6668-6677. [DOI: 10.1021/acs.jctc.9b00884] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Affiliation(s)
- Lixue Cheng
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Nikola B. Kovachki
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California 91125, United States
| | - Matthew Welborn
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Thomas F. Miller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| |
Collapse
|
44
|
Coe JP. Machine Learning Configuration Interaction for ab Initio Potential Energy Curves. J Chem Theory Comput 2019; 15:6179-6189. [DOI: 10.1021/acs.jctc.9b00828] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Jeremy P. Coe
- Institute of Chemical Sciences, School of Engineering and Physical Sciences, Heriot-Watt University, Edinburgh EH14 4AS, United Kingdom
| |
Collapse
|
45
|
Proppe J, Gugler S, Reiher M. Gaussian Process-Based Refinement of Dispersion Corrections. J Chem Theory Comput 2019; 15:6046-6060. [PMID: 31603673 DOI: 10.1021/acs.jctc.9b00627] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
We employ Gaussian process (GP) regression to adjust for systematic errors in D3-type dispersion corrections. We refer to the associated, statistically improved model as D3-GP. It is trained on differences between interaction energies obtained from PBE-D3(BJ)/ma-def2-QZVPP and DLPNO-CCSD(T)/CBS calculations. We generated a data set containing interaction energies for 1248 molecular dimers, which resemble the dispersion-dominated systems contained in the S66 data set. Our systems represent not only equilibrium structures but also dimers with various relative orientations and conformations at both shorter and longer distances. A reparametrization of the D3(BJ) model based on 66 of these dimers suggests that two of its three empirical parameters, a1 and s8, are zero, whereas a2 = 5.6841 bohr. For the remaining 1182 dimers, we find that this new set of parameters is superior to all previously published D3(BJ) parameter sets. To train our D3-GP model, we engineered two different vectorial representations of (supra-)molecular systems, both derived from the matrix of atom-pairwise D3(BJ) interaction terms: (a) a distance-resolved interaction energy histogram, histD3(BJ), and (b) eigenvalues of the interaction matrix ordered according to their decreasing absolute value, eigD3(BJ). Hence, the GP learns a mapping from D3(BJ) information only, which renders D3-GP-type dispersion corrections comparable to those obtained with the original D3 approach. They improve systematically if the underlying training set is selected carefully. Here, we harness the prediction variance obtained from GP regression to select optimal training sets in an automated fashion. The larger the variance, the more information the corresponding data point may add to the training set. For a given set of molecular systems, variance-based sampling can approximately determine the smallest subset being subjected to reference calculations such that all dispersion corrections for the remaining systems fall below a predefined accuracy threshold. To render the entire D3-GP workflow as efficient as possible, we present an improvement over our variance-based, sequential active-learning scheme [ J. Chem. Theory Comput. 2018 , 14 , 5238 ]. Our refined learning algorithm selects multiple (instead of single) systems that can be subjected to reference calculations simultaneously. We refer to the underlying selection strategy as batchwise variance-based sampling (BVS). BVS-guided active learning is an essential component of our D3-GP workflow, which is implemented in a black-box fashion. Once provided with reference data for new molecular systems, the underlying GP model automatically learns to adapt to these and similar systems. This approach leads overall to a self-improving model (D3-GP) that predicts system-focused and GP-refined D3-type dispersion corrections for any given system of reference data.
Collapse
Affiliation(s)
- Jonny Proppe
- Department of Chemistry , and Department of Computer Science , University of Toronto , Toronto , Ontario M5S , Canada.,Laboratory of Physical Chemistry , ETH Zurich , Vladimir-Prelog-Weg 2 , 8093 Zurich , Switzerland
| | - Stefan Gugler
- Laboratory of Physical Chemistry , ETH Zurich , Vladimir-Prelog-Weg 2 , 8093 Zurich , Switzerland
| | - Markus Reiher
- Laboratory of Physical Chemistry , ETH Zurich , Vladimir-Prelog-Weg 2 , 8093 Zurich , Switzerland
| |
Collapse
|
46
|
Kodrycka M, Patkowski K. Platinum, gold, and silver standards of intermolecular interaction energy calculations. J Chem Phys 2019; 151:070901. [DOI: 10.1063/1.5116151] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Monika Kodrycka
- Department of Chemistry and Biochemistry, Auburn University, Auburn, Alabama 36849, USA
| | - Konrad Patkowski
- Department of Chemistry and Biochemistry, Auburn University, Auburn, Alabama 36849, USA
| |
Collapse
|
47
|
Townsend J, Vogiatzis KD. Data-Driven Acceleration of the Coupled-Cluster Singles and Doubles Iterative Solver. J Phys Chem Lett 2019; 10:4129-4135. [PMID: 31290671 DOI: 10.1021/acs.jpclett.9b01442] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Solving the coupled-cluster (CC) equations is a cost-prohibitive process that exhibits poor scaling with system size. These equations are solved by determining the set of amplitudes (t) that minimize the system energy with respect to the coupled-cluster equations at the selected level of truncation. Here, a novel approach to predict the converged coupled-cluster singles and doubles (CCSD) amplitudes, thus the coupled-cluster wave function, is explored by using machine learning and electronic structure properties inherent to the MP2 level. Features are collected from quantum chemical data, such as orbital energies, one-electron Hamiltonian, Coulomb, and exchange terms. The data-driven CCSD (DDCCSD) is not an alchemical method because the actual iterative coupled-cluster equations are solved. However, accurate energetics can also be obtained by bypassing solving the CC equations entirely. Our preliminary data show that it is possible to achieve remarkable speedups in solving the CCSD equations, especially when the correct physics are encoded and used for training of machine learning models.
Collapse
Affiliation(s)
- Jacob Townsend
- Department of Chemistry , University of Tennessee , Knoxville , Tennessee 37996 , United States
| | | |
Collapse
|
48
|
Abstract
Machine learning enables computers to address problems by learning from data. Deep learning is a type of machine learning that uses a hierarchical recombination of features to extract pertinent information and then learn the patterns represented in the data. Over the last eight years, its abilities have increasingly been applied to a wide variety of chemical challenges, from improving computational chemistry to drug and materials design and even synthesis planning. This review aims to explain the concepts of deep learning to chemists from any background and follows this with an overview of the diverse applications demonstrated in the literature. We hope that this will empower the broader chemical community to engage with this burgeoning field and foster the growing movement of deep learning accelerated chemistry.
Collapse
Affiliation(s)
- Adam C Mater
- ARC Centre of Excellence for Electromaterials Science, Research School of Chemistry , Australian National University , Canberra , Australian Capital Territory 2601 , Australia
| | - Michelle L Coote
- ARC Centre of Excellence for Electromaterials Science, Research School of Chemistry , Australian National University , Canberra , Australian Capital Territory 2601 , Australia
| |
Collapse
|
49
|
Cheng L, Welborn M, Christensen AS, Miller TF. A universal density matrix functional from molecular orbital-based machine learning: Transferability across organic molecules. J Chem Phys 2019; 150:131103. [DOI: 10.1063/1.5088393] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Affiliation(s)
- Lixue Cheng
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Matthew Welborn
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Anders S. Christensen
- Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials, Department of Chemistry, University of Basel, Basel, Switzerland
| | - Thomas F. Miller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| |
Collapse
|
50
|
Kruse H, Banáš P, Šponer J. Investigations of Stacked DNA Base-Pair Steps: Highly Accurate Stacking Interaction Energies, Energy Decomposition, and Many-Body Stacking Effects. J Chem Theory Comput 2018; 15:95-115. [DOI: 10.1021/acs.jctc.8b00643] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Holger Kruse
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Pavel Banáš
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University, 17 Listopadu 12, 77146 Olomouc, Czech Republic
| | - Jiřı́ Šponer
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacky University, 17 Listopadu 12, 77146 Olomouc, Czech Republic
| |
Collapse
|