1
|
Sami S, LaCour RA, Heindel JP, Head-Gordon T. Simple and Accurate One-Body Energy and Dipole Moment Surfaces for Water and Beyond. J Phys Chem Lett 2024:6712-6721. [PMID: 38900596 DOI: 10.1021/acs.jpclett.4c00587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2024]
Abstract
Water is often the testing ground for new, advanced force fields. While advanced functional forms for intermolecular interactions have been integral to the development of accurate water models, less attention has been paid to a transferable model for intramolecular valence terms. In this work, we present a one-body energy and dipole moment surface model, named 1B-UCB, that is simple yet accurate and can be feasibly adapted for both standard and advanced potentials. 1B-UCB for water is comparable in accuracy to those with much more complex functional forms, despite having drastically fewer parameters. The parametrization protocol has been implemented as part of the Q-Force automated workflow and requires only a quantum mechanical Hessian calculation as reference data, hence allowing it to be easily extended to a variety of molecular systems beyond water, which we demonstrate on a selection of small molecules with different symmetries.
Collapse
Affiliation(s)
- Selim Sami
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, Berkeley, California 94720, United States
| | - R Allen LaCour
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Joseph P Heindel
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Teresa Head-Gordon
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
- Departments of Bioengineering and Chemical and Biomolecular Engineering, University of California, Berkeley, Berkeley, California 94720, United States
| |
Collapse
|
2
|
Chung JK, Brown ML, Popelier PLA. Transferability of Buckingham Parameters for Short-Range Repulsion between Topological Atoms. J Phys Chem A 2024; 128:4561-4572. [PMID: 38805440 PMCID: PMC11163427 DOI: 10.1021/acs.jpca.4c02048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 05/16/2024] [Accepted: 05/20/2024] [Indexed: 05/30/2024]
Abstract
The repulsive part of the Buckingham potential, with parameters A and B, can be used to model deformation energies and steric energies. Both are calculated using the interacting quantum atom energy decomposition scheme where the latter is obtained from the former by a charge-transfer-based energy correction. These energies relate to short-range interactions, specifically the deformation of electron density and steric hindrance, respectively, when topological atoms approach each other. In this work, we calculate and fit the energies of carbonyl carbon, carbonyl oxygen, and, where possible, amine nitrogen atoms to the repulsive part of the Buckingham potential for 26 molecules. We find that while the steric energies of all atom pairs studied display exponential behavior with respect to distance, some deformation energy data do not. The obtained parameters are shown to be transferable by calculating root-mean-square errors of fitted potentials with respect to energy data of the same atom in, as far as possible, all other molecules from our data set. We observed that 36% and 10% of these errors were smaller than 4 kJ mol-1 for steric and deformation energy, respectively. Thus, we find that steric energy parameters are more transferable than deformation energy parameters. Finally, we speculate about the physical meaning of the A and B parameters and the implications of the strong exponential and exponential-linear piecewise relationships that we observe between them.
Collapse
Affiliation(s)
- Jaiming
J. K. Chung
- Department of Chemistry, The University of Manchester, Oxford Road, Manchester M13 9PL, Great
Britain
| | - Matthew L. Brown
- Department of Chemistry, The University of Manchester, Oxford Road, Manchester M13 9PL, Great
Britain
| | - Paul L. A. Popelier
- Department of Chemistry, The University of Manchester, Oxford Road, Manchester M13 9PL, Great
Britain
| |
Collapse
|
3
|
Manchev YT, Popelier PLA. FFLUX molecular simulations driven by atomic Gaussian process regression models. J Comput Chem 2024; 45:1235-1246. [PMID: 38345165 DOI: 10.1002/jcc.27323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 12/14/2023] [Accepted: 01/16/2024] [Indexed: 04/19/2024]
Abstract
Machine learning (ML) force fields are revolutionizing molecular dynamics (MD) simulations as they bypass the computational cost associated with ab initio methods but do not sacrifice accuracy in the process. In this work, the GPyTorch library is used to create Gaussian process regression (GPR) models that are interfaced with the next-generation ML force field FFLUX. These models predict atomic properties of different molecular configurations that appear in a progressing MD simulation. An improved kernel function is utilized to correctly capture the periodicity of the input descriptors. The first FFLUX molecular simulations of ammonia, methanol, and malondialdehyde with the updated kernel are performed. Geometry optimizations with the GPR models result in highly accurate final structures with a maximum root-mean-squared deviation of 0.064 Å and sub-kJ mol-1 total energy predictions. Additionally, the models are tested in 298 K MD simulations with FFLUX to benchmark for robustness. The resulting energy and force predictions throughout the simulation are in excellent agreement with ab initio data for ammonia and methanol but decrease in quality for malondialdehyde due to the increased system complexity. GPR model improvements are discussed, which will ensure the future scalability to larger systems.
Collapse
Affiliation(s)
- Yulian T Manchev
- Department of Chemistry, The University of Manchester, Manchester, Great Britain
| | - Paul L A Popelier
- Department of Chemistry, The University of Manchester, Manchester, Great Britain
| |
Collapse
|
4
|
Gallegos M, Vassilev-Galindo V, Poltavsky I, Martín Pendás Á, Tkatchenko A. Explainable chemical artificial intelligence from accurate machine learning of real-space chemical descriptors. Nat Commun 2024; 15:4345. [PMID: 38773090 DOI: 10.1038/s41467-024-48567-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 04/24/2024] [Indexed: 05/23/2024] Open
Abstract
Machine-learned computational chemistry has led to a paradoxical situation in which molecular properties can be accurately predicted, but they are difficult to interpret. Explainable AI (XAI) tools can be used to analyze complex models, but they are highly dependent on the AI technique and the origin of the reference data. Alternatively, interpretable real-space tools can be employed directly, but they are often expensive to compute. To address this dilemma between explainability and accuracy, we developed SchNet4AIM, a SchNet-based architecture capable of dealing with local one-body (atomic) and two-body (interatomic) descriptors. The performance of SchNet4AIM is tested by predicting a wide collection of real-space quantities ranging from atomic charges and delocalization indices to pairwise interaction energies. The accuracy and speed of SchNet4AIM breaks the bottleneck that has prevented the use of real-space chemical descriptors in complex systems. We show that the group delocalization indices, arising from our physically rigorous atomistic predictions, provide reliable indicators of supramolecular binding events, thus contributing to the development of Explainable Chemical Artificial Intelligence (XCAI) models.
Collapse
Affiliation(s)
- Miguel Gallegos
- Department of Analytical and Physical Chemistry, University of Oviedo, E-33006, Oviedo, Spain
| | | | - Igor Poltavsky
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Ángel Martín Pendás
- Department of Analytical and Physical Chemistry, University of Oviedo, E-33006, Oviedo, Spain.
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg.
| |
Collapse
|
5
|
Brown M, Skelton JM, Popelier PLA. Application of the FFLUX Force Field to Molecular Crystals: A Study of Formamide. J Chem Theory Comput 2023; 19:7946-7959. [PMID: 37847867 PMCID: PMC10653110 DOI: 10.1021/acs.jctc.3c00578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Indexed: 10/19/2023]
Abstract
In this work, we present the first application of the quantum chemical topology force field FFLUX to the solid state. FFLUX utilizes Gaussian process regression machine learning models trained on data from the interacting quantum atom partitioning scheme to predict atomic energies and flexible multipole moments that change with geometry. Here, the ambient (α) and high-pressure (β) polymorphs of formamide are used as test systems and optimized using FFLUX. Optimizing the structures with increasing multipolar ranks indicates that the lattice parameters of the α phase differ by less than 5% to the experimental structure when multipole moments up to the quadrupole are used. These differences are found to be in line with the dispersion-corrected density functional theory. Lattice dynamics calculations are also found to be possible using FFLUX, yielding harmonic phonon spectra comparable to dispersion-corrected DFT while enabling larger supercells to be considered than is typically possible with first-principles calculations. These promising results indicate that FFLUX can be used to accurately determine properties of molecular solids that are difficult to access using DFT, including the structural dynamics, free energies, and properties at finite temperature.
Collapse
Affiliation(s)
- Matthew
L. Brown
- Department of Chemistry, The University of Manchester, Oxford Road, Manchester M13 9PL, Britain
| | - Jonathan M. Skelton
- Department of Chemistry, The University of Manchester, Oxford Road, Manchester M13 9PL, Britain
| | - Paul L. A. Popelier
- Department of Chemistry, The University of Manchester, Oxford Road, Manchester M13 9PL, Britain
| |
Collapse
|
6
|
Herman KM, Stone AJ, Xantheas SS. Accurate Calculation of Many-Body Energies in Water Clusters Using a Classical Geometry-Dependent Induction Model. J Chem Theory Comput 2023; 19:6805-6815. [PMID: 37703063 DOI: 10.1021/acs.jctc.3c00575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/14/2023]
Abstract
We incorporate geometry-dependent distributed multipole and polarizability surfaces into an induction model that is used to describe the 3- and 4-body terms of the interaction between water molecules. The moment expansion is carried out up to the hexadecapole with the multipoles distributed on the atom sites. Dipole-dipole, dipole-quadrupole, and quadrupole-quadrupole distributed polarizabilities are used to represent the response of the multipoles to an electric field. We compare the model against two large databases consisting of 43,844 3-body terms and 3,603 4-body terms obtained from high level ab initio calculations previously used to fit the MB-pol and q-AQUA classical interaction potentials for water. The classical induction model with no adjustable parameters reproduces the ab initio 3-/4-body terms contained in these two databases with a root-mean-square error (RMSE) of 0.104/0.058 and a mean-absolute error (MAE) of 0.054/0.026 kcal/mol, respectively. These results are on par with the ones obtained by fitting the same data using over 14,000 (for the 3-body) and 200 (for the 4-body) parameters via Permutationally Invariant Polynomials (PIPs). This demonstrates the accuracy of this physically motivated model in describing the 3- and 4-body terms in the interactions between water molecules with no adjustable parameters. The triple-dipole-dispersion energy, included in the calculation of the 3-body energy, was found to be small but not quite negligible. The model represents a practical, efficient, and transferable approach for obtaining accurate nonadditive interactions for multicomponent systems without the need to perform tens of thousands of high level electronic structure calculations and fitting them with PIPs.
Collapse
Affiliation(s)
- Kristina M Herman
- Department of Chemistry, University of Washington, Seattle, Washington 98185, United States
| | - Anthony J Stone
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, U.K
| | - Sotiris S Xantheas
- Department of Chemistry, University of Washington, Seattle, Washington 98185, United States
- Advanced Computing Mathematics and Data Division, Pacific Northwest National Laboratory, 902 Battelle Boulevard, P.O. Box 999, MSIN J7-10, Richland, Washington 99352, United States
| |
Collapse
|
7
|
Tokita AM, Behler J. How to train a neural network potential. J Chem Phys 2023; 159:121501. [PMID: 38127396 DOI: 10.1063/5.0160326] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 07/24/2023] [Indexed: 12/23/2023] Open
Abstract
The introduction of modern Machine Learning Potentials (MLPs) has led to a paradigm change in the development of potential energy surfaces for atomistic simulations. By providing efficient access to energies and forces, they allow us to perform large-scale simulations of extended systems, which are not directly accessible by demanding first-principles methods. In these simulations, MLPs can reach the accuracy of electronic structure calculations, provided that they have been properly trained and validated using a suitable set of reference data. Due to their highly flexible functional form, the construction of MLPs has to be done with great care. In this Tutorial, we describe the necessary key steps for training reliable MLPs, from data generation via training to final validation. The procedure, which is illustrated for the example of a high-dimensional neural network potential, is general and applicable to many types of MLPs.
Collapse
Affiliation(s)
- Alea Miako Tokita
- Lehrstuhl für Theoretische Chemie II, Ruhr-Universität Bochum, 44780 Bochum, Germany and Research Center Chemical Sciences and Sustainability, Research Alliance Ruhr, 44780 Bochum, Germany
| | - Jörg Behler
- Lehrstuhl für Theoretische Chemie II, Ruhr-Universität Bochum, 44780 Bochum, Germany and Research Center Chemical Sciences and Sustainability, Research Alliance Ruhr, 44780 Bochum, Germany
| |
Collapse
|
8
|
Triestram L, Falcioni F, Popelier PLA. Interacting Quantum Atoms and Multipolar Electrostatic Study of XH···π Interactions. ACS OMEGA 2023; 8:34844-34851. [PMID: 37779962 PMCID: PMC10535255 DOI: 10.1021/acsomega.3c04149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 08/31/2023] [Indexed: 10/03/2023]
Abstract
The interaction energies of nine XH···π (X = C, N, and O) benzene-containing van der Waals complexes were analyzed, at the atomic and fragment levels, using QTAIM multipolar electrostatics and the energy partitioning method interacting quantum atoms/fragment (IQA/IQF). These descriptors were paired with the relative energy gradient method, which solidifies the connection between quantum mechanical properties and chemical interpretation. This combination provides a precise understanding, both qualitative and quantitative, of the nature of these interactions, which are ubiquitous in biochemical systems. The formation of the OH···π and NH···π systems is electrostatically driven, with the Qzz component of the quadrupole moment of the benzene carbons interacting with the charges of X and H in XH. There is the unexpectedly intramonomeric role of X-H (X = O, N) where its electrostatic energy helps the formation of the complex and its covalent energy thwarts it. However, the CH···π interaction is governed by exchange-correlation energies, thereby establishing a covalent character, as opposed to the literature's designation as a noncovalent interaction. Moreover, dispersion energy is relevant, statically and in absolute terms, but less relevant compared to other energy components in terms of the formation of the complex. Multipolar electrostatics are similar across all systems.
Collapse
Affiliation(s)
- Lena Triestram
- Department of Chemistry, University
of Manchester, Manchester M13 9PL, Great
Britain
| | - Fabio Falcioni
- Department of Chemistry, University
of Manchester, Manchester M13 9PL, Great
Britain
| | - Paul L. A. Popelier
- Department of Chemistry, University
of Manchester, Manchester M13 9PL, Great
Britain
| |
Collapse
|
9
|
Burn M, Popelier PLA. Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments. J Chem Theory Comput 2023; 19:1370-1380. [PMID: 36757024 PMCID: PMC9979601 DOI: 10.1021/acs.jctc.2c00731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
Developing a force field is a difficult task because its design is typically pulled in opposite directions by speed and accuracy. FFLUX breaks this trend by utilizing Gaussian process regression (GPR) to predict, at ab initio accuracy, atomic energies and multipole moments as obtained from the quantum theory of atoms in molecules (QTAIM). This work demonstrates that the in-house FFLUX training pipeline can generate successful GPR models for six representative molecules: peptide-capped glycine and alanine, glucose, paracetamol, aspirin, and ibuprofen. The molecules were sufficiently distorted to represent configurations from an AMBER-GAFF2 molecular dynamics run. All internal degrees of freedom were covered corresponding to 93 dimensions in the case of the largest molecule ibuprofen (33 atoms). Benefiting from active learning, the GPR models contain only about 2000 training points and return largely sub-kcal mol-1 prediction errors for the validation sets. A proof of concept has been reached for transferring the model produced through active learning on one atomic property to that of the remaining atomic properties. The prediction of electrostatic interaction can be assessed at the intermolecular level, and the vast majority of interactions have a root-mean-square error of less than 0.1 kJ mol-1 with a maximum value of ∼1 kJ mol-1 for a glycine and paracetamol dimer.
Collapse
|
10
|
Brown M, Skelton JM, Popelier PLA. Construction of a Gaussian Process Regression Model of Formamide for Use in Molecular Simulations. J Phys Chem A 2023; 127:1702-1714. [PMID: 36756842 PMCID: PMC9969515 DOI: 10.1021/acs.jpca.2c06566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
FFLUX, a novel force field based on quantum chemical topology, can perform molecular dynamics simulations with flexible multipole moments that change with geometry. This is enabled by Gaussian process regression machine learning models, which accurately predict atomic energies and multipole moments up to the hexadecapole. We have constructed a model of the formamide monomer at the B3LYP/aug-cc-pVTZ level of theory capable of sub-kJ mol-1 accuracy, with the maximum prediction error for the molecule being 0.8 kJ mol-1. This model was used in FFLUX simulations along with Lennard-Jones parameters to successfully optimize the geometry of formamide dimers with errors smaller than 0.1 Å compared to those obtained with D3-corrected B3LYP/aug-cc-pVTZ. Comparisons were also made to a force field constructed with static multipole moments and Lennard-Jones parameters. FFLUX recovers the expected energy ranking of dimers compared to the literature, and changes in C═O and C-N bond lengths associated with hydrogen bonding were found to be consistent with density functional theory.
Collapse
|
11
|
Alkorta I, Popelier PLA. Linking the Interatomic Exchange-Correlation Energy to Experimental J-Coupling Constants. J Phys Chem A 2023; 127:468-476. [PMID: 36608277 PMCID: PMC9869393 DOI: 10.1021/acs.jpca.2c07693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The main aim of the current work is to find an experimental connection to the interatomic exchange-correlation energy as defined by the energy decomposition method Interacting Quantum Atoms (IQA). A suitable candidate as (essentially) experimental quantity is the nuclear magnetic resonance (NMR) J-coupling constant denoted 3J(H,H'), which a number of previous studies showed to correlate well with QTAIM's delocalization index (DI), which is essentially a bond order. Inspired by Karplus equations, here, we investigate correlations between 3J(H,H') and a relevant dihedral angle in six simple initial compounds of the shape H3C-YHn (Y = C, N, O, Si, P, and S), N-methylacetamide (as prototype of the peptide bond), and five peptide-capped amino acids (Gly, Ala, Val, Ile, and Leu) because of the protein direction of the force field FFLUX. In conclusion, except for methanol, the inter-hydrogen exchange-correlation energy Vxc(H,H') makes the best contact with experiment, through 3J(H,H'), when multiplied with the internuclear distance RHH'.
Collapse
Affiliation(s)
- Ibon Alkorta
- Instituto
de Química Médica (CSIC), Juan de la Cierva, 3, Madrid 28006, Spain
| | - Paul L. A. Popelier
- Department
of Chemistry, University of Manchester, Manchester M13 9PL, U.K.,
| |
Collapse
|
12
|
Duarte LJ, Bruns RE. Energetic Origins of Force Constants: Adding a New Dimension to the Hessian Matrix via Interacting Quantum Atoms. J Phys Chem A 2022; 126:8945-8954. [DOI: 10.1021/acs.jpca.2c05798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Affiliation(s)
- Leonardo J. Duarte
- Institute of Chemistry, State University of Campinas.Campinas, SP13083-970, Brazil
| | - Roy. E. Bruns
- Institute of Chemistry, State University of Campinas.Campinas, SP13083-970, Brazil
| |
Collapse
|