1
|
Manchev YT, Popelier PLA. FFLUX molecular simulations driven by atomic Gaussian process regression models. J Comput Chem 2024; 45:1235-1246. [PMID: 38345165 DOI: 10.1002/jcc.27323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 12/14/2023] [Accepted: 01/16/2024] [Indexed: 04/19/2024]
Abstract
Machine learning (ML) force fields are revolutionizing molecular dynamics (MD) simulations as they bypass the computational cost associated with ab initio methods but do not sacrifice accuracy in the process. In this work, the GPyTorch library is used to create Gaussian process regression (GPR) models that are interfaced with the next-generation ML force field FFLUX. These models predict atomic properties of different molecular configurations that appear in a progressing MD simulation. An improved kernel function is utilized to correctly capture the periodicity of the input descriptors. The first FFLUX molecular simulations of ammonia, methanol, and malondialdehyde with the updated kernel are performed. Geometry optimizations with the GPR models result in highly accurate final structures with a maximum root-mean-squared deviation of 0.064 Å and sub-kJ mol-1 total energy predictions. Additionally, the models are tested in 298 K MD simulations with FFLUX to benchmark for robustness. The resulting energy and force predictions throughout the simulation are in excellent agreement with ab initio data for ammonia and methanol but decrease in quality for malondialdehyde due to the increased system complexity. GPR model improvements are discussed, which will ensure the future scalability to larger systems.
Collapse
Affiliation(s)
- Yulian T Manchev
- Department of Chemistry, The University of Manchester, Manchester, Great Britain
| | - Paul L A Popelier
- Department of Chemistry, The University of Manchester, Manchester, Great Britain
| |
Collapse
|
2
|
Gallegos M, Isamura BK, Popelier PLA, Martín Pendás Á. An Unsupervised Machine Learning Approach for the Automatic Construction of Local Chemical Descriptors. J Chem Inf Model 2024; 64:3059-3079. [PMID: 38498942 DOI: 10.1021/acs.jcim.3c01906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]
Abstract
Condensing the many physical variables defining a chemical system into a fixed-size array poses a significant challenge in the development of chemical Machine Learning (ML). Atom Centered Symmetry Functions (ACSFs) offer an intuitive featurization approach by means of a tedious and labor-intensive selection of tunable parameters. In this work, we implement an unsupervised ML strategy relying on a Gaussian Mixture Model (GMM) to automatically optimize the ACSF parameters. GMMs effortlessly decompose the vastness of the chemical and conformational spaces into well-defined radial and angular clusters, which are then used to build tailor-made ACSFs. The unsupervised exploration of the space has demonstrated general applicability across a diverse range of systems, spanning from various unimolecular landscapes to heterogeneous databases. The impact of the sampling technique and temperature on space exploration is also addressed, highlighting the particularly advantageous role of high-temperature Molecular Dynamics (MD) simulations. The reliability of the resulting features is assessed through the estimation of the atomic charges of a prototypical capped amino acid and a heterogeneous collection of CHON molecules. The automatically constructed ACSFs serve as high-quality descriptors, consistently yielding typical prediction errors below 0.010 electrons bound for the reported atomic charges. Altering the spatial distribution of the functions with respect to the cluster highlights the critical role of symmetry rupture in achieving significantly improved features. More specifically, using two separate functions to describe the lower and upper tails of the cluster results in the best performing models with errors as low as 0.006 electrons. Finally, the effectiveness of finely tuned features was checked across different architectures, unveiling the superior performance of Gaussian Process (GP) models over Feed Forward Neural Networks (FFNNs), particularly in low-data regimes, with nearly a 2-fold increase in prediction quality. Altogether, this approach paves the way toward an easier construction of local chemical descriptors, while providing valuable insights into how radial and angular spaces should be mapped. Finally, this work opens the possibility of encoding many-body information beyond angular terms into upcoming ML features.
Collapse
Affiliation(s)
- Miguel Gallegos
- Department of Analytical and Physical Chemistry, University of Oviedo, Oviedo E-33006, Spain
| | | | - Paul L A Popelier
- Department of Chemistry, The University of Manchester, Oxford Road, Manchester M13 9PL, U.K
| | - Ángel Martín Pendás
- Department of Analytical and Physical Chemistry, University of Oviedo, Oviedo E-33006, Spain
| |
Collapse
|
3
|
Burn M, Popelier PLA. Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments. J Chem Theory Comput 2023; 19:1370-1380. [PMID: 36757024 PMCID: PMC9979601 DOI: 10.1021/acs.jctc.2c00731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
Developing a force field is a difficult task because its design is typically pulled in opposite directions by speed and accuracy. FFLUX breaks this trend by utilizing Gaussian process regression (GPR) to predict, at ab initio accuracy, atomic energies and multipole moments as obtained from the quantum theory of atoms in molecules (QTAIM). This work demonstrates that the in-house FFLUX training pipeline can generate successful GPR models for six representative molecules: peptide-capped glycine and alanine, glucose, paracetamol, aspirin, and ibuprofen. The molecules were sufficiently distorted to represent configurations from an AMBER-GAFF2 molecular dynamics run. All internal degrees of freedom were covered corresponding to 93 dimensions in the case of the largest molecule ibuprofen (33 atoms). Benefiting from active learning, the GPR models contain only about 2000 training points and return largely sub-kcal mol-1 prediction errors for the validation sets. A proof of concept has been reached for transferring the model produced through active learning on one atomic property to that of the remaining atomic properties. The prediction of electrostatic interaction can be assessed at the intermolecular level, and the vast majority of interactions have a root-mean-square error of less than 0.1 kJ mol-1 with a maximum value of ∼1 kJ mol-1 for a glycine and paracetamol dimer.
Collapse
|
4
|
Burn MJ, Popelier PLA. Producing chemically accurate atomic Gaussian process regression models by active learning for molecular simulation. J Comput Chem 2022; 43:2084-2098. [PMID: 36165338 PMCID: PMC9828508 DOI: 10.1002/jcc.27006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 08/20/2022] [Accepted: 08/24/2022] [Indexed: 01/12/2023]
Abstract
Machine learning is becoming increasingly more important in the field of force field development. Never has it been more vital to have chemically accurate machine learning potentials because force fields become more sophisticated and their applications expand. In this study a method for developing chemically accurate Gaussian process regression models is demonstrated for an increasingly complex set of molecules. This work is an extension to previous work showing the progression of the active learning technique in producing more accurate models in much less CPU time than ever before. The per-atom active learning approach has unlocked the potential to generate chemically accurate models for molecules such as peptide-capped glycine.
Collapse
Affiliation(s)
- Matthew J. Burn
- Manchester Institute of BiotechnologyThe University of ManchesterManchesterUK,Department of ChemistryThe University of ManchesterManchesterUK
| | - Paul L. A. Popelier
- Manchester Institute of BiotechnologyThe University of ManchesterManchesterUK,Department of ChemistryThe University of ManchesterManchesterUK
| |
Collapse
|
5
|
Yuan Y, Yan H, Cui Z, Liu Z, Su W, Zhang R. Quantum Chemical Calculations with Machine Learning for Multipolar Electrostatics Prediction in RNA: An Application to Pentose. J Chem Inf Model 2022; 62:4122-4133. [PMID: 36036609 DOI: 10.1021/acs.jcim.2c00747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
To develop a realistic electrostatic model that allows for the anisotropy of the atomic electron density, high-rank atomic multipole moments computed by quantum chemical calculations have been studied extensively. However, it is hard to process huge RNA systems only relying on quantum chemical calculations due to its highly computational cost. In this study, we employ five machine learning methods of Gaussian process regression with automatic relevance determination (ARDGPR), Kriging, radial basis function neural networks, Bagging, and generalized regression neural network to predict atomic multipole moments. Atom-atom electrostatic interaction energies are subsequently computed using the predicted atomic multipole moments in the pilot system pentose of RNA. Here, the performance of the five methods is compared in terms of both the multipole moment prediction errors and the electrostatic energy prediction errors. For the predicted high-rank multipole moments of the four elements (O, C, N, and H) in capped pentose, ARDGPR and Kriging consistently outperform the other three methods. Therefore, the multipole moments predicted by the two best methods of ARDGPR and Kriging are then used to predict electrostatic interaction energy of each pentose. Finally, the absolute average energy errors of ARDGPR and Kriging are 1.83 and 4.33 kJ mol-1, respectively. Compared to Kriging, the ARDGPR method achieves a 58% decrease in the absolute average energy error. These satisfactory results demonstrated that the ARDGPR method with the strong feature extraction ability can predict the electrostatic interaction energy of pentose in RNA correctly and reliably.
Collapse
Affiliation(s)
- Yongna Yuan
- School of Information Science & Engineering, Lanzhou University, Lanzhou, China, 730000
| | - Haoqiu Yan
- School of Information Science & Engineering, Lanzhou University, Lanzhou, China, 730000
| | - Zeyang Cui
- School of Information Science & Engineering, Lanzhou University, Lanzhou, China, 730000
| | - Zhenyu Liu
- School of Cyberspace Security, Gansu University of Political Science and Law, Lanzhou, China, 730070
| | - Wei Su
- School of Information Science & Engineering, Lanzhou University, Lanzhou, China, 730000
| | - Ruisheng Zhang
- School of Information Science & Engineering, Lanzhou University, Lanzhou, China, 730000
| |
Collapse
|
6
|
Abstract
We review different models for introducing electric polarization in force fields, with special focus on methods where polarization is modelled at the atomic charge level. While electric polarization has been included in several force fields, the common approach has been to focus on atomic dipole polarizability. Several approaches allow modelling electric polarization by using charge-flow between charge sites instead, but this has been less exploited, despite that atomic charges and charge-flow is expected to be more important than atomic dipoles and dipole polarizability. A number of challenges are required to be solved for charge-flow models to be incorporated into polarizable force fields, for example how to parameterize the models and how to make them computational efficient.
Collapse
Affiliation(s)
- Frank Jensen
- Department of Chemistry, Aarhus University, Denmark.
| |
Collapse
|
7
|
Symons BCB, Bane MK, Popelier PLA. DL_FFLUX: A Parallel, Quantum Chemical Topology Force Field. J Chem Theory Comput 2021; 17:7043-7055. [PMID: 34617748 PMCID: PMC8582247 DOI: 10.1021/acs.jctc.1c00595] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
DL_FFLUX is a force
field based on quantum chemical topology that
can perform molecular dynamics for flexible molecules endowed with
polarizable atomic multipole moments (up to hexadecapole). Using the
machine learning method kriging (aka Gaussian process regression),
DL_FFLUX has access to atomic properties (energy, charge, dipole moment,
etc.) with quantum mechanical accuracy. Newly optimized and parallelized
using domain decomposition Message Passing Interface (MPI), DL_FFLUX
is now able to deliver this rigorous methodology at scale while still
in reasonable time frames. DL_FFLUX is delivered as an add-on to the
widely distributed molecular dynamics code DL_POLY 4.08. For the systems
studied here (103–105 atoms), DL_FFLUX
is shown to add minimal computational cost to the standard DL_POLY
package. In fact, the optimization of the electrostatics in DL_FFLUX
means that, when high-rank multipole moments are enabled, DL_FFLUX
is up to 1.25× faster than standard DL_POLY. The parallel DL_FFLUX
preserves the quality of the scaling of MPI implementation in standard
DL_POLY. For the first time, it is feasible to use the full capability
of DL_FFLUX to study systems that are large enough to be of real-world
interest. For example, a fully flexible, high-rank polarized (up to
and including quadrupole moments) 1 ns simulation of a system of 10 125
atoms (3375 water molecules) takes 30 h (wall time) on 18 cores.
Collapse
Affiliation(s)
- Benjamin C B Symons
- Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester M1 7DN, Great Britain.,Department of Chemistry, University of Manchester, Oxford Road, Manchester M13 9PL, Great Britain
| | - Michael K Bane
- High End Compute LTD, 23 Welby Street, Manchester M13 0EL, Great Britainhttps://highendcompute.co.uk.,Department of Computing and Mathematics, Manchester Metropolitan University, Manchester M15 6BH, Great Britain
| | - Paul L A Popelier
- Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester M1 7DN, Great Britain.,Department of Chemistry, University of Manchester, Oxford Road, Manchester M13 9PL, Great Britain
| |
Collapse
|
8
|
Friederich P, Häse F, Proppe J, Aspuru-Guzik A. Machine-learned potentials for next-generation matter simulations. NATURE MATERIALS 2021; 20:750-761. [PMID: 34045696 DOI: 10.1038/s41563-020-0777-6] [Citation(s) in RCA: 108] [Impact Index Per Article: 36.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Accepted: 07/17/2020] [Indexed: 05/18/2023]
Abstract
The choice of simulation methods in computational materials science is driven by a fundamental trade-off: bridging large time- and length-scales with highly accurate simulations at an affordable computational cost. Venturing the investigation of complex phenomena on large scales requires fast yet accurate computational methods. We review the emerging field of machine-learned potentials, which promises to reach the accuracy of quantum mechanical computations at a substantially reduced computational cost. This Review will summarize the basic principles of the underlying machine learning methods, the data acquisition process and active learning procedures. We highlight multiple recent applications of machine-learned potentials in various fields, ranging from organic chemistry and biomolecules to inorganic crystal structure predictions and surface science. We furthermore discuss the developments required to promote a broader use of ML potentials, and the possibility of using them to help solve open questions in materials science and facilitate fully computational materials design.
Collapse
Affiliation(s)
- Pascal Friederich
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Institute of Nanotechnology, Karlsruhe Institute of Technology, Eggenstein-Leopoldshafen, Germany
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Florian Häse
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA
| | - Jonny Proppe
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Institute of Physical Chemistry, Georg-August University, Göttingen, Germany
| | - Alán Aspuru-Guzik
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.
- Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada.
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA.
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), Toronto, Ontario, Canada.
| |
Collapse
|
9
|
Yuan Y, Ma Y, Huo D, Mills MJL, Wei J, Su W, Zhang R. Multipolar Description of Atom-Atom Electrostatic Interaction Energies in Single/Double-Stranded DNAs. J Phys Chem B 2020; 124:10089-10103. [PMID: 33138384 DOI: 10.1021/acs.jpcb.0c06757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Molecular force field simulation is an effective method to explore the properties of DNA molecules in depth. Almost all current popular force fields calculate atom-atom electrostatic interaction energies for DNAs based on the atomic charge and dipole or quadrupole moments, without considering high-rank atomic multipole moments for more accurate electrostatics. Actually, the distribution of electrons around atomic nuclei is not spherically symmetric but is geometry dependent. In this work, a multipole expansion method that allows us to combine polarizability and anisotropy was applied. One single-stranded DNA and one double-stranded DNA were selected as pilot systems. Deoxynucleotides were cut out from pilot systems and capped by mimicking the original DNA environment. Atomic multipole moments were integrated instead of fixed-point charges to calculate atom-atom electrostatic energies to improve the accuracy of force fields for DNA simulations. Also, the applicability of modeling the behavior of both single-stranded and double-stranded DNAs was investigated. The calculation results indicated that the models can be transferred from pilot systems to test systems, which is of great significance for the development of future DNA force fields.
Collapse
Affiliation(s)
- Yongna Yuan
- School of Information Science & Engineering, Lanzhou University, No. 222 South Tianshui Road, Lanzhou 730000, China
| | - Yan Ma
- School of Information Science & Engineering, Lanzhou University, No. 222 South Tianshui Road, Lanzhou 730000, China
| | - Dongxu Huo
- School of Information Science & Engineering, Lanzhou University, No. 222 South Tianshui Road, Lanzhou 730000, China
| | - Matthew J L Mills
- 3M Corporate Research Analytical Laboratory, Saint Paul, Minnesota 55114, United States
| | - Jiaxuan Wei
- School of Information Science & Engineering, Lanzhou University, No. 222 South Tianshui Road, Lanzhou 730000, China
| | - Wei Su
- School of Information Science & Engineering, Lanzhou University, No. 222 South Tianshui Road, Lanzhou 730000, China
| | - Ruisheng Zhang
- School of Information Science & Engineering, Lanzhou University, No. 222 South Tianshui Road, Lanzhou 730000, China
| |
Collapse
|
10
|
Konovalov A, Symons BCB, Popelier PLA. On the many-body nature of intramolecular forces in FFLUX and its implications. J Comput Chem 2020; 42:107-116. [PMID: 33107993 DOI: 10.1002/jcc.26438] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 09/28/2020] [Accepted: 09/28/2020] [Indexed: 12/24/2022]
Abstract
FFLUX is a biomolecular force field under construction, based on Quantum Chemical Topology (QCT) and machine learning (kriging), with a minimalistic and physically motivated design. A detailed analysis of the forces within the kriging models as treated in FFLUX is presented, taking as a test example a liquid water model. The energies of topological atoms are modeled as 3Natoms -6 dimensional potential energy surfaces, using atomic local frames to represent the internal degrees of freedom. As a result, the forces within the kriging models in FFLUX are inherently N-body in nature where N refers to Natoms . This provides a fuller picture that is closer to a true quantum mechanical representation of interactions between atoms. The presented computational example quantitatively showcases the non-negligible (as much as 9%) three-body nature of bonded forces and angular forces in a water molecule. We discuss the practical impact on the pressure calculation with N-body forces and periodic boundary conditions (PBC) in molecular dynamics, as opposed to classical force fields with two-body forces. The equivalence between the PBC-related correction terms in the general virial equation is shown mathematically.
Collapse
Affiliation(s)
- Anton Konovalov
- Manchester Institute of Biotechnology (MIB), Manchester, United Kingdom.,Department of Chemistry, University of Manchester, Manchester, United Kingdom
| | - Benjamin C B Symons
- Manchester Institute of Biotechnology (MIB), Manchester, United Kingdom.,Department of Chemistry, University of Manchester, Manchester, United Kingdom
| | - Paul L A Popelier
- Manchester Institute of Biotechnology (MIB), Manchester, United Kingdom.,Department of Chemistry, University of Manchester, Manchester, United Kingdom
| |
Collapse
|
11
|
Burn MJ, Popelier PLA. Creating Gaussian process regression models for molecular simulations using adaptive sampling. J Chem Phys 2020; 153:054111. [DOI: 10.1063/5.0017887] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Affiliation(s)
- Matthew J. Burn
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, United Kingdom and Department of Chemistry, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Paul L. A. Popelier
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, United Kingdom and Department of Chemistry, The University of Manchester, Manchester M13 9PL, United Kingdom
| |
Collapse
|
12
|
Vincent MA, Silva AF, Popelier PLA. Atomic Partitioning of the MPn (n = 2, 3, 4) Dynamic Electron Correlation Energy by the Interacting Quantum Atoms Method: A Fast and Accurate Electrostatic Potential Integral Approach. J Comput Chem 2019; 40:2793-2800. [PMID: 31373709 PMCID: PMC6900022 DOI: 10.1002/jcc.26037] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2019] [Revised: 07/10/2019] [Accepted: 07/11/2019] [Indexed: 11/13/2022]
Abstract
Recently, the quantum topological energy partitioning method called interacting quantum atoms (IQA) has been extended to MPn (n = 2, 3, 4) wave functions. This enables the extraction of chemical insight related to dynamic electron correlation. The large computational expense of the IQA-MPn approach is compensated by the advantages that IQA offers compared to older nontopological energy decomposition schemes. This expense is problematic in the construction of a machine learning training set to create kriging models for topological atoms. However, the algorithm presented here markedly accelerates the calculation of atomically partitioned electron correlation energies. Then again, the algorithm cannot calculate pairwise interatomic energies because it applies analytical integrals over whole space (rather than over atomic volumes). However, these pairwise energies are not needed in the quantum topological force field FFLUX, which only uses the energy of an atom interacting with all remaining atoms of the system that it is part of. Thus, it is now feasible to generate accurate and sizeable training sets at MPn level of theory. © 2019 The Authors. Journal of Computational Chemistry published by Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Mark A. Vincent
- Manchester Institute of BiotechnologyThe University of ManchesterManchesterM1 7DNUK
- School of ChemistryThe University of ManchesterManchesterM13 9PLUK
| | - Arnaldo F. Silva
- Manchester Institute of BiotechnologyThe University of ManchesterManchesterM1 7DNUK
- School of ChemistryThe University of ManchesterManchesterM13 9PLUK
| | - Paul L. A. Popelier
- Manchester Institute of BiotechnologyThe University of ManchesterManchesterM1 7DNUK
- School of ChemistryThe University of ManchesterManchesterM13 9PLUK
| |
Collapse
|
13
|
Hughes ZE, Thacker JCR, Wilson AL, Popelier PLA. Description of Potential Energy Surfaces of Molecules Using FFLUX Machine Learning Models. J Chem Theory Comput 2018; 15:116-126. [DOI: 10.1021/acs.jctc.8b00806] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Zak E. Hughes
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, U.K
- School of Chemistry, The University of Manchester, Manchester M13 9PL, U.K
| | - Joseph C. R. Thacker
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, U.K
- School of Chemistry, The University of Manchester, Manchester M13 9PL, U.K
| | - Alex L. Wilson
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, U.K
- School of Chemistry, The University of Manchester, Manchester M13 9PL, U.K
| | - Paul L. A. Popelier
- Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, U.K
- School of Chemistry, The University of Manchester, Manchester M13 9PL, U.K
| |
Collapse
|
14
|
Yuan Y, Zhang Z, Mills MJL, Hu R, Zhang R. Assessing Force Field Potential Energy Function Accuracy via a Multipolar Description of Atomic Electrostatic Interactions in RNA. J Chem Inf Model 2018; 58:2239-2254. [PMID: 30362754 DOI: 10.1021/acs.jcim.8b00328] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Computational investigations of RNA properties often rely on a molecular mechanical approach to define molecular potential energy. Force fields for RNA typically employ a point charge model of electrostatics, which does not provide a realistic quantum-mechanical picture. In reality, electron distributions around nuclei are not spherically symmetric and are geometry dependent. A multipole expansion method which allows for incorporation of polarizability and anisotropy in a force field is described, and its applicability to modeling the behavior of RNA molecules is investigated. Transferability of the model, critical for force field development, is also investigated.
Collapse
Affiliation(s)
- Yongna Yuan
- School of Information Science & Engineering , Lanzhou University , Lanzhou , Gansu 730000 , China
| | - Zhuangzhuang Zhang
- School of Information Science & Engineering , Lanzhou University , Lanzhou , Gansu 730000 , China
| | | | - Rongjing Hu
- School of Information Science & Engineering , Lanzhou University , Lanzhou , Gansu 730000 , China
| | - Ruisheng Zhang
- School of Information Science & Engineering , Lanzhou University , Lanzhou , Gansu 730000 , China
| |
Collapse
|
15
|
Geometry Optimization with Machine Trained Topological Atoms. Sci Rep 2017; 7:12817. [PMID: 28993674 PMCID: PMC5634454 DOI: 10.1038/s41598-017-12600-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2017] [Accepted: 09/06/2017] [Indexed: 11/19/2022] Open
Abstract
The geometry optimization of a water molecule with a novel type of energy function called FFLUX is presented, which bypasses the traditional bonded potentials. Instead, topologically-partitioned atomic energies are trained by the machine learning method kriging to predict their IQA atomic energies for a previously unseen molecular geometry. Proof-of-concept that FFLUX’s architecture is suitable for geometry optimization is rigorously demonstrated. It is found that accurate kriging models can optimize 2000 distorted geometries to within 0.28 kJ mol−1 of the corresponding ab initio energy, and 50% of those to within 0.05 kJ mol−1. Kriging models are robust enough to optimize the molecular geometry to sub-noise accuracy, when two thirds of the geometric inputs are outside the training range of that model. Finally, the individual components of the potential energy are analyzed, and chemical intuition is reflected in the independent behavior of the three energy terms \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${E}_{{\rm{intra}}}^{{\rm{A}}}$$\end{document}EintraA(intra-atomic), \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${V}_{{\rm{cl}}}^{\text{AA}\text{'}}$$\end{document}VclAA' (electrostatic) and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${V}_{{\rm{x}}}^{\text{AA}\text{'}}$$\end{document}VxAA' (exchange), in contrast to standard force fields.
Collapse
|
16
|
Cardamone S, Popelier PLA. Prediction of conformationally dependent atomic multipole moments in carbohydrates. J Comput Chem 2015; 36:2361-73. [PMID: 26547500 PMCID: PMC5031233 DOI: 10.1002/jcc.24215] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Revised: 08/18/2015] [Accepted: 09/10/2015] [Indexed: 01/18/2023]
Abstract
The conformational flexibility of carbohydrates is challenging within the field of computational chemistry. This flexibility causes the electron density to change, which leads to fluctuating atomic multipole moments. Quantum Chemical Topology (QCT) allows for the partitioning of an "atom in a molecule," thus localizing electron density to finite atomic domains, which permits the unambiguous evaluation of atomic multipole moments. By selecting an ensemble of physically realistic conformers of a chemical system, one evaluates the various multipole moments at defined points in configuration space. The subsequent implementation of the machine learning method kriging delivers the evaluation of an analytical function, which smoothly interpolates between these points. This allows for the prediction of atomic multipole moments at new points in conformational space, not trained for but within prediction range. In this work, we demonstrate that the carbohydrates erythrose and threose are amenable to the above methodology. We investigate how kriging models respond when the training ensemble incorporating multiple energy minima and their environment in conformational space. Additionally, we evaluate the gains in predictive capacity of our models as the size of the training ensemble increases. We believe this approach to be entirely novel within the field of carbohydrates. For a modest training set size of 600, more than 90% of the external test configurations have an error in the total (predicted) electrostatic energy (relative to ab initio) of maximum 1 kJ mol(-1) for open chains and just over 90% an error of maximum 4 kJ mol(-1) for rings.
Collapse
Affiliation(s)
- Salvatore Cardamone
- Manchester Institute of Biotechnology (MIB)131 Princess StreetManchesterM1 7DNGreat Britain
- School of ChemistryUniversity of ManchesterOxford RoadManchesterM13 9PLGreat Britain
| | - Paul L. A. Popelier
- Manchester Institute of Biotechnology (MIB)131 Princess StreetManchesterM1 7DNGreat Britain
- School of ChemistryUniversity of ManchesterOxford RoadManchesterM13 9PLGreat Britain
| |
Collapse
|
17
|
Transferable kriging machine learning models for the multipolar electrostatics of helical deca-alanine. Theor Chem Acc 2015. [DOI: 10.1007/s00214-015-1739-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
18
|
Gresh N, Sponer JE, Devereux M, Gkionis K, de Courcy B, Piquemal JP, Sponer J. Stacked and H-Bonded Cytosine Dimers. Analysis of the Intermolecular Interaction Energies by Parallel Quantum Chemistry and Polarizable Molecular Mechanics. J Phys Chem B 2015; 119:9477-95. [DOI: 10.1021/acs.jpcb.5b01695] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Affiliation(s)
- Nohad Gresh
- Chemistry & Biology, Nucleo(s)tides & Immunology for Therapy (CBNIT), CNRS UMR8601, Université Paris Descartes, PRES Sorbonne Paris Cité, UFR Biomédicale, 45 rue des Saints-Pères, 75270 Paris Cedex 06, France
- Laboratoire
de Chimie Théorique, Sorbonne Universités, UPMC, Paris 6, case courrier
137, 4, place Jussieu, Paris, F75252, France
- Laboratoire
de Chimie Théorique, UMR 7616 CNRS, case courrier 137, 4, place Jussieu, Paris, F75252, France
| | - Judit E. Sponer
- Institute
of Biophysics, Academy of Sciences of the Czech Republic, Kralovopolska,
135, 612 65 Brno, Czech Republic
- CEITEC − Central European Institute of Technology, Campus Bohunice, Kamenice 5, 625 00 Brno, Czech Republic
| | - Mike Devereux
- Department
of Chemistry, University of Basel, Klingelbergstrasse 80, Basel CH 4056, Switzerland
| | - Konstantinos Gkionis
- Institute
of Biophysics, Academy of Sciences of the Czech Republic, Kralovopolska,
135, 612 65 Brno, Czech Republic
| | - Benoit de Courcy
- Chemistry & Biology, Nucleo(s)tides & Immunology for Therapy (CBNIT), CNRS UMR8601, Université Paris Descartes, PRES Sorbonne Paris Cité, UFR Biomédicale, 45 rue des Saints-Pères, 75270 Paris Cedex 06, France
- Laboratoire
de Chimie Théorique, Sorbonne Universités, UPMC, Paris 6, case courrier
137, 4, place Jussieu, Paris, F75252, France
- Laboratoire
de Chimie Théorique, UMR 7616 CNRS, case courrier 137, 4, place Jussieu, Paris, F75252, France
| | - Jean-Philip Piquemal
- Laboratoire
de Chimie Théorique, Sorbonne Universités, UPMC, Paris 6, case courrier
137, 4, place Jussieu, Paris, F75252, France
- Laboratoire
de Chimie Théorique, UMR 7616 CNRS, case courrier 137, 4, place Jussieu, Paris, F75252, France
| | - Jiri Sponer
- Institute
of Biophysics, Academy of Sciences of the Czech Republic, Kralovopolska,
135, 612 65 Brno, Czech Republic
- CEITEC − Central European Institute of Technology, Campus Bohunice, Kamenice 5, 625 00 Brno, Czech Republic
| |
Collapse
|
19
|
Mutter ST, Zielinski F, Popelier PLA, Blanch EW. Calculation of Raman optical activity spectra for vibrational analysis. Analyst 2015; 140:2944-56. [DOI: 10.1039/c4an02357a] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
This review provides the necessary knowledge to accurately model ROA spectra of solvated systems and interpret their vibrational characteristics.
Collapse
Affiliation(s)
- Shaun T. Mutter
- Manchester Institute of Biotechnology and Faculty of Life Sciences
- University of Manchester
- Manchester
- UK
| | - François Zielinski
- Manchester Institute of Biotechnology and School of Chemistry
- University of Manchester
- Manchester
- UK
| | - Paul L. A. Popelier
- Manchester Institute of Biotechnology and School of Chemistry
- University of Manchester
- Manchester
- UK
| | - Ewan W. Blanch
- Manchester Institute of Biotechnology and Faculty of Life Sciences
- University of Manchester
- Manchester
- UK
| |
Collapse
|