1
|
Chen J, Gao Q, Huang M, Yu K. Application of modern artificial intelligence techniques in the development of organic molecular force fields. Phys Chem Chem Phys 2025; 27:2294-2319. [PMID: 39820957 DOI: 10.1039/d4cp02989e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2025]
Abstract
The molecular force field (FF) determines the accuracy of molecular dynamics (MD) and is one of the major bottlenecks that limits the application of MD in molecular design. Recently, artificial intelligence (AI) techniques, such as machine-learning potentials (MLPs), have been rapidly reshaping the landscape of MD. Meanwhile, organic molecular systems feature unique characteristics, and require more careful treatment in both model construction, optimization, and validation. While an accurate and generic organic molecular force field is still missing, significant progress has been made with the facilitation of AI, warranting a promising future. In this review, we provide an overview of the various types of AI techniques used in molecular FF development and discuss both the advantages and weaknesses of these methodologies. We show how AI methods provide unprecedented capabilities in many tasks such as potential fitting, atom typification, and automatic optimization. Meanwhile, it is also worth noting that more efforts are needed to improve the transferability of the model, develop a more comprehensive database, and establish more standardized validation procedures. With these discussions, we hope to inspire more efforts to solve the existing problems, eventually leading to the birth of next-generation generic organic FFs.
Collapse
Affiliation(s)
- Junmin Chen
- Institute of Materials Research (IMR), Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China.
- Tsinghua-Berkeley Shenzhen Institute (TBSI), Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China
| | - Qian Gao
- Institute of Materials Research (IMR), Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China.
| | - Miaofei Huang
- Institute of Materials Research (IMR), Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China.
| | - Kuang Yu
- Institute of Materials Research (IMR), Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China.
- Tsinghua-Berkeley Shenzhen Institute (TBSI), Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China
| |
Collapse
|
2
|
Zossimova E, Fiedler J, Vollmer F, Walter M. Hybrid quantum-classical polarizability model for single molecule biosensing. NANOSCALE 2024; 16:5820-5828. [PMID: 38436120 DOI: 10.1039/d3nr05396b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2024]
Abstract
Optical whispering gallery mode biosensors are able to detect single molecules through effects of their polarizability. We address the factors that affect the polarizability of amino acids, which are the building blocks of life, via electronic structure theory. Amino acids are detected in aqueous environments, where their polarizability is different compared to the gasphase due to solvent effects. Solvent effects include structural changes, protonation and the local field enhancement through the solvent (water). We analyse the impact of these effects and find that all contribute to an increased effective polarizability in the solvent. We also address the excess polarizability relative to the displaced water cavity and develop a hybrid quantum-classical model that is in good agreement with self-consistent calculations. We apply our model to calculate the excess polarizability of 20 proteinogenic amino acids and determine the minimum resolution required to distinguish the different molecules and their ionised conformers based on their polarizability.
Collapse
Affiliation(s)
- Ekaterina Zossimova
- Department of Physics and Astronomy, Living Systems Institute, University of Exeter, EX4 4QD, Exeter, UK.
- Freiburg Center for Interactive Materials and Bioinspired Technologies (FIT), University of Freiburg, D-79110 Freiburg, Germany
| | - Johannes Fiedler
- Department of Physics and Technology, University of Bergen, Allégaten 55, 5007 Bergen, Norway
| | - Frank Vollmer
- Department of Physics and Astronomy, Living Systems Institute, University of Exeter, EX4 4QD, Exeter, UK.
| | - Michael Walter
- Freiburg Center for Interactive Materials and Bioinspired Technologies (FIT), University of Freiburg, D-79110 Freiburg, Germany
- Cluster of Excellence livMatS @ FIT, Freiburg, Germany
- Fraunhofer IWM, MikroTribologie Centrum μTC, Freiburg, Germany
| |
Collapse
|
3
|
Hu X, Amin KS, Schneider M, Lim C, Salahub D, Baldauf C. System-Specific Parameter Optimization for Nonpolarizable and Polarizable Force Fields. J Chem Theory Comput 2024; 20:1448-1464. [PMID: 38279917 PMCID: PMC10867808 DOI: 10.1021/acs.jctc.3c01141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 12/04/2023] [Accepted: 12/05/2023] [Indexed: 01/29/2024]
Abstract
The accuracy of classical force fields (FFs) has been shown to be limited for the simulation of cation-protein systems despite their importance in understanding the processes of life. Improvements can result from optimizing the parameters of classical FFs or by extending the FF formulation by terms describing charge transfer (CT) and polarization (POL) effects. In this work, we introduce our implementation of the CTPOL model in OpenMM, which extends the classical additive FF formula by adding CT and POL. Furthermore, we present an open-source parametrization tool, called FFAFFURR, that enables the (system-specific) parametrization of OPLS-AA and CTPOL models. The performance of our workflow was evaluated by its ability to reproduce quantum chemistry energies and by molecular dynamics simulations of a zinc-finger protein.
Collapse
Affiliation(s)
- Xiaojuan Hu
- Fritz-Haber-Institut
der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin, Germany
| | - Kazi S. Amin
- Centre
for Molecular Simulation and Department of Biological Sciences, University of Calgary, 2500 University Drive NW, Calgary, Alberta T2N 1N4, Canada
| | - Markus Schneider
- Fritz-Haber-Institut
der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin, Germany
| | - Carmay Lim
- Institute
of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan
- Department
of Chemistry, National Tsing Hua University, Hsinchu 300, Taiwan
| | - Dennis Salahub
- Centre
for Molecular Simulation and Department of Chemistry, University of Calgary, 2500 University Drive NW, Calgary, Alberta T2N 1N4, Canada
| | - Carsten Baldauf
- Fritz-Haber-Institut
der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin, Germany
| |
Collapse
|
4
|
Khire SS, Gattadahalli N, Gurav ND, Kumar A, Gadre SR. Constructing Potential Energy Surface with Correlated Theory for Dipeptides Using Molecular Tailoring Approach. Chemphyschem 2023; 24:e202200784. [PMID: 36735449 DOI: 10.1002/cphc.202200784] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 01/31/2023] [Accepted: 02/01/2023] [Indexed: 02/04/2023]
Abstract
We demonstrate a cost-effective alternative employing the fragment-based molecular tailoring approach (MTA) for building the potential energy surface (PES) for two dipeptides viz. alanine-alanine and alanine-proline employing correlated theory, with augmented Dunning basis sets. About 1369 geometries are generated for each test dipeptide by systematically varying the dihedral angles Φ ${{\rm{\Phi }}}$ and Ψ ${{{\Psi }}}$ . These conformational geometries are partially optimized by relaxing all the other Z-matrix parameters, fixing the values of Φ ${{\rm{\Phi }}}$ and Ψ ${{{\Psi }}}$ . The MP2 level PES is constructed from the MTA-energies of chemically intact geometries using minimal hardware. The fidelity of MP2/aug-cc-pVDZ level PES is brought out by comparing it with its full calculation counterpart. Further, we bring out the power of the method by reporting the MTA-based CCSD/aug-cc-pVDZ level PES for these two dipeptides containing 498 and 562 basis functions respectively.
Collapse
Affiliation(s)
- Subodh S Khire
- RIKEN Center for Computational Science, Kobe, 650-0047, Japan.,Department of Scientific Computing Modelling and Simulation, Savitribai Phule Pune University, Pune, 411 007, India
| | - Nandini Gattadahalli
- Department of Scientific Computing Modelling and Simulation, Savitribai Phule Pune University, Pune, 411 007, India
| | - Nalini D Gurav
- Department of Scientific Computing Modelling and Simulation, Savitribai Phule Pune University, Pune, 411 007, India.,Organisch-Chemisches Institut and Center for Multiscale Theory and Computation (CMTC), Westfälische Wilhelms-Universität Münster, Corrensstrasse 36, 48149, Münster, Germany
| | - Anmol Kumar
- School of Pharmacy, University of Maryland, Baltimore, 20 Penn Street, HSFII, Baltimore, Maryland, 21201, U.S.A
| | - Shridhar R Gadre
- Department of Scientific Computing Modelling and Simulation, Savitribai Phule Pune University, Pune, 411 007, India
| |
Collapse
|
5
|
Nacsa AB, Czakó G. Benchmark Ab Initio Determination of the Conformers, Proton Affinities, and Gas-Phase Basicities of Cysteine. J Phys Chem A 2022; 126:9667-9679. [PMID: 36524999 PMCID: PMC9806835 DOI: 10.1021/acs.jpca.2c07035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
A systematic conformational mapping combined with literature data leads to 85 stable neutral cysteine conformers. The implementation of the same mapping process for the protonated counterparts reveals 21 N-(amino-), 64 O-(carbonyl-), and 37 S-(thiol-)protonated cysteine conformers. Their relative energies and harmonic vibrational frequencies are given at the MP2/aug-cc-pVDZ level of theory. Further benchmark ab initio computations are performed for the 10 lowest-lying neutral and protonated amino acid conformers (for each type) such as CCSD(T)-F12a/cc-pVDZ-F12 geometry optimizations (and frequency computations for cysteine) as well as auxiliary correction computations of the basis set effects up to CCSD(T)-F12b/cc-pVQZ-F12, electron correlation effects up to CCSDT(Q), core correlation effects, second-order Douglass-Kroll relativistic effects, and zero-point energy contributions. Boltzmann-averaged 0 (298.15) K proton affinity and [298.15 K gas-phase basicity] values of cysteine are predicted to be 214.96 (216.39) [208.21], 201.83 (203.55) [194.16], and 193.31 (194.74) [186.40] kcal/mol for N-, O-, and S-protonation, respectively, also considering the previously described auxiliary corrections.
Collapse
|
6
|
Chakraborty A, Brumme T, Schmahl S, Weiske H, Baldauf C, Asmis KR. Impact of anion polarizability on ion pairing in microhydrated salt clusters. Chem Sci 2022; 13:13187-13200. [PMID: 36425505 PMCID: PMC9668056 DOI: 10.1039/d2sc03431j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Accepted: 09/28/2022] [Indexed: 09/08/2024] Open
Abstract
Despite longstanding interest in the mechanism of salt dissolution in aqueous media, a molecular level understanding remains incomplete. Here, cryogenic ion trap vibrational action spectroscopy is combined with electronic structure calculations to track salt hydration in a gas phase model system one water molecule at a time. The infrared photodissociation spectra of microhydrated lithium dihalide anions [LiXX'(H2O) n ]- (XX' = I2, ClI and Cl2; n = 1-3) in the OH stretching region (3800-2800 cm-1) provide a detailed picture of how anion polarizability influences the competition among ion-ion, ion-water and water-water interactions. While exclusively contact ion pairs are observed for n = 1, the formation of solvent-shared ion pairs, identified by markedly red-shifted OH stretching bands (<3200 cm-1), originating from the bridging water molecules, is favored already for n = 2. For n = 3, Li+ reaches its maximum coordination number of four only in [LiI2(H2O)3]-, in accordance with the hard and soft Lewis acid and base principle. Water-water hydrogen bond formation leads to a different solvent-shared ion pair motif in [LiI2(H2O)3]- and network formation even restabilizes the contact ion pair motif in [LiCl2(H2O)3]-. Structural assignments are exclusively possible after the consideration of anharmonic effects. Molecular dynamics simulations confirm that the significance of large amplitude motion (of the water molecules) increases with increasing anion polarizability and that needs to be considered already at cryogenic temperatures.
Collapse
Affiliation(s)
- Arghya Chakraborty
- Wilhelm-Ostwald-Institut für Physikalische und Theoretische Chemie, Universität Leipzig Linnéstrasse 2 D-04103 Leipzig Germany
| | - Thomas Brumme
- Wilhelm-Ostwald-Institut für Physikalische und Theoretische Chemie, Universität Leipzig Linnéstrasse 2 D-04103 Leipzig Germany
- Theoretische Chemie, Technische Universität Dresden Dresden Germany
| | - Sonja Schmahl
- Wilhelm-Ostwald-Institut für Physikalische und Theoretische Chemie, Universität Leipzig Linnéstrasse 2 D-04103 Leipzig Germany
| | - Hendrik Weiske
- Wilhelm-Ostwald-Institut für Physikalische und Theoretische Chemie, Universität Leipzig Linnéstrasse 2 D-04103 Leipzig Germany
| | - Carsten Baldauf
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Berlin Germany
| | - Knut R Asmis
- Wilhelm-Ostwald-Institut für Physikalische und Theoretische Chemie, Universität Leipzig Linnéstrasse 2 D-04103 Leipzig Germany
| |
Collapse
|
7
|
OSTEOCALCIN ACTIVE CENTER MODELS: electrochemical adsorption on platinum AND QUANTUM CHEMICAL ANALYSIS. Electrochim Acta 2022. [DOI: 10.1016/j.electacta.2022.141466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
8
|
Kalvoda T, Culka M, Rulíšek L, Andris E. Exhaustive Mapping of the Conformational Space of Natural Dipeptides by the DFT-D3//COSMO-RS Method. J Phys Chem B 2022; 126:5949-5958. [PMID: 35930560 DOI: 10.1021/acs.jpcb.2c02861] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We extensively mapped energy landscapes and conformations of 22 (including three His protonation states) proteinogenic α-amino acids in trans configuration and the corresponding 484 (222) dipeptides. To mimic the environment in a protein chain, the N- and C-termini of the studied systems were capped with acetyl and N-methylamide groups, respectively. We systematically varied the main chain dihedral angles (ϕ, ψ) by 40° steps and all side chain angles by 90° or 120° steps. We optimized the molecular geometries with the GFN2-xTB semiempirical (SQM) method and performed single point density functional theory calculations at the BP86-D3/DGauss-DZVP//COSMO-RS level in water, 1-octanol, N,N-dimethylformamide, and n-hexane. For each restrained (nonequilibrium) structure, we also calculated energy gradients (in water) and natural atomic charges. The exhaustive and unprecedented QM-based sampling enabled us to construct Ramachandran plots of quantum mechanical (QM(BP86-D3)//COSMO-RS) energies calculated on SQM structures, for all 506 (484 dipeptides and 22 amino acids) studied systems. We showed how the character of an amino acid side chain influences the conformational space of single amino acids and dipeptides. With clustering techniques, we were able to identify unique minima of amino acids and dipeptides (i.e., minima on the GFN2-xTB potential energy surfaces) and analyze the distribution of their BP86-D3//COSMO-RS conformational energies in all four solvents. We also derived an empirical formula for the number of unique minima based on the overall number of rotatable bonds within each peptide. The final peptide conformer data set (PeptideCs) comprises over 400 million structures, all of them annotated with QM(BP86-D3)//COSMO-RS energies. Thanks to its completeness and unbiased nature, the PeptideCs can serve, inter alia, as a data set for the validation of new methods for predicting the energy landscapes of protein structures. This data set may also prove to be useful in the development and reparameterization of biomolecular force fields. The data set is deposited at Figshare (10.25452/figshare.plus.19607172) and can be accessed using a simple web interface at http://peptidecs.uochb.cas.cz.
Collapse
Affiliation(s)
- Tadeáš Kalvoda
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha, Czech Republic
| | - Martin Culka
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha, Czech Republic
| | - Lubomír Rulíšek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha, Czech Republic
| | - Erik Andris
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha, Czech Republic
| |
Collapse
|
9
|
Hu X, Lenz-Himmer MO, Baldauf C. Better force fields start with better data: A data set of cation dipeptide interactions. Sci Data 2022; 9:327. [PMID: 35715420 PMCID: PMC9205945 DOI: 10.1038/s41597-022-01297-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 03/18/2022] [Indexed: 11/08/2022] Open
Abstract
We present a data set from a first-principles study of amino-methylated and acetylated (capped) dipeptides of the 20 proteinogenic amino acids - including alternative possible side chain protonation states and their interactions with selected divalent cations (Ca2+, Mg2+ and Ba2+). The data covers 21,909 stationary points on the respective potential-energy surfaces in a wide relative energy range of up to 4 eV (390 kJ/mol). Relevant properties of interest, like partial charges, were derived for the conformers. The motivation was to provide a solid data basis for force field parameterization and further applications like machine learning or benchmarking. In particular the process of creating all this data on the same first-principles footing, i.e. density-functional theory calculations employing the generalized gradient approximation with a van der Waals correction, makes this data suitable for first principles data-driven force field development. To make the data accessible across domain borders and to machines, we formalized the metadata in an ontology.
Collapse
Affiliation(s)
- Xiaojuan Hu
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195, Berlin, Germany.
| | | | - Carsten Baldauf
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195, Berlin, Germany.
| |
Collapse
|
10
|
Gervasoni S, Malloci G, Bosin A, Vargiu AV, Zgurskaya HI, Ruggerone P. AB-DB: Force-Field parameters, MD trajectories, QM-based data, and Descriptors of Antimicrobials. Sci Data 2022; 9:148. [PMID: 35365662 PMCID: PMC8976083 DOI: 10.1038/s41597-022-01261-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 03/11/2022] [Indexed: 12/13/2022] Open
Abstract
Antibiotic resistance is a major threat to public health. The development of chemo-informatic tools to guide medicinal chemistry campaigns in the efficint design of antibacterial libraries is urgently needed. We present AB-DB, an open database of all-atom force-field parameters, molecular dynamics trajectories, quantum-mechanical properties, and curated physico-chemical descriptors of antimicrobial compounds. We considered more than 300 molecules belonging to 25 families that include the most relevant antibiotic classes in clinical use, such as β-lactams and (fluoro)quinolones, as well as inhibitors of key bacterial proteins. We provide traditional descriptors together with properties obtained with Density Functional Theory calculations. Noteworthy, AB-DB contains less conventional descriptors extracted from μs-long molecular dynamics simulations in explicit solvent. In addition, for each compound we make available force-field parameters for the major micro-species at physiological pH. With the rise of multi-drug-resistant pathogens and the consequent need for novel antibiotics, inhibitors, and drug re-purposing strategies, curated databases containing reliable and not straightforward properties facilitate the integration of data mining and statistics into the discovery of new antimicrobials.
Collapse
Affiliation(s)
- Silvia Gervasoni
- University of Cagliari, Department of Physics, I-09042, Monserrato (Cagliari), Italy
| | - Giuliano Malloci
- University of Cagliari, Department of Physics, I-09042, Monserrato (Cagliari), Italy.
| | - Andrea Bosin
- University of Cagliari, Department of Physics, I-09042, Monserrato (Cagliari), Italy
| | - Attilio V Vargiu
- University of Cagliari, Department of Physics, I-09042, Monserrato (Cagliari), Italy
| | - Helen I Zgurskaya
- University of Oklahoma, Department of Chemistry and Biochemistry, Norman, OK, 73072, United States
| | - Paolo Ruggerone
- University of Cagliari, Department of Physics, I-09042, Monserrato (Cagliari), Italy
| |
Collapse
|
11
|
Fabregat R, Fabrizio A, Engel EA, Meyer B, Juraskova V, Ceriotti M, Corminboeuf C. Local Kernel Regression and Neural Network Approaches to the Conformational Landscapes of Oligopeptides. J Chem Theory Comput 2022; 18:1467-1479. [PMID: 35179897 PMCID: PMC8908737 DOI: 10.1021/acs.jctc.1c00813] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Indexed: 11/30/2022]
Abstract
The application of machine learning to theoretical chemistry has made it possible to combine the accuracy of quantum chemical energetics with the thorough sampling of finite-temperature fluctuations. To reach this goal, a diverse set of methods has been proposed, ranging from simple linear models to kernel regression and highly nonlinear neural networks. Here we apply two widely different approaches to the same, challenging problem: the sampling of the conformational landscape of polypeptides at finite temperature. We develop a local kernel regression (LKR) coupled with a supervised sparsity method and compare it with a more established approach based on Behler-Parrinello type neural networks. In the context of the LKR, we discuss how the supervised selection of the reference pool of environments is crucial to achieve accurate potential energy surfaces at a competitive computational cost and leverage the locality of the model to infer which chemical environments are poorly described by the DFTB baseline. We then discuss the relative merits of the two frameworks and perform Hamiltonian-reservoir replica-exchange Monte Carlo sampling and metadynamics simulations, respectively, to demonstrate that both frameworks can achieve converged and transferable sampling of the conformational landscape of complex and flexible biomolecules with comparable accuracy and computational cost.
Collapse
Affiliation(s)
- Raimon Fabregat
- Laboratory for Computational
Molecular Design, Institute of Chemical
Sciences and Engineering, National Centre for Computational Design and Discovery
of Novel Materials (MARVEL), École
Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Alberto Fabrizio
- Laboratory for Computational
Molecular Design, Institute of Chemical
Sciences and Engineering, National Centre for Computational Design and Discovery
of Novel Materials (MARVEL), École
Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Edgar A. Engel
- Laboratory for Computational
Molecular Design, Institute of Chemical
Sciences and Engineering, National Centre for Computational Design and Discovery
of Novel Materials (MARVEL), École
Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- Laboratory
of Computational Science and Modeling, IMX,
École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Benjamin Meyer
- Laboratory for Computational
Molecular Design, Institute of Chemical
Sciences and Engineering, National Centre for Computational Design and Discovery
of Novel Materials (MARVEL), École
Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Veronika Juraskova
- Laboratory for Computational
Molecular Design, Institute of Chemical
Sciences and Engineering, National Centre for Computational Design and Discovery
of Novel Materials (MARVEL), École
Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Michele Ceriotti
- Laboratory for Computational
Molecular Design, Institute of Chemical
Sciences and Engineering, National Centre for Computational Design and Discovery
of Novel Materials (MARVEL), École
Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- Laboratory
of Computational Science and Modeling, IMX,
École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational
Molecular Design, Institute of Chemical
Sciences and Engineering, National Centre for Computational Design and Discovery
of Novel Materials (MARVEL), École
Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| |
Collapse
|
12
|
Ferro-Costas D, Mosquera-Lois I, Fernández-Ramos A. TorsiFlex: an automatic generator of torsional conformers. Application to the twenty proteinogenic amino acids. J Cheminform 2021; 13:100. [PMID: 34952644 PMCID: PMC8710030 DOI: 10.1186/s13321-021-00578-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Accepted: 12/08/2021] [Indexed: 11/10/2022] Open
Abstract
In this work, we introduce TorsiFlex, a user-friendly software written in Python 3 and designed to find all the torsional conformers of flexible acyclic molecules in an automatic fashion. For the mapping of the torsional potential energy surface, the algorithm implemented in TorsiFlex combines two searching strategies: preconditioned and stochastic. The former is a type of systematic search based on chemical knowledge and should be carried out before the stochastic (random) search. The algorithm applies several validation tests to accelerate the exploration of the torsional space. For instance, the optimized structures are stored and this information is used to prevent revisiting these points and their surroundings in future iterations. TorsiFlex operates with a dual-level strategy by which the initial search is carried out at an inexpensive electronic structure level of theory and the located conformers are reoptimized at a higher level. Additionally, the program takes advantage of conformational enantiomerism, when possible. As a case study, and in order to exemplify the effectiveness and capabilities of this program, we have employed TorsiFlex to locate the conformers of the twenty proteinogenic amino acids in their neutral canonical form. TorsiFlex has produced a number of conformers that roughly doubles the amount of the most complete work to date.
Collapse
Affiliation(s)
- David Ferro-Costas
- Centro Singular de Investigación en Química Biolóxica e Materiais Moleculares (CIQUS), Universidade de Santiago de Compostela, 15782, Santiago de Compostela, Spain.
| | - Irea Mosquera-Lois
- Centro Singular de Investigación en Química Biolóxica e Materiais Moleculares (CIQUS), Universidade de Santiago de Compostela, 15782, Santiago de Compostela, Spain
| | - Antonio Fernández-Ramos
- Centro Singular de Investigación en Química Biolóxica e Materiais Moleculares (CIQUS), Universidade de Santiago de Compostela, 15782, Santiago de Compostela, Spain.
| |
Collapse
|
13
|
Tsubaki M, Mizoguchi T. Quantum Deep Descriptor: Physically Informed Transfer Learning from Small Molecules to Polymers. J Chem Theory Comput 2021; 17:7814-7821. [PMID: 34846893 DOI: 10.1021/acs.jctc.1c00568] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
In this study, we propose a physically informed transfer learning approach for materials informatics (MI) using a quantum deep descriptor (QDD) obtained from the quantum deep field (QDF). The QDF is a machine learning model based on density functional theory (DFT) and can be trained with a large database of molecular properties. The pre-trained QDF model can provide an effective molecular descriptor that encodes the fundamental quantum-chemical characteristics (i.e., the wave function or orbital, electron density, and energies of a molecule) learned from the large database; we refer to this descriptor as a QDD. We show that a QDD pre-trained with certain properties of small molecules can predict different properties (e.g., the band gap and dielectric constant) of polymers compared with some existing descriptors. We believe that our DFT-based, physically informed transfer learning approach will not only be useful for practical applications in MI but will also provide quantum-chemical insights into materials in the future. All codes used in this study are available at https://github.com/masashitsubaki.
Collapse
Affiliation(s)
- Masashi Tsubaki
- National Institute of Advanced Industrial Science and Technology, Tokyo 135-0064, Japan
| | - Teruyasu Mizoguchi
- Institute of Industrial Science, The University of Tokyo, Tokyo 113-0033, Japan
| |
Collapse
|
14
|
Lin K, TomHon P, Lehmkuhl S, Laasner R, Theis T, Blum V. Density Functional Theory Study of Reaction Equilibria in Signal Amplification by Reversible Exchange. Chemphyschem 2021; 22:1937-1938. [PMID: 34617650 PMCID: PMC8725239 DOI: 10.1002/cphc.202100678] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The front cover artwork is provided by the groups of Prof. Thomas Theis (North Carolina State University) Prof. Volker Blum (Duke University). The image shows the reaction network of Signal Amplification by Reversible Exchange (SABRE), elucidated by density functional theory (DFT). Read the full text of the Review at 10.1002/cphc.202100204.
Collapse
Affiliation(s)
- Kailai Lin
- Department of Chemistry, Duke University, Durham, NC 27708, USA
| | - Patrick TomHon
- Department of Chemistry, North Carolina State University, Raleigh, NC 27606, USA
| | - Sören Lehmkuhl
- Department of Chemistry, North Carolina State University, Raleigh, NC 27606, USA
| | - Raul Laasner
- Thomas Lord Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC 27708, USA
| | - Thomas Theis
- Department of Chemistry, North Carolina State University, Raleigh, NC 27606, USA
- Joint Department of Biomedical Engineering, UNC, Chapel Hill, and NC State University, Raleigh, NC 27606, USA
- Department of Physics, North Carolina State University, Raleigh, NC 27606, USA
| | - Volker Blum
- Department of Chemistry, Duke University, Durham, NC 27708, USA
- Thomas Lord Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC 27708, USA
| |
Collapse
|
15
|
Lin K, TomHon P, Lehmkuhl S, Laasner R, Theis T, Blum V. Density Functional Theory Study of Reaction Equilibria in Signal Amplification by Reversible Exchange. Chemphyschem 2021; 22:1947-1957. [PMID: 34549869 DOI: 10.1002/cphc.202100204] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 05/19/2021] [Indexed: 11/07/2022]
Abstract
An in-depth theoretical analysis of key chemical equilibria in Signal Amplification by Reversible Exchange (SABRE) is provided, employing density functional theory calculations to characterize the likely reaction network. For all reactions in the network, the potential energy surface is probed to identify minimum energy pathways. Energy barriers and transition states are calculated, and harmonic transition state theory is applied to calculate exchange rates that approximate experimental values. The reaction network energy surface can be modulated by chemical potentials that account for the dependence on concentration, temperature, and partial pressure of molecular constituents (hydrogen, methanol, pyridine) supplied to the experiment under equilibrium conditions. We show that, under typical experimental conditions, the Gibbs free energies of the two key states involved in pyridine-hydrogen exchange at the common Ir-IMes catalyst system in methanol are essentially the same, i. e., nearly optimal for SABRE. We also show that a methanol-containing intermediate is plausible as a transient species in the process.
Collapse
Affiliation(s)
- Kailai Lin
- Department of Chemistry, Duke University, Durham, NC 27708, USA
| | - Patrick TomHon
- Department of Chemistry, North Carolina State University, Raleigh, NC 27606, USA
| | - Sören Lehmkuhl
- Department of Chemistry, North Carolina State University, Raleigh, NC 27606, USA
| | - Raul Laasner
- Thomas Lord Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC 27708, USA
| | - Thomas Theis
- Department of Chemistry, North Carolina State University, Raleigh, NC 27606, USA.,Joint Department of Biomedical Engineering, UNC, Chapel Hill, and NC State University, Raleigh, NC 27606, USA.,Department of Physics, North Carolina State University, Raleigh, NC 27606, USA
| | - Volker Blum
- Department of Chemistry, Duke University, Durham, NC 27708, USA.,Thomas Lord Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC 27708, USA
| |
Collapse
|
16
|
Musil F, Grisafi A, Bartók AP, Ortner C, Csányi G, Ceriotti M. Physics-Inspired Structural Representations for Molecules and Materials. Chem Rev 2021; 121:9759-9815. [PMID: 34310133 DOI: 10.1021/acs.chemrev.1c00021] [Citation(s) in RCA: 173] [Impact Index Per Article: 43.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The first step in the construction of a regression model or a data-driven analysis, aiming to predict or elucidate the relationship between the atomic-scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of atomic-scale representations has played, and continues to play, a central role in the success of machine-learning methods for chemistry and materials science. This review summarizes the current understanding of the nature and characteristics of the most commonly used structural and chemical descriptions of atomistic structures, highlighting the deep underlying connections between different frameworks and the ideas that lead to computationally efficient and universally applicable models. It emphasizes the link between properties, structures, their physical chemistry, and their mathematical description, provides examples of recent applications to a diverse set of chemical and materials science problems, and outlines the open questions and the most promising research directions in the field.
Collapse
Affiliation(s)
- Felix Musil
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Andrea Grisafi
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Albert P Bartók
- Department of Physics and Warwick Centre for Predictive Modelling, School of Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Christoph Ortner
- University of British Columbia, Vancouver, British Columbia V6T 1Z2, Canada
| | - Gábor Csányi
- Engineering Laboratory, University of Cambridge, Trumpington Street, Cambridge CB2 1PZ, United Kingdom
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
17
|
Terayama K, Sumita M, Katouda M, Tsuda K, Okuno Y. Efficient Search for Energetically Favorable Molecular Conformations against Metastable States via Gray-Box Optimization. J Chem Theory Comput 2021; 17:5419-5427. [PMID: 34261321 DOI: 10.1021/acs.jctc.1c00301] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In order to accurately understand and estimate molecular properties, finding energetically favorable molecular conformations is the most fundamental task for atomistic computational research on molecules and materials. Geometry optimization based on quantum chemical calculations has enabled the conformation prediction of arbitrary molecules, including de novo ones. However, it is computationally expensive to perform geometry optimizations for enormous conformers. In this study, we introduce the gray-box optimization (GBO) framework, which enables optimal control over the entire geometry optimization process, among multiple conformers. Algorithms designed for GBO roughly estimate energetically preferable conformers during their geometry optimization iterations. They then preferentially compute promising conformers. To evaluate the performance of the GBO framework, we applied it to a test set consisting of seven dipeptides and mycophenolic acid to determine their stable conformations at the density functional theory level. We thus preferentially obtained energetically favorable conformations. Furthermore, the computational costs required to find the most stable conformation were significantly reduced (approximately 1% on average, compared to the naive approach for the dipeptides).
Collapse
Affiliation(s)
- Kei Terayama
- Graduate School of Medical Life Science, Yokohama City University, Tsurumi-ku, Yokohama 230-0045, Japan.,RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan.,Medical Sciences Innovation Hub Program, RIKEN, Yokohama 230-0045, Japan.,Graduate School of Medicine, Kyoto University, Sakyo-ku, Kyoto 606-8507, Japan
| | - Masato Sumita
- RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan.,International Center for Materials Nanoarchitectonics(WPI-MANA), National Institute for Materials Science, Tsukuba 305-0044, Japan
| | - Michio Katouda
- Department of Computational Science and Technology, Research Organization for Information Science and Technology, Minato-ku, Tokyo 105-0013, Japan.,Waseda Research Institute for Science and Engineering, Waseda University, Sinjuku-ku, Tokyo 169-8555, Japan
| | - Koji Tsuda
- RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan.,Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa 277-8561, Japan.,Research and Services Division of Materials Data and Integrated System, National Institute for Materials Science, Tsukuba 305-0047, Japan
| | - Yasushi Okuno
- Medical Sciences Innovation Hub Program, RIKEN, Yokohama 230-0045, Japan.,Graduate School of Medicine, Kyoto University, Sakyo-ku, Kyoto 606-8507, Japan
| |
Collapse
|
18
|
Kulichenko M, Smith JS, Nebgen B, Li YW, Fedik N, Boldyrev AI, Lubbers N, Barros K, Tretiak S. The Rise of Neural Networks for Materials and Chemical Dynamics. J Phys Chem Lett 2021; 12:6227-6243. [PMID: 34196559 DOI: 10.1021/acs.jpclett.1c01357] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Machine learning (ML) is quickly becoming a premier tool for modeling chemical processes and materials. ML-based force fields, trained on large data sets of high-quality electron structure calculations, are particularly attractive due their unique combination of computational efficiency and physical accuracy. This Perspective summarizes some recent advances in the development of neural network-based interatomic potentials. Designing high-quality training data sets is crucial to overall model accuracy. One strategy is active learning, in which new data are automatically collected for atomic configurations that produce large ML uncertainties. Another strategy is to use the highest levels of quantum theory possible. Transfer learning allows training to a data set of mixed fidelity. A model initially trained to a large data set of density functional theory calculations can be significantly improved by retraining to a relatively small data set of expensive coupled cluster theory calculations. These advances are exemplified by applications to molecules and materials.
Collapse
Affiliation(s)
- Maksim Kulichenko
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Department of Chemistry and Biochemistry, Utah State University, Logan, Utah 84322, United States
| | - Justin S Smith
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Benjamin Nebgen
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Ying Wai Li
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Nikita Fedik
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Department of Chemistry and Biochemistry, Utah State University, Logan, Utah 84322, United States
| | - Alexander I Boldyrev
- Department of Chemistry and Biochemistry, Utah State University, Logan, Utah 84322, United States
| | - Nicholas Lubbers
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Kipton Barros
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Sergei Tretiak
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Center for Integrated Nanotechnologies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| |
Collapse
|
19
|
Stuke A, Rinke P, Todorović M. Efficient hyperparameter tuning for kernel ridge regression with Bayesian optimization. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abee59] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Abstract
Machine learning methods usually depend on internal parameters—so called hyperparameters—that need to be optimized for best performance. Such optimization poses a burden on machine learning practitioners, requiring expert knowledge, intuition or computationally demanding brute-force parameter searches. We here assess three different hyperparameter selection methods: grid search, random search and an efficient automated optimization technique based on Bayesian optimization (BO). We apply these methods to a machine learning problem based on kernel ridge regression in computational chemistry. Two different descriptors are employed to represent the atomic structure of organic molecules, one of which introduces its own set of hyperparameters to the method. We identify optimal hyperparameter configurations and infer entire prediction error landscapes in hyperparameter space that serve as visual guides for the hyperparameter performance. We further demonstrate that for an increasing number of hyperparameters, BO and random search become significantly more efficient in computational time than an exhaustive grid search, while delivering an equivalent or even better accuracy.
Collapse
|
20
|
Morawietz T, Artrith N. Machine learning-accelerated quantum mechanics-based atomistic simulations for industrial applications. J Comput Aided Mol Des 2021; 35:557-586. [PMID: 33034008 PMCID: PMC8018928 DOI: 10.1007/s10822-020-00346-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 09/26/2020] [Indexed: 01/13/2023]
Abstract
Atomistic simulations have become an invaluable tool for industrial applications ranging from the optimization of protein-ligand interactions for drug discovery to the design of new materials for energy applications. Here we review recent advances in the use of machine learning (ML) methods for accelerated simulations based on a quantum mechanical (QM) description of the system. We show how recent progress in ML methods has dramatically extended the applicability range of conventional QM-based simulations, allowing to calculate industrially relevant properties with enhanced accuracy, at reduced computational cost, and for length and time scales that would have otherwise not been accessible. We illustrate the benefits of ML-accelerated atomistic simulations for industrial R&D processes by showcasing relevant applications from two very different areas, drug discovery (pharmaceuticals) and energy materials. Writing from the perspective of both a molecular and a materials modeling scientist, this review aims to provide a unified picture of the impact of ML-accelerated atomistic simulations on the pharmaceutical, chemical, and materials industries and gives an outlook on the exciting opportunities that could emerge in the future.
Collapse
Affiliation(s)
- Tobias Morawietz
- Bayer AG, Pharmaceuticals, R&D, Digital Technologies, Computational Molecular Design, 42096 Wuppertal, Germany
| | - Nongnuch Artrith
- Department of Chemical Engineering, Columbia University, New York, NY 10027 USA
| |
Collapse
|
21
|
Moerman E, Furman D, Wales DJ. Systematic Evaluation of ReaxFF Reactive Force Fields for Biochemical Applications. J Chem Theory Comput 2020; 17:497-514. [DOI: 10.1021/acs.jctc.0c01043] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Evgeny Moerman
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lens_eld Road, Cambridge CB2 1EW, U.K
| | - David Furman
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lens_eld Road, Cambridge CB2 1EW, U.K
- Division of Chemistry, NRCN, P.O. Box 9001, Beer-Sheva 84190, Israel
| | - David J. Wales
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lens_eld Road, Cambridge CB2 1EW, U.K
| |
Collapse
|
22
|
Helfrecht BA, Cersonsky RK, Fraux G, Ceriotti M. Structure-property maps with Kernel principal covariates regression. MACHINE LEARNING-SCIENCE AND TECHNOLOGY 2020. [DOI: 10.1088/2632-2153/aba9ef] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
23
|
Amin KS, Hu X, Salahub DR, Baldauf C, Lim C, Noskov S. Benchmarking polarizable and non-polarizable force fields for Ca2+–peptides against a comprehensive QM dataset. J Chem Phys 2020; 153:144102. [DOI: 10.1063/5.0020768] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Affiliation(s)
- Kazi S. Amin
- CMS – Centre for Molecular Simulation and Department of Biological Sciences, University of Calgary, 2500 University Drive NW, Calgary, Alberta T2N 1N4, Canada
| | - Xiaojuan Hu
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin, Germany
| | - Dennis R. Salahub
- Department of Chemistry, CMS – Centre for Molecular Simulation, IQST – Institute for Quantum Science and Technology, Quantum Alberta, University of Calgary, 2500 University Drive NW, Calgary, Alberta T2N 1N4, Canada
| | - Carsten Baldauf
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin, Germany
| | - Carmay Lim
- Institute of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan
- Department of Chemistry, National Tsing Hua University, Hsinchu 300, Taiwan
| | - Sergei Noskov
- CMS – Centre for Molecular Simulation and Department of Biological Sciences, University of Calgary, 2500 University Drive NW, Calgary, Alberta T2N 1N4, Canada
| |
Collapse
|
24
|
Toward accurate prediction of amino acid derivatives structure and energetics from DFT: glycine conformers and their interconversions. J Mol Model 2020; 26:129. [DOI: 10.1007/s00894-020-4342-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 03/04/2020] [Indexed: 12/30/2022]
|
25
|
Stuke A, Kunkel C, Golze D, Todorović M, Margraf JT, Reuter K, Rinke P, Oberhofer H. Atomic structures and orbital energies of 61,489 crystal-forming organic molecules. Sci Data 2020; 7:58. [PMID: 32071311 PMCID: PMC7029047 DOI: 10.1038/s41597-020-0385-y] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 01/21/2020] [Indexed: 12/14/2022] Open
Abstract
Data science and machine learning in materials science require large datasets of technologically relevant molecules or materials. Currently, publicly available molecular datasets with realistic molecular geometries and spectral properties are rare. We here supply a diverse benchmark spectroscopy dataset of 61,489 molecules extracted from organic crystals in the Cambridge Structural Database (CSD), denoted OE62. Molecular equilibrium geometries are reported at the Perdew-Burke-Ernzerhof (PBE) level of density functional theory (DFT) including van der Waals corrections for all 62 k molecules. For these geometries, OE62 supplies total energies and orbital eigenvalues at the PBE and the PBE hybrid (PBE0) functional level of DFT for all 62 k molecules in vacuum as well as at the PBE0 level for a subset of 30,876 molecules in (implicit) water. For 5,239 molecules in vacuum, the dataset provides quasiparticle energies computed with many-body perturbation theory in the G0W0 approximation with a PBE0 starting point (denoted GW5000 in analogy to the GW100 benchmark set (M. van Setten et al. J. Chem. Theory Comput. 12, 5076 (2016))).
Collapse
Affiliation(s)
- Annika Stuke
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto, FI-00076, Finland.
| | - Christian Kunkel
- Chair for Theoretical Chemistry and Catalysis Research Center, Technische Universität München, Lichtenbergstraße 4, D-85747, Garching, Germany
| | - Dorothea Golze
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto, FI-00076, Finland
| | - Milica Todorović
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto, FI-00076, Finland
| | - Johannes T Margraf
- Chair for Theoretical Chemistry and Catalysis Research Center, Technische Universität München, Lichtenbergstraße 4, D-85747, Garching, Germany
| | - Karsten Reuter
- Chair for Theoretical Chemistry and Catalysis Research Center, Technische Universität München, Lichtenbergstraße 4, D-85747, Garching, Germany
| | - Patrick Rinke
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto, FI-00076, Finland
- Chair for Theoretical Chemistry and Catalysis Research Center, Technische Universität München, Lichtenbergstraße 4, D-85747, Garching, Germany
| | - Harald Oberhofer
- Chair for Theoretical Chemistry and Catalysis Research Center, Technische Universität München, Lichtenbergstraße 4, D-85747, Garching, Germany
| |
Collapse
|
26
|
Perfetto E, Trabattoni A, Calegari F, Nisoli M, Marini A, Stefanucci G. Ultrafast Quantum Interference in the Charge Migration of Tryptophan. J Phys Chem Lett 2020; 11:891-899. [PMID: 31944766 DOI: 10.1021/acs.jpclett.9b03517] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Extreme-ultraviolet-induced charge migration in biorelevant molecules is a fundamental step in the complex path leading to photodamage. In this work we propose a simple interpretation of the charge migration recently observed in an attosecond pump-probe experiment on the amino acid tryptophan. We find that the decay of the prominent low-frequency spectral structure with increasing pump-probe delay is due to a quantum beating between two geometrically distinct, almost degenerate charge oscillations. Quantum beating is ubiquitous in these systems, and at least on the few-to-tens of femtosecond time scales, it may dominate over decoherence the line intensities of time-resolved spectra. We also address the experimentally observed phase shift in the charge oscillations of two different amino acids, tryptophan and phenylalanine. Our results indicate that a beyond mean-field treatment of the electron dynamics is necessary to reproduce the correct behavior.
Collapse
Affiliation(s)
- E Perfetto
- Dipartimento di Fisica , Università di Roma Tor Vergata , Via della Ricerca Scientifica 1 , 00133 Rome , Italy
- CNR-ISM , Division of Ultrafast Processes in Materials (FLASHit) , Area della Ricerca di Roma 1, Via Salaria Km 29.3 , I-00016 Monterotondo Scalo , Italy
| | - A Trabattoni
- Center for Free-Electron Laser Science (CFEL) , DESY , 22607 Hamburg , Germany
| | - F Calegari
- Center for Free-Electron Laser Science (CFEL) , DESY , 22607 Hamburg , Germany
- Institute for Photonics and Nanotechnologies , IFN-CNR , 20133 Milano , Italy
- Institut fur Experimentalphysik , Universität Hamburg , D-22761 Hamburg , Germany
| | - M Nisoli
- Institute for Photonics and Nanotechnologies , IFN-CNR , 20133 Milano , Italy
- Dipartimento di Fisica , Politecnico di Milano , 20133 Milano , Italy
| | - A Marini
- CNR-ISM , Division of Ultrafast Processes in Materials (FLASHit) , Area della Ricerca di Roma 1, Via Salaria Km 29.3 , I-00016 Monterotondo Scalo , Italy
| | - G Stefanucci
- Dipartimento di Fisica , Università di Roma Tor Vergata , Via della Ricerca Scientifica 1 , 00133 Rome , Italy
- INFN , Sezione di Roma Tor Vergata , Via della Ricerca Scientifica 1 , 00133 Rome , Italy
| |
Collapse
|
27
|
Štejfa V, Fulem M, Růžička K. Ideal-gas thermodynamic properties of proteinogenic aliphatic amino acids calculated by R1SM approach. J Chem Phys 2019; 151:144504. [PMID: 31615223 DOI: 10.1063/1.5123450] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
In this work, a R1SM approach was applied for the calculation of ideal-gas thermodynamic properties of five amino acids with aliphatic side chains: glycine, alanine, valine, leucine, and isoleucine. The first step of the calculation was an extensive conformational analysis that located several conformers not reported previously. A new systematic and user-friendly nomenclature of the conformers was introduced, and the stable conformers were clearly assigned with the previously used labeling where possible. Stability and calculated relative energies of the conformers were compared between various levels of theory and with several experimental studies, demonstrating a good performance of the selected B3LYP-D3/6-311+G(2df,p) level of theory. As a second step, the theoretically calculated vibrational frequencies were compared to the previously reported experimental spectra to verify the performance of the applied double-linear scaling factor. Finally, ideal-gas heat capacities, enthalpies, and absolute entropies were calculated, accounting for all stable conformers using the R1SM model. The resulting thermodynamic data are presented for the first time, since they cannot be determined experimentally and their rigorous calculation requires a complex thermodynamic model.
Collapse
Affiliation(s)
- Vojtěch Štejfa
- Department of Physical Chemistry, University of Chemistry and Technology, Prague, Technická 5, CZ-166 28 Prague 6, Czech Republic
| | - Michal Fulem
- Department of Physical Chemistry, University of Chemistry and Technology, Prague, Technická 5, CZ-166 28 Prague 6, Czech Republic
| | - Květoslav Růžička
- Department of Physical Chemistry, University of Chemistry and Technology, Prague, Technická 5, CZ-166 28 Prague 6, Czech Republic
| |
Collapse
|
28
|
Červinka C, Fulem M. Cohesive properties of the crystalline phases of twenty proteinogenic α-aminoacids from first-principles calculations. Phys Chem Chem Phys 2019; 21:18501-18515. [PMID: 31411212 DOI: 10.1039/c9cp03102b] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Cohesive properties (lattice and cohesive energy of the crystal and corresponding sublimation enthalpy) of the complete set of twenty enantiopure anhydrous proteinogenic amino acids are investigated using first-principles calculations. In contrast to neutral amino acid molecules in the vapor phase, all amino acids form crystals in their zwitterionic form. Therefore, reliable ab initio calculations of the proton transfer energy are an indispensable step of such calculations. Simplifying procedures, designed to rationalize the computational cost of the quasi-harmonic approximation, which proves too demanding if performed fully at the given quantum level of theory, are presented and tested. For this purpose, atomic multipoles (up to the quadrupoles) for the amoeba force field are parametrized for all amino acid zwitterions. While the calculated lattice energies of the amino acids range from 235-458 kJ mol-1 in absolute value, the proton transfer energies typically amount to 100-220 kJ mol-1, which translates to sublimation enthalpies ranging from 117-202 kJ mol-1, appreciably exceeding the sublimation enthalpy values common for nonionic molecular crystals. Critically assessed experimental data on sublimation enthalpies are used as a benchmark for comparison of the data calculated in this work. Cohesive properties of most amino acids calculated in this work, combining the PBE-D3(BJ)/PAW and CCSD(T)-F12/aug-cc-pVDZ levels of theory used for predictions of the lattice energies and of the proton transfer energies, respectively, exhibit a reasonable agreement with the experiment. At the same time, this work contains the first published data on cohesive properties for several enantiopure amino acids.
Collapse
Affiliation(s)
- Ctirad Červinka
- Department of Physical Chemistry, University of Chemistry and Technology, Prague, Technická 5, CZ-166 28 Prague 6, Czech Republic.
| | | |
Collapse
|
29
|
Kim H, Park JY, Choi S. Energy refinement and analysis of structures in the QM9 database via a highly accurate quantum chemical method. Sci Data 2019; 6:109. [PMID: 31270326 PMCID: PMC6610095 DOI: 10.1038/s41597-019-0121-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Accepted: 06/13/2019] [Indexed: 12/12/2022] Open
Abstract
A wide variety of data-driven approaches have been introduced in the field of quantum chemistry. To extend the applicable range and improve the prediction power of those approaches, highly accurate quantum chemical benchmarks that cover extremely large chemical spaces are required. Here, we report ~134 k quantum chemical calculations performed with G4MP2, the fourth generation of the G-n series in which second-order perturbation theory is employed. A single composite method calculation executes several low-level calculations to reproduce the results of high-level ab initio calculations with the aim of saving computational costs. Therefore, our database reports the results of the various methods (e.g., density functional theory, Hartree-Fock, Møller-Plesset perturbation theory, and coupled-cluster theory). Additionally, we examined the structure information of both the QM9 and the revised databases via chemical graph analysis. Our database can be applied to refine and improve the quality of data-driven quantum chemical prediction. Furthermore, we reported the raw outputs of all calculations performed in this work for other potential applications.
Collapse
Affiliation(s)
- Hyungjun Kim
- Department of Chemistry, Incheon National University, 119 Academy-ro, Yeonsu-gu, Incheon, 22012, Republic of Korea
| | - Ji Young Park
- Department of Chemistry, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Sunghwan Choi
- National Institute of Supercomputing and Network, Korea Institute of Science and Technology Information, Daejeon, 34141, Republic of Korea.
| |
Collapse
|
30
|
Stuke A, Todorović M, Rupp M, Kunkel C, Ghosh K, Himanen L, Rinke P. Chemical diversity in molecular orbital energy predictions with kernel ridge regression. J Chem Phys 2019; 150:204121. [DOI: 10.1063/1.5086105] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Affiliation(s)
- Annika Stuke
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto FI-00076, Finland
| | - Milica Todorović
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto FI-00076, Finland
| | - Matthias Rupp
- Fritz Haber Institute of the Max Planck Society, Faradayweg 4-6, 14195 Berlin, Germany
| | - Christian Kunkel
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto FI-00076, Finland
- Chair for Theoretical Chemistry and Catalysis Research Center, Technische Universität München, Lichtenbergstr. 4, 85747 Garching, Germany
| | - Kunal Ghosh
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto FI-00076, Finland
- Department of Computer Science, Aalto University, P.O. Box 15400, Aaalto FI-00076, Finland
| | - Lauri Himanen
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto FI-00076, Finland
| | - Patrick Rinke
- Department of Applied Physics, Aalto University, P.O. Box 11100, Aalto FI-00076, Finland
- Chair for Theoretical Chemistry and Catalysis Research Center, Technische Universität München, Lichtenbergstr. 4, 85747 Garching, Germany
| |
Collapse
|
31
|
Mucha E, Stuckmann A, Marianski M, Struwe WB, Meijer G, Pagel K. In-depth structural analysis of glycans in the gas phase. Chem Sci 2019; 10:1272-1284. [PMID: 30809341 PMCID: PMC6357860 DOI: 10.1039/c8sc05426f] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2018] [Accepted: 01/04/2019] [Indexed: 12/26/2022] Open
Abstract
Although there have been substantial improvements in glycan analysis over the past decade, the lack of both high-resolution and high-throughput methods hampers progress in glycomics. This perspective article highlights the current developments of liquid chromatography, mass spectrometry, ion-mobility spectrometry and cryogenic IR spectroscopy for glycan analysis and gives a critical insight to their individual strengths and limitations. Moreover, we discuss a novel concept in which ion mobility-mass spectrometry and cryogenic IR spectroscopy is combined in a single instrument such that datasets consisting of m/z, collision cross sections and IR fingerprints can be obtained. This multidimensional data will then be compared to a comprehensive reference library of intact glycans and their fragments to accurately identify unknown glycans on a high-throughput scale with minimal sample requirements. Due to the complementarity of the obtained information, this novel approach is highly diagnostic and also suitable for the identification of larger glycans; however, the workflow and instrumentation is straightforward enough to be implemented into a user-friendly setup.
Collapse
Affiliation(s)
- Eike Mucha
- Fritz Haber Institute of the Max Planck Society , Department of Molecular Physics , Faradayweg 4-6 , 14195 Berlin , Germany .
- Institute of Chemistry and Biochemistry , Freie Universität Berlin , Takustraße 3 , 14195 Berlin , Germany
| | - Alexandra Stuckmann
- Fritz Haber Institute of the Max Planck Society , Department of Molecular Physics , Faradayweg 4-6 , 14195 Berlin , Germany .
- Institute of Chemistry and Biochemistry , Freie Universität Berlin , Takustraße 3 , 14195 Berlin , Germany
| | - Mateusz Marianski
- Fritz Haber Institute of the Max Planck Society , Department of Molecular Physics , Faradayweg 4-6 , 14195 Berlin , Germany .
| | - Weston B Struwe
- Oxford Glycobiology Institute , Department of Biochemistry , University of Oxford , OX1 3QU Oxford , UK
| | - Gerard Meijer
- Fritz Haber Institute of the Max Planck Society , Department of Molecular Physics , Faradayweg 4-6 , 14195 Berlin , Germany .
| | - Kevin Pagel
- Fritz Haber Institute of the Max Planck Society , Department of Molecular Physics , Faradayweg 4-6 , 14195 Berlin , Germany .
- Institute of Chemistry and Biochemistry , Freie Universität Berlin , Takustraße 3 , 14195 Berlin , Germany
| |
Collapse
|
32
|
Klyne J, Dopfer O. Microhydration of protonated 5-hydroxyindole revealed by infrared spectroscopy. Phys Chem Chem Phys 2019; 21:2706-2718. [PMID: 30663737 DOI: 10.1039/c8cp06950f] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Controlled microsolvation of protonated aromatic biomolecules with water is fundamental to understand proton transfer reactions in aqueous environments. We measured infrared photodissociation (IRPD) spectra of mass-selected microhydrates of protonated 5-hydroxyindole (5HIH+-Wn, W = H2O, n = 1-3) in the OH and NH stretch ranges (2700-3800 cm-1), which are sensitive to the spectroscopic characteristics of interior solvation, water network formation, and proton transfer to solvent. Analysis of the IRPD spectra by dispersion-corrected density functional theory calculations (B3LYP-D3/aug-cc-pVTZ) reveals the coexistence of C3- and C4-protonated carbenium ions, 5HIH+(C3) and 5HIH+(C4), as well as the O-protonated oxonium ion, 5HIH+(O). Monohydrated 5HIH+-W clusters are formed by hydrogen-bonding (H-bonding) of the first water to the most acidic functional group, namely, the NH group in the case of 5HIH+(C3), the OH group for 5HIH+(C4), and the OH2 group for 5HIH+(O). The latter benefits from its twofold degeneracy and the outstandingly high binding energy of D0 ∼ 100 kJ mol-1. Larger 5HIH+-W2/3 clusters preferably grow (i) by H-bonding of the second water to the remaining vacant functional group and and/or (ii) by formation of W2 water chains at the respective most acidic functional group. Our IRPD spectra of 5HIH+-Wn do not indicate any proton transfer to the solvent up to n = 3, in line with the proton affinities of 5HI and Wn. Comparison of 5HIH+-Wn to neutral 5HI-W and cationic 5HI+-Wn clusters elucidates the impact of different charge states on the topology of the initial solvation shell. Furthermore, to access the influence of the size of the arene ion and a second functional group, we draw a comparison to microhydration of protonated phenol.
Collapse
Affiliation(s)
- Johanna Klyne
- Institut für Optik und Atomare Physik, Technische Universität Berlin, Hardenbergstr. 36, 10623 Berlin, Germany.
| | | |
Collapse
|
33
|
PEPCONF, a diverse data set of peptide conformational energies. Sci Data 2019; 6:180310. [PMID: 30667382 PMCID: PMC6343515 DOI: 10.1038/sdata.2018.310] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Accepted: 11/30/2018] [Indexed: 12/13/2022] Open
Abstract
We present an extensive and diverse database of peptide conformational energies. Our database contains five different classes of model geometries: dipeptides, tripeptides, and disulfide-bridged, bioactive, and cyclic peptides. In total, the database consists of 3775 conformational energy data points and 4530 conformer geometries. All the reference energies have been calculated at the LC-ωPBE-XDM/aug-cc-pVTZ level of theory, which is shown to yield conformational energies with an accuracy in the order of tenths of a kcal/mol when compared to complete-basis-set coupled-cluster reference data. The peptide conformational data set (PEPCONF) is presented as a high-quality reference set for the development and benchmarking of molecular-mechanics and semi-empirical electronic structure methods, which are the most commonly used techniques in the modeling of medium to large proteins.
Collapse
|
34
|
Sharapa DI, Genaev A, Cavallo L, Minenkov Y. A Robust and Cost‐Efficient Scheme for Accurate Conformational Energies of Organic Molecules. Chemphyschem 2018; 20:92-102. [DOI: 10.1002/cphc.201801063] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Indexed: 11/09/2022]
Affiliation(s)
- Dmitry I. Sharapa
- Institute of Catalysis Research and TechnologyKarlsruhe Institute of Technology (KIT) Hermann-von-Helmholtz Platz 1 Eggenstein-Leopoldshafen D-76344 Germany
| | - Alexander Genaev
- Vorozhtsov Novosibirsk Institute of Organic Chemistry Academician Lavrent'ev Ave., 9 Novosibirsk 630090 Russian Federation
| | - Luigi Cavallo
- KAUST Catalysis Center (KCC)King Abdullah University of Science and Technology Thuwal- 23955-6900 Saudi Arabia
| | - Yury Minenkov
- Moscow Institute of Physics and Technology Institutskiy Pereulok 9, Dolgoprudny Moscow Region 141700 Russia
| |
Collapse
|
35
|
Klyne J, Bouchet A, Ishiuchi SI, Fujii M, Dopfer O. Cation-Size-Dependent Conformational Locking of Glutamic Acid by Alkali Ions: Infrared Photodissociation Spectroscopy of Cryogenic Ions. J Phys Chem B 2018; 122:2295-2306. [DOI: 10.1021/acs.jpcb.7b12601] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- Johanna Klyne
- Institut
für Optik und Atomare Physik, Technische Universität Berlin, Hardenbergstr. 36, 10623 Berlin, Germany
| | - Aude Bouchet
- Institut
für Optik und Atomare Physik, Technische Universität Berlin, Hardenbergstr. 36, 10623 Berlin, Germany
- Laboratory
for Chemistry and Life Science, Institute of Innovation Research, Tokyo Institute of Technology, 4259, Nagatsuta-cho, Midori-ku, 226-8503 Yokohama, Japan
| | - Shun-ichi Ishiuchi
- Laboratory
for Chemistry and Life Science, Institute of Innovation Research, Tokyo Institute of Technology, 4259, Nagatsuta-cho, Midori-ku, 226-8503 Yokohama, Japan
| | - Masaaki Fujii
- Laboratory
for Chemistry and Life Science, Institute of Innovation Research, Tokyo Institute of Technology, 4259, Nagatsuta-cho, Midori-ku, 226-8503 Yokohama, Japan
| | - Otto Dopfer
- Institut
für Optik und Atomare Physik, Technische Universität Berlin, Hardenbergstr. 36, 10623 Berlin, Germany
| |
Collapse
|
36
|
Jiang Z, Biczysko M, Moriarty NW. Accurate geometries for “Mountain pass” regions of the Ramachandran plot using quantum chemical calculations. Proteins 2018; 86:273-278. [DOI: 10.1002/prot.25451] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Revised: 12/16/2017] [Accepted: 01/01/2018] [Indexed: 12/28/2022]
Affiliation(s)
- Zhongming Jiang
- International Centre for Quantum and Molecular Structures, College of Sciences; Shanghai University, 99 Shangda Road; 200444 Shanghai China
| | - Malgorzata Biczysko
- International Centre for Quantum and Molecular Structures, College of Sciences; Shanghai University, 99 Shangda Road; 200444 Shanghai China
| | - Nigel W. Moriarty
- Molecular Biophysics and Integrated Bioimaging Division; Lawrence Berkeley National Laboratory; Berkeley California 94720
| |
Collapse
|
37
|
Klyne J, Bouchet A, Ishiuchi SI, Fujii M, Schneider M, Baldauf C, Dopfer O. Probing chirality recognition of protonated glutamic acid dimers by gas-phase vibrational spectroscopy and first-principles simulations. Phys Chem Chem Phys 2018; 20:28452-28464. [DOI: 10.1039/c8cp05855e] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
We characterize stereospecific aspects of homochiral and heterochiral dimers of glutamic acid by infrared spectroscopy and first-principles molecular dynamics simulations.
Collapse
Affiliation(s)
- Johanna Klyne
- Institut für Optik und Atomare Physik
- Technische Universität Berlin
- 10623 Berlin
- Germany
| | - Aude Bouchet
- Laboratory for Chemistry and Life Science
- Institute of Innovation Research
- Tokyo Institute of Technology
- Yokohama
- Japan
| | - Shun-ichi Ishiuchi
- Laboratory for Chemistry and Life Science
- Institute of Innovation Research
- Tokyo Institute of Technology
- Yokohama
- Japan
| | - Masaaki Fujii
- Laboratory for Chemistry and Life Science
- Institute of Innovation Research
- Tokyo Institute of Technology
- Yokohama
- Japan
| | | | - Carsten Baldauf
- Fritz-Haber-Institut der MPG
- 14195 Berlin
- Germany
- Wilhelm-Ostwald-Institut für Physikalische und Theoretische Chemie
- Universität Leipzig
| | - Otto Dopfer
- Institut für Optik und Atomare Physik
- Technische Universität Berlin
- 10623 Berlin
- Germany
- Tokyo Tech World Research Hub Initiative (WRHI)
| |
Collapse
|
38
|
Bartók AP, De S, Poelking C, Bernstein N, Kermode JR, Csányi G, Ceriotti M. Machine learning unifies the modeling of materials and molecules. SCIENCE ADVANCES 2017; 3:e1701816. [PMID: 29242828 PMCID: PMC5729016 DOI: 10.1126/sciadv.1701816] [Citation(s) in RCA: 318] [Impact Index Per Article: 39.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 11/14/2017] [Indexed: 05/22/2023]
Abstract
Determining the stability of molecules and condensed phases is the cornerstone of atomistic modeling, underpinning our understanding of chemical and materials properties and transformations. We show that a machine-learning model, based on a local description of chemical environments and Bayesian statistical learning, provides a unified framework to predict atomic-scale properties. It captures the quantum mechanical effects governing the complex surface reconstructions of silicon, predicts the stability of different classes of molecules with chemical accuracy, and distinguishes active and inactive protein ligands with more than 99% reliability. The universality and the systematic nature of our framework provide new insight into the potential energy surface of materials and molecules.
Collapse
Affiliation(s)
- Albert P. Bartók
- Scientific Computing Department, Science and Technology Facilities Council, Rutherford Appleton Laboratory, Oxfordshire OX11 0QX, UK
| | - Sandip De
- National Center for Computational Design and Discovery of Novel Materials (MARVEL), Lausanne, Switzerland
- Laboratory of Computational Science and Modelling, Institute of Materials, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Carl Poelking
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, UK
| | - Noam Bernstein
- Center for Materials Physics and Technology, U.S. Naval Research Laboratory, Washington, DC 20375, USA
| | - James R. Kermode
- Warwick Centre for Predictive Modelling, School of Engineering, University of Warwick, Coventry CV4 7AL, UK
| | - Gábor Csányi
- Engineering Laboratory, University of Cambridge, Cambridge, UK
| | - Michele Ceriotti
- National Center for Computational Design and Discovery of Novel Materials (MARVEL), Lausanne, Switzerland
- Laboratory of Computational Science and Modelling, Institute of Materials, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Corresponding author.
| |
Collapse
|
39
|
Schneider M, Masellis C, Rizzo T, Baldauf C. Kinetically Trapped Liquid-State Conformers of a Sodiated Model Peptide Observed in the Gas Phase. J Phys Chem A 2017; 121:6838-6844. [DOI: 10.1021/acs.jpca.7b06431] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- Markus Schneider
- Theory
Department, Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
| | - Chiara Masellis
- Laboratoire
de Chimie Physique Moléculaire, EPFL SB ISIC LCPM, Ecole Polytechnique Fédérale de Lausanne, Station 6, CH-1015 Lausanne, Switzerland
| | - Thomas Rizzo
- Laboratoire
de Chimie Physique Moléculaire, EPFL SB ISIC LCPM, Ecole Polytechnique Fédérale de Lausanne, Station 6, CH-1015 Lausanne, Switzerland
| | - Carsten Baldauf
- Theory
Department, Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
| |
Collapse
|
40
|
De S, Musil F, Ingram T, Baldauf C, Ceriotti M. Mapping and classifying molecules from a high-throughput structural database. J Cheminform 2017; 9:6. [PMID: 28203290 PMCID: PMC5289135 DOI: 10.1186/s13321-017-0192-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2016] [Accepted: 01/17/2017] [Indexed: 11/10/2022] Open
Abstract
High-throughput computational materials design promises to greatly accelerate the process of discovering new materials and compounds, and of optimizing their properties. The large databases of structures and properties that result from computational searches, as well as the agglomeration of data of heterogeneous provenance leads to considerable challenges when it comes to navigating the database, representing its structure at a glance, understanding structure-property relations, eliminating duplicates and identifying inconsistencies. Here we present a case study, based on a data set of conformers of amino acids and dipeptides, of how machine-learning techniques can help addressing these issues. We will exploit a recently-developed strategy to define a metric between structures, and use it as the basis of both clustering and dimensionality reduction techniques-showing how these can help reveal structure-property relations, identify outliers and inconsistent structures, and rationalise how perturbations (e.g. binding of ions to the molecule) affect the stability of different conformers.
Collapse
Affiliation(s)
- Sandip De
- National Center for Computational Design and Discovery of Novel Materials (MARVEL), Lausanne, Switzerland.,Laboratory of Computational Science and Modelling, Institute of Materials, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Felix Musil
- National Center for Computational Design and Discovery of Novel Materials (MARVEL), Lausanne, Switzerland.,Laboratory of Computational Science and Modelling, Institute of Materials, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Teresa Ingram
- Theory Department of the Fritz Haber Institute, Faradayweg 4-6, 14195 Berlin-Dahlem, Germany
| | - Carsten Baldauf
- Theory Department of the Fritz Haber Institute, Faradayweg 4-6, 14195 Berlin-Dahlem, Germany
| | - Michele Ceriotti
- National Center for Computational Design and Discovery of Novel Materials (MARVEL), Lausanne, Switzerland.,Laboratory of Computational Science and Modelling, Institute of Materials, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| |
Collapse
|
41
|
Marianski M, Supady A, Ingram T, Schneider M, Baldauf C. Assessing the Accuracy of Across-the-Scale Methods for Predicting Carbohydrate Conformational Energies for the Examples of Glucose and α-Maltose. J Chem Theory Comput 2016; 12:6157-6168. [DOI: 10.1021/acs.jctc.6b00876] [Citation(s) in RCA: 73] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Mateusz Marianski
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
| | - Adriana Supady
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
| | - Teresa Ingram
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
| | - Markus Schneider
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
| | - Carsten Baldauf
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
| |
Collapse
|
42
|
Ropo M, Blum V, Baldauf C. Trends for isolated amino acids and dipeptides: Conformation, divalent ion binding, and remarkable similarity of binding to calcium and lead. Sci Rep 2016; 6:35772. [PMID: 27808109 PMCID: PMC5093913 DOI: 10.1038/srep35772] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Accepted: 10/03/2016] [Indexed: 12/16/2022] Open
Abstract
We derive structural and binding energy trends for twenty amino acids, their dipeptides, and their interactions with the divalent cations Ca2+, Ba2+, Sr2+, Cd2+, Pb2+, and Hg2+. The underlying data set consists of more than 45,000 first-principles predicted conformers with relative energies up to ~4 eV (~400 kJ/mol). We show that only very few distinct backbone structures of isolated amino acids and their dipeptides emerge as lowest-energy conformers. The isolated amino acids predominantly adopt structures that involve an acidic proton shared between the carboxy and amino function. Dipeptides adopt one of two intramolecular-hydrogen bonded conformations C5 or . Upon complexation with a divalent cation, the accessible conformational space shrinks and intramolecular hydrogen bonding is prevented due to strong electrostatic interaction of backbone and side chain functional groups with cations. Clear correlations emerge from the binding energies of the six divalent ions with amino acids and dipeptides. Cd2+ and Hg2+ show the largest binding energies-a potential correlation with their known high acute toxicities. Ca2+ and Pb2+ reveal almost identical binding energies across the entire series of amino acids and dipeptides. This observation validates past indications that ion-mimicry of calcium and lead should play an important role in a toxicological context.
Collapse
Affiliation(s)
- M. Ropo
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
- Department of Physics, Tampere University of Technology, Finland
- COMP, Department of Applied Physics, Aalto University, Finland
| | - V. Blum
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
- Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC, USA
| | - C. Baldauf
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
| |
Collapse
|
43
|
De S, Bartók AP, Csányi G, Ceriotti M. Comparing molecules and solids across structural and alchemical space. Phys Chem Chem Phys 2016; 18:13754-69. [DOI: 10.1039/c6cp00415f] [Citation(s) in RCA: 368] [Impact Index Per Article: 40.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
A general procedure to compare molecules and materials powers insightful representations of energy landscapes and precise machine-learning predictions of properties.
Collapse
Affiliation(s)
- Sandip De
- National Center for Computational Design and Discovery of Novel Materials (MARVEL)
- Switzerland
- Laboratory of Computational Science and Modelling
- Institute of Materials
- Ecole Polytechnique Fédérale de Lausanne
| | - Albert P. Bartók
- Engineering Laboratory
- University of Cambridge
- Cambridge CB2 1PZ
- UK
| | - Gábor Csányi
- Engineering Laboratory
- University of Cambridge
- Cambridge CB2 1PZ
- UK
| | - Michele Ceriotti
- National Center for Computational Design and Discovery of Novel Materials (MARVEL)
- Switzerland
- Laboratory of Computational Science and Modelling
- Institute of Materials
- Ecole Polytechnique Fédérale de Lausanne
| |
Collapse
|