51
|
Secor M, Soudackov AV, Hammes-Schiffer S. Artificial Neural Networks as Mappings between Proton Potentials, Wave Functions, Densities, and Energy Levels. J Phys Chem Lett 2021; 12:2206-2212. [PMID: 33630595 PMCID: PMC8021271 DOI: 10.1021/acs.jpclett.1c00229] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Artificial neural networks (ANNs) have become important in quantum chemistry. Herein, applications to nuclear quantum effects, such as zero-point energy, vibrationally excited states, and hydrogen tunneling, are explored. ANNs are used to solve the time-independent Schrödinger equation for single- and double-well potentials representing hydrogen-bonded molecular systems capable of proton transfer. ANN mappings are trained to predict the lowest five proton vibrational energies, wave functions, and densities from the proton potentials and to predict the excited state proton vibrational energies and densities from the proton ground state density. For the inverse problem, ANN mappings are trained to predict the proton potential from the proton vibrational energy levels or the proton ground state density. This latter mapping is theoretically justified by the first Hohenberg-Kohn theorem establishing a one-to-one correspondence between the external potential and the ground state density. ANNs for two- and three-dimensional systems are also presented to illustrate the straightforward extension to higher dimensions.
Collapse
Affiliation(s)
- Maxim Secor
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520
| | - Alexander V. Soudackov
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520
| | - Sharon Hammes-Schiffer
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520
| |
Collapse
|
52
|
Vanova V, Mitrevska K, Milosavljevic V, Hynek D, Richtera L, Adam V. Peptide-based electrochemical biosensors utilized for protein detection. Biosens Bioelectron 2021; 180:113087. [PMID: 33662844 DOI: 10.1016/j.bios.2021.113087] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 02/04/2021] [Accepted: 02/10/2021] [Indexed: 02/07/2023]
Abstract
Proteins are generally detected as biomarkers for tracing or determining various disorders in organisms. Biomarker proteins can be tracked in samples with various origins and in different concentrations, revealing whether an organism is in a healthy or unhealthy state. In regard to detection, electrochemical biosensors are a potential fusion of electronics, chemistry, and biology, allowing for fast and early point-of-care detection from a biological sample with the advantages of high sensitivity, simple construction, and easy operation. Peptides present a promising approach as a biorecognition element when connected with electrochemical biosensors. The benefits of short peptides lie mainly in their good stability and selective affinity to a target analyte. Therefore, peptide-based electrochemical biosensors (PBEBs) represent an alternative approach for the detection of different protein biomarkers. This review provides a summary of the past decade of recently proposed PBEBs designed for protein detection, dividing them according to different protein types: (i) enzyme detection, including proteases and kinases; (ii) antibody detection; and (iii) other protein detection. According to these protein types, different sensing mechanisms are discussed, such as the peptide cleavage by a proteases, phosphorylation by kinases, presence of antibodies, and exploiting of affinities; furthermore, measurements are obtained by different electrochemical methods. A discussion and comparison of various constructions, modifications, immobilization strategies and different sensing techniques in terms of high sensitivity, selectivity, repeatability, and potential for practical application are presented.
Collapse
Affiliation(s)
- Veronika Vanova
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic
| | - Katerina Mitrevska
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic
| | - Vedran Milosavljevic
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic; Central European Institute of Technology, Brno University of Technology, Purkynova 123, 61 200, Brno, Czech Republic
| | - David Hynek
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic; Central European Institute of Technology, Brno University of Technology, Purkynova 123, 61 200, Brno, Czech Republic
| | - Lukas Richtera
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic; Central European Institute of Technology, Brno University of Technology, Purkynova 123, 61 200, Brno, Czech Republic
| | - Vojtech Adam
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic; Central European Institute of Technology, Brno University of Technology, Purkynova 123, 61 200, Brno, Czech Republic.
| |
Collapse
|
53
|
Wieduwilt EK, Boisson JC, Terraneo G, Hénon E, Genoni A. A Step toward the Quantification of Noncovalent Interactions in Large Biological Systems: The Independent Gradient Model-Extremely Localized Molecular Orbital Approach. J Chem Inf Model 2021; 61:795-809. [PMID: 33444021 DOI: 10.1021/acs.jcim.0c01188] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The independent gradient model (IGM) is a recent electron density-based computational method that enables to detect and quantify covalent and noncovalent interactions. When applied to large systems, the original version of the technique still relies on promolecular electron densities given by the sum of spherically averaged atomic electron distributions, which leads to approximate evaluations of the inter- and intramolecular interactions occurring in systems of biological interest. To overcome this drawback and perform IGM analyses based on quantum mechanically rigorous electron densities also for macromolecular systems, we coupled the IGM approach with the recently constructed libraries of extremely localized molecular orbitals (ELMOs) that allow fast and reliable reconstructions of polypeptide and protein electron densities. The validation tests performed on small polypeptides and peptide dimers have shown that the novel IGM-ELMO strategy provides results that are systematically closer to the fully quantum mechanical ones and outperforms the IGM method based on the crude promolecular approximation, but still keeping a quite low computational cost. The results of the test calculations carried out on proteins have also confirmed the trends observed for the IGM analyses conducted on small systems. This makes us envisage the future application of the novel IGM-ELMO approach to unravel complicated noncovalent interaction networks (e.g., in protein-protein contacts) or to rationally design new drugs through molecular docking calculations and virtual high-throughput screenings.
Collapse
Affiliation(s)
- Erna K Wieduwilt
- Université de Lorraine & CNRS, Laboratoire de Physique et Chimie Théoriques, UMR CNRS 7019, 1 Boulevard Arago, Metz F-57078, France
| | - Jean-Charles Boisson
- CReSTIC EA 3804, Université de Reims Champagne-Ardenne, Moulin de la Housse, Reims Cedex 02 BP39, F-51687, France
| | - Giancarlo Terraneo
- Laboratory of Supramolecular and Bio-Nanomaterials (SupraBioNanoLab), Department of Chemistry, Materials and Chemical Engineering "Giulio Natta", Politecnico di Milano, Via L. Mancinelli 7, Milan I-20131, Italy
| | - Eric Hénon
- Institut de Chimie Moléculaire de Reims UMR CNRS 7312, Université de Reims Champagne-Ardenne, Moulin de la Housse, Reims Cedex 02 BP39, F-51687, France
| | - Alessandro Genoni
- Université de Lorraine & CNRS, Laboratoire de Physique et Chimie Théoriques, UMR CNRS 7019, 1 Boulevard Arago, Metz F-57078, France
| |
Collapse
|
54
|
Pure non-local machine-learned density functional theory for electron correlation. Nat Commun 2021; 12:344. [PMID: 33436595 PMCID: PMC7804195 DOI: 10.1038/s41467-020-20471-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Accepted: 12/02/2020] [Indexed: 01/02/2023] Open
Abstract
Density-functional theory (DFT) is a rigorous and (in principle) exact framework for the description of the ground state properties of atoms, molecules and solids based on their electron density. While computationally efficient density-functional approximations (DFAs) have become essential tools in computational chemistry, their (semi-)local treatment of electron correlation has a number of well-known pathologies, e.g. related to electron self-interaction. Here, we present a type of machine-learning (ML) based DFA (termed Kernel Density Functional Approximation, KDFA) that is pure, non-local and transferable, and can be efficiently trained with fully quantitative reference methods. The functionals retain the mean-field computational cost of common DFAs and are shown to be applicable to non-covalent, ionic and covalent interactions, as well as across different system sizes. We demonstrate their remarkable possibilities by computing the free energy surface for the protonated water dimer at hitherto unfeasible gold-standard coupled cluster quality on a single commodity workstation. Semilocal density functionals, while computationally efficient, do not account for non-local correlation. Here, the authors propose a machine-learning approach to DFT that leads to non-local and transferable functionals applicable to non-covalent, ionic and covalent interactions across system of different sizes.
Collapse
|
55
|
Fabrizio A, Briling KR, Girardier DD, Corminboeuf C. Learning on-top: Regressing the on-top pair density for real-space visualization of electron correlation. J Chem Phys 2020; 153:204111. [DOI: 10.1063/5.0033326] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Affiliation(s)
- Alberto Fabrizio
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Ksenia R. Briling
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - David D. Girardier
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| |
Collapse
|
56
|
Ramírez-Palma DI, Cortés-Guzmán F. From the Linnett-Gillespie model to the polarization of the spin valence shells of metals in complexes. Phys Chem Chem Phys 2020; 22:24201-24212. [PMID: 32851390 DOI: 10.1039/d0cp02064h] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In this paper, we present a novel approach to track the origin of the metal complex structure from the topology of the α and β spin densities as an extension of the Linnett-Gillespie model. Usually, the theories that explain the metal-ligand interactions consider the disposition and the relative energies of the empty or occupied set of d orbitals, ignoring the spin contribution explicitly. Our quantum topological approach considers the spatial distribution of the α and β spin valence shells, and the energy interaction between them. We used the properties of the atomic graph, a topological object that summarises the charge concentrations and depletions on the valence shell of an atom in a molecule, and the interacting quantum atoms (IQA) energy partition scheme. Unlike the Linnett-Gillespie model, which is based on electron-electron repulsion, our approach states that the ligands provoke a redistribution of the electron density to maximize the nuclear-electron interactions in each spin valence shell to bypass the concentration of electron-electron interactions, resulting in a polarization pattern which determines the position of the ligands.
Collapse
Affiliation(s)
- David I Ramírez-Palma
- Universidad Nacional Autónoma de México, Instituto de Química, Ciudad Universitaria, Ciudad de Mexico, 04510, Mexico.
| | | |
Collapse
|
57
|
Del Rio BG, Kuenneth C, Tran HD, Ramprasad R. An Efficient Deep Learning Scheme To Predict the Electronic Structure of Materials and Molecules: The Example of Graphene-Derived Allotropes. J Phys Chem A 2020; 124:9496-9502. [PMID: 33138367 DOI: 10.1021/acs.jpca.0c07458] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Computations based on density functional theory (DFT) are transforming various aspects of materials research and discovery. However, the effort required to solve the central equation of DFT, namely the Kohn-Sham equation, which remains a major obstacle for studying large systems with hundreds of atoms in a practical amount of time with routine computational resources. Here, we propose a deep learning architecture that systematically learns the input-output behavior of the Kohn-Sham equation and predicts the electronic density of states, a primary output of DFT calculations, with unprecedented speed and chemical accuracy. The algorithm also adapts and progressively improves in predictive power and versatility as it is exposed to new diverse atomic configurations. We demonstrate this capability for a diverse set of carbon allotropes spanning a large configurational and phase space. The electronic density of states, along with the electronic charge density, may be used downstream to predict a variety of materials properties, bypassing the Kohn-Sham equation, leading to an ultrafast and high-fidelity DFT emulator.
Collapse
Affiliation(s)
- Beatriz G Del Rio
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Drive NW, Atlanta, Georgia 30332, United States
| | - Christopher Kuenneth
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Drive NW, Atlanta, Georgia 30332, United States
| | - Huan Doan Tran
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Drive NW, Atlanta, Georgia 30332, United States
| | - Rampi Ramprasad
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Drive NW, Atlanta, Georgia 30332, United States
| |
Collapse
|
58
|
Bogojeski M, Vogt-Maranto L, Tuckerman ME, Müller KR, Burke K. Quantum chemical accuracy from density functional approximations via machine learning. Nat Commun 2020; 11:5223. [PMID: 33067479 PMCID: PMC7567867 DOI: 10.1038/s41467-020-19093-1] [Citation(s) in RCA: 146] [Impact Index Per Article: 29.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 09/24/2020] [Indexed: 12/21/2022] Open
Abstract
Kohn-Sham density functional theory (DFT) is a standard tool in most branches of chemistry, but accuracies for many molecules are limited to 2-3 kcal ⋅ mol-1 with presently-available functionals. Ab initio methods, such as coupled-cluster, routinely produce much higher accuracy, but computational costs limit their application to small molecules. In this paper, we leverage machine learning to calculate coupled-cluster energies from DFT densities, reaching quantum chemical accuracy (errors below 1 kcal ⋅ mol-1) on test data. Moreover, density-based Δ-learning (learning only the correction to a standard DFT calculation, termed Δ-DFT ) significantly reduces the amount of training data required, particularly when molecular symmetries are included. The robustness of Δ-DFT is highlighted by correcting "on the fly" DFT-based molecular dynamics (MD) simulations of resorcinol (C6H4(OH)2) to obtain MD trajectories with coupled-cluster accuracy. We conclude, therefore, that Δ-DFT facilitates running gas-phase MD simulations with quantum chemical accuracy, even for strained geometries and conformer changes where standard DFT fails.
Collapse
Affiliation(s)
- Mihail Bogojeski
- Machine Learning Group, Technische Universität Berlin, Marchstr. 23, 10587, Berlin, Germany
| | | | - Mark E Tuckerman
- Department of Chemistry, New York University, New York, NY, 10003, USA.
- Courant Institute of Mathematical Science, New York University, New York, NY, 10012, USA.
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, 3663 Zhongshan Road North, Shanghai, 200062, China.
| | - Klaus-Robert Müller
- Machine Learning Group, Technische Universität Berlin, Marchstr. 23, 10587, Berlin, Germany.
- Department of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul, 02841, Korea.
- Max-Planck-Institut für Informatik, Stuhlsatzenhausweg, 66123, Saarbrücken, Germany.
| | - Kieron Burke
- Department of Physics and Astronomy, University of California, Irvine, CA, 92697, USA.
- Department of Chemistry, University of California, Irvine, CA, 92697, USA.
| |
Collapse
|
59
|
Casey AD, Son SF, Bilionis I, Barnes BC. Prediction of Energetic Material Properties from Electronic Structure Using 3D Convolutional Neural Networks. J Chem Inf Model 2020; 60:4457-4473. [PMID: 33054184 DOI: 10.1021/acs.jcim.0c00259] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Affiliation(s)
- Alex D. Casey
- School of Mechanical Engineering, Purdue University, West Lafayette, Indiana 47907, United States
| | - Steven F. Son
- School of Mechanical Engineering, Purdue University, West Lafayette, Indiana 47907, United States
| | - Ilias Bilionis
- School of Mechanical Engineering, Purdue University, West Lafayette, Indiana 47907, United States
| | - Brian C. Barnes
- CCDC Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| |
Collapse
|
60
|
Bahlke MP, Mogos N, Proppe J, Herrmann C. Exchange Spin Coupling from Gaussian Process Regression. J Phys Chem A 2020; 124:8708-8723. [DOI: 10.1021/acs.jpca.0c05983] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Marc Philipp Bahlke
- Department of Chemistry, University of Hamburg, Martin-Luther-King-Platz 6, 20146 Hamburg, Germany
| | - Natnael Mogos
- Department of Chemistry, University of Hamburg, Martin-Luther-King-Platz 6, 20146 Hamburg, Germany
| | - Jonny Proppe
- Institute of Physical Chemistry, Georg-August University, Tammannstr. 6, 37077 Göttingen, Germany
| | - Carmen Herrmann
- Department of Chemistry, University of Hamburg, Martin-Luther-King-Platz 6, 20146 Hamburg, Germany
| |
Collapse
|
61
|
Meyer R, Weichselbaum M, Hauser AW. Machine Learning Approaches toward Orbital-free Density Functional Theory: Simultaneous Training on the Kinetic Energy Density Functional and Its Functional Derivative. J Chem Theory Comput 2020; 16:5685-5694. [PMID: 32786898 PMCID: PMC7482319 DOI: 10.1021/acs.jctc.0c00580] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
![]()
Orbital-free
approaches might offer a way to boost the applicability
of density functional theory by orders of magnitude in system size.
An important ingredient for this endeavor is the kinetic energy density
functional. Snyder et al. [2012, 108, 25300223004593] presented a machine
learning approximation for this functional achieving chemical accuracy
on a one-dimensional model system. However, a poor performance with
respect to the functional derivative, a crucial element in iterative
energy minimization procedures, enforced the application of a computationally
expensive projection method. In this work we circumvent this issue
by including the functional derivative into the training of various
machine learning models. Besides kernel ridge regression, the original
method of choice, we also test the performance of convolutional neural
network techniques borrowed from the field of image recognition.
Collapse
Affiliation(s)
- Ralf Meyer
- Institute of Experimental Physics, Graz University of Technology, Petersgasse 16, 8010 Graz, Austria
| | - Manuel Weichselbaum
- Institute of Experimental Physics, Graz University of Technology, Petersgasse 16, 8010 Graz, Austria
| | - Andreas W Hauser
- Institute of Experimental Physics, Graz University of Technology, Petersgasse 16, 8010 Graz, Austria
| |
Collapse
|
62
|
Dick S, Fernandez-Serra M. Machine learning accurate exchange and correlation functionals of the electronic density. Nat Commun 2020; 11:3509. [PMID: 32665540 PMCID: PMC7360771 DOI: 10.1038/s41467-020-17265-7] [Citation(s) in RCA: 77] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Accepted: 06/17/2020] [Indexed: 11/23/2022] Open
Abstract
Density functional theory (DFT) is the standard formalism to study the electronic structure of matter at the atomic scale. In Kohn–Sham DFT simulations, the balance between accuracy and computational cost depends on the choice of exchange and correlation functional, which only exists in approximate form. Here, we propose a framework to create density functionals using supervised machine learning, termed NeuralXC. These machine-learned functionals are designed to lift the accuracy of baseline functionals towards that provided by more accurate methods while maintaining their efficiency. We show that the functionals learn a meaningful representation of the physical information contained in the training data, making them transferable across systems. A NeuralXC functional optimized for water outperforms other methods characterizing bond breaking and excels when comparing against experimental results. This work demonstrates that NeuralXC is a first step towards the design of a universal, highly accurate functional valid for both molecules and solids. Increasing the non-locality of the exchange and correlation functional in DFT theory comes at a steep increase in computational cost. Here, the authors develop NeuralXC, a supervised machine learning approach to generate density functionals close to coupled-cluster level of accuracy yet computationally efficient.
Collapse
Affiliation(s)
- Sebastian Dick
- Physics and Astronomy Department, Stony Brook University, Stony Brook, NY, 11794-3800, USA. .,Institute for Advanced Computational Science, Stony Brook University, Stony Brook, Stony Brook, NY, 11794-3800, USA.
| | - Marivi Fernandez-Serra
- Physics and Astronomy Department, Stony Brook University, Stony Brook, NY, 11794-3800, USA. .,Institute for Advanced Computational Science, Stony Brook University, Stony Brook, Stony Brook, NY, 11794-3800, USA.
| |
Collapse
|
63
|
von Lilienfeld OA, Müller KR, Tkatchenko A. Exploring chemical compound space with quantum-based machine learning. Nat Rev Chem 2020; 4:347-358. [PMID: 37127950 DOI: 10.1038/s41570-020-0189-9] [Citation(s) in RCA: 144] [Impact Index Per Article: 28.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/23/2020] [Indexed: 12/16/2022]
Abstract
Rational design of compounds with specific properties requires understanding and fast evaluation of molecular properties throughout chemical compound space - the huge set of all potentially stable molecules. Recent advances in combining quantum-mechanical calculations with machine learning provide powerful tools for exploring wide swathes of chemical compound space. We present our perspective on this exciting and quickly developing field by discussing key advances in the development and applications of quantum-mechanics-based machine-learning methods to diverse compounds and properties, and outlining the challenges ahead. We argue that significant progress in the exploration and understanding of chemical compound space can be made through a systematic combination of rigorous physical theories, comprehensive synthetic data sets of microscopic and macroscopic properties, and modern machine-learning methods that account for physical and chemical knowledge.
Collapse
|
64
|
Hoffmann R, Malrieu JP. Simulation vs. Understanding: A Tension, in Quantum Chemistry and Beyond. Part B. The March of Simulation, for Better or Worse. Angew Chem Int Ed Engl 2020; 59:13156-13178. [PMID: 31675462 DOI: 10.1002/anie.201910283] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Indexed: 11/08/2022]
Abstract
In the second part of this Essay, we leave philosophy, and begin by describing Roald's being trashed by simulation. This leads us to a general sketch of artificial intelligence (AI), Searle's Chinese room, and Strevens' account of what a go-playing program knows. Back to our terrain-we ask "Quantum Chemistry, † ca. 2020?" Then we move to examples of Big Data, machine learning and neural networks in action, first in chemistry and then affecting social matters, trivial to scary. We argue that moral decisions are hardly to be left to a computer. And that posited causes, even if recognized as provisional, represent a much deeper level of understanding than correlations. At this point, we try to pull the reader up, giving voice to the opposing view of an optimistic, limitless future. But we don't do justice to that view-how could we, older mammals on the way to extinction that we are? We try. But then we return to fuss, questioning the ascetic dimension of scientists, their romance with black boxes. And argue for a science of many tongues.
Collapse
Affiliation(s)
- Roald Hoffmann
- Dept. of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, 14850, USA
| | - Jean-Paul Malrieu
- Laboratoire de Chimie et Physique Quantiques, Université de Toulouse 3, 118 route de Narbonne, 31062, Toulouse, France
| |
Collapse
|
65
|
Hoffmann R, Malrieu J. Simulation vs. Understanding: A Tension, in Quantum Chemistry and Beyond. Part B. The March of Simulation, for Better or Worse. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.201910283] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Roald Hoffmann
- Dept. of Chemistry and Chemical Biology Cornell University Ithaca NY 14850 USA
| | - Jean‐Paul Malrieu
- Laboratoire de Chimie et Physique Quantiques Université de Toulouse 3 118 route de Narbonne 31062 Toulouse France
| |
Collapse
|
66
|
Pihlajamäki A, Hämäläinen J, Linja J, Nieminen P, Malola S, Kärkkäinen T, Häkkinen H. Monte Carlo Simulations of Au 38(SCH 3) 24 Nanocluster Using Distance-Based Machine Learning Methods. J Phys Chem A 2020; 124:4827-4836. [PMID: 32412747 DOI: 10.1021/acs.jpca.0c01512] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We present an implementation of distance-based machine learning (ML) methods to create a realistic atomistic interaction potential to be used in Monte Carlo simulations of thermal dynamics of thiolate (SR) protected gold nanoclusters. The ML potential is trained for Au38(SR)24 by using previously published, density functional theory (DFT) based, molecular dynamics (MD) simulation data on two experimentally characterized structural isomers of the cluster and validated against independent DFT MD simulations. This method opens a door to efficient probing of the configuration space for further investigations of thermal-dependent electronic and optical properties of Au38(SR)24. Our ML implementation strategy allows for generalization and accuracy control of distance-based ML models for complex nanostructures having several chemical elements and interactions of varying strength.
Collapse
Affiliation(s)
- Antti Pihlajamäki
- Department of Physics, Nanoscience Center, University of Jyväskylä, FI-40014 Jyväskylä, Finland
| | - Joonas Hämäläinen
- Faculty of Information Technology, University of Jyväskylä, FI-40014 Jyväskylä, Finland
| | - Joakim Linja
- Faculty of Information Technology, University of Jyväskylä, FI-40014 Jyväskylä, Finland
| | - Paavo Nieminen
- Faculty of Information Technology, University of Jyväskylä, FI-40014 Jyväskylä, Finland
| | - Sami Malola
- Department of Physics, Nanoscience Center, University of Jyväskylä, FI-40014 Jyväskylä, Finland
| | - Tommi Kärkkäinen
- Faculty of Information Technology, University of Jyväskylä, FI-40014 Jyväskylä, Finland
| | - Hannu Häkkinen
- Department of Physics, Nanoscience Center, University of Jyväskylä, FI-40014 Jyväskylä, Finland.,Department of Chemistry, Nanoscience Center, University of Jyväskylä, FI-40014 Jyväskylä, Finland
| |
Collapse
|
67
|
Cendagorta JR, Tolpin J, Schneider E, Topper RQ, Tuckerman ME. Comparison of the Performance of Machine Learning Models in Representing High-Dimensional Free Energy Surfaces and Generating Observables. J Phys Chem B 2020; 124:3647-3660. [PMID: 32275148 DOI: 10.1021/acs.jpcb.0c01218] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Free energy surfaces of chemical and physical systems are often generated using a popular class of enhanced sampling methods that target a set of collective variables (CVs) chosen to distinguish the characteristic features of these surfaces. While some of these approaches are typically limited to low (∼1-3)-dimensional CV subspaces, methods such as driven adiabatic free-energy dynamics/temperature-accelerated molecular dynamics have been shown to be capable of generating free energy surfaces of quite high dimension by sampling the associated marginal probability distribution via full sweeps over the CV landscape. These approaches repeatedly visit conformational basins, producing a scattering of points within the basins on each visit. Consequently, they are particularly amenable to synergistic combination with regression machine learning methods for filling in the surfaces between the sampled points and for providing a compact and continuous (or semicontinuous) representation of the surfaces that can be easily stored and used for further computation of observable properties. Given the central role of machine learning techniques in this combined approach, it is timely to provide a detailed comparison of the performance of different machine learning strategies and models, including neural networks, kernel ridge regression, support vector machines, and weighted neighbor schemes, for their ability to learn these high-dimensional surfaces as a function of the amount of sampled training data and, once trained, to subsequently generate accurate ensemble averages corresponding to observable properties of the systems. In this article, we perform such a comparison on a set of oligopeptides, in both gas and aqueous phases, corresponding to CV spaces of 2-10 dimensions and assess their ability to provide a global representation of the free energy surfaces and to generate accurate ensemble averages.
Collapse
Affiliation(s)
- Joseph R Cendagorta
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Jocelyn Tolpin
- Department of Statistics, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Elia Schneider
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Robert Q Topper
- Department of Chemistry, The Cooper Union for the Advancement of Science and Art, New York, New York 10003, United States
| | - Mark E Tuckerman
- Department of Chemistry, New York University, New York, New York 10003, United States.,Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States.,NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|
68
|
Fabrizio A, Petraglia R, Corminboeuf C. Balancing Density Functional Theory Interaction Energies in Charged Dimers Precursors to Organic Semiconductors. J Chem Theory Comput 2020; 16:3530-3542. [DOI: 10.1021/acs.jctc.9b01193] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Alberto Fabrizio
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Riccardo Petraglia
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
69
|
Fabrizio A, Meyer B, Corminboeuf C. Machine learning models of the energy curvature vs particle number for optimal tuning of long-range corrected functionals. J Chem Phys 2020; 152:154103. [DOI: 10.1063/5.0005039] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Affiliation(s)
- Alberto Fabrizio
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Benjamin Meyer
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
70
|
Abstract
As the quantum chemistry (QC) community embraces machine learning (ML), the number of new methods and applications based on the combination of QC and ML is surging. In this Perspective, a view of the current state of affairs in this new and exciting research field is offered, challenges of using machine learning in quantum chemistry applications are described, and potential future developments are outlined. Specifically, examples of how machine learning is used to improve the accuracy and accelerate quantum chemical research are shown. Generalization and classification of existing techniques are provided to ease the navigation in the sea of literature and to guide researchers entering the field. The emphasis of this Perspective is on supervised machine learning.
Collapse
Affiliation(s)
- Pavlo O Dral
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| |
Collapse
|
71
|
Kamal D, Chandrasekaran A, Batra R, Ramprasad R. A charge density prediction model for hydrocarbons using deep neural networks. MACHINE LEARNING-SCIENCE AND TECHNOLOGY 2020. [DOI: 10.1088/2632-2153/ab5929] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
72
|
Sethio D, Martins JBL, Lawson Daku LM, Hagemann H, Kraka E. Modified Density Functional Dispersion Correction for Inorganic Layered MFX Compounds (M = Ca, Sr, Ba, Pb and X = Cl, Br, I). J Phys Chem A 2020; 124:1619-1633. [PMID: 31999454 DOI: 10.1021/acs.jpca.9b10357] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
MFX (M = Ca, Ba, Sr, Pb and X = Cl, Br, I) compounds have received considerable attention due to their technological application as X-ray detectors, pressure sensors, and optical data storage materials, when doped with rare-earth ions. MFX compounds belong to the class of layered materials with a tetragonal Matlockite crystal structure, characterized by weakly stacked double-halide layers along the crystallographic c-axis. These layers predominantly determine phase transitions, elastic, and mechanical properties. However, the correct description of the lattice parameter c is a challenge for most standard DFT functionals, which tend to overestimate the lattice parameter c. Because of the weak interactions between the halide layers, dispersion-corrected functionals seem to be a better choice. We investigated 11 different inorganic layered MFX compounds for which experimental data are available, with standard and dispersion-corrected functionals to assess their performance in reproducing the lattice parameter c, structural, and vibrational properties of the MFX compounds. Our results revealed that these functionals do not describe the weak interactions between the halide layers in a balanced way. Therefore, we modified Grimme's popular DFT-D2 dispersion correction scheme in two different ways by (i) replacing the dispersion coefficients and van der Waals radii with those of noble gas atoms or (ii) increasing the van der Waals radii of the MFX atoms up to 40%. Comparison with the available experimental data revealed that the latter approach applied to the PBE (Perdew-Burke-Ernzerhof)-D2 functional with 30% increased van der Waals radii, which we coined PBE-D2* (Srvdw 1.30) is best suited to fine-tune the description of the weak interlayer interactions in MFX compounds, thus significantly improving the description of their structural, vibrational, and mechanical properties. Work is in progress applying this new, computationally inexpensive scheme to other inorganic layered compounds and periodic systems with weakly stacked layers.
Collapse
Affiliation(s)
- Daniel Sethio
- Computational and Theoretical Chemistry Group (CATCO), Department of Chemistry , Southern Methodist University , 3215 Daniel Avenue , Dallas , Texas 75275-0314 , United States
| | - João B L Martins
- Institute of Chemistry , University of Brasilia , Brasilia , DF 70910-900 , Brazil
| | - Latévi Max Lawson Daku
- Department of Physical Chemistry , University of Geneva , 30 Quai Ernest-Ansermet , CH-1211 Geneva 4 , Switzerland
| | - Hans Hagemann
- Department of Physical Chemistry , University of Geneva , 30 Quai Ernest-Ansermet , CH-1211 Geneva 4 , Switzerland
| | - Elfi Kraka
- Computational and Theoretical Chemistry Group (CATCO), Department of Chemistry , Southern Methodist University , 3215 Daniel Avenue , Dallas , Texas 75275-0314 , United States
| |
Collapse
|
73
|
Metcalf DP, Koutsoukas A, Spronk SA, Claus BL, Loughney DA, Johnson SR, Cheney DL, Sherrill CD. Approaches for machine learning intermolecular interaction energies and application to energy components from symmetry adapted perturbation theory. J Chem Phys 2020; 152:074103. [DOI: 10.1063/1.5142636] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Affiliation(s)
- Derek P. Metcalf
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| | - Alexios Koutsoukas
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Steven A. Spronk
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Brian L. Claus
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Deborah A. Loughney
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Stephen R. Johnson
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - Daniel L. Cheney
- Molecular Structure and Design, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543, USA
| | - C. David Sherrill
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry and School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| |
Collapse
|
74
|
Proppe J, Gugler S, Reiher M. Gaussian Process-Based Refinement of Dispersion Corrections. J Chem Theory Comput 2019; 15:6046-6060. [PMID: 31603673 DOI: 10.1021/acs.jctc.9b00627] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
We employ Gaussian process (GP) regression to adjust for systematic errors in D3-type dispersion corrections. We refer to the associated, statistically improved model as D3-GP. It is trained on differences between interaction energies obtained from PBE-D3(BJ)/ma-def2-QZVPP and DLPNO-CCSD(T)/CBS calculations. We generated a data set containing interaction energies for 1248 molecular dimers, which resemble the dispersion-dominated systems contained in the S66 data set. Our systems represent not only equilibrium structures but also dimers with various relative orientations and conformations at both shorter and longer distances. A reparametrization of the D3(BJ) model based on 66 of these dimers suggests that two of its three empirical parameters, a1 and s8, are zero, whereas a2 = 5.6841 bohr. For the remaining 1182 dimers, we find that this new set of parameters is superior to all previously published D3(BJ) parameter sets. To train our D3-GP model, we engineered two different vectorial representations of (supra-)molecular systems, both derived from the matrix of atom-pairwise D3(BJ) interaction terms: (a) a distance-resolved interaction energy histogram, histD3(BJ), and (b) eigenvalues of the interaction matrix ordered according to their decreasing absolute value, eigD3(BJ). Hence, the GP learns a mapping from D3(BJ) information only, which renders D3-GP-type dispersion corrections comparable to those obtained with the original D3 approach. They improve systematically if the underlying training set is selected carefully. Here, we harness the prediction variance obtained from GP regression to select optimal training sets in an automated fashion. The larger the variance, the more information the corresponding data point may add to the training set. For a given set of molecular systems, variance-based sampling can approximately determine the smallest subset being subjected to reference calculations such that all dispersion corrections for the remaining systems fall below a predefined accuracy threshold. To render the entire D3-GP workflow as efficient as possible, we present an improvement over our variance-based, sequential active-learning scheme [ J. Chem. Theory Comput. 2018 , 14 , 5238 ]. Our refined learning algorithm selects multiple (instead of single) systems that can be subjected to reference calculations simultaneously. We refer to the underlying selection strategy as batchwise variance-based sampling (BVS). BVS-guided active learning is an essential component of our D3-GP workflow, which is implemented in a black-box fashion. Once provided with reference data for new molecular systems, the underlying GP model automatically learns to adapt to these and similar systems. This approach leads overall to a self-improving model (D3-GP) that predicts system-focused and GP-refined D3-type dispersion corrections for any given system of reference data.
Collapse
Affiliation(s)
- Jonny Proppe
- Department of Chemistry , and Department of Computer Science , University of Toronto , Toronto , Ontario M5S , Canada.,Laboratory of Physical Chemistry , ETH Zurich , Vladimir-Prelog-Weg 2 , 8093 Zurich , Switzerland
| | - Stefan Gugler
- Laboratory of Physical Chemistry , ETH Zurich , Vladimir-Prelog-Weg 2 , 8093 Zurich , Switzerland
| | - Markus Reiher
- Laboratory of Physical Chemistry , ETH Zurich , Vladimir-Prelog-Weg 2 , 8093 Zurich , Switzerland
| |
Collapse
|