1
|
Kalikadien AV, Mirza A, Hossaini AN, Sreenithya A, Pidko EA. Paving the road towards automated homogeneous catalyst design. Chempluschem 2024; 89:e202300702. [PMID: 38279609 DOI: 10.1002/cplu.202300702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 12/20/2023] [Indexed: 01/28/2024]
Abstract
In the past decade, computational tools have become integral to catalyst design. They continue to offer significant support to experimental organic synthesis and catalysis researchers aiming for optimal reaction outcomes. More recently, data-driven approaches utilizing machine learning have garnered considerable attention for their expansive capabilities. This Perspective provides an overview of diverse initiatives in the realm of computational catalyst design and introduces our automated tools tailored for high-throughput in silico exploration of the chemical space. While valuable insights are gained through methods for high-throughput in silico exploration and analysis of chemical space, their degree of automation and modularity are key. We argue that the integration of data-driven, automated and modular workflows is key to enhancing homogeneous catalyst design on an unprecedented scale, contributing to the advancement of catalysis research.
Collapse
Affiliation(s)
- Adarsh V Kalikadien
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| | - Adrian Mirza
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| | - Aydin Najl Hossaini
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| | - Avadakkam Sreenithya
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| | - Evgeny A Pidko
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| |
Collapse
|
2
|
Focke K, De Santis M, Wolter M, Martinez B JA, Vallet V, Pereira Gomes AS, Olejniczak M, Jacob CR. Interoperable workflows by exchanging grid-based data between quantum-chemical program packages. J Chem Phys 2024; 160:162503. [PMID: 38686818 DOI: 10.1063/5.0201701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 04/02/2024] [Indexed: 05/02/2024] Open
Abstract
Quantum-chemical subsystem and embedding methods require complex workflows that may involve multiple quantum-chemical program packages. Moreover, such workflows require the exchange of voluminous data that go beyond simple quantities, such as molecular structures and energies. Here, we describe our approach for addressing this interoperability challenge by exchanging electron densities and embedding potentials as grid-based data. We describe the approach that we have implemented to this end in a dedicated code, PyEmbed, currently part of a Python scripting framework. We discuss how it has facilitated the development of quantum-chemical subsystem and embedding methods and highlight several applications that have been enabled by PyEmbed, including wave-function theory (WFT) in density-functional theory (DFT) embedding schemes mixing non-relativistic and relativistic electronic structure methods, real-time time-dependent DFT-in-DFT approaches, the density-based many-body expansion, and workflows including real-space data analysis and visualization. Our approach demonstrates, in particular, the merits of exchanging (complex) grid-based data and, in general, the potential of modular software development in quantum chemistry, which hinges upon libraries that facilitate interoperability.
Collapse
Affiliation(s)
- Kevin Focke
- Institute of Physical and Theoretical Chemistry, Technische Universität Braunschweig, Gaußstraße 17, 38106 Braunschweig, Germany
| | - Matteo De Santis
- CNRS, UMR 8523-PhLAM-Physique des Lasers Atomes et Molécules, Univ. Lille, F-59000 Lille, France
| | - Mario Wolter
- Institute of Physical and Theoretical Chemistry, Technische Universität Braunschweig, Gaußstraße 17, 38106 Braunschweig, Germany
| | - Jessica A Martinez B
- CNRS, UMR 8523-PhLAM-Physique des Lasers Atomes et Molécules, Univ. Lille, F-59000 Lille, France
- Department of Chemistry, Rutgers University, Newark, New Jersey 07102, USA
| | - Valérie Vallet
- CNRS, UMR 8523-PhLAM-Physique des Lasers Atomes et Molécules, Univ. Lille, F-59000 Lille, France
| | | | - Małgorzata Olejniczak
- Centre of New Technologies, University of Warsaw, S. Banacha 2c, 02-097 Warsaw, Poland
| | - Christoph R Jacob
- Institute of Physical and Theoretical Chemistry, Technische Universität Braunschweig, Gaußstraße 17, 38106 Braunschweig, Germany
| |
Collapse
|
3
|
Jin H, Merz KM. Modeling Zinc Complexes Using Neural Networks. J Chem Inf Model 2024; 64:3140-3148. [PMID: 38587510 PMCID: PMC11040731 DOI: 10.1021/acs.jcim.4c00095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 03/04/2024] [Accepted: 03/28/2024] [Indexed: 04/09/2024]
Abstract
Understanding the energetic landscapes of large molecules is necessary for the study of chemical and biological systems. Recently, deep learning has greatly accelerated the development of models based on quantum chemistry, making it possible to build potential energy surfaces and explore chemical space. However, most of this work has focused on organic molecules due to the simplicity of their electronic structures as well as the availability of data sets. In this work, we build a deep learning architecture to model the energetics of zinc organometallic complexes. To achieve this, we have compiled a configurationally and conformationally diverse data set of zinc complexes using metadynamics to overcome the limitations of traditional sampling methods. In terms of the neural network potentials, our results indicate that for zinc complexes, partial charges play an important role in modeling the long-range interactions with a neural network. Our developed model outperforms semiempirical methods in predicting the relative energy of zinc conformers, yielding a mean absolute error (MAE) of 1.32 kcal/mol with reference to the double-hybrid PWPB95 method.
Collapse
Affiliation(s)
- Hongni Jin
- Department
of Chemistry, Michigan State University, East Lansing, Michigan 48824, United States
| | - Kenneth M. Merz
- Department
of Chemistry, Michigan State University, East Lansing, Michigan 48824, United States
- Department
of Biochemistry and Molecular Biology, Michigan
State University, East Lansing, Michigan 48824, United States
| |
Collapse
|
4
|
Sgueglia G, Vrettas MD, Chino M, De Simone A, Lombardi A. MetalHawk: Enhanced Classification of Metal Coordination Geometries by Artificial Neural Networks. J Chem Inf Model 2024; 64:2356-2367. [PMID: 37956388 PMCID: PMC11005052 DOI: 10.1021/acs.jcim.3c00873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 09/29/2023] [Accepted: 10/26/2023] [Indexed: 11/15/2023]
Abstract
The chemical properties of metal complexes are strongly dependent on the number and geometrical arrangement of ligands coordinated to the metal center. Existing methods for determining either coordination number or geometry rely on a trade-off between accuracy and computational costs, which hinders their application to the study of large structure data sets. Here, we propose MetalHawk (https://github.com/vrettasm/MetalHawk), a machine learning-based approach to perform simultaneous classification of metal site coordination number and geometry through artificial neural networks (ANNs), which were trained using the Cambridge Structural Database (CSD) and Metal Protein Data Bank (MetalPDB). We demonstrate that the CSD-trained model can be used to classify sites belonging to the most common coordination numbers and geometry classes with balanced accuracy equal to 96.51% for CSD-deposited metal sites. The CSD-trained model was also found to be capable of classifying bioinorganic metal sites from the MetalPDB database, with balanced accuracy equal to 84.29% on the whole PDB data set and to 91.66% on manually reviewed sites in the PDB validation set. Moreover, we report evidence that the output vectors of the CSD-trained model can be considered as a proxy indicator of metal-site distortions, showing that these can be interpreted as a low-dimensional representation of subtle geometrical features present in metal site structures.
Collapse
Affiliation(s)
- Gianmattia Sgueglia
- Department
of Chemical Sciences, University of Naples
Federico II, Via Cintia 21, 80126 Napoli, Italy
| | - Michail D. Vrettas
- Department
of Pharmacy, University of Naples Federico
II, Via Domenico Montesano
49, 80131 Napoli, Italy
| | - Marco Chino
- Department
of Chemical Sciences, University of Naples
Federico II, Via Cintia 21, 80126 Napoli, Italy
| | - Alfonso De Simone
- Department
of Pharmacy, University of Naples Federico
II, Via Domenico Montesano
49, 80131 Napoli, Italy
| | - Angela Lombardi
- Department
of Chemical Sciences, University of Naples
Federico II, Via Cintia 21, 80126 Napoli, Italy
| |
Collapse
|
5
|
Romero S, Baruah T, Zope RR. Spin-state gaps and self-interaction-corrected density functional approximations: Octahedral Fe(II) complexes as case study. J Chem Phys 2023; 158:054305. [PMID: 36754787 DOI: 10.1063/5.0133999] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Accurate prediction of a spin-state energy difference is crucial for understanding the spin crossover phenomena and is very challenging for density functional approximations, especially for local and semi-local approximations due to delocalization errors. Here, we investigate the effect of the self-interaction error removal from the local spin density approximation (LSDA) and Perdew-Burke-Ernzerhof generalized gradient approximation on the spin-state gaps of Fe(II) complexes with various ligands using recently developed locally scaled self-interaction correction (LSIC) by Zope et al. [J. Chem. Phys. 151, 214108 (2019)]. The LSIC method is exact for one-electron density, recovers the uniform electron gas limit of the underlying functional, and approaches the well-known Perdew-Zunger self-interaction correction (PZSIC) as a particular case when the scaling factor is set to unity. Our results, when compared with reference diffusion Monte Carlo results, show that the PZSIC method significantly overestimates spin-state gaps favoring low spin states for all ligands and does not improve upon density functional approximations. The perturbative LSIC-LSDA using PZSIC densities significantly improves the gaps with a mean absolute error of 0.51 eV but slightly overcorrects for the stronger CO ligands. The quasi-self-consistent LSIC-LSDA, such as coupled-cluster single double and perturbative triple [CCSD(T)], gives a correct sign of spin-state gaps for all ligands with a mean absolute error of 0.56 eV, comparable to that of CCSD(T) (0.49 eV).
Collapse
Affiliation(s)
- Selim Romero
- Computational Science Program, The University of Texas at El Paso, El Paso, Texas 79968, USA
| | - Tunna Baruah
- Department of Physics, University of Texas at El Paso, El Paso, Texas 79968, USA
| | - Rajendra R Zope
- Department of Physics, University of Texas at El Paso, El Paso, Texas 79968, USA
| |
Collapse
|
6
|
Fan Y, Xia W, Ma C, Huang Y, Li S, Wang X, Qian C, Chen K, Liu D. Recent advances of computational studies on bioethanol to light olefin reactions using zeolite and metal oxide catalysts. Chem Eng Sci 2023. [DOI: 10.1016/j.ces.2023.118532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
7
|
Cheng L, Sun J, Deustua JE, Bhethanabotla VC, Miller TF. Molecular-orbital-based machine learning for open-shell and multi-reference systems with kernel addition Gaussian process regression. J Chem Phys 2022; 157:154105. [PMID: 36272799 DOI: 10.1063/5.0110886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We introduce a novel machine learning strategy, kernel addition Gaussian process regression (KA-GPR), in molecular-orbital-based machine learning (MOB-ML) to learn the total correlation energies of general electronic structure theories for closed- and open-shell systems by introducing a machine learning strategy. The learning efficiency of MOB-ML(KA-GPR) is the same as the original MOB-ML method for the smallest criegee molecule, which is a closed-shell molecule with multi-reference characters. In addition, the prediction accuracies of different small free radicals could reach the chemical accuracy of 1 kcal/mol by training on one example structure. Accurate potential energy surfaces for the H10 chain (closed-shell) and water OH bond dissociation (open-shell) could also be generated by MOB-ML(KA-GPR). To explore the breadth of chemical systems that KA-GPR can describe, we further apply MOB-ML to accurately predict the large benchmark datasets for closed- (QM9, QM7b-T, and GDB-13-T) and open-shell (QMSpin) molecules.
Collapse
Affiliation(s)
- Lixue Cheng
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Jiace Sun
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - J Emiliano Deustua
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Vignesh C Bhethanabotla
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Thomas F Miller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| |
Collapse
|
8
|
Kuntz D, Wilson AK. Machine learning, artificial intelligence, and chemistry: how smart algorithms are reshaping simulation and the laboratory. PURE APPL CHEM 2022. [DOI: 10.1515/pac-2022-0202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
Machine learning and artificial intelligence are increasingly gaining in prominence through image analysis, language processing, and automation, to name a few applications. Machine learning is also making profound changes in chemistry. From revisiting decades-old analytical techniques for the purpose of creating better calibration curves, to assisting and accelerating traditional in silico simulations, to automating entire scientific workflows, to being used as an approach to deduce underlying physics of unexplained chemical phenomena, machine learning and artificial intelligence are reshaping chemistry, accelerating scientific discovery, and yielding new insights. This review provides an overview of machine learning and artificial intelligence from a chemist’s perspective and focuses on a number of examples of the use of these approaches in computational chemistry and in the laboratory.
Collapse
Affiliation(s)
- David Kuntz
- Department of Chemistry , University of North Texas , Denton , TX 76201 , USA
| | - Angela K. Wilson
- Department of Chemistry , Michigan State University , East Lansing , MI 48824 , USA
| |
Collapse
|
9
|
Boyn JN, McNamara LE, Anderson JS, Mazziotti DA. Interplay of Electronic and Geometric Structure Tunes Organic Biradical Character in Bimetallic Tetrathiafulvalene Tetrathiolate Complexes. J Phys Chem A 2022; 126:3329-3337. [PMID: 35604797 DOI: 10.1021/acs.jpca.2c01773] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The synthesis and design of organic biradicals with tunable singlet-triplet gaps have become the subject of significant research interest, owing to their possible photochemical applications and use in the development of molecular switches and conductors. Recently, tetrathiafulvalene tetrathiolate (TTFtt) has been demonstrated to exhibit such organic biradical character in doubly ionized bimetallic complexes. In this article we use high-level ab initio calculations to interrogate the electronic structure of a series of TTFtt-bridged metal complexes, resolving the factors governing their biradical character and singlet-triplet gaps. We show that the degree of biradical character correlates with a readily measured experimental predictor, the central TTFtt C-C bond length, and that it may be described by a one-parameter model, providing valuable insight for the future rational design of TTFtt based biradical compounds and materials.
Collapse
Affiliation(s)
- Jan-Niklas Boyn
- Department of Chemistry and The James Franck Institute, The University of Chicago, Chicago, Illinois 60637, United States
| | - Lauren E McNamara
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - John S Anderson
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - David A Mazziotti
- Department of Chemistry and The James Franck Institute, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
10
|
Westermayr J, Marquetand P. Machine Learning for Electronically Excited States of Molecules. Chem Rev 2021; 121:9873-9926. [PMID: 33211478 PMCID: PMC8391943 DOI: 10.1021/acs.chemrev.0c00749] [Citation(s) in RCA: 173] [Impact Index Per Article: 57.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Indexed: 12/11/2022]
Abstract
Electronically excited states of molecules are at the heart of photochemistry, photophysics, as well as photobiology and also play a role in material science. Their theoretical description requires highly accurate quantum chemical calculations, which are computationally expensive. In this review, we focus on not only how machine learning is employed to speed up such excited-state simulations but also how this branch of artificial intelligence can be used to advance this exciting research field in all its aspects. Discussed applications of machine learning for excited states include excited-state dynamics simulations, static calculations of absorption spectra, as well as many others. In order to put these studies into context, we discuss the promises and pitfalls of the involved machine learning techniques. Since the latter are mostly based on quantum chemistry calculations, we also provide a short introduction into excited-state electronic structure methods and approaches for nonadiabatic dynamics simulations and describe tricks and problems when using them in machine learning for excited states of molecules.
Collapse
Affiliation(s)
- Julia Westermayr
- Institute
of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
| | - Philipp Marquetand
- Institute
of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Vienna
Research Platform on Accelerating Photoreaction Discovery, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Data
Science @ Uni Vienna, University of Vienna, Währinger Strasse 29, 1090 Vienna, Austria
| |
Collapse
|
11
|
Nandy A, Duan C, Taylor MG, Liu F, Steeves AH, Kulik HJ. Computational Discovery of Transition-metal Complexes: From High-throughput Screening to Machine Learning. Chem Rev 2021; 121:9927-10000. [PMID: 34260198 DOI: 10.1021/acs.chemrev.1c00347] [Citation(s) in RCA: 70] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Transition-metal complexes are attractive targets for the design of catalysts and functional materials. The behavior of the metal-organic bond, while very tunable for achieving target properties, is challenging to predict and necessitates searching a wide and complex space to identify needles in haystacks for target applications. This review will focus on the techniques that make high-throughput search of transition-metal chemical space feasible for the discovery of complexes with desirable properties. The review will cover the development, promise, and limitations of "traditional" computational chemistry (i.e., force field, semiempirical, and density functional theory methods) as it pertains to data generation for inorganic molecular discovery. The review will also discuss the opportunities and limitations in leveraging experimental data sources. We will focus on how advances in statistical modeling, artificial intelligence, multiobjective optimization, and automation accelerate discovery of lead compounds and design rules. The overall objective of this review is to showcase how bringing together advances from diverse areas of computational chemistry and computer science have enabled the rapid uncovering of structure-property relationships in transition-metal chemistry. We aim to highlight how unique considerations in motifs of metal-organic bonding (e.g., variable spin and oxidation state, and bonding strength/nature) set them and their discovery apart from more commonly considered organic molecules. We will also highlight how uncertainty and relative data scarcity in transition-metal chemistry motivate specific developments in machine learning representations, model training, and in computational chemistry. Finally, we will conclude with an outlook of areas of opportunity for the accelerated discovery of transition-metal complexes.
Collapse
Affiliation(s)
- Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Michael G Taylor
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Fang Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Adam H Steeves
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
12
|
Abstract
Electronically excited states of molecules are at the heart of photochemistry, photophysics, as well as photobiology and also play a role in material science. Their theoretical description requires highly accurate quantum chemical calculations, which are computationally expensive. In this review, we focus on not only how machine learning is employed to speed up such excited-state simulations but also how this branch of artificial intelligence can be used to advance this exciting research field in all its aspects. Discussed applications of machine learning for excited states include excited-state dynamics simulations, static calculations of absorption spectra, as well as many others. In order to put these studies into context, we discuss the promises and pitfalls of the involved machine learning techniques. Since the latter are mostly based on quantum chemistry calculations, we also provide a short introduction into excited-state electronic structure methods and approaches for nonadiabatic dynamics simulations and describe tricks and problems when using them in machine learning for excited states of molecules.
Collapse
Affiliation(s)
- Julia Westermayr
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
| | - Philipp Marquetand
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Vienna Research Platform on Accelerating Photoreaction Discovery, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Data Science @ Uni Vienna, University of Vienna, Währinger Strasse 29, 1090 Vienna, Austria
| |
Collapse
|
13
|
Krieger AM, Pidko EA. The Impact of Computational Uncertainties on the Enantioselectivity Predictions: A Microkinetic Modeling of Ketone Transfer Hydrogenation with a Noyori-type Mn-diamine Catalyst. ChemCatChem 2021; 13:3517-3524. [PMID: 34589158 PMCID: PMC8453751 DOI: 10.1002/cctc.202100341] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 05/23/2021] [Indexed: 12/26/2022]
Abstract
Selectivity control is one of the most important functions of a catalyst. In asymmetric catalysis the enantiomeric excess (e.e.) is a property of major interest, with a lot of effort dedicated to developing the most enantioselective catalyst, understanding the origin of selectivity, and predicting stereoselectivity. Herein, we investigate the relationship between predicted selectivity and the uncertainties in the computed energetics of the catalytic reaction mechanism obtained by DFT calculations in a case study of catalytic asymmetric transfer hydrogenation (ATH) of ketones with an Mn-diamine catalyst. Data obtained from our analysis of DFT data by microkinetic modeling is compared to results from experiment. We discuss the limitations of the conventional reductionist approach of e.e. estimation from assessing the enantiodetermining steps only. Our analysis shows that the energetics of other reaction steps in the reaction mechanism have a substantial impact on the predicted reaction selectivity. The uncertainty of DFT calculations within the commonly accepted energy ranges of chemical accuracy may reverse the predicted e.e. with the non-enantiodetermining steps contributing to e.e. deviations of up to 25 %.
Collapse
Affiliation(s)
- Annika M. Krieger
- Inorganic Systems EngineeringDepartment of Chemical EngineeringFaculty of Applied SciencesDelft University of TechnologyVan der Maasweg 92629 HZDelftThe Netherlands
| | - Evgeny A. Pidko
- Inorganic Systems EngineeringDepartment of Chemical EngineeringFaculty of Applied SciencesDelft University of TechnologyVan der Maasweg 92629 HZDelftThe Netherlands
| |
Collapse
|
14
|
McCarver GA, Rajeshkumar T, Vogiatzis KD. Computational catalysis for metal-organic frameworks: An overview. Coord Chem Rev 2021. [DOI: 10.1016/j.ccr.2021.213777] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
15
|
Affiliation(s)
- Heather J. Kulik
- Department of Chemical Engineering Massachusetts Institute of Technology 77 Massachusetts Ave Rm 66–464 Cambridge MA 02139 USA
| |
Collapse
|
16
|
Mirth J, Zhai Y, Bush J, Alvarado EG, Jordan H, Heim M, Krishnamoorthy B, Pflaum M, Clark A, Z Y, Adams H. Representations of energy landscapes by sublevelset persistent homology: An example with n-alkanes. J Chem Phys 2021; 154:114114. [PMID: 33752361 DOI: 10.1063/5.0036747] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Encoding the complex features of an energy landscape is a challenging task, and often, chemists pursue the most salient features (minima and barriers) along a highly reduced space, i.e., two- or three-dimensions. Even though disconnectivity graphs or merge trees summarize the connectivity of the local minima of an energy landscape via the lowest-barrier pathways, there is much information to be gained by also considering the topology of each connected component at different energy thresholds (or sublevelsets). We propose sublevelset persistent homology as an appropriate tool for this purpose. Our computations on the configuration phase space of n-alkanes from butane to octane allow us to conjecture, and then prove, a complete characterization of the sublevelset persistent homology of the alkane CmH2m+2 Potential Energy Landscapes (PELs), for all m, in all homological dimensions. We further compare both the analytical configurational PELs and sampled data from molecular dynamics simulation using the united and all-atom descriptions of the intramolecular interactions. In turn, this supports the application of distance metrics to quantify sampling fidelity and lays the foundation for future work regarding new metrics that quantify differences between the topological features of high-dimensional energy landscapes.
Collapse
Affiliation(s)
- Joshua Mirth
- Department of Mathematics, Colorado State University, Fort Collins, Colorado 80524, USA
| | - Yanqin Zhai
- Department of Nuclear, Plasma, and Radiological Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Johnathan Bush
- Department of Mathematics, Colorado State University, Fort Collins, Colorado 80524, USA
| | - Enrique G Alvarado
- Department of Mathematics and Statistics, Washington State University, Pullman, Washington 99164, USA
| | - Howie Jordan
- Department of Mathematics, University of Colorado, Boulder, Colorado 80309, USA
| | - Mark Heim
- Department of Mathematics, Colorado State University, Fort Collins, Colorado 80524, USA
| | - Bala Krishnamoorthy
- Department of Mathematics and Statistics, Washington State University, Vancouver, Washington 98686, USA
| | - Markus Pflaum
- Department of Mathematics, University of Colorado, Boulder, Colorado 80309, USA
| | - Aurora Clark
- Department of Chemistry, Washington State University, Pullman, Washington 99164, USA
| | - Y Z
- Department of Nuclear, Plasma, and Radiological Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Henry Adams
- Department of Mathematics, Colorado State University, Fort Collins, Colorado 80524, USA
| |
Collapse
|
17
|
Janet JP, Duan C, Nandy A, Liu F, Kulik HJ. Navigating Transition-Metal Chemical Space: Artificial Intelligence for First-Principles Design. Acc Chem Res 2021; 54:532-545. [PMID: 33480674 DOI: 10.1021/acs.accounts.0c00686] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The variability of chemical bonding in open-shell transition-metal complexes not only motivates their study as functional materials and catalysts but also challenges conventional computational modeling tools. Here, tailoring ligand chemistry can alter preferred spin or oxidation states as well as electronic structure properties and reactivity, creating vast regions of chemical space to explore when designing new materials atom by atom. Although first-principles density functional theory (DFT) remains the workhorse of computational chemistry in mechanism deduction and property prediction, it is of limited use here. DFT is both far too computationally costly for widespread exploration of transition-metal chemical space and also prone to inaccuracies that limit its predictive performance for localized d electrons in transition-metal complexes. These challenges starkly contrast with the well-trodden regions of small-organic-molecule chemical space, where the analytical forms of molecular mechanics force fields and semiempirical theories have for decades accelerated the discovery of new molecules, accurate DFT functional performance has been demonstrated, and gold-standard methods from correlated wavefunction theory can predict experimental results to chemical accuracy.The combined promise of transition-metal chemical space exploration and lack of established tools has mandated a distinct approach. In this Account, we outline the path we charted in exploration of transition-metal chemical space starting from the first machine learning (ML) models (i.e., artificial neural network and kernel ridge regression) and representations for the prediction of open-shell transition-metal complex properties. The distinct importance of the immediate coordination environment of the metal center as well as the lack of low-level methods to accurately predict structural properties in this coordination environment first motivated and then benefited from these ML models and representations. Once developed, the recipe for prediction of geometric, spin state, and redox potential properties was straightforwardly extended to a diverse range of other properties, including in catalysis, computational "feasibility", and the gas separation properties of periodic metal-organic frameworks. Interpretation of selected features most important for model prediction revealed new ways to encapsulate design rules and confirmed that models were robustly mapping essential structure-property relationships. Encountering the special challenge of ensuring that good model performance could generalize to new discovery targets motivated investigation of how to best carry out model uncertainty quantification. Distance-based approaches, whether in model latent space or in carefully engineered feature space, provided intuitive measures of the domain of applicability. With all of these pieces together, ML can be harnessed as an engine to tackle the large-scale exploration of transition-metal chemical space needed to satisfy multiple objectives using efficient global optimization methods. In practical terms, bringing these artificial intelligence tools to bear on the problems of transition-metal chemical space exploration has resulted in ML-model assessments of large, multimillion compound spaces in minutes and validated new design leads in weeks instead of decades.
Collapse
Affiliation(s)
- Jon Paul Janet
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Fang Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J. Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
18
|
SSnet: A Deep Learning Approach for Protein-Ligand Interaction Prediction. Int J Mol Sci 2021; 22:ijms22031392. [PMID: 33573266 PMCID: PMC7869013 DOI: 10.3390/ijms22031392] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 01/24/2021] [Accepted: 01/27/2021] [Indexed: 12/15/2022] Open
Abstract
Computational prediction of Protein-Ligand Interaction (PLI) is an important step in the modern drug discovery pipeline as it mitigates the cost, time, and resources required to screen novel therapeutics. Deep Neural Networks (DNN) have recently shown excellent performance in PLI prediction. However, the performance is highly dependent on protein and ligand features utilized for the DNN model. Moreover, in current models, the deciphering of how protein features determine the underlying principles that govern PLI is not trivial. In this work, we developed a DNN framework named SSnet that utilizes secondary structure information of proteins extracted as the curvature and torsion of the protein backbone to predict PLI. We demonstrate the performance of SSnet by comparing against a variety of currently popular machine and non-Machine Learning (ML) models using various metrics. We visualize the intermediate layers of SSnet to show a potential latent space for proteins, in particular to extract structural elements in a protein that the model finds influential for ligand binding, which is one of the key features of SSnet. We observed in our study that SSnet learns information about locations in a protein where a ligand can bind, including binding sites, allosteric sites and cryptic sites, regardless of the conformation used. We further observed that SSnet is not biased to any specific molecular interaction and extracts the protein fold information critical for PLI prediction. Our work forms an important gateway to the general exploration of secondary structure-based Deep Learning (DL), which is not just confined to protein-ligand interactions, and as such will have a large impact on protein research, while being readily accessible for de novo drug designers as a standalone package.
Collapse
|
19
|
Li X, Paier W, Paier J. Machine Learning in Computational Surface Science and Catalysis: Case Studies on Water and Metal-Oxide Interfaces. Front Chem 2021; 8:601029. [PMID: 33425857 PMCID: PMC7793815 DOI: 10.3389/fchem.2020.601029] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Accepted: 10/27/2020] [Indexed: 11/13/2022] Open
Abstract
The goal of many computational physicists and chemists is the ability to bridge the gap between atomistic length scales of about a few multiples of an Ångström (Å), i. e., 10−10 m, and meso- or macroscopic length scales by virtue of simulations. The same applies to timescales. Machine learning techniques appear to bring this goal into reach. This work applies the recently published on-the-fly machine-learned force field techniques using a variant of the Gaussian approximation potentials combined with Bayesian regression and molecular dynamics as efficiently implemented in the Vienna ab initio simulation package, VASP. The generation of these force fields follows active-learning schemes. We apply these force fields to simple oxides such as MgO and more complex reducible oxides such as iron oxide, examine their generalizability, and further increase complexity by studying water adsorption on these metal oxide surfaces. We successfully examined surface properties of pristine and reconstructed MgO and Fe3O4 surfaces. However, the accurate description of water–oxide interfaces by machine-learned force fields, especially for iron oxides, remains a field offering plenty of research opportunities.
Collapse
Affiliation(s)
- Xiaoke Li
- Institut für Chemie, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Wolfgang Paier
- Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute HHI, Berlin, Germany
| | - Joachim Paier
- Institut für Chemie, Humboldt-Universität zu Berlin, Berlin, Germany
| |
Collapse
|
20
|
Krieger A, Kuliaev P, Armstrong Hall FQ, Sun D, Pidko EA. Composition- and Condition-Dependent Kinetics of Homogeneous Ester Hydrogenation by a Mn-Based Catalyst. THE JOURNAL OF PHYSICAL CHEMISTRY. C, NANOMATERIALS AND INTERFACES 2020; 124:26990-26998. [PMID: 33335641 PMCID: PMC7735017 DOI: 10.1021/acs.jpcc.0c09953] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Revised: 11/10/2020] [Indexed: 06/12/2023]
Abstract
The reaction medium and conditions are the key parameters defining the efficiency and performance of a homogeneous catalyst. In the state-of-the-art molecular descriptions of catalytic systems by density functional theory (DFT) calculations, the reaction medium is commonly reduced to an infinitely diluted ideal solution model. In this work, we carry out a detailed operando computational modeling analysis of the condition dependencies and nonideal solution effects on the mechanism and kinetics of a model ester hydrogenation reaction by a homogeneous Mn(I)-P,N catalyst. By combining DFT calculations, COSMO-RS solvent model, and the microkinetic modeling approach, the kinetic behavior of the multicomponent homogeneous catalyst system under realistic reaction conditions was investigated in detail. The effects of the reaction medium and its dynamic evolution in the course of the reaction were analyzed by comparing the results obtained for the model methyl acetate hydrogenation reaction in a THF solution and under solvent-free neat reaction conditions. The dynamic representations of the reaction medium give rise to strongly nonlinear effects in the kinetic models. The nonideal representation of the reaction medium results in pronounced condition dependencies of the computed energetics of the elementary reaction steps and the computed kinetic profiles but affects only slightly such experimentally accessible kinetic descriptors as the apparent activation energy and the degree of rate control.
Collapse
Affiliation(s)
- Annika
M. Krieger
- Inorganic
Systems Engineering Group, Department of Chemical Engineering, Delft University of Technology, Van der Maasweg 9, Delft 2629 HZ, The Netherlands
| | - Pavel Kuliaev
- TheoMAT
group, ChemBio Cluster, ITMO University, Lomonosova str. 9, St. Petersburg, 191002 Russia
| | - Felix Q. Armstrong Hall
- Inorganic
Systems Engineering Group, Department of Chemical Engineering, Delft University of Technology, Van der Maasweg 9, Delft 2629 HZ, The Netherlands
| | - Dapeng Sun
- Inorganic
Systems Engineering Group, Department of Chemical Engineering, Delft University of Technology, Van der Maasweg 9, Delft 2629 HZ, The Netherlands
| | - Evgeny A. Pidko
- Inorganic
Systems Engineering Group, Department of Chemical Engineering, Delft University of Technology, Van der Maasweg 9, Delft 2629 HZ, The Netherlands
- TheoMAT
group, ChemBio Cluster, ITMO University, Lomonosova str. 9, St. Petersburg, 191002 Russia
| |
Collapse
|
21
|
Bahlke MP, Mogos N, Proppe J, Herrmann C. Exchange Spin Coupling from Gaussian Process Regression. J Phys Chem A 2020; 124:8708-8723. [DOI: 10.1021/acs.jpca.0c05983] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Marc Philipp Bahlke
- Department of Chemistry, University of Hamburg, Martin-Luther-King-Platz 6, 20146 Hamburg, Germany
| | - Natnael Mogos
- Department of Chemistry, University of Hamburg, Martin-Luther-King-Platz 6, 20146 Hamburg, Germany
| | - Jonny Proppe
- Institute of Physical Chemistry, Georg-August University, Tammannstr. 6, 37077 Göttingen, Germany
| | - Carmen Herrmann
- Department of Chemistry, University of Hamburg, Martin-Luther-King-Platz 6, 20146 Hamburg, Germany
| |
Collapse
|
22
|
Zou W, Tao Y, Kraka E. Describing Polytopal Rearrangements of Fluxional Molecules with Curvilinear Coordinates Derived from Normal Vibrational Modes: A Conceptual Extension of Cremer-Pople Puckering Coordinates. J Chem Theory Comput 2020; 16:3162-3193. [PMID: 32208729 DOI: 10.1021/acs.jctc.9b01274] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
In this work a new curvilinear coordinate system is presented for the comprehensive description of polytopal rearrangements of N-coordinate compounds (N = 4-7) and systems containing an N-coordinate subunit. It is based on normal vibrational modes and a natural extension of the Cremer-Pople puckering coordinates ( J. Am. Chem. Soc. 1975, 97, 1354) together with the Zou-Izotov-Cremer deformation coordinates ( J. Phys. Chem. A 2011, 115, 8731) for ring structures to N-coordinate systems. We demonstrate that the new curvilinear coordinates are ideal reaction coordinates describing fluxional rearrangement pathways by revisiting the Berry pseudorotation and the lever mechanism in sulfur tetrafluoride, the Berry pseudorotation and two Muetterties' mechanisms in pentavalent compounds, the chimeric pseudorotation in iodine pentafluoride, Bailar and Ray-Dutt twists in hexacoordinate tris-chelates as well as the Bartell mechanism in iodine heptafluoride. The results of our study reveal that this dedicated curvilinear coordinate system can be applied to most coordination compounds opening new ways for the systematic modeling of fluxional processes.
Collapse
Affiliation(s)
- Wenli Zou
- Institute of Modern Physics, Northwest University, and Shaanxi Key Laboratory for Theoretical Physics Frontiers, Xi'an, Shaanxi 710127, P. R. China.,Department of Chemistry, Southern Methodist University, 3215 Daniel Avenue, Dallas, Texas 75275-0314, United States
| | - Yunwen Tao
- Department of Chemistry, Southern Methodist University, 3215 Daniel Avenue, Dallas, Texas 75275-0314, United States
| | - Elfi Kraka
- Department of Chemistry, Southern Methodist University, 3215 Daniel Avenue, Dallas, Texas 75275-0314, United States
| |
Collapse
|
23
|
Taylor MG, Yang T, Lin S, Nandy A, Janet JP, Duan C, Kulik HJ. Seeing Is Believing: Experimental Spin States from Machine Learning Model Structure Predictions. J Phys Chem A 2020; 124:3286-3299. [PMID: 32223165 PMCID: PMC7311053 DOI: 10.1021/acs.jpca.0c01458] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
![]()
Determination of ground-state spins
of open-shell transition-metal
complexes is critical to understanding catalytic and materials properties
but also challenging with approximate electronic structure methods.
As an alternative approach, we demonstrate how structure alone can
be used to guide assignment of ground-state spin from experimentally
determined crystal structures of transition-metal complexes. We first
identify the limits of distance-based heuristics from distributions
of metal–ligand bond lengths of over 2000 unique mononuclear
Fe(II)/Fe(III) transition-metal complexes. To overcome these limits,
we employ artificial neural networks (ANNs) to predict spin-state-dependent
metal–ligand bond lengths and classify experimental ground-state
spins based on agreement of experimental structures with the ANN predictions.
Although the ANN is trained on hybrid density functional theory data,
we exploit the method-insensitivity of geometric properties to enable
assignment of ground states for the majority (ca. 80–90%) of
structures. We demonstrate the utility of the ANN by data-mining the
literature for spin-crossover (SCO) complexes, which have experimentally
observed temperature-dependent geometric structure changes, by correctly
assigning almost all (>95%) spin states in the 46 Fe(II) SCO complex
set. This approach represents a promising complement to more conventional
energy-based spin-state assignment from electronic structure theory
at the low cost of a machine learning model.
Collapse
Affiliation(s)
- Michael G Taylor
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Tzuhsiung Yang
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Sean Lin
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Jon Paul Janet
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
24
|
Affiliation(s)
- Marco Foscato
- Department of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| | - Vidar R. Jensen
- Department of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| |
Collapse
|
25
|
Hong Y, Hou B, Jiang H, Zhang J. Machine learning and artificial neural network accelerated computational discoveries in materials science. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1450] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Yang Hong
- Department of Chemistry University of Nebraska‐Lincoln Lincoln Nebraska
| | - Bo Hou
- Department of Engineering University of Cambridge Cambridge UK
| | - Hengle Jiang
- Holland Computing Center University of Nebraska‐Lincoln Lincoln Nebraska
| | - Jingchao Zhang
- Holland Computing Center University of Nebraska‐Lincoln Lincoln Nebraska
| |
Collapse
|