151
|
Botella R, Kistanov AA. A Unified View of Vibrational Spectroscopy Simulation through Kernel Density Estimations. J Phys Chem Lett 2023; 14:3691-3697. [PMID: 37037010 PMCID: PMC10123815 DOI: 10.1021/acs.jpclett.3c00665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 03/30/2023] [Indexed: 06/19/2023]
Abstract
To date, vibrational simulation results constitute more of an experimental support than a predictive tool, as the simulated vibrational modes are discrete due to quantization. This is different from what is obtained experimentally. Here, we propose a way to combine outputs such as the phonon density of states surrogate and peak intensities obtained from ab initio simulations to allow comparison with experimental data by using machine learning. This work is paving the way for using simulated vibrational spectra as a tool to identify materials with defined stoichiometry, enabling the separation of genuine vibrational features of pure phases from morphological and defect-induced signals.
Collapse
|
152
|
Schütt KT, Hessmann SSP, Gebauer NWA, Lederer J, Gastegger M. SchNetPack 2.0: A neural network toolbox for atomistic machine learning. J Chem Phys 2023; 158:144801. [PMID: 37061495 DOI: 10.1063/5.0138367] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2023] Open
Abstract
SchNetPack is a versatile neural network toolbox that addresses both the requirements of method development and the application of atomistic machine learning. Version 2.0 comes with an improved data pipeline, modules for equivariant neural networks, and a PyTorch implementation of molecular dynamics. An optional integration with PyTorch Lightning and the Hydra configuration framework powers a flexible command-line interface. This makes SchNetPack 2.0 easily extendable with a custom code and ready for complex training tasks, such as the generation of 3D molecular structures.
Collapse
Affiliation(s)
- Kristof T Schütt
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | | | - Niklas W A Gebauer
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Jonas Lederer
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Michael Gastegger
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| |
Collapse
|
153
|
Perrella F, Coppola F, Rega N, Petrone A. An Expedited Route to Optical and Electronic Properties at Finite Temperature via Unsupervised Learning. Molecules 2023; 28:3411. [PMID: 37110644 PMCID: PMC10144358 DOI: 10.3390/molecules28083411] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 04/06/2023] [Accepted: 04/07/2023] [Indexed: 04/29/2023] Open
Abstract
Electronic properties and absorption spectra are the grounds to investigate molecular electronic states and their interactions with the environment. Modeling and computations are required for the molecular understanding and design strategies of photo-active materials and sensors. However, the interpretation of such properties demands expensive computations and dealing with the interplay of electronic excited states with the conformational freedom of the chromophores in complex matrices (i.e., solvents, biomolecules, crystals) at finite temperature. Computational protocols combining time dependent density functional theory and ab initio molecular dynamics (MD) have become very powerful in this field, although they require still a large number of computations for a detailed reproduction of electronic properties, such as band shapes. Besides the ongoing research in more traditional computational chemistry fields, data analysis and machine learning methods have been increasingly employed as complementary approaches for efficient data exploration, prediction and model development, starting from the data resulting from MD simulations and electronic structure calculations. In this work, dataset reduction capabilities by unsupervised clustering techniques applied to MD trajectories are proposed and tested for the ab initio modeling of electronic absorption spectra of two challenging case studies: a non-covalent charge-transfer dimer and a ruthenium complex in solution at room temperature. The K-medoids clustering technique is applied and is proven to be able to reduce by ∼100 times the total cost of excited state calculations on an MD sampling with no loss in the accuracy and it also provides an easier understanding of the representative structures (medoids) to be analyzed on the molecular scale.
Collapse
Affiliation(s)
- Fulvio Perrella
- Scuola Superiore Meridionale, Largo San Marcellino 10, I-80138 Napoli, Italy; (F.P.); (F.C.); (N.R.)
| | - Federico Coppola
- Scuola Superiore Meridionale, Largo San Marcellino 10, I-80138 Napoli, Italy; (F.P.); (F.C.); (N.R.)
| | - Nadia Rega
- Scuola Superiore Meridionale, Largo San Marcellino 10, I-80138 Napoli, Italy; (F.P.); (F.C.); (N.R.)
- Department of Chemical Sciences, University of Napoli Federico II, Complesso Universitario di M.S. Angelo, via Cintia 21, I-80126 Napoli, Italy
- Istituto Nazionale di Fisica Nucleare, Sezione di Napoli, Complesso Universitario di M.S. Angelo ed. 6, via Cintia 21, I-80126 Napoli, Italy
| | - Alessio Petrone
- Scuola Superiore Meridionale, Largo San Marcellino 10, I-80138 Napoli, Italy; (F.P.); (F.C.); (N.R.)
- Department of Chemical Sciences, University of Napoli Federico II, Complesso Universitario di M.S. Angelo, via Cintia 21, I-80126 Napoli, Italy
- Istituto Nazionale di Fisica Nucleare, Sezione di Napoli, Complesso Universitario di M.S. Angelo ed. 6, via Cintia 21, I-80126 Napoli, Italy
| |
Collapse
|
154
|
Armeli G, Peters JH, Koop T. Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data. ACS OMEGA 2023; 8:12298-12309. [PMID: 37033862 PMCID: PMC10077449 DOI: 10.1021/acsomega.2c08146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Accepted: 03/07/2023] [Indexed: 06/19/2023]
Abstract
Knowledge of the glass transition temperature of molecular compounds that occur in atmospheric aerosol particles is important for estimating their viscosity, as it directly influences the kinetics of chemical reactions and particle phase state. While there is a great diversity of organic compounds present in aerosol particles, for only a minor fraction of them experimental glass transition temperatures are known. Therefore, we have developed a machine learning model designed to predict the glass transition temperature of organic molecular compounds based on molecule-derived input variables. The extremely randomized trees (extra trees) procedure was chosen for this purpose. Two approaches using different sets of input variables were followed. The first one uses the number of selected functional groups present in the compound, while the second one generates descriptors from a SMILES (Simplified Molecular Input Line Entry System) string. Organic compounds containing carbon, hydrogen, oxygen, nitrogen, and halogen atoms are included. For improved results, both approaches can be combined with the melting temperature of the compound as an additional input variable. The results show that the predictions of both approaches show a similar mean absolute error of about 12-13 K, with the SMILES-based predictions performing slightly better. In general, the model shows good predictive power considering the diversity of the experimental input data. Furthermore, we also show that its performance exceeds that of previous parameterizations developed for this purpose and also performs better than existing machine learning models. In order to provide user-friendly versions of the model for applications, we have developed a web site where the model can be run by interested scientists via a web-based interface without prior technical knowledge. We also provide Python code of the model. Additionally, all experimental input data are provided in form of the Bielefeld Molecular Organic Glasses (BIMOG) database. We believe that this model is a powerful tool for many applications in atmospheric aerosol science and material science.
Collapse
|
155
|
Anstine D, Isayev O. Machine Learning Interatomic Potentials and Long-Range Physics. J Phys Chem A 2023; 127:2417-2431. [PMID: 36802360 PMCID: PMC10041642 DOI: 10.1021/acs.jpca.2c06778] [Citation(s) in RCA: 33] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 02/03/2023] [Indexed: 02/23/2023]
Abstract
Advances in machine learned interatomic potentials (MLIPs), such as those using neural networks, have resulted in short-range models that can infer interaction energies with near ab initio accuracy and orders of magnitude reduced computational cost. For many atom systems, including macromolecules, biomolecules, and condensed matter, model accuracy can become reliant on the description of short- and long-range physical interactions. The latter terms can be difficult to incorporate into an MLIP framework. Recent research has produced numerous models with considerations for nonlocal electrostatic and dispersion interactions, leading to a large range of applications that can be addressed using MLIPs. In light of this, we present a Perspective focused on key methodologies and models being used where the presence of nonlocal physics and chemistry are crucial for describing system properties. The strategies covered include MLIPs augmented with dispersion corrections, electrostatics calculated with charges predicted from atomic environment descriptors, the use of self-consistency and message passing iterations to propagated nonlocal system information, and charges obtained via equilibration schemes. We aim to provide a pointed discussion to support the development of machine learning-based interatomic potentials for systems where contributions from only nearsighted terms are deficient.
Collapse
Affiliation(s)
- Dylan
M. Anstine
- Department of Chemistry,
Mellon College of Science, Carnegie Mellon
University, Pittsburgh, Pennsylvania 15213, United States
| | - Olexandr Isayev
- Department of Chemistry,
Mellon College of Science, Carnegie Mellon
University, Pittsburgh, Pennsylvania 15213, United States
| |
Collapse
|
156
|
Stevens K, Thamwattana N, Tran‐Duc T. Continuum Modeling with Functional Lennard–Jones Parameters for DNA‐Graphene Interactions. ADVANCED THEORY AND SIMULATIONS 2023. [DOI: 10.1002/adts.202200896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/14/2023]
Affiliation(s)
- Kyle Stevens
- School of Information and Physical Sciences University of Newcastle, University Dr Callaghan New South Wales 2308 Australia
| | - Ngamta Thamwattana
- School of Information and Physical Sciences University of Newcastle, University Dr Callaghan New South Wales 2308 Australia
| | - Thien Tran‐Duc
- School of Mathematical Sciences University of Adelaide Adelaide South Australia 5005 Australia
| |
Collapse
|
157
|
|
158
|
Schubert M, Panzarasa G, Burgert I. Sustainability in Wood Products: A New Perspective for Handling Natural Diversity. Chem Rev 2023; 123:1889-1924. [PMID: 36535040 DOI: 10.1021/acs.chemrev.2c00360] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Wood is a renewable resource with excellent qualities and the potential to become a key element of a future bioeconomy. The increasing environmental awareness and drive to achieve sustainability is leading to a resurgence of research on wood materials. Nevertheless, the global climate changes and associated consequences will soon challenge the wood-value chains in several regions (e.g., central Europe). To cope with these challenges, it is necessary to rethink the current practice of wood sourcing and transformation. The goal of this review is to address the intrinsic natural diversity of wood, from its origin to its technological consequences for the present and future manufacturing of wood products. So far, industrial processes have been optimized to repress the variability of wood properties, enabling more efficient processing and production of reliable products. However, the need to preserve biodiversity and the impact of climate change on forests call for new wood processing techniques and green chemistry protocols for wood modification as enabling factors necessary for managing a more diverse wood provision in the future. This article discusses the past developments that have resulted in the current wood value chains and provides a perspective about how natural variability could be turned into an asset for making truly sustainable wood products. After briefly introducing the chemical and structural complexity of wood, the methods conventionally adopted for industrial homogenization and modification of wood are discussed in relation to their evolution toward increased sustainability. Finally, a perspective is given on technological potentials of machine learning techniques and of novel functional wood materials. Here the main message is that through a combination of sustainable forestry, adherence to green chemistry principles and adapted processes based on machine learning, the wood industry could not only overcome current challenges but also thrive in the near future despite the awaiting challenges.
Collapse
Affiliation(s)
- Mark Schubert
- WoodTec Group, Cellulose & Wood Materials, Empa, CH-8600 Dübendorf, Switzerland
| | - Guido Panzarasa
- Wood Materials Science, Institute for Building Materials, ETH Zürich, CH-8093 Zurich, Switzerland
| | - Ingo Burgert
- WoodTec Group, Cellulose & Wood Materials, Empa, CH-8600 Dübendorf, Switzerland.,Wood Materials Science, Institute for Building Materials, ETH Zürich, CH-8093 Zurich, Switzerland
| |
Collapse
|
159
|
Kulichenko M, Barros K, Lubbers N, Li YW, Messerly R, Tretiak S, Smith JS, Nebgen B. Uncertainty-driven dynamics for active learning of interatomic potentials. NATURE COMPUTATIONAL SCIENCE 2023; 3:230-239. [PMID: 38177878 PMCID: PMC10766548 DOI: 10.1038/s43588-023-00406-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 01/24/2023] [Indexed: 01/06/2024]
Abstract
Machine learning (ML) models, if trained to data sets of high-fidelity quantum simulations, produce accurate and efficient interatomic potentials. Active learning (AL) is a powerful tool to iteratively generate diverse data sets. In this approach, the ML model provides an uncertainty estimate along with its prediction for each new atomic configuration. If the uncertainty estimate passes a certain threshold, then the configuration is included in the data set. Here we develop a strategy to more rapidly discover configurations that meaningfully augment the training data set. The approach, uncertainty-driven dynamics for active learning (UDD-AL), modifies the potential energy surface used in molecular dynamics simulations to favor regions of configuration space for which there is large model uncertainty. The performance of UDD-AL is demonstrated for two AL tasks: sampling the conformational space of glycine and sampling the promotion of proton transfer in acetylacetone. The method is shown to efficiently explore the chemically relevant configuration space, which may be inaccessible using regular dynamical sampling at target temperature conditions.
Collapse
Affiliation(s)
- Maksim Kulichenko
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA.
| | - Kipton Barros
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Nicholas Lubbers
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Ying Wai Li
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Richard Messerly
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Sergei Tretiak
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA
- Center for Integrated Nanotechnologies, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Justin S Smith
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA.
- Nvidia Corporation, Santa Clara, CA, USA.
| | - Benjamin Nebgen
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA.
| |
Collapse
|
160
|
Feng C, Xi J, Zhang Y, Jiang B, Zhou Y. Accurate and Interpretable Dipole Interaction Model-Based Machine Learning for Molecular Polarizability. J Chem Theory Comput 2023; 19:1207-1217. [PMID: 36753749 DOI: 10.1021/acs.jctc.2c01094] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
Polarizabilities play significant roles in describing dispersive and inductive interactions of the atom and molecular systems. However, an accurate prediction of molecular polarizabilities from first principles is computationally prohibitive. Although physical models or statistical machine learning models have been proposed, either a lack of accurate description of local chemical environments or demanding a large number of samples for training has limited their practical applications. In this study, we combine a physically inspired dipole interaction model and an accurate neural network method for predicting the polarizability tensors of molecules. With the local chemical environment precisely described and the requirement of rotational covariance naturally fulfilled, this hybrid model is proven to give an accurate molecular polarizability prediction, essentially reducing the number of training samples. The atomic polarizabilities are physically interpretable and transferable to larger molecules unseen in the training set. This promising method may find its wide range of applications, such as spectroscopic simulations and the construction of polarizable force fields.
Collapse
Affiliation(s)
- Chaoqiang Feng
- Anhui Key Laboratory of Optoelectric Materials Science and Technology, Department of Physics, Anhui Normal University, Wuhu, Anhui 241000, China.,Hefei National Research Center for Physical Sciences at the Microscale, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Jin Xi
- Anhui Key Laboratory of Optoelectric Materials Science and Technology, Department of Physics, Anhui Normal University, Wuhu, Anhui 241000, China
| | - Yaolong Zhang
- Hefei National Research Center for Physical Sciences at the Microscale, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Bin Jiang
- Hefei National Research Center for Physical Sciences at the Microscale, Department of Chemical Physics, Key Laboratory of Surface and Interface Chemistry and Energy Catalysis of Anhui Higher Education Institutes, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Yong Zhou
- Anhui Key Laboratory of Optoelectric Materials Science and Technology, Department of Physics, Anhui Normal University, Wuhu, Anhui 241000, China
| |
Collapse
|
161
|
Li H, Jiao Y, Davey K, Qiao SZ. Data-Driven Machine Learning for Understanding Surface Structures of Heterogeneous Catalysts. Angew Chem Int Ed Engl 2023; 62:e202216383. [PMID: 36509704 DOI: 10.1002/anie.202216383] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 12/11/2022] [Accepted: 12/12/2022] [Indexed: 12/15/2022]
Abstract
The design of heterogeneous catalysts is necessarily surface-focused, generally achieved via optimization of adsorption energy and microkinetic modelling. A prerequisite is to ensure the adsorption energy is physically meaningful is the stable existence of the conceived active-site structure on the surface. The development of improved understanding of the catalyst surface, however, is challenging practically because of the complex nature of dynamic surface formation and evolution under in-situ reactions. We propose therefore data-driven machine-learning (ML) approaches as a solution. In this Minireview we summarize recent progress in using machine-learning to search and predict (meta)stable structures, assist operando simulation under reaction conditions and micro-environments, and critically analyze experimental characterization data. We conclude that ML will become the new norm to lower costs associated with discovery and design of optimal heterogeneous catalysts.
Collapse
Affiliation(s)
- Haobo Li
- School of Chemical Engineering and Advanced Materials, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Yan Jiao
- School of Chemical Engineering and Advanced Materials, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Kenneth Davey
- School of Chemical Engineering and Advanced Materials, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Shi-Zhang Qiao
- School of Chemical Engineering and Advanced Materials, The University of Adelaide, Adelaide, SA 5005, Australia
| |
Collapse
|
162
|
Käser S, Vazquez-Salazar LI, Meuwly M, Töpfer K. Neural network potentials for chemistry: concepts, applications and prospects. DIGITAL DISCOVERY 2023; 2:28-58. [PMID: 36798879 PMCID: PMC9923808 DOI: 10.1039/d2dd00102k] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 12/20/2022] [Indexed: 12/24/2022]
Abstract
Artificial Neural Networks (NN) are already heavily involved in methods and applications for frequent tasks in the field of computational chemistry such as representation of potential energy surfaces (PES) and spectroscopic predictions. This perspective provides an overview of the foundations of neural network-based full-dimensional potential energy surfaces, their architectures, underlying concepts, their representation and applications to chemical systems. Methods for data generation and training procedures for PES construction are discussed and means for error assessment and refinement through transfer learning are presented. A selection of recent results illustrates the latest improvements regarding accuracy of PES representations and system size limitations in dynamics simulations, but also NN application enabling direct prediction of physical results without dynamics simulations. The aim is to provide an overview for the current state-of-the-art NN approaches in computational chemistry and also to point out the current challenges in enhancing reliability and applicability of NN methods on a larger scale.
Collapse
Affiliation(s)
- Silvan Käser
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | | | - Markus Meuwly
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | - Kai Töpfer
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| |
Collapse
|
163
|
Guseva DV, Glagolev MK, Lazutin AA, Vasilevskaya VV. Revealing Structural and Physical Properties of Polylactide: What Simulation Can Do beyond the Experimental Methods. POLYM REV 2023. [DOI: 10.1080/15583724.2023.2174136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Affiliation(s)
- D. V. Guseva
- A. N. Nesmeyanov Institute of Organoelement Compounds RAS, Moscow, Russia
| | - M. K. Glagolev
- A. N. Nesmeyanov Institute of Organoelement Compounds RAS, Moscow, Russia
| | - A. A. Lazutin
- A. N. Nesmeyanov Institute of Organoelement Compounds RAS, Moscow, Russia
| | - V. V. Vasilevskaya
- A. N. Nesmeyanov Institute of Organoelement Compounds RAS, Moscow, Russia
- Chemistry Department, M. V. Lomonosov Moscow State University, Moscow, Russia
| |
Collapse
|
164
|
Elishav O, Podgaetsky R, Meikler O, Hirshberg B. Collective Variables for Conformational Polymorphism in Molecular Crystals. J Phys Chem Lett 2023; 14:971-976. [PMID: 36689770 PMCID: PMC9900638 DOI: 10.1021/acs.jpclett.2c03491] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 01/12/2023] [Indexed: 06/17/2023]
Abstract
Controlling polymorphism in molecular crystals is crucial in the pharmaceutical, dye, and pesticide industries. However, its theoretical description is extremely challenging, due to the associated long time scales (>1 μs). We present an efficient procedure for identifying collective variables that promote transitions between conformational polymorphs in molecular dynamics simulations. It involves applying a simple dimensionality reduction algorithm to data from short (∼ps) simulations of the isolated conformers that correspond to each polymorph. We demonstrate the utility of our method in the challenging case of the important energetic material, CL-20, which has three anhydrous conformational polymorphs at ambient pressure. Using these collective variables in Metadynamics simulations, we observe transitions between all solid polymorphs in the biased trajectories. We reconstruct the free energy surface and identify previously unknown defect and intermediate forms in the transition from one known polymorph to another. Our method provides insights into complex conformational polymorphic transitions of flexible molecular crystals.
Collapse
Affiliation(s)
- Oren Elishav
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Roy Podgaetsky
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Olga Meikler
- Rafael
Ltd., P.O. Box 2250, Haifa 3102102, Israel
| | - Barak Hirshberg
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Ratner Center for Single Molecule Science, Tel Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
165
|
Wang L, Ore RM, Jayamaha PK, Wu ZP, Zhong CJ. Density functional theory based computational investigations on the stability of highly active trimetallic PtPdCu nanoalloys for electrochemical oxygen reduction. Faraday Discuss 2023; 242:429-442. [PMID: 36173024 DOI: 10.1039/d2fd00101b] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Activity, cost, and durability are the trinity of catalysis research for the electrochemical oxygen reduction reaction (ORR). While studies towards increasing activity and reducing cost of ORR catalysts have been carried out extensively, much effort is needed in durability investigation of highly active ORR catalysts. In this work, we examined the stability of a trimetallic PtPdCu catalyst that has demonstrated high activity and incredible durability during ORR using density functional theory (DFT) based computations. Specifically, we studied the processes of dissolution/deposition and diffusion between the surface and inner layer of Cu species of Pt20Pd20Cu60 catalysts at electrode potentials up to 1.2 V to understand their role towards stabilizing Pt20Pd20Cu60 catalysts. The results show there is a dynamic Cu surface composition range that is dictated by the interplay of the four processes, dissolution, deposition, diffusion from the surface to inner layer, and diffusion from the inner layer to the surface of Cu species, in the stability and observed oscillation of lattice constants of Cu-rich PtPdCu nanoalloys.
Collapse
Affiliation(s)
- Lichang Wang
- School of Chemical and Biomolecular Sciences and the Materials Technology Center, Southern Illinois University, Carbondale, IL 62901, USA.
| | - Rotimi M Ore
- School of Chemical and Biomolecular Sciences and the Materials Technology Center, Southern Illinois University, Carbondale, IL 62901, USA.
| | - Peshala K Jayamaha
- School of Chemical and Biomolecular Sciences and the Materials Technology Center, Southern Illinois University, Carbondale, IL 62901, USA.
| | - Zhi-Peng Wu
- Department of Chemistry, State University of New York at Binghamton, Binghamton, NY 13902, USA
| | - Chuan-Jian Zhong
- Department of Chemistry, State University of New York at Binghamton, Binghamton, NY 13902, USA
| |
Collapse
|
166
|
Chmiela S, Vassilev-Galindo V, Unke OT, Kabylda A, Sauceda HE, Tkatchenko A, Müller KR. Accurate global machine learning force fields for molecules with hundreds of atoms. SCIENCE ADVANCES 2023; 9:eadf0873. [PMID: 36630510 PMCID: PMC9833674 DOI: 10.1126/sciadv.adf0873] [Citation(s) in RCA: 40] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 11/28/2022] [Indexed: 05/25/2023]
Abstract
Global machine learning force fields, with the capacity to capture collective interactions in molecular systems, now scale up to a few dozen atoms due to considerable growth of model complexity with system size. For larger molecules, locality assumptions are introduced, with the consequence that nonlocal interactions are not described. Here, we develop an exact iterative approach to train global symmetric gradient domain machine learning (sGDML) force fields (FFs) for several hundred atoms, without resorting to any potentially uncontrolled approximations. All atomic degrees of freedom remain correlated in the global sGDML FF, allowing the accurate description of complex molecules and materials that present phenomena with far-reaching characteristic correlation lengths. We assess the accuracy and efficiency of sGDML on a newly developed MD22 benchmark dataset containing molecules from 42 to 370 atoms. The robustness of our approach is demonstrated in nanosecond path-integral molecular dynamics simulations for supramolecular complexes in the MD22 dataset.
Collapse
Affiliation(s)
- Stefan Chmiela
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
- Berlin Institute for the Foundations of Learning and Data – BIFOLD, Germany
| | - Valentin Vassilev-Galindo
- Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Oliver T. Unke
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
- Google Research, Brain Team, Berlin, Germany
| | - Adil Kabylda
- Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Huziel E. Sauceda
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
- Berlin Institute for the Foundations of Learning and Data – BIFOLD, Germany
- Departamento de Materia Condensada, Instituto de Física, Universidad Nacional Autónoma de México, Cd. de México C.P. 04510, Mexico
- BASLEARN - TU Berlin/BASF Joint Lab for Machine Learning, Technische Universität Berlin, 10587 Berlin, Germany
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Klaus-Robert Müller
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
- Berlin Institute for the Foundations of Learning and Data – BIFOLD, Germany
- Google Research, Brain Team, Berlin, Germany
- Max Planck Institute for Informatics, Stuhlsatzenhausweg, 66123 Saarbrücken, Germany
- Department of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul 02841, Korea
| |
Collapse
|
167
|
Xia S, Zhang D, Zhang Y. Multitask Deep Ensemble Prediction of Molecular Energetics in Solution: From Quantum Mechanics to Experimental Properties. J Chem Theory Comput 2023; 19:10.1021/acs.jctc.2c01024. [PMID: 36607141 PMCID: PMC10323048 DOI: 10.1021/acs.jctc.2c01024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The past few years have witnessed significant advances in developing machine learning methods for molecular energetics predictions, including calculated electronic energies with high-level quantum mechanical methods and experimental properties, such as solvation free energy and logP. Typically, task-specific machine learning models are developed for distinct prediction tasks. In this work, we present a multitask deep ensemble model, sPhysNet-MT-ens5, which can simultaneously and accurately predict electronic energies of molecules in gas, water, and octanol phases, as well as transfer free energies at both calculated and experimental levels. On the calculated data set Frag20-solv-678k, which is developed in this work and contains 678,916 molecular conformations, up to 20 heavy atoms, and their properties calculated at B3LYP/6-31G* level of theory with continuum solvent models, sPhysNet-MT-ens5 predicts density functional theory (DFT)-level electronic energies directly from force field-optimized geometry within chemical accuracy. On the experimental data sets, sPhysNet-MT-ens5 achieves state-of-the-art performances, which predict both experimental hydration free energy with a RMSE of 0.620 kcal/mol on the FreeSolv data set and experimental logP with a RMSE of 0.393 on the PHYSPROP data set. Furthermore, sPhysNet-MT-ens5 also provides a reasonable estimation of model uncertainty which shows correlations with prediction error. Finally, by analyzing the atomic contributions of its predictions, we find that the developed deep learning model is aware of the chemical environment of each atom by assigning reasonable atomic contributions consistent with our chemical knowledge.
Collapse
Affiliation(s)
- Song Xia
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Dongdong Zhang
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Yingkai Zhang
- Department of Chemistry, New York University, New York, New York 10003, United States
- Simons Center for Computational Physical Chemistry at New York University, New York, New York 10003, United States
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|
168
|
Zhang W, Huang W, Tan J, Huang D, Ma J, Wu B. Modeling, optimization and understanding of adsorption process for pollutant removal via machine learning: Recent progress and future perspectives. CHEMOSPHERE 2023; 311:137044. [PMID: 36330979 DOI: 10.1016/j.chemosphere.2022.137044] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 10/22/2022] [Accepted: 10/25/2022] [Indexed: 06/16/2023]
Abstract
It is crucial to reduce the concentration of pollutants in water environment to below safe levels. Some cost-effective pollutant removal technologies have been developed, among which adsorption technology is considered as a promising solution. However, the batch experiments and adsorption isotherms widely employed at present are inefficient and time-consuming to some extent, which limits the development of adsorption technology. As a new research paradigm, machine learning (ML) is expected to innovate traditional adsorption models. This reviews summarized the general workflow of ML and commonly employed ML algorithms for pollutant adsorption. Then, the latest progress of ML for pollutant adsorption was reviewed from the perspective of all-round regulation of adsorption process, including adsorption efficiency, operating conditions and adsorption mechanism. General guidelines of ML for pollutant adsorption were presented. Finally, the existing problems and future perspectives of ML for pollutant adsorption were put forward. We highly expect that this review will promote the application of ML in pollutant adsorption and improve the interpretability of ML.
Collapse
Affiliation(s)
- Wentao Zhang
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, People's Republic of China
| | - Wenguang Huang
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PR China, Guangzhou, 510655, People's Republic of China.
| | - Jie Tan
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PR China, Guangzhou, 510655, People's Republic of China
| | - Dawei Huang
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PR China, Guangzhou, 510655, People's Republic of China
| | - Jun Ma
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PR China, Guangzhou, 510655, People's Republic of China
| | - Bingdang Wu
- School of Environmental Science and Engineering, Suzhou University of Science and Technology, Suzhou, 215009, People's Republic of China; Key Laboratory of Suzhou Sponge City Technology, Suzhou, 215002, People's Republic of China.
| |
Collapse
|
169
|
Atom hybridization of metallic elements: Emergence of subnano metallurgy for the post-nanotechnology. Coord Chem Rev 2023. [DOI: 10.1016/j.ccr.2022.214826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
170
|
Zhang KH, Jiang Y, Zhang LS. Inferring the Physics of Structural Evolution of Multicomponent Polymers via Machine-Learning-Accelerated Method. CHINESE JOURNAL OF POLYMER SCIENCE 2022. [DOI: 10.1007/s10118-023-2891-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
171
|
Gelin MF, Chen L, Domcke W. Equation-of-Motion Methods for the Calculation of Femtosecond Time-Resolved 4-Wave-Mixing and N-Wave-Mixing Signals. Chem Rev 2022; 122:17339-17396. [PMID: 36278801 DOI: 10.1021/acs.chemrev.2c00329] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Femtosecond nonlinear spectroscopy is the main tool for the time-resolved detection of photophysical and photochemical processes. Since most systems of chemical interest are rather complex, theoretical support is indispensable for the extraction of the intrinsic system dynamics from the detected spectroscopic responses. There exist two alternative theoretical formalisms for the calculation of spectroscopic signals, the nonlinear response-function (NRF) approach and the spectroscopic equation-of-motion (EOM) approach. In the NRF formalism, the system-field interaction is assumed to be sufficiently weak and is treated in lowest-order perturbation theory for each laser pulse interacting with the sample. The conceptual alternative to the NRF method is the extraction of the spectroscopic signals from the solutions of quantum mechanical, semiclassical, or quasiclassical EOMs which govern the time evolution of the material system interacting with the radiation field of the laser pulses. The NRF formalism and its applications to a broad range of material systems and spectroscopic signals have been comprehensively reviewed in the literature. This article provides a detailed review of the suite of EOM methods, including applications to 4-wave-mixing and N-wave-mixing signals detected with weak or strong fields. Under certain circumstances, the spectroscopic EOM methods may be more efficient than the NRF method for the computation of various nonlinear spectroscopic signals.
Collapse
Affiliation(s)
- Maxim F Gelin
- School of Science, Hangzhou Dianzi University, Hangzhou 310018, China
| | - Lipeng Chen
- Max-Planck-Institut für Physik komplexer Systeme, Nöthnitzer Strasse 38, D-01187 Dresden, Germany
| | - Wolfgang Domcke
- Department of Chemistry, Technical University of Munich, D-85747 Garching,Germany
| |
Collapse
|
172
|
Ranbir, Kumar M, Singh G, Singh J, Kaur N, Singh N. Machine Learning-Based Analytical Systems: Food Forensics. ACS OMEGA 2022; 7:47518-47535. [PMID: 36591133 PMCID: PMC9798398 DOI: 10.1021/acsomega.2c05632] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 11/29/2022] [Indexed: 02/06/2024]
Abstract
Despite a large amount of money being spent on both food analyses and control measures, various food-borne illnesses associated with pathogens, toxins, pesticides, adulterants, colorants, and other contaminants pose a serious threat to human health, and thus food safety draws considerable attention in the modern pace of the world. The presence of various biogenic amines in processed food have been frequently considered as the primary quality parameter in order to check food freshness and spoilage of protein-rich food. Various conventional detection methods for detecting hazardous analytes including microscopy, nucleic acid, and immunoassay-based techniques have been employed; however, recently, array-based sensing strategies are becoming popular for the development of a highly accurate and precise analytical method. Array-based sensing is majorly facilitated by the advancements in multivariate analytical techniques as well as machine learning-based approaches. These techniques allow one to solve the typical problem associated with the interpretation of the complex response patterns generated in array-based strategies. Consequently, the machine learning-based neural networks enable the fast, robust, and accurate detection of analytes using sensor arrays. Thus, for commercial applications, most of the focus has shifted toward the development of analytical methods based on electrical and chemical sensor arrays. Therefore, herein, we briefly highlight and review the recently reported array-based sensor systems supported by machine learning and multivariate analytics to monitor food safety and quality in the field of food forensics.
Collapse
Affiliation(s)
- Ranbir
- Department
of Chemistry, Indian Institute of Technology
Ropar, Rupnagar 140001, Punjab, India
| | - Manish Kumar
- Department
of Chemistry, Indian Institute of Technology
Ropar, Rupnagar 140001, Punjab, India
| | - Gagandeep Singh
- Department
of Biomedical Engineering, Indian Institute
of Technology Ropar, Rupnagar 140001, Punjab, India
| | - Jasvir Singh
- Department
of Chemistry, Himachal Pradesh University, Shimla 171005, India
| | - Navneet Kaur
- Department
of Chemistry, Panjab University, Chandigarh 160014, India
| | - Narinder Singh
- Department
of Chemistry, Indian Institute of Technology
Ropar, Rupnagar 140001, Punjab, India
- Department
of Biomedical Engineering, Indian Institute
of Technology Ropar, Rupnagar 140001, Punjab, India
| |
Collapse
|
173
|
Zhang W, Huang W, Tan J, Guo Q, Wu B. Heterogeneous catalysis mediated by light, electricity and enzyme via machine learning: Paradigms, applications and prospects. CHEMOSPHERE 2022; 308:136447. [PMID: 36116627 DOI: 10.1016/j.chemosphere.2022.136447] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 09/08/2022] [Accepted: 09/11/2022] [Indexed: 06/15/2023]
Abstract
Energy crisis and environmental pollution have become the bottleneck of human sustainable development. Therefore, there is an urgent need to develop new catalysts for energy production and environmental remediation. Due to the high cost caused by blind screening and limited valuable computing resources, the traditional experimental methods and theoretical calculations are difficult to meet with the requirements. In the past decades, computer science has made great progress, especially in the field of machine learning (ML). As a new research paradigm, ML greatly accelerates the theoretical calculation methods represented by first principal calculation and molecular dynamics, and establish the physical picture of heterogeneous catalytic processes for energy and environment. This review firstly summarized the general research paradigms of ML in the discovery of catalysts. Then, the latest progresses of ML in light-, electricity- and enzyme-mediated heterogeneous catalysis were reviewed from the perspective of catalytic performance, operating conditions and reaction mechanism. The general guidelines of ML for heterogeneous catalysis were proposed. Finally, the existing problems and future development trend of ML in heterogeneous catalysis mediated by light, electricity and enzyme were summarized. We highly expect that this review will facilitate the interaction between ML and heterogeneous catalysis, and illuminate the development prospect of heterogeneous catalysis.
Collapse
Affiliation(s)
- Wentao Zhang
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, People's Republic of China
| | - Wenguang Huang
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PRC, Guangzhou, 510655, People's Republic of China.
| | - Jie Tan
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PRC, Guangzhou, 510655, People's Republic of China
| | - Qingwei Guo
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PRC, Guangzhou, 510655, People's Republic of China
| | - Bingdang Wu
- School of Environmental Science and Engineering, Suzhou University of Science and Technology, Suzhou, 215009, People's Republic of China; Key Laboratory of Suzhou Sponge City Technology, Suzhou, 215002, People's Republic of China.
| |
Collapse
|
174
|
Vazquez-Salazar LI, Boittier ED, Meuwly M. Uncertainty quantification for predictions of atomistic neural networks. Chem Sci 2022; 13:13068-13084. [PMID: 36425481 PMCID: PMC9667919 DOI: 10.1039/d2sc04056e] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 10/16/2022] [Indexed: 12/31/2023] Open
Abstract
The value of uncertainty quantification on predictions for trained neural networks (NNs) on quantum chemical reference data is quantitatively explored. For this, the architecture of the PhysNet NN was suitably modified and the resulting model (PhysNet-DER) was evaluated with different metrics to quantify its calibration, the quality of its predictions, and whether prediction error and the predicted uncertainty can be correlated. Training on the QM9 database and evaluating data in the test set within and outside the distribution indicate that error and uncertainty are not linearly related. However, the observed variance provides insight into the quality of the data used for training. Additionally, the influence of the chemical space covered by the training data set was studied by using a biased database. The results clarify that noise and redundancy complicate property prediction for molecules even in cases for which changes - such as double bond migration in two otherwise identical molecules - are small. The model was also applied to a real database of tautomerization reactions. Analysis of the distance between members in feature space in combination with other parameters shows that redundant information in the training dataset can lead to large variances and small errors whereas the presence of similar but unspecific information returns large errors but small variances. This was, e.g., observed for nitro-containing aliphatic chains for which predictions were difficult although the training set contained several examples for nitro groups bound to aromatic molecules. The finding underlines the importance of the composition of the training data and provides chemical insight into how this affects the prediction capabilities of a ML model. Finally, the presented method can be used for information-based improvement of chemical databases for target applications through active learning optimization.
Collapse
Affiliation(s)
| | - Eric D Boittier
- Department of Chemistry, University of Basel Basel Switzerland
| | - Markus Meuwly
- Department of Chemistry, University of Basel Basel Switzerland
- Department of Chemistry, Brown University USA
| |
Collapse
|
175
|
Kraka E, Quintano M, La Force HW, Antonio JJ, Freindorf M. The Local Vibrational Mode Theory and Its Place in the Vibrational Spectroscopy Arena. J Phys Chem A 2022; 126:8781-8798. [DOI: 10.1021/acs.jpca.2c05962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Elfi Kraka
- Computational and Theoretical Chemistry Group (CATCO), Department of Chemistry, Southern Methodist University, 3215 Daniel Ave, Dallas, Texas75275-0314, United States
| | - Mateus Quintano
- Computational and Theoretical Chemistry Group (CATCO), Department of Chemistry, Southern Methodist University, 3215 Daniel Ave, Dallas, Texas75275-0314, United States
| | - Hunter W. La Force
- Computational and Theoretical Chemistry Group (CATCO), Department of Chemistry, Southern Methodist University, 3215 Daniel Ave, Dallas, Texas75275-0314, United States
| | - Juliana J. Antonio
- Computational and Theoretical Chemistry Group (CATCO), Department of Chemistry, Southern Methodist University, 3215 Daniel Ave, Dallas, Texas75275-0314, United States
| | - Marek Freindorf
- Computational and Theoretical Chemistry Group (CATCO), Department of Chemistry, Southern Methodist University, 3215 Daniel Ave, Dallas, Texas75275-0314, United States
| |
Collapse
|
176
|
Cuierrier E, Roy PO, Wang R, Ernzerhof M. The fourth-order expansion of the exchange hole and neural networks to construct exchange–correlation functionals. J Chem Phys 2022; 157:171103. [DOI: 10.1063/5.0122761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The curvature Q σ of spherically averaged exchange (X) holes ρX, σ(r, u) is one of the crucial variables for the construction of approximations to the exchange–correlation energy of Kohn–Sham theory, the most prominent example being the Becke–Roussel model [A. D. Becke and M. R. Roussel, Phys. Rev. A 39, 3761 (1989)]. Here, we consider the next higher nonzero derivative of the spherically averaged X hole, the fourth-order term T σ. This variable contains information about the nonlocality of the X hole and we employ it to approximate hybrid functionals, eliminating the sometimes demanding calculation of the exact X energy. The new functional is constructed using machine learning; having identified a physical correlation between T σ and the nonlocality of the X hole, we employ a neural network to express this relation. While we only modify the X functional of the Perdew–Burke–Ernzerhof functional [Perdew et al., Phys. Rev. Lett. 77, 3865 (1996)], a significant improvement over this method is achieved.
Collapse
Affiliation(s)
- Etienne Cuierrier
- Département de Chimie, Université de Montréal, C.P. 6128 Succursale A, Montréal, Québec H3C 3J7, Canada
| | - Pierre-Olivier Roy
- Département de Chimie, Université de Montréal, C.P. 6128 Succursale A, Montréal, Québec H3C 3J7, Canada
| | - Rodrigo Wang
- Good Chemistry Company, Vancouver, British Columbia V6E 4B1, Canada
| | - Matthias Ernzerhof
- Département de Chimie, Université de Montréal, C.P. 6128 Succursale A, Montréal, Québec H3C 3J7, Canada
| |
Collapse
|
177
|
A New Theobromine-Based EGFRWT and EGFRT790M Inhibitor and Apoptosis Inducer: Design, Semi-Synthesis, Docking, DFT, MD Simulations, and In Vitro Studies. Processes (Basel) 2022. [DOI: 10.3390/pr10112290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The essential pharmacophoric structural properties were applied to design a new derivative of theobromine as an antiangiogenic EGFR inhibitor. The designed candidate is a (para-nitrophenyl)acetamide derivative of the natural alkaloid, theobromine (T-2-PNPA). The potentialities of T-2-PNPA to inhibit the EGFR protein were studied computationally in an extensive way. Firstly, the molecular docking against EGFRWT and EGFRT790M demonstrated T-2-PNPA’s capabilities of binding with the targeted receptors. Then, the MD experiments (for 100 ns) illustrated through six different studies the changes that occurred in the energy as well as in the structure of EGFR–T-2-PNPA complex. Additionally, an MM-GBSA analysis determined the exact energy of binding and the essential residues. Furthermore, DFT calculations investigated the stability, reactivity, and electrostatic potential of T-2-PNPA. Finally, ADMET and toxicity studies confirmed both the safety as well as the general likeness of T-2-PNPA. Consequently, T-2-PNPA was prepared for the in vitro biological studies. T-2-PNPA inhibited EGFRWT and EGFRT790M with IC50 values of 7.05 and 126.20 nM, respectively, which is comparable with erlotinib activities (5.91 and 202.40, respectively). Interestingly, T-2-PNPA expressed cytotoxic potentialities against A549 and HCT-116 cells with IC50 values of 11.09 and 21.01 µM, respectively, which is again comparable with erlotinib activities (6.73 and 16.35, respectively). T-2-PNPA was much safer against WI-38 (IC50 = 48.06 µM) than erlotinib (IC50 = 31.17 µM). The calculated selectivity indices of T-2-PNPA against A549 and HCT-116 cells were 4.3 and 2.3, respectively. This manuscript presents a new lead anticancer compound (T-2-PNPA) that has been synthesized for the first time and exhibited promising in silico and in vitro anticancer potentialities.
Collapse
|
178
|
Schmitz N, Müller KR, Chmiela S. Algorithmic Differentiation for Automated Modeling of Machine Learned Force Fields. J Phys Chem Lett 2022; 13:10183-10189. [PMID: 36279418 PMCID: PMC9639201 DOI: 10.1021/acs.jpclett.2c02632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 10/20/2022] [Indexed: 05/09/2023]
Abstract
Reconstructing force fields (FFs) from atomistic simulation data is a challenge since accurate data can be highly expensive. Here, machine learning (ML) models can help to be data economic as they can be successfully constrained using the underlying symmetry and conservation laws of physics. However, so far, every descriptor newly proposed for an ML model has required a cumbersome and mathematically tedious remodeling. We therefore propose using modern techniques from algorithmic differentiation within the ML modeling process, effectively enabling the usage of novel descriptors or models fully automatically at an order of magnitude higher computational efficiency. This paradigmatic approach enables not only a versatile usage of novel representations and the efficient computation of larger systems─all of high value to the FF community─but also the simple inclusion of further physical knowledge, such as higher-order information (e.g., Hessians, more complex partial differential equations constraints etc.), even beyond the presented FF domain.
Collapse
Affiliation(s)
| | - Klaus-Robert Müller
- Machine
Learning Group, Technische Universität
Berlin, 10587Berlin, Germany
- BIFOLD
- Berlin Institute for the Foundations of Learning and Data, 10587Berlin, Germany
- Department
of Artificial Intelligence, Korea University, Seongbuk-gu, Seoul02841, Korea
- Max
Planck Institute for Informatics, Stuhlsatzenhausweg, 66123Saarbrücken, Germany
- Google
Research, Brain Team, 10117Berlin, Germany
| | - Stefan Chmiela
- Machine
Learning Group, Technische Universität
Berlin, 10587Berlin, Germany
- BIFOLD
- Berlin Institute for the Foundations of Learning and Data, 10587Berlin, Germany
| |
Collapse
|
179
|
Fan G, McSloy A, Aradi B, Yam CY, Frauenheim T. Obtaining Electronic Properties of Molecules through Combining Density Functional Tight Binding with Machine Learning. J Phys Chem Lett 2022; 13:10132-10139. [PMID: 36269857 DOI: 10.1021/acs.jpclett.2c02586] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
We have introduced a machine learning workflow that allows for optimizing electronic properties in the density functional tight binding method (DFTB). The workflow allows for the optimization of electronic properties by generating two-center integrals, either by training basis function parameters directly or by training a spline model for the diatomic integrals, which are then used to build the Hamiltonian and the overlap matrices. Using our workflow, we have managed to obtain improved electronic properties, such as charge distributions, dipole moments, and approximated polarizabilities. While both machine learning approaches enabled us to improve on the electronic properties of the molecules as compared with existing DFTB parametrizations, only by training on the basis function parameters we were able to obtain consistent Hamiltonians and overlap matrices in the physically reasonable ranges or to improve on multiple electronic properties simultaneously.
Collapse
Affiliation(s)
- Guozheng Fan
- Bremen Center for Computational Materials Science, University of Bremen, 28359Bremen, Germany
| | - Adam McSloy
- Bremen Center for Computational Materials Science, University of Bremen, 28359Bremen, Germany
| | - Bálint Aradi
- Bremen Center for Computational Materials Science, University of Bremen, 28359Bremen, Germany
| | - Chi-Yung Yam
- Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen, 518000China
| | - Thomas Frauenheim
- Bremen Center for Computational Materials Science, University of Bremen, 28359Bremen, Germany
- Beijing Computational Science Research Center, 100193Beijing, China
- Shenzhen JL Computational Science and Applied Research Institute, 518110Shenzhen, China
| |
Collapse
|
180
|
Schnake T, Eberle O, Lederer J, Nakajima S, Schutt KT, Muller KR, Montavon G. Higher-Order Explanations of Graph Neural Networks via Relevant Walks. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:7581-7596. [PMID: 34559639 DOI: 10.1109/tpami.2021.3115452] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Graph Neural Networks (GNNs) are a popular approach for predicting graph structured data. As GNNs tightly entangle the input graph into the neural network structure, common explainable AI approaches are not applicable. To a large extent, GNNs have remained black-boxes for the user so far. In this paper, we show that GNNs can in fact be naturally explained using higher-order expansions, i.e., by identifying groups of edges that jointly contribute to the prediction. Practically, we find that such explanations can be extracted using a nested attribution scheme, where existing techniques such as layer-wise relevance propagation (LRP) can be applied at each step. The output is a collection of walks into the input graph that are relevant for the prediction. Our novel explanation method, which we denote by GNN-LRP, is applicable to a broad range of graph neural networks and lets us extract practically relevant insights on sentiment analysis of text data, structure-property relationships in quantum chemistry, and image classification.
Collapse
|
181
|
Reid AG, Hooe SL, Moreno JJ, Dickie DA, Machan CW. Homogeneous Electrocatalytic Reduction of CO 2 by a CrN 3O Complex: Electronic Coupling with a Redox-Active Terpyridine Fragment Favors Selectivity for CO. Inorg Chem 2022; 61:16963-16970. [PMID: 36260749 DOI: 10.1021/acs.inorgchem.2c02013] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Electrocatalyst design and optimization strategies continue to be an active area of research interest for the applied use of renewable energy resources. The electrocatalytic conversion of carbon dioxide (CO2) is an attractive approach in this context because of the added potential benefit of addressing its rising atmospheric concentrations. In previous experimental and computational studies, we have described the mechanism of the first molecular Cr complex capable of electrocatalytically reducing CO2 to carbon monoxide (CO) in the presence of an added proton donor, which contained a redox-active 2,2'-bipyridine (bpy) fragment, CrN2O2. The high selectivity for CO in the bpy-based system was dependent on a delocalized CrII(bpy•-) active state. Subsequently, we became interested in exploring how expanding the polypyridyl ligand core would impact the selectivity and activity during electrocatalytic CO2 reduction. Here, we report a new CrN3O catalyst, Cr(tpytbupho)Cl2 (1), where 2-(2,2':6',2″-terpyridin-6-yl)-4,6-di-tert-butylphenolate = [tpytbupho]-, which reduces CO2 to CO with almost quantitative selectivity via a different mechanism than our previously reported Cr(tbudhbpy)Cl(H2O) catalyst. Computational analyses indicate that, although the stoichiometry of both reactions is identical, changes in the observed rate law are the combined result of a decrease in the intrinsic ligand charge (L3X vs L2X2) and an increase in the ligand redox activity, which result in increased electronic coupling between the doubly reduced tpy fragment of the ligand and the CrII center. The strong electronic coupling enhances the rate of protonation and subsequent C-OH bond cleavage, resulting in CO2 binding becoming the rate-determining step, which is an uncommon mechanism during protic CO2 reduction.
Collapse
Affiliation(s)
- Amelia G Reid
- Department of Chemistry, University of Virginia, P.O. Box 400319, Charlottesville, Virginia22904-4319, United States
| | - Shelby L Hooe
- Department of Chemistry, University of Virginia, P.O. Box 400319, Charlottesville, Virginia22904-4319, United States
| | - Juan J Moreno
- Department of Chemistry, University of Virginia, P.O. Box 400319, Charlottesville, Virginia22904-4319, United States
| | - Diane A Dickie
- Department of Chemistry, University of Virginia, P.O. Box 400319, Charlottesville, Virginia22904-4319, United States
| | - Charles W Machan
- Department of Chemistry, University of Virginia, P.O. Box 400319, Charlottesville, Virginia22904-4319, United States
| |
Collapse
|
182
|
Krenn M, Ai Q, Barthel S, Carson N, Frei A, Frey NC, Friederich P, Gaudin T, Gayle AA, Jablonka KM, Lameiro RF, Lemm D, Lo A, Moosavi SM, Nápoles-Duarte JM, Nigam A, Pollice R, Rajan K, Schatzschneider U, Schwaller P, Skreta M, Smit B, Strieth-Kalthoff F, Sun C, Tom G, Falk von Rudorff G, Wang A, White AD, Young A, Yu R, Aspuru-Guzik A. SELFIES and the future of molecular string representations. PATTERNS (NEW YORK, N.Y.) 2022; 3:100588. [PMID: 36277819 PMCID: PMC9583042 DOI: 10.1016/j.patter.2022.100588] [Citation(s) in RCA: 44] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language for each of these tasks. Strings are a common tool to represent molecular graphs, and the most popular molecular string representation, Smiles, has powered cheminformatics since the late 1980s. However, in the context of AI and ML in chemistry, Smiles has several shortcomings-most pertinently, most combinations of symbols lead to invalid results with no valid chemical interpretation. To overcome this issue, a new language for molecules was introduced in 2020 that guarantees 100% robustness: SELF-referencing embedded string (Selfies). Selfies has since simplified and enabled numerous new applications in chemistry. In this perspective, we look to the future and discuss molecular string representations, along with their respective opportunities and challenges. We propose 16 concrete future projects for robust molecular representations. These involve the extension toward new chemical domains, exciting questions at the interface of AI and robust languages, and interpretability for both humans and machines. We hope that these proposals will inspire several follow-up works exploiting the full potential of molecular string representations for the future of AI in chemistry and materials science.
Collapse
Affiliation(s)
- Mario Krenn
- Max Planck Institute for the Science of Light (MPL), Erlangen, Germany
| | - Qianxiang Ai
- Department of Chemistry, Fordham University, The Bronx, NY, USA
| | - Senja Barthel
- Department of Mathematics, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Nessa Carson
- Syngenta Jealott’s Hill International Research Centre, Bracknell, Berkshire, UK
| | - Angelo Frei
- Department of Chemistry, Imperial College London, Molecular Sciences Research Hub, White City Campus, Wood Lane, London, UK
| | - Nathan C. Frey
- Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Pascal Friederich
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
- Institute of Nanotechnology, Karlsruhe Institute of Technology, Eggenstein-Leopoldshafen, Germany
| | - Théophile Gaudin
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- IBM Research Europe, Zürich, Switzerland
| | | | - Kevin Maik Jablonka
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Sion, Valais, Switzerland
| | - Rafael F. Lameiro
- Medicinal and Biological Chemistry Group, São Carlos Institute of Chemistry, University of São Paulo, São Paulo, Brazil
| | - Dominik Lemm
- Faculty of Physics, University of Vienna, Vienna, Austria
| | - Alston Lo
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Seyed Mohamad Moosavi
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| | | | - AkshatKumar Nigam
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Robert Pollice
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | - Kohulan Rajan
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller Universität Jena, Jena, Germany
| | - Ulrich Schatzschneider
- Institut für Anorganische Chemie, Julius-Maximilians-Universität Würzburg, Würzburg, Germany
| | - Philippe Schwaller
- IBM Research Europe, Zürich, Switzerland
- Laboratory of Artificial Chemical Intelligence (LIAC), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- National Centre of Competence in Research (NCCR) Catalysis, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Marta Skreta
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| | - Berend Smit
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Sion, Valais, Switzerland
| | - Felix Strieth-Kalthoff
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | - Chong Sun
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Gary Tom
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | | | - Andrew Wang
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Solar Fuels Group, Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | - Andrew D. White
- Department of Chemical Engineering, University of Rochester, Rochester, NY, USA
| | - Adamo Young
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| | - Rose Yu
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
| | - Alán Aspuru-Guzik
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Materials Science, University of Toronto, Toronto, ON, Canada
- Canadian Institute for Advanced Research (CIFAR) Lebovic Fellow, Toronto, ON, Canada
| |
Collapse
|
183
|
Towards fully ab initio simulation of atmospheric aerosol nucleation. Nat Commun 2022; 13:6067. [PMID: 36241616 PMCID: PMC9568664 DOI: 10.1038/s41467-022-33783-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Accepted: 09/29/2022] [Indexed: 11/08/2022] Open
Abstract
Atmospheric aerosol nucleation contributes to approximately half of the worldwide cloud condensation nuclei. Despite the importance of climate, detailed nucleation mechanisms are still poorly understood. Understanding aerosol nucleation dynamics is hindered by the nonreactivity of force fields (FFs) and high computational costs due to the rare event nature of aerosol nucleation. Developing reactive FFs for nucleation systems is even more challenging than developing covalently bonded materials because of the wide size range and high dimensional characteristics of noncovalent hydrogen bonding bridging clusters. Here, we propose a general workflow that is also applicable to other systems to train an accurate reactive FF based on a deep neural network (DNN) and further bridge DNN-FF-based molecular dynamics (MD) with a cluster kinetics model based on Poisson distributions of reactive events to overcome the high computational costs of direct MD. We found that previously reported acid-base formation rates tend to be significantly underestimated, especially in polluted environments, emphasizing that acid-base nucleation observed in multiple environments should be revisited.
Collapse
|
184
|
Galuzzi B, Mirarchi A, Viganò EL, De Gioia L, Damiani C, Arrigoni F. Machine Learning for Efficient Prediction of Protein Redox Potential: The Flavoproteins Case. J Chem Inf Model 2022; 62:4748-4759. [PMID: 36126254 PMCID: PMC9554915 DOI: 10.1021/acs.jcim.2c00858] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Indexed: 11/29/2022]
Abstract
Determining the redox potentials of protein cofactors and how they are influenced by their molecular neighborhoods is essential for basic research and many biotechnological applications, from biosensors and biocatalysis to bioremediation and bioelectronics. The laborious determination of redox potential with current experimental technologies pushes forward the need for computational approaches that can reliably predict it. Although current computational approaches based on quantum and molecular mechanics are accurate, their large computational costs hinder their usage. In this work, we explored the possibility of using more efficient QSPR models based on machine learning (ML) for the prediction of protein redox potential, as an alternative to classical approaches. As a proof of concept, we focused on flavoproteins, one of the most important families of enzymes directly involved in redox processes. To train and test different ML models, we retrieved a dataset of flavoproteins with a known midpoint redox potential (Em) and 3D structure. The features of interest, accounting for both short- and long-range effects of the protein matrix on the flavin cofactor, have been automatically extracted from each protein PDB file. Our best ML model (XGB) has a performance error below 1 kcal/mol (∼36 mV), comparing favorably to more sophisticated computational approaches. We also provided indications on the features that mostly affect the Em value, and when possible, we rationalized them on the basis of previous studies.
Collapse
Affiliation(s)
- Bruno
Giovanni Galuzzi
- Department
of Biotechnology and Biosciences, University
of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
- SYSBIO
Centre of Systems Biology/ISBE.IT, Piazza della Scienza 2, 20126, Milan, Italy
| | - Antonio Mirarchi
- Department
of Biotechnology and Biosciences, University
of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
| | - Edoardo Luca Viganò
- Istituto
di Ricerche Farmacologiche Mario Negri, Via Mario Negri 2, 20156 Milan, Italy
| | - Luca De Gioia
- Department
of Biotechnology and Biosciences, University
of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
| | - Chiara Damiani
- Department
of Biotechnology and Biosciences, University
of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
- SYSBIO
Centre of Systems Biology/ISBE.IT, Piazza della Scienza 2, 20126, Milan, Italy
| | - Federica Arrigoni
- Department
of Biotechnology and Biosciences, University
of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
| |
Collapse
|
185
|
Ojih J, Onyekpe U, Rodriguez A, Hu J, Peng C, Hu M. Machine Learning Accelerated Discovery of Promising Thermal Energy Storage Materials with High Heat Capacity. ACS APPLIED MATERIALS & INTERFACES 2022; 14:43277-43289. [PMID: 36106746 DOI: 10.1021/acsami.2c11350] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Thermal energy storage offers numerous benefits by reducing energy consumption and promoting the use of renewable energy sources. Thermal energy storage materials have been investigated for many decades with the aim of improving the overall efficiency of energy systems. However, finding solid materials that meet the requirement of high heat capacity has been a grand challenge for material scientists. Herewith, by training various machine learning models on 3377 high-quality data from full density functional theory (DFT) calculations, we efficiently search for potential materials with high heat capacity. We build four traditional machine learning models and two graph neural network models. Cross-comparison of the prediction performance and model accuracy was conducted among different models. The deeperGATGNN model exhibits high prediction accuracy and is used for predicting the heat capacity of 32,026 structures screened from the open quantum material database. We gain deep insight into the correlation between heat capacity and structure descriptors such as space group, prototype, lattice volume, atomic weight, etc. Twenty-two structures were predicted to possess high heat capacity, and the results were further validated with DFT calculations. We also identified one special structure, namely, MnIn2Se4, with space group no. 227 (Fd3̅m), that exhibits extremely high heat capacity, even higher than that of the Dulong-Petit limit at room temperature. This study paves the way for accelerating the discovery of novel thermal energy storage materials by combining machine learning with minimal DFT inquiry.
Collapse
Affiliation(s)
- Joshua Ojih
- Department of Mechanical Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| | - Uche Onyekpe
- Department of Computer and Data Science, School of Science, Technology and Health, York St. John University, York YO31 7EX, United Kingdom
- Centre for Computational Sciences and Mathematical Modelling, Coventry University, Priory Road, Coventry CV1 5FB, United Kingdom
| | - Alejandro Rodriguez
- Department of Mechanical Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| | - Jianjun Hu
- Department of Computer Science and Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| | - Chengxiao Peng
- Institute for Computational Materials Science, School of Physics and electronics, Henan University, Kaifeng 475004, People's Republic of China
| | - Ming Hu
- Department of Mechanical Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| |
Collapse
|
186
|
Shmilovich K, Willmott D, Batalov I, Kornbluth M, Mailoa J, Kolter JZ. Orbital Mixer: Using Atomic Orbital Features for Basis-Dependent Prediction of Molecular Wavefunctions. J Chem Theory Comput 2022; 18:6021-6030. [PMID: 36122312 DOI: 10.1021/acs.jctc.2c00555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Leveraging ab initio data at scale has enabled the development of machine learning models capable of extremely accurate and fast molecular property prediction. A central paradigm of many previous studies focuses on generating predictions for only a fixed set of properties. Recent lines of research instead aim to explicitly learn the electronic structure via molecular wavefunctions, from which other quantum chemical properties can be directly derived. While previous methods generate predictions as a function of only the atomic configuration, in this work we present an alternate approach that directly purposes basis-dependent information to predict molecular electronic structure. Our model, Orbital Mixer, is composed entirely of multi-layer perceptrons (MLPs) using MLP-Mixer layers within a simple, intuitive, and scalable architecture that achieves competitive Hamiltonian and molecular orbital energy and coefficient prediction accuracies compared to the state-of-the-art.
Collapse
Affiliation(s)
- Kirill Shmilovich
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Devin Willmott
- Bosch Center for Artificial Intelligence, Pittsburgh, Pennsylvania 15222, United States
| | - Ivan Batalov
- Bosch Center for Artificial Intelligence, Pittsburgh, Pennsylvania 15222, United States
| | - Mordechai Kornbluth
- Bosch Research and Technology Center, Cambridge, Massachusetts 02139, United States
| | - Jonathan Mailoa
- Tencent Quantum Laboratory, Shenzhen, Guangdong 518057, China
| | - J Zico Kolter
- Bosch Center for Artificial Intelligence, Pittsburgh, Pennsylvania 15222, United States.,Carnegie Mellon University, Pittsburgh, Pennsylvania 15222, United States
| |
Collapse
|
187
|
Ratcliff LE, Oshima T, Nippert F, Janzen BM, Kluth E, Goldhahn R, Feneberg M, Mazzolini P, Bierwagen O, Wouters C, Nofal M, Albrecht M, Swallow JEN, Jones LAH, Thakur PK, Lee TL, Kalha C, Schlueter C, Veal TD, Varley JB, Wagner MR, Regoutz A. Tackling Disorder in γ-Ga 2 O 3. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2022; 34:e2204217. [PMID: 35866491 DOI: 10.1002/adma.202204217] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 07/05/2022] [Indexed: 06/15/2023]
Abstract
Ga2 O3 and its polymorphs are attracting increasing attention. The rich structural space of polymorphic oxide systems such as Ga2 O3 offers potential for electronic structure engineering, which is of particular interest for a range of applications, such as power electronics. γ-Ga2 O3 presents a particular challenge across synthesis, characterization, and theory due to its inherent disorder and resulting complex structure-electronic-structure relationship. Here, density functional theory is used in combination with a machine-learning approach to screen nearly one million potential structures, thereby developing a robust atomistic model of the γ-phase. Theoretical results are compared with surface and bulk sensitive soft and hard X-ray photoelectron spectroscopy, X-ray absorption spectroscopy, spectroscopic ellipsometry, and photoluminescence excitation spectroscopy experiments representative of the occupied and unoccupied states of γ-Ga2 O3 . The first onset of strong absorption at room temperature is found at 5.1 eV from spectroscopic ellipsometry, which agrees well with the excitation maximum at 5.17 eV obtained by photoluminescence excitation spectroscopy, where the latter shifts to 5.33 eV at 5 K. This work presents a leap forward in the treatment of complex, disordered oxides and is a crucial step toward exploring how their electronic structure can be understood in terms of local coordination and overall structure.
Collapse
Affiliation(s)
- Laura E Ratcliff
- Department of Materials, Imperial College London, London, SW7 2AZ, UK
- Center for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, BS8 1TS, UK
| | - Takayoshi Oshima
- Department of Electrical and Electronic Engineering, Saga University, Saga, 840-8502, Japan
| | - Felix Nippert
- Technische Universität Berlin, Institute of Solid State Physics, Hardenbergstrasse 36, 10623, Berlin, Germany
| | - Benjamin M Janzen
- Technische Universität Berlin, Institute of Solid State Physics, Hardenbergstrasse 36, 10623, Berlin, Germany
| | - Elias Kluth
- Institut für Physik, Otto-von-Guericke-Universität Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
| | - Rüdiger Goldhahn
- Institut für Physik, Otto-von-Guericke-Universität Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
| | - Martin Feneberg
- Institut für Physik, Otto-von-Guericke-Universität Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
| | - Piero Mazzolini
- Paul-Drude-Institut für Festkörperelektronik, Leibniz-Institut im Forschungsverbund Berlin e.V., Hausvogteiplatz 5-7, 10117, Berlin, Germany
| | - Oliver Bierwagen
- Paul-Drude-Institut für Festkörperelektronik, Leibniz-Institut im Forschungsverbund Berlin e.V., Hausvogteiplatz 5-7, 10117, Berlin, Germany
| | - Charlotte Wouters
- Leibniz-Institut für Kristallzüchtung, Max-Born-Str. 2, 12489, Berlin, Germany
| | - Musbah Nofal
- Leibniz-Institut für Kristallzüchtung, Max-Born-Str. 2, 12489, Berlin, Germany
| | - Martin Albrecht
- Leibniz-Institut für Kristallzüchtung, Max-Born-Str. 2, 12489, Berlin, Germany
| | - Jack E N Swallow
- Department of Materials, University of Oxford, Parks Road, Oxford, OX1 3PH, UK
| | - Leanne A H Jones
- Stephenson Institute for Renewable Energy and Department of Physics, University of Liverpool, Liverpool, L69 7ZF, UK
| | - Pardeep K Thakur
- Diamond Light Source Ltd., Diamond House, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
| | - Tien-Lin Lee
- Diamond Light Source Ltd., Diamond House, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
| | - Curran Kalha
- Department of Chemistry, University College London, 20 Gordon Street, London, WC1H 0AJ, UK
| | - Christoph Schlueter
- Deutsches Elektronen-Synchrotron DESY, Notkestrasse 85, 22607, Hamburg, Germany
| | - Tim D Veal
- Stephenson Institute for Renewable Energy and Department of Physics, University of Liverpool, Liverpool, L69 7ZF, UK
| | - Joel B Varley
- Lawrence Livermore National Laboratory, Livermore, CA, 94550, USA
| | - Markus R Wagner
- Technische Universität Berlin, Institute of Solid State Physics, Hardenbergstrasse 36, 10623, Berlin, Germany
| | - Anna Regoutz
- Department of Chemistry, University College London, 20 Gordon Street, London, WC1H 0AJ, UK
| |
Collapse
|
188
|
Qiu J, Xie J, Su S, Gao Y, Meng H, Yang Y, Liao K. Selective functionalization of hindered meta-C–H bond of o-alkylaryl ketones promoted by automation and deep learning. Chem 2022. [DOI: 10.1016/j.chempr.2022.08.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
189
|
Fedik N, Zubatyuk R, Kulichenko M, Lubbers N, Smith JS, Nebgen B, Messerly R, Li YW, Boldyrev AI, Barros K, Isayev O, Tretiak S. Extending machine learning beyond interatomic potentials for predicting molecular properties. Nat Rev Chem 2022; 6:653-672. [PMID: 37117713 DOI: 10.1038/s41570-022-00416-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/15/2022] [Indexed: 11/09/2022]
Abstract
Machine learning (ML) is becoming a method of choice for modelling complex chemical processes and materials. ML provides a surrogate model trained on a reference dataset that can be used to establish a relationship between a molecular structure and its chemical properties. This Review highlights developments in the use of ML to evaluate chemical properties such as partial atomic charges, dipole moments, spin and electron densities, and chemical bonding, as well as to obtain a reduced quantum-mechanical description. We overview several modern neural network architectures, their predictive capabilities, generality and transferability, and illustrate their applicability to various chemical properties. We emphasize that learned molecular representations resemble quantum-mechanical analogues, demonstrating the ability of the models to capture the underlying physics. We also discuss how ML models can describe non-local quantum effects. Finally, we conclude by compiling a list of available ML toolboxes, summarizing the unresolved challenges and presenting an outlook for future development. The observed trends demonstrate that this field is evolving towards physics-based models augmented by ML, which is accompanied by the development of new methods and the rapid growth of user-friendly ML frameworks for chemistry.
Collapse
|
190
|
Wang X, Jiang S, Hu W, Ye S, Wang T, Wu F, Yang L, Li X, Zhang G, Chen X, Jiang J, Luo Y. Quantitatively Determining Surface-Adsorbate Properties from Vibrational Spectroscopy with Interpretable Machine Learning. J Am Chem Soc 2022; 144:16069-16076. [PMID: 36001497 DOI: 10.1021/jacs.2c06288] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Learning microscopic properties of a material from its macroscopic measurables is a grand and challenging goal in physical science. Conventional wisdom is to first identify material structures exploiting characterization tools, such as spectroscopy, and then to infer properties of interest, often with assistance of theory and simulations. This indirect approach has limitations due to the accumulation of errors from retrieving structures from spectral signals and the lack of quantitative structure-property relationship. A new pathway directly from spectral signals to microscopic properties is highly desirable, as it would offer valuable guidance toward materials evaluation and design via spectroscopic measurements. Herein, we exploit machine-learned vibrational spectroscopy to establish quantitative spectrum-property relationships. Key interaction properties of substrate-adsorbate systems, including adsorption energy and charge transfer, are quantitatively determined directly from Infrared and Raman spectroscopic signals of the adsorbates. The machine-learned spectrum-property relationships are presented as mathematical formulas, which are physically interpretable and therefore transferrable to a series of metal/alloy surfaces. The demonstrated ability of quantitative determination of hard-to-measure microscopic properties using machine-learned spectroscopy will significantly broaden the applicability of conventional spectroscopic techniques for materials design and high throughput screening under operando conditions.
Collapse
Affiliation(s)
- Xijun Wang
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China
| | - Shuang Jiang
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China
| | - Wei Hu
- School of Chemistry and Chemical Engineering, Qilu University of Technology (Shandong Academy of Science), Jinan 250353, China
| | - Sheng Ye
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China.,School of Artificial Intelligence, Anhui University, Hefei 230601, China
| | - Tairan Wang
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China
| | - Fan Wu
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China
| | - Li Yang
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China
| | - Xiyu Li
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China
| | - Guozhen Zhang
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China
| | - Xin Chen
- GuSu Laboratory of Materials, Suzhou 215123, China
| | - Jun Jiang
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China.,Hefei National Laboratory, University of Science and Technology of China, Hefei 230088, China
| | - Yi Luo
- Hefei National Research Center for Physical Sciences at the Microscale, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China.,Hefei National Laboratory, University of Science and Technology of China, Hefei 230088, China
| |
Collapse
|
191
|
Cencer MM, Suslick BA, Moore JS. From skeptic to believer: The power of models. Tetrahedron 2022. [DOI: 10.1016/j.tet.2022.132984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
192
|
Mai H, Le TC, Chen D, Winkler DA, Caruso RA. Machine Learning for Electrocatalyst and Photocatalyst Design and Discovery. Chem Rev 2022; 122:13478-13515. [PMID: 35862246 DOI: 10.1021/acs.chemrev.2c00061] [Citation(s) in RCA: 80] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Electrocatalysts and photocatalysts are key to a sustainable future, generating clean fuels, reducing the impact of global warming, and providing solutions to environmental pollution. Improved processes for catalyst design and a better understanding of electro/photocatalytic processes are essential for improving catalyst effectiveness. Recent advances in data science and artificial intelligence have great potential to accelerate electrocatalysis and photocatalysis research, particularly the rapid exploration of large materials chemistry spaces through machine learning. Here a comprehensive introduction to, and critical review of, machine learning techniques used in electrocatalysis and photocatalysis research are provided. Sources of electro/photocatalyst data and current approaches to representing these materials by mathematical features are described, the most commonly used machine learning methods summarized, and the quality and utility of electro/photocatalyst models evaluated. Illustrations of how machine learning models are applied to novel electro/photocatalyst discovery and used to elucidate electrocatalytic or photocatalytic reaction mechanisms are provided. The review offers a guide for materials scientists on the selection of machine learning methods for electrocatalysis and photocatalysis research. The application of machine learning to catalysis science represents a paradigm shift in the way advanced, next-generation catalysts will be designed and synthesized.
Collapse
Affiliation(s)
- Haoxin Mai
- Applied Chemistry and Environmental Science, School of Science, STEM College, RMIT University, GPO Box 2476, Melbourne, Victoria 3001, Australia
| | - Tu C Le
- School of Engineering, STEM College, RMIT University, GPO Box 2476, Melbourne, Victoria 3001, Australia
| | - Dehong Chen
- Applied Chemistry and Environmental Science, School of Science, STEM College, RMIT University, GPO Box 2476, Melbourne, Victoria 3001, Australia
| | - David A Winkler
- Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria 3052, Australia.,Biochemistry and Chemistry, La Trobe University, Kingsbury Drive, Bundoora, Victoria 3042, Australia.,School of Pharmacy, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| | - Rachel A Caruso
- Applied Chemistry and Environmental Science, School of Science, STEM College, RMIT University, GPO Box 2476, Melbourne, Victoria 3001, Australia
| |
Collapse
|
193
|
Design, Synthesis, Docking, DFT, MD Simulation Studies of a New Nicotinamide-Based Derivative: In Vitro Anticancer and VEGFR-2 Inhibitory Effects. Molecules 2022; 27:molecules27144606. [PMID: 35889478 PMCID: PMC9317904 DOI: 10.3390/molecules27144606] [Citation(s) in RCA: 58] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 07/16/2022] [Accepted: 07/18/2022] [Indexed: 12/29/2022] Open
Abstract
A nicotinamide-based derivative was designed as an antiproliferative VEGFR-2 inhibitor with the key pharmacophoric features needed to interact with the VEGFR-2 catalytic pocket. The ability of the designed congener ((E)-N-(4-(1-(2-(4-benzamidobenzoyl)hydrazono)ethyl)phenyl)nicotinamide), compound 10, to bind with the VEGFR-2 enzyme was demonstrated by molecular docking studies. Furthermore, six various MD simulations studies established the excellent binding of compound 10 with VEGFR-2 over 100 ns, exhibiting optimum dynamics. MM-GBSA confirmed the proper binding with a total exact binding energy of −38.36 Kcal/Mol. MM-GBSA studies also revealed the crucial amino acids in the binding through the free binding energy decomposition and declared the interactions variation of compound 10 inside VEGFR-2 via the Protein–Ligand Interaction Profiler (PLIP). Being new, its molecular structure was optimized by DFT. The DFT studies also confirmed the binding mode of compound 10 with the VEGFR-2. ADMET (in silico) profiling indicated the examined compound’s acceptable range of drug-likeness. The designed compound was synthesized through the condensation of N-(4-(hydrazinecarbonyl)phenyl)benzamide with N-(4-acetylphenyl)nicotinamide, where the carbonyl group has been replaced by an imine group. The in-vitro studies were consonant with the obtained in silico results as compound 10 prohibited VEGFR-2 with an IC50 value of 51 nM. Compound 10 also showed antiproliferative effects against MCF-7 and HCT 116 cancer cell lines with IC50 values of 8.25 and 6.48 μM, revealing magnificent selectivity indexes of 12.89 and 16.41, respectively.
Collapse
|
194
|
Weinreich J, Lemm D, von Rudorff GF, von Lilienfeld OA. Ab initio machine learning of phase space averages. J Chem Phys 2022; 157:024303. [PMID: 35840379 DOI: 10.1063/5.0095674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Equilibrium structures determine material properties and biochemical functions. We here propose to machine learn phase space averages, conventionally obtained by ab initio or force-field-based molecular dynamics (MD) or Monte Carlo (MC) simulations. In analogy to ab initio MD, our ab initio machine learning (AIML) model does not require bond topologies and, therefore, enables a general machine learning pathway to obtain ensemble properties throughout the chemical compound space. We demonstrate AIML for predicting Boltzmann averaged structures after training on hundreds of MD trajectories. The AIML output is subsequently used to train machine learning models of free energies of solvation using experimental data and to reach competitive prediction errors (mean absolute error ∼ 0.8 kcal/mol) for out-of-sample molecules-within milliseconds. As such, AIML effectively bypasses the need for MD or MC-based phase space sampling, enabling exploration campaigns of Boltzmann averages throughout the chemical compound space at a much accelerated pace. We contextualize our findings by comparison to state-of-the-art methods resulting in a Pareto plot for the free energy of solvation predictions in terms of accuracy and time.
Collapse
Affiliation(s)
- Jan Weinreich
- Faculty of Physics, University of Vienna, Kolingasse 14-16, AT-1090 Wien, Austria
| | - Dominik Lemm
- Faculty of Physics, University of Vienna, Kolingasse 14-16, AT-1090 Wien, Austria
| | | | | |
Collapse
|
195
|
Duan C, Ladera AJ, Liu JCL, Taylor MG, Ariyarathna IR, Kulik HJ. Exploiting Ligand Additivity for Transferable Machine Learning of Multireference Character across Known Transition Metal Complex Ligands. J Chem Theory Comput 2022; 18:4836-4845. [PMID: 35834742 DOI: 10.1021/acs.jctc.2c00468] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Accurate virtual high-throughput screening (VHTS) of transition metal complexes (TMCs) remains challenging due to the possibility of high multireference (MR) character that complicates property evaluation. We compute MR diagnostics for over 5,000 ligands present in previously synthesized octahedral mononuclear transition metal complexes in the Cambridge Structural Database (CSD). To accomplish this task, we introduce an iterative approach for consistent ligand charge assignment for ligands in the CSD. Across this set, we observe that the MR character correlates linearly with the inverse value of the averaged bond order over all bonds in the molecule. We then demonstrate that ligand additivity of the MR character holds in TMCs, which suggests that the TMC MR character can be inferred from the sum of the MR character of the ligands. Encouraged by this observation, we leverage ligand additivity and develop a ligand-derived machine learning representation to train neural networks to predict the MR character of TMCs from properties of the constituent ligands. This approach yields models with excellent performance and superior transferability to unseen ligand chemistry and compositions.
Collapse
Affiliation(s)
- Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Adriana J Ladera
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Julian C-L Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Michael G Taylor
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Isuru R Ariyarathna
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
196
|
Heterogeneous catalyst design by generative adversarial network and first-principles based microkinetics. Sci Rep 2022; 12:11657. [PMID: 35803991 PMCID: PMC9270484 DOI: 10.1038/s41598-022-15586-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 06/27/2022] [Indexed: 11/27/2022] Open
Abstract
Microkinetic analysis based on density functional theory (DFT) was combined with a generative adversarial network (GAN) to enable the artificial proposal of heterogeneous catalysts based on the DFT-calculated dataset. The approach was applied to the NH3 formation reaction on Rh−Ru alloy surfaces as an example. The NH3 formation turnover frequency (TOF) was calculated by DFT-based microkinetics. Six elementary reactions, namely, N2 dissociation, H2 dissociation, NHx (x = 1–3) formation, and NH3 desorption, were explicitly considered, and their reaction energies were evaluated by DFT calculations. Based on the TOF values and atomic compositions, new alloy surfaces were generated using the GAN. This approach successfully generated the surfaces that were not included in the initial dataset but exhibited higher TOF values. The N2 dissociation reaction was more exothermic for the generated surfaces, leading to higher TOF. The present study demonstrates that the automatic improvement of catalyst materials is possible using DFT calculations and GAN sample generation.
Collapse
|
197
|
Sauceda HE, Gálvez-González LE, Chmiela S, Paz-Borbón LO, Müller KR, Tkatchenko A. BIGDML-Towards accurate quantum machine learning force fields for materials. Nat Commun 2022; 13:3733. [PMID: 35768400 PMCID: PMC9243122 DOI: 10.1038/s41467-022-31093-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 06/01/2022] [Indexed: 12/16/2022] Open
Abstract
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof. Currently, MLFFs often introduce tradeoffs that restrict their practical applicability to small subsets of chemical space or require exhaustive datasets for training. Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning (BIGDML) approach and demonstrate its ability to construct reliable force fields using a training set with just 10-200 geometries for materials including pristine and defect-containing 2D and 3D semiconductors and metals, as well as chemisorbed and physisorbed atomic and molecular adsorbates on surfaces. The BIGDML model employs the full relevant symmetry group for a given material, does not assume artificial atom types or localization of atomic interactions and exhibits high data efficiency and state-of-the-art energy accuracies (errors substantially below 1 meV per atom) for an extended set of materials. Extensive path-integral molecular dynamics carried out with BIGDML models demonstrate the counterintuitive localization of benzene-graphene dynamics induced by nuclear quantum effects and their strong contributions to the hydrogen diffusion coefficient in a Pd crystal for a wide range of temperatures.
Collapse
Affiliation(s)
- Huziel E Sauceda
- Departamento de Materia Condensada, Instituto de Física, Universidad Nacional Autónoma de México, Cd. de México C.P., 04510, Mexico.
- Machine Learning Group, Technische Universität Berlin, 10587, Berlin, Germany.
- BASLEARN - TU Berlin/BASF Joint Lab for Machine Learning, Technische Universität Berlin, 10587, Berlin, Germany.
| | - Luis E Gálvez-González
- Programa de Doctorado en Ciencias (Física), División de Ciencias Exactas y Naturales, Universidad de Sonora, Blvd. Luis Encinas & Rosales, Hermosillo, C.P., 83000, Mexico
| | - Stefan Chmiela
- Machine Learning Group, Technische Universität Berlin, 10587, Berlin, Germany
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, Berlin, Germany
| | - Lauro Oliver Paz-Borbón
- Departamento de Física Química, Instituto de Física, Universidad Nacional Autónoma de México, Cd. de México C.P., 04510, Mexico
| | - Klaus-Robert Müller
- Machine Learning Group, Technische Universität Berlin, 10587, Berlin, Germany.
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, Berlin, Germany.
- Google Research, Brain team, Berlin, Germany.
- Department of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, 02841, Seoul, Korea.
- Max Planck Institute for Informatics, Stuhlsatzenhausweg, 66123, Saarbrücken, Germany.
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg.
| |
Collapse
|
198
|
Xiouras C, Cameli F, Quilló GL, Kavousanakis ME, Vlachos DG, Stefanidis GD. Applications of Artificial Intelligence and Machine Learning Algorithms to Crystallization. Chem Rev 2022; 122:13006-13042. [PMID: 35759465 DOI: 10.1021/acs.chemrev.2c00141] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Artificial intelligence and specifically machine learning applications are nowadays used in a variety of scientific applications and cutting-edge technologies, where they have a transformative impact. Such an assembly of statistical and linear algebra methods making use of large data sets is becoming more and more integrated into chemistry and crystallization research workflows. This review aims to present, for the first time, a holistic overview of machine learning and cheminformatics applications as a novel, powerful means to accelerate the discovery of new crystal structures, predict key properties of organic crystalline materials, simulate, understand, and control the dynamics of complex crystallization process systems, as well as contribute to high throughput automation of chemical process development involving crystalline materials. We critically review the advances in these new, rapidly emerging research areas, raising awareness in issues such as the bridging of machine learning models with first-principles mechanistic models, data set size, structure, and quality, as well as the selection of appropriate descriptors. At the same time, we propose future research at the interface of applied mathematics, chemistry, and crystallography. Overall, this review aims to increase the adoption of such methods and tools by chemists and scientists across industry and academia.
Collapse
Affiliation(s)
- Christos Xiouras
- Chemical Process R&D, Crystallization Technology Unit, Janssen R&D, Turnhoutseweg 30, 2340 Beerse, Belgium
| | - Fabio Cameli
- Department of Chemical and Biomolecular Engineering, University of Delaware, 150 Academy Street, Newark, Delaware 19716, United States
| | - Gustavo Lunardon Quilló
- Chemical Process R&D, Crystallization Technology Unit, Janssen R&D, Turnhoutseweg 30, 2340 Beerse, Belgium.,Chemical and BioProcess Technology and Control, Department of Chemical Engineering, Faculty of Engineering Technology, KU Leuven, Gebroeders de Smetstraat 1, 9000 Ghent, Belgium
| | - Mihail E Kavousanakis
- School of Chemical Engineering, National Technical University of Athens, Heroon Polytechniou 9, 15780 Zografou, Greece
| | - Dionisios G Vlachos
- Department of Chemical and Biomolecular Engineering, University of Delaware, 150 Academy Street, Newark, Delaware 19716, United States
| | - Georgios D Stefanidis
- School of Chemical Engineering, National Technical University of Athens, Heroon Polytechniou 9, 15780 Zografou, Greece.,Laboratory for Chemical Technology, Ghent University; Tech Lane Ghent Science Park 125, B-9052 Ghent, Belgium
| |
Collapse
|
199
|
Melville J, Hargis C, Davenport MT, Hamilton RS, Ess DH. Machine Learning Analysis of Dynamic‐Dependent Bond Formation in Trajectories with Consecutive Transition States. J PHYS ORG CHEM 2022. [DOI: 10.1002/poc.4405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Jesse Melville
- Department of Chemistry and Biochemistry Brigham Young University Provo Utah USA
| | - Cal Hargis
- Department of Chemistry and Biochemistry Brigham Young University Provo Utah USA
| | - Michael T. Davenport
- Department of Chemistry and Biochemistry Brigham Young University Provo Utah USA
| | - R. Spencer Hamilton
- Department of Chemistry and Biochemistry Brigham Young University Provo Utah USA
| | - Daniel H. Ess
- Department of Chemistry and Biochemistry Brigham Young University Provo Utah USA
| |
Collapse
|
200
|
Peters B. Simple Model and Spectral Analysis for a Fluxional Catalyst: Intermediate Abundances, Pathway Fluxes, Rates, and Transients. ACS Catal 2022. [DOI: 10.1021/acscatal.2c01875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Baron Peters
- Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|