1
|
Feng C, Zhang Y, Jiang B. Efficient Sampling for Machine Learning Electron Density and Its Response in Real Space. J Chem Theory Comput 2025. [PMID: 39750024 DOI: 10.1021/acs.jctc.4c01355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2025]
Abstract
Electron density is a fundamental quantity that can in principle determine all ground state electronic properties of a given system. Although machine learning (ML) models for electron density based on either an atom-centered basis or a real-space grid have been proposed, the demand for a number of high-order basis functions or grid points is enormous. In this work, we propose an efficient grid-point sampling strategy that combines targeted sampling favoring a large density and a screening of grid points associated with linearly independent atomic features. This new sampling strategy is integrated with a field-induced recursively embedded atom neural network model to develop a real-space grid-based ML model for the electron density and its response to an electric field. This approach is applied to a QM9 molecular data set, a H2O/Pt(111) interfacial system, an Au(100) electrode, and an Au nanoparticle under an electric field. The number of training points is found to be much smaller than previous models, while yielding comparably accurate predictions for the electron density of the entire grid. The resultant machine-learned electron density model enables us to properly partition partial charge onto each atom and analyze the charge variation upon proton transfer in the H2O/Pt(111) system. The machine-learning electronic response model allows us to predict charge transfer and the electrostatic potential change induced by an electric field applied to an Au(100) electrode or an Au nanoparticle.
Collapse
Affiliation(s)
- Chaoqiang Feng
- Hefei National Research Center for Physical Sciences at the Microscale, Department of Chemical Physics, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Yaolong Zhang
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Bin Jiang
- Key Laboratory of Precision and Intelligent Chemistry, Department of Chemical Physics, University of Science and Technology of China, Hefei, Anhui 230026, China
- Hefei National Laboratory, University of Science and Technology of China, Hefei 230088, China
| |
Collapse
|
2
|
Getie FA, Ayele DW, Habtu NG, Yemata TA, Yihun FA, Worku AK, Teshager MA. Recent advances and various detection strategies of deep eutectic solvents for zinc air batteries. Heliyon 2024; 10:e40383. [PMID: 39641049 PMCID: PMC11617240 DOI: 10.1016/j.heliyon.2024.e40383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Revised: 10/21/2024] [Accepted: 11/12/2024] [Indexed: 12/07/2024] Open
Abstract
Sustainable technology in energy-related applications will be crucial in the coming decades. As a result, developing new materials for existing processes has presently arisen as a major research priority. Recently, Deep eutectic solvents (DESs) have been expected as low-cost task-specific solvents for zinc-air batteries (ZABs). Here in, initially the various preparation methods of DESs their detection strategies, and the fundamental characteristics of DESs are summarized. Then, the recent utilization of DESs on ZABs has been reviewed. After that, the chemical and physical characteristics of DESs including phase behavior, viscosity, density, ionic conductivity, refractive index, pH, surface tension and stability have been studied. Lastly, the challenges, limitations, and possible upcoming research fields of DESs for ZABs were discussed.
Collapse
Affiliation(s)
- Fentahun Adamu Getie
- Bahir Dar Energy Center, Bahir Dar Institute of Technology, Bahir Dar University, P.O. Box 26, Bahir Dar, Ethiopia
- Department of Chemistry, College of Natural and Computational Science, Injibara University, P.O. Box 40, Injibara, Ethiopia
| | - Delele Worku Ayele
- Bahir Dar Energy Center, Bahir Dar Institute of Technology, Bahir Dar University, P.O. Box 26, Bahir Dar, Ethiopia
- Department of Chemistry, College of Science, Bahir Dar University, P.O. Box 79, Bahir Dar, Ethiopia
| | - Nigus Gabbiye Habtu
- Bahir Dar Energy Center, Bahir Dar Institute of Technology, Bahir Dar University, P.O. Box 26, Bahir Dar, Ethiopia
- Department of Chemical Engineering, Bahir Dar Institute of Technology, Bahir Dar University, P.O. Box 26, Bahir Dar, Ethiopia
| | - Temesgen Atnafu Yemata
- Department of Chemical Engineering, Bahir Dar Institute of Technology, Bahir Dar University, P.O. Box 26, Bahir Dar, Ethiopia
| | - Fantahun Aklog Yihun
- Department of Industrial Chemistry, College of Science, Bahir Dar University, P.O. Box 79, Bahir Dar, Ethiopia
| | - Ababay Ketema Worku
- Bahir Dar Energy Center, Bahir Dar Institute of Technology, Bahir Dar University, P.O. Box 26, Bahir Dar, Ethiopia
| | - Minbale Admas Teshager
- Department of Chemistry, College of Science, Bahir Dar University, P.O. Box 79, Bahir Dar, Ethiopia
| |
Collapse
|
3
|
Cheng AH, Ser CT, Skreta M, Guzmán-Cordero A, Thiede L, Burger A, Aldossary A, Leong SX, Pablo-García S, Strieth-Kalthoff F, Aspuru-Guzik A. Spiers Memorial Lecture: How to do impactful research in artificial intelligence for chemistry and materials science. Faraday Discuss 2024. [PMID: 39400305 DOI: 10.1039/d4fd00153b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2024]
Abstract
Machine learning has been pervasively touching many fields of science. Chemistry and materials science are no exception. While machine learning has been making a great impact, it is still not reaching its full potential or maturity. In this perspective, we first outline current applications across a diversity of problems in chemistry. Then, we discuss how machine learning researchers view and approach problems in the field. Finally, we provide our considerations for maximizing impact when researching machine learning for chemistry.
Collapse
Affiliation(s)
- Austin H Cheng
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Cher Tian Ser
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Marta Skreta
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Andrés Guzmán-Cordero
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
- Tinbergen Institute, University of Amsterdam, Amsterdam, Netherlands
| | - Luca Thiede
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Andreas Burger
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | | | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore 63737, Singapore
| | | | | | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
- Acceleration Consortium, Toronto, Ontario M5G 1X6, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Canada
- Department of Materials Science and Engineering, University of Toronto, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), Canada
| |
Collapse
|
4
|
Gong X, Louie SG, Duan W, Xu Y. Generalizing deep learning electronic structure calculation to the plane-wave basis. NATURE COMPUTATIONAL SCIENCE 2024; 4:752-760. [PMID: 39363113 PMCID: PMC11499277 DOI: 10.1038/s43588-024-00701-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 09/04/2024] [Indexed: 10/05/2024]
Abstract
Deep neural networks capable of representing the density functional theory (DFT) Hamiltonian as a function of material structure hold great promise for revolutionizing future electronic structure calculations. However, a notable limitation of previous neural networks is their compatibility solely with the atomic-orbital (AO) basis, excluding the widely used plane-wave (PW) basis. Here we overcome this critical limitation by proposing an accurate and efficient real-space reconstruction method for directly computing AO Hamiltonian matrices from PW DFT results. The reconstruction method is orders of magnitude faster than traditional projection-based methods to convert PW results to the AO basis, and the reconstructed Hamiltonian matrices can faithfully reproduce the PW electronic structure, thus bridging the longstanding gap between the AO basis deep learning electronic structure approach and PW DFT. Advantages of the PW methods, such as high accuracy, high flexibility and wide applicability, thus can be all integrated into deep learning electronic structure methods without sacrificing these methods' inherent benefits. This allows for the construction of large-scale and high-fidelity training datasets with the help of PW DFT results towards the development of precise and broadly applicable deep learning electronic structure models.
Collapse
Affiliation(s)
- Xiaoxun Gong
- State Key Laboratory of Low Dimensional Quantum Physics and Department of Physics, Tsinghua University, Beijing, China
- Department of Physics, University of California at Berkeley, Berkeley, CA, USA
- Materials Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Steven G Louie
- Department of Physics, University of California at Berkeley, Berkeley, CA, USA.
- Materials Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Wenhui Duan
- State Key Laboratory of Low Dimensional Quantum Physics and Department of Physics, Tsinghua University, Beijing, China.
- Institute for Advanced Study, Tsinghua University, Beijing, China.
- Frontier Science Center for Quantum Information, Beijing, China.
| | - Yong Xu
- State Key Laboratory of Low Dimensional Quantum Physics and Department of Physics, Tsinghua University, Beijing, China.
- Frontier Science Center for Quantum Information, Beijing, China.
- RIKEN Center for Emergent Matter Science (CEMS), Wako, Japan.
| |
Collapse
|
5
|
van Gerwen P, Briling KR, Bunne C, Somnath VR, Laplaza R, Krause A, Corminboeuf C. 3DReact: Geometric Deep Learning for Chemical Reactions. J Chem Inf Model 2024; 64:5771-5785. [PMID: 39007724 PMCID: PMC11323278 DOI: 10.1021/acs.jcim.4c00104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 07/03/2024] [Accepted: 07/08/2024] [Indexed: 07/16/2024]
Abstract
Geometric deep learning models, which incorporate the relevant molecular symmetries within the neural network architecture, have considerably improved the accuracy and data efficiency of predictions of molecular properties. Building on this success, we introduce 3DReact, a geometric deep learning model to predict reaction properties from three-dimensional structures of reactants and products. We demonstrate that the invariant version of the model is sufficient for existing reaction data sets. We illustrate its competitive performance on the prediction of activation barriers on the GDB7-22-TS, Cyclo-23-TS, and Proparg-21-TS data sets in different atom-mapping regimes. We show that, compared to existing models for reaction property prediction, 3DReact offers a flexible framework that exploits atom-mapping information, if available, as well as geometries of reactants and products (in an invariant or equivariant fashion). Accordingly, it performs systematically well across different data sets, atom-mapping regimes, as well as both interpolation and extrapolation tasks.
Collapse
Affiliation(s)
- Puck van Gerwen
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Ksenia R. Briling
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Charlotte Bunne
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
- Learning
& Adaptive Systems Group, Department of Computer Science, ETH Zurich, 8092 Zurich, Switzerland
| | - Vignesh Ram Somnath
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
- Learning
& Adaptive Systems Group, Department of Computer Science, ETH Zurich, 8092 Zurich, Switzerland
| | - Ruben Laplaza
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Andreas Krause
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
- Learning
& Adaptive Systems Group, Department of Computer Science, ETH Zurich, 8092 Zurich, Switzerland
| | - Clemence Corminboeuf
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
6
|
Voss J. Machine learning for accuracy in density functional approximations. J Comput Chem 2024; 45:1829-1845. [PMID: 38668453 DOI: 10.1002/jcc.27366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 02/16/2024] [Accepted: 03/25/2024] [Indexed: 07/21/2024]
Abstract
Machine learning techniques have found their way into computational chemistry as indispensable tools to accelerate atomistic simulations and materials design. In addition, machine learning approaches hold the potential to boost the predictive power of computationally efficient electronic structure methods, such as density functional theory, to chemical accuracy and to correct for fundamental errors in density functional approaches. Here, recent progress in applying machine learning to improve the accuracy of density functional and related approximations is reviewed. Promises and challenges in devising machine learning models transferable between different chemistries and materials classes are discussed with the help of examples applying promising models to systems far outside their training sets.
Collapse
Affiliation(s)
- Johannes Voss
- SUNCAT Center for Interface Science and Catalysis, SLAC National Accelerator Laboratory, Menlo Park, California, USA
| |
Collapse
|
7
|
Berger E, Niemelä J, Lampela O, Juffer AH, Komsa HP. Raman Spectra of Amino Acids and Peptides from Machine Learning Polarizabilities. J Chem Inf Model 2024; 64:4601-4612. [PMID: 38829726 DOI: 10.1021/acs.jcim.4c00077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]
Abstract
Raman spectroscopy is an important tool in the study of vibrational properties and composition of molecules, peptides, and even proteins. Raman spectra can be simulated based on the change of the electronic polarizability with vibrations, which can nowadays be efficiently obtained via machine learning models trained on first-principles data. However, the transferability of the models trained on small molecules to larger structures is unclear, and direct training on large structures is prohibitively expensive. In this work, we first train two machine learning models to predict the polarizabilities of all 20 amino acids. Both models are carefully benchmarked and compared to density functional theory (DFT) calculations, with the neural network method being found to offer better transferability. By combination of machine learning models with classical force field molecular dynamics, Raman spectra of all amino acids are also obtained and investigated, showing good agreement with experiments. The models are further extended to small peptides. We find that adding structures containing peptide bonds to the training set greatly improves predictions, even for peptides not included in training sets.
Collapse
Affiliation(s)
- Ethan Berger
- Microelectronics Research Unit, Faculty of Information Technology and Electrical Engineering, University of Oulu, P.O. Box 4500, Oulu FIN-90014, Finland
| | - Juha Niemelä
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, Oulu FIN-90014, Finland
| | - Outi Lampela
- Biocenter Oulu and Faculty of Biochemistry and Molecular Medicine, University of Oulu, Oulu FIN-90014, Finland
| | - André H Juffer
- Biocenter Oulu and Faculty of Biochemistry and Molecular Medicine, University of Oulu, Oulu FIN-90014, Finland
| | - Hannu-Pekka Komsa
- Microelectronics Research Unit, Faculty of Information Technology and Electrical Engineering, University of Oulu, P.O. Box 4500, Oulu FIN-90014, Finland
| |
Collapse
|
8
|
Lee AJ, Rackers JA, Pathak S, Bricker WP. Building an ab initio solvated DNA model using Euclidean neural networks. PLoS One 2024; 19:e0297502. [PMID: 38358990 PMCID: PMC10868815 DOI: 10.1371/journal.pone.0297502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Accepted: 01/06/2024] [Indexed: 02/17/2024] Open
Abstract
Accurately modeling large biomolecules such as DNA from first principles is fundamentally challenging due to the steep computational scaling of ab initio quantum chemistry methods. This limitation becomes even more prominent when modeling biomolecules in solution due to the need to include large numbers of solvent molecules. We present a machine-learned electron density model based on a Euclidean neural network framework that includes a built-in understanding of equivariance to model explicitly solvated double-stranded DNA. By training the machine learning model using molecular fragments that sample the key DNA and solvent interactions, we show that the model predicts electron densities of arbitrary systems of solvated DNA accurately, resolves polarization effects that are neglected by classical force fields, and captures the physics of the DNA-solvent interaction at the ab initio level.
Collapse
Affiliation(s)
- Alex J. Lee
- Department of Chemical and Biological Engineering, University of New Mexico, Albuquerque, NM, United States of America
| | - Joshua A. Rackers
- Center for Computing Research, Sandia National Laboratories, Albuquerque, NM, United States of America
| | - Shivesh Pathak
- Center for Computing Research, Sandia National Laboratories, Albuquerque, NM, United States of America
| | - William P. Bricker
- Department of Chemical and Biological Engineering, University of New Mexico, Albuquerque, NM, United States of America
| |
Collapse
|
9
|
Briling K, Calvino Alonso Y, Fabrizio A, Corminboeuf C. SPA HM(a,b): Encoding the Density Information from Guess Hamiltonian in Quantum Machine Learning Representations. J Chem Theory Comput 2024; 20:1108-1117. [PMID: 38227222 PMCID: PMC10867806 DOI: 10.1021/acs.jctc.3c01040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 12/20/2023] [Accepted: 12/26/2023] [Indexed: 01/17/2024]
Abstract
Recently, we introduced a class of molecular representations for kernel-based regression methods─the spectrum of approximated Hamiltonian matrices (SPAHM)─that takes advantage of lightweight one-electron Hamiltonians traditionally used as a self-consistent field initial guess. The original SPAHM variant is built from occupied-orbital energies (i.e., eigenvalues) and naturally contains all of the information about nuclear charges, atomic positions, and symmetry requirements. Its advantages were demonstrated on data sets featuring a wide variation of charge and spin, for which traditional structure-based representations commonly fail. SPAHM(a,b), as introduced here, expand the eigenvalue SPAHM into local and transferable representations. They rely upon one-electron density matrices to build fingerprints from atomic and bond density overlap contributions inspired from preceding state-of-the-art representations. The performance and efficiency of SPAHM(a,b) is assessed on the predictions for data sets of prototypical organic molecules (QM7) of different charges and azoheteroarene dyes in an excited state. Overall, both SPAHM(a) and SPAHM(b) outperform state-of-the-art representations on difficult prediction tasks such as the atomic properties of charged open-shell species and of π-conjugated systems.
Collapse
Affiliation(s)
- Ksenia
R. Briling
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Yannick Calvino Alonso
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Alberto Fabrizio
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National
Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National
Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
10
|
Pang S, Yu H, Zhang Y, Jiao Y, Zheng Z, Wang M, Zhang H, Liu A. Bioscreening specific peptide-expressing phage and its application in sensitive dual-mode immunoassay of SARS-CoV-2 spike antigen. Talanta 2024; 266:125093. [PMID: 37611368 DOI: 10.1016/j.talanta.2023.125093] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Revised: 08/12/2023] [Accepted: 08/17/2023] [Indexed: 08/25/2023]
Abstract
Biorecognition components with high affinity and selectivity are vital in bioassay to diagnose and treat epidemic disease. Herein a phage display strategy of combining single-amplification-panning with non-amplification-panning was developed, by which a phage displaying cyclic heptapeptide ACLDWLFNSC (peptide J4) with good affinity and specificity to SARS-CoV-2 spike protein (SP) was identified. Molecular docking suggests that peptide J4 binds to S2 subunit by hydrogen bonding and hydrophobic interaction. Then the J4-phage was used as the capture antibody to establish phage-based chemiluminescence immunoassay (CLIA) and electrochemical impedance spectroscopy (EIS) analytical systems. The as-proposed dual-modal immunoassay platform exhibited good sensitivity and reliability in SARS-CoV-2 SP and pseudovirus assay. The limit of detection for SARS-CoV-2 SP by EIS immunoassay is 0.152 pg/mL, which is dramatically lower than that of 42 pg/mL for J4-phage based CLIA. Further, low to 40 transducing units (TU)/mL, 10 TU/mL SARS-CoV-2 pseudoviruses can be detected by the proposed J4-phage based CLIA and electrochemical immunosensor, respectively. Therefore, the as-developed dual mode immunoassays are potential methods to detect SARS-CoV-2. It is also expected to explore various phages with specific peptides to different targets for bioanalysis.
Collapse
Affiliation(s)
- Shuang Pang
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University, 308 Ningxia Rd, Qingdao, 266071, China
| | - Haipeng Yu
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University, 308 Ningxia Rd, Qingdao, 266071, China
| | - Yaru Zhang
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University, 308 Ningxia Rd, Qingdao, 266071, China
| | - Yiming Jiao
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University, 308 Ningxia Rd, Qingdao, 266071, China
| | - Zongmei Zheng
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University, 308 Ningxia Rd, Qingdao, 266071, China; Qingdao Hightop Biotech Co., Ltd, 369 Hedong Road, Hi-tech Industrial Development Zone, Qingdao, 266112, China
| | - Mingyang Wang
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University, 308 Ningxia Rd, Qingdao, 266071, China
| | - Haohan Zhang
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University, 308 Ningxia Rd, Qingdao, 266071, China
| | - Aihua Liu
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University, 308 Ningxia Rd, Qingdao, 266071, China.
| |
Collapse
|
11
|
Huguenin-Dumittan K, Loche P, Haoran N, Ceriotti M. Physics-Inspired Equivariant Descriptors of Nonbonded Interactions. J Phys Chem Lett 2023; 14:9612-9618. [PMID: 37862712 PMCID: PMC10626632 DOI: 10.1021/acs.jpclett.3c02375] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 10/13/2023] [Indexed: 10/22/2023]
Abstract
One essential ingredient in many machine learning (ML) based methods for atomistic modeling of materials and molecules is the use of locality. While allowing better system-size scaling, this systematically neglects long-range (LR) effects such as electrostatic or dispersion interactions. We present an extension of the long distance equivariant (LODE) framework that can handle diverse LR interactions in a consistent way and seamlessly integrates with preexisting methods by building new sets of atom centered features. We provide a direct physical interpretation of these using the multipole expansion, which allows for simpler and more efficient implementations. The framework is applied to simple toy systems as proof of concept and a heterogeneous set of molecular dimers to push the method to its limits. By generalizing LODE to arbitrary asymptotic behaviors, we provide a coherent approach to treat arbitrary two- and many-body nonbonded interactions in the data-driven modeling of matter.
Collapse
Affiliation(s)
- Kevin
K. Huguenin-Dumittan
- Laboratory
of Computational Science and Modeling, IMX,
École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Philip Loche
- Laboratory
of Computational Science and Modeling, IMX,
École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Ni Haoran
- Laboratory
of Computational Science and Modeling, IMX,
École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Michele Ceriotti
- Laboratory
of Computational Science and Modeling, IMX,
École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
12
|
Azmat M, Ghalandari B, Jessica J, Xu Y, Li X, Su W, Qiang Z, Deng S, Azmat T, Jiang L, Ding X. PepDRED: De Novo Peptide Design with Strong Binding Affinity for Target Protein. Anal Chem 2023; 95:12264-12272. [PMID: 37553082 DOI: 10.1021/acs.analchem.3c01057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/10/2023]
Abstract
De novo design of peptides that bind specifically to functional proteins is beneficial for diagnostics and therapeutics. However, complex permutations and combinations of amino acids pose significant challenges to the rational design of peptides with desirable stability and affinity. Herein, we develop a computational-based evolution method, namely, peptidomimetics-driven recognition elements design (PepDRED), to derive hemoglobin-inspired peptidomimetics. PepDRED mimics the natural evolutionism pipeline to generate stable apovariant (AVs) structures for wild-type counterparts via automated point mutations and validates their efficiency through free binding energy analysis and per residue energy decomposition analysis. For application demonstration, we applied PepDRED to design de novo peptides to bind FhuA, a typical TonB-dependent transporter (TBDT). TBDTs are Gram-negative bacterial outer membrane proteins responsible for iron transport and vital for bacterial resistance. PepDRED generated a pool of AVs and proceeded to reach an optimized peptide, AV440, with a remarkable binding affinity of -21 kcal/mol. AV440 is ∼2.5-fold stronger than the existing FhuA inhibitor Microcin J25. Network energy analysis further unveils that incorporating methionine (M42) in the N-terminal region significantly enhances inter-residue contacts and binding affinity. PepDRED offers a prompt and efficient in silico approach to develop potent peptide candidates for target proteins.
Collapse
Affiliation(s)
- Mehmoona Azmat
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200230, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| | - Behafarid Ghalandari
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| | - Jessica Jessica
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| | - Yuechen Xu
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| | - Xinle Li
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| | - Wenqiong Su
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| | - Zhang Qiang
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| | - Shuxin Deng
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| | - Tabina Azmat
- Department of Cyber Security, AIR University, PAF Complex, E-9, Islamabad 44000, Pakistan
| | - Lai Jiang
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200230, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| | - Xianting Ding
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200230, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai 200230, China
| |
Collapse
|
13
|
Achar SK, Bernasconi L, Johnson JK. Machine Learning Electron Density Prediction Using Weighted Smooth Overlap of Atomic Positions. NANOMATERIALS (BASEL, SWITZERLAND) 2023; 13:1853. [PMID: 37368284 DOI: 10.3390/nano13121853] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 05/29/2023] [Accepted: 06/11/2023] [Indexed: 06/28/2023]
Abstract
Having access to accurate electron densities in chemical systems, especially for dynamical systems involving chemical reactions, ion transport, and other charge transfer processes, is crucial for numerous applications in materials chemistry. Traditional methods for computationally predicting electron density data for such systems include quantum mechanical (QM) techniques, such as density functional theory. However, poor scaling of these QM methods restricts their use to relatively small system sizes and short dynamic time scales. To overcome this limitation, we have developed a deep neural network machine learning formalism, which we call deep charge density prediction (DeepCDP), for predicting charge densities by only using atomic positions for molecules and condensed phase (periodic) systems. Our method uses the weighted smooth overlap of atomic positions to fingerprint environments on a grid-point basis and map it to electron density data generated from QM simulations. We trained models for bulk systems of copper, LiF, and silicon; for a molecular system, water; and for two-dimensional charged and uncharged systems, hydroxyl-functionalized graphane, with and without an added proton. We showed that DeepCDP achieves prediction R2 values greater than 0.99 and mean squared error values on the order of 10-5e2 Å-6 for most systems. DeepCDP scales linearly with system size, is highly parallelizable, and is capable of accurately predicting the excess charge in protonated hydroxyl-functionalized graphane. We demonstrate how DeepCDP can be used to accurately track the location of charges (protons) by computing electron densities at a few selected grid points in the materials, thus significantly reducing the computational cost. We also show that our models can be transferable, allowing prediction of electron densities for systems on which it has not been trained but that contain a subset of atomic species on which it has been trained. Our approach can be used to develop models that span different chemical systems and train them for the study of large-scale charge transport and chemical reactions.
Collapse
Affiliation(s)
- Siddarth K Achar
- Computational Modeling & Simulation Program, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Department of Chemical & Petroleum Engineering, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Leonardo Bernasconi
- Center for Research Computing and Department of Chemistry, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - J Karl Johnson
- Department of Chemical & Petroleum Engineering, University of Pittsburgh, Pittsburgh, PA 15261, USA
| |
Collapse
|
14
|
Schütt KT, Hessmann SSP, Gebauer NWA, Lederer J, Gastegger M. SchNetPack 2.0: A neural network toolbox for atomistic machine learning. J Chem Phys 2023; 158:144801. [PMID: 37061495 DOI: 10.1063/5.0138367] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2023] Open
Abstract
SchNetPack is a versatile neural network toolbox that addresses both the requirements of method development and the application of atomistic machine learning. Version 2.0 comes with an improved data pipeline, modules for equivariant neural networks, and a PyTorch implementation of molecular dynamics. An optional integration with PyTorch Lightning and the Hydra configuration framework powers a flexible command-line interface. This makes SchNetPack 2.0 easily extendable with a custom code and ready for complex training tasks, such as the generation of 3D molecular structures.
Collapse
Affiliation(s)
- Kristof T Schütt
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | | | - Niklas W A Gebauer
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Jonas Lederer
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Michael Gastegger
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| |
Collapse
|
15
|
Jinnouchi R, Minami S, Karsai F, Verdi C, Kresse G. Proton Transport in Perfluorinated Ionomer Simulated by Machine-Learned Interatomic Potential. J Phys Chem Lett 2023; 14:3581-3588. [PMID: 37018477 DOI: 10.1021/acs.jpclett.3c00293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Polymers are a class of materials that are highly challenging to deal with using first-principles methods. Here, we present an application of machine-learned interatomic potentials to predict structural and dynamical properties of dry and hydrated perfluorinated ionomers. An improved active-learning algorithm using a small number of descriptors allows to efficiently construct an accurate and transferable model for this multielemental amorphous polymer. Molecular dynamics simulations accelerated by the machine-learned potentials accurately reproduce the heterogeneous hydrophilic and hydrophobic domains formed in this material as well as proton and water diffusion coefficients under a variety of humidity conditions. Our results reveal pronounced contributions of Grotthuss chains consisting of two to three water molecules to the high proton mobility under strongly humidified conditions.
Collapse
Affiliation(s)
- Ryosuke Jinnouchi
- Toyota Central R&D Laboratories., Inc., 41-1 Yokomichi, Nagakute, Aichi 480-1192, Japan
| | - Saori Minami
- Toyota Central R&D Laboratories., Inc., 41-1 Yokomichi, Nagakute, Aichi 480-1192, Japan
| | - Ferenc Karsai
- VASP Software GmbH, Sensengasse 8, 1090 Vienna, Austria
| | - Carla Verdi
- University of Vienna, Faculty of Physics, Computational Materials Physics, Kolingasse 14-16, 1090 Vienna, Austria
| | - Georg Kresse
- VASP Software GmbH, Sensengasse 8, 1090 Vienna, Austria
- University of Vienna, Faculty of Physics, Computational Materials Physics, Kolingasse 14-16, 1090 Vienna, Austria
| |
Collapse
|
16
|
Xiao Y, Shen C, Zhang W, Zhang M, Zhang H, Shao T, Xiong Z, Ding Y, Hao S, Liu L, Chen Y, Li J. Electrocatalytic Biomass Upgrading of Furfural using Transition-Metal Borides via Density Functional Theory Investigation. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2023; 19:e2205876. [PMID: 36494175 DOI: 10.1002/smll.202205876] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Revised: 11/13/2022] [Indexed: 06/17/2023]
Abstract
Electrocatalytic biomass upgrading has proven to be an effective technique for generating value-added products. Herein, the design and development of furfural upgrading using transition-metal borides (MBenes) with simultaneous production of hydrogen are presented. Using density functional theory, the stabilities, selectivities, and activities of 13 MBene candidates are systematically evaluated for furfural upgrading. This research suggests that Fe2 B2 can serve as a promising electrocatalyst for the formation of furoic acid (FAC), with a limiting potential of -0.15 V, and 5-hydroxy-2(5H)-furanone (HFO), with a limiting potential of -0.93 V. Furthermore, Fe2 B2 and Mn2 Fe2 are shown to exhibit favorable limiting potentials of -1.35 and -1.36 V, respectively, for producing 6-hydroxy-2.3-dihydro-6H-pyrano-3-one (HDPO), indicating that they may also serve as electrocatalysts. Based on Sabatier's principle, a descriptor (φ) of material properties is developed for screening catalysts with high catalytic activity considering the electronegativities and d-electron number of metals. Additionally, surface redox potential, electronic properties, and charge-density differences are determined for Fe2 B2 , which is estimated to exhibit high catalytic activity for the oxidation of furfural to FAC and HFO.
Collapse
Affiliation(s)
- Yi Xiao
- Key Laboratory of Advanced Technologies of Materials (Ministry of Education), School of Materials Science and Engineering, Southwest Jiaotong University, Chengdu, 610031, China
- Institute of Materials Science, TU Darmstadt, 64287, Darmstadt, Germany
| | - Chen Shen
- Institute of Materials Science, TU Darmstadt, 64287, Darmstadt, Germany
| | - Weibin Zhang
- Institute of Physics and Electronic Information, Yunnan Normal University, Kunming, 650500, China
| | - Meiling Zhang
- Key Laboratory of Advanced Technologies of Materials (Ministry of Education), School of Materials Science and Engineering, Southwest Jiaotong University, Chengdu, 610031, China
| | - Haowen Zhang
- Key Laboratory of Advanced Technologies of Materials (Ministry of Education), School of Materials Science and Engineering, Southwest Jiaotong University, Chengdu, 610031, China
| | - Taihuan Shao
- Key Laboratory of Advanced Technologies of Materials (Ministry of Education), School of Materials Science and Engineering, Southwest Jiaotong University, Chengdu, 610031, China
| | - Zhengwei Xiong
- Joint Laboratory for Extreme Conditions Matter Properties, Southwest University of Science and Technology, Mianyang, 621010, China
| | - Yingchun Ding
- College of Optoelectronics Technology, Chengdu University of Information Technology, Chengdu, 610225, China
| | - Shu Hao
- Institute of Water Conservancy and hydropower, Xi'an University of Technology, Xi'an, 710048, China
| | - Li Liu
- Laboratory of New Energy and Materials, Xinjiang Institute of Engineering, Urumqi, 830091, China
| | - Yu'an Chen
- College of Materials Science and Engineering, Chongqing University, Chongqing, 400044, China
- National Engineering Research Center for Magnesium Alloys, Chongqing University, Chongqing, 400044, China
| | - Jinyang Li
- Key Laboratory of Advanced Technologies of Materials (Ministry of Education), School of Materials Science and Engineering, Southwest Jiaotong University, Chengdu, 610031, China
- School of Materials Science and Engineering, Yibin Institute of Southwest Jiaotong University, Yibin, 644000, P. R. China
| |
Collapse
|
17
|
Csizi K, Reiher M. Universal
QM
/
MM
approaches for general nanoscale applications. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2023. [DOI: 10.1002/wcms.1656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
| | - Markus Reiher
- Laboratorium für Physikalische Chemie ETH Zürich Zürich Switzerland
| |
Collapse
|
18
|
Bertoni AI, Sánchez CG. Data-driven approach for benchmarking DFTB-approximate excited state methods. Phys Chem Chem Phys 2023; 25:3789-3798. [PMID: 36645084 DOI: 10.1039/d2cp04979a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
In this work we propose a chemically-informed data-driven approach to benchmark the approximate density-functional tight-binding (DFTB) excited state (ES) methods that are currently available within the DFTB+ suite. By taking advantage of the large volume of low-detail ES-data in the machine learning (ML) dataset, QM8, we were able to extract valuable insights regarding the limitations of the benchmarked methods in terms of the approximations made to the parent formalism, density-functional theory (DFT), while providing recommendations on how to overcome them. For this benchmark, we compared the first singlet-singlet vertical excitation energies (E1) predicted by the DFTB-approximate methods with predictions of less approximate methods from the reference ML-dataset. For the nearly 21800 organic molecules in the GDB-8 chemical space, we were able to identify clear trends in the E1 prediction error distributions, with respect to second-order approximate coupled cluster (CC2), showing a strong dependence on chemical identity.
Collapse
Affiliation(s)
- Andrés I Bertoni
- Instituto Interdisciplinario de Ciencias Básicas (ICB-CONICET), Universidad Nacional de Cuyo, Padre Jorge Contreras 1300, Mendoza 5502, Argentina.
| | - Cristián G Sánchez
- Instituto Interdisciplinario de Ciencias Básicas (ICB-CONICET), Universidad Nacional de Cuyo, Padre Jorge Contreras 1300, Mendoza 5502, Argentina.
| |
Collapse
|
19
|
Xue W, Liu H, Zhao B, Tang C, Xia BY, You B. Interheteromolecular Hyperconjugation Boosts (De)hydrogenation for Reversible H 2 Storage. CHEMSUSCHEM 2023; 16:e202201512. [PMID: 36321739 DOI: 10.1002/cssc.202201512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Revised: 10/27/2022] [Indexed: 06/16/2023]
Abstract
Interheteromolecular hyperconjugation is ubiquitous in organic systems, affecting bond length, dipole moments, conformations and so on, while its effect on (de)hydrogenation reactivity in a heterogeneous thermo-catalytic system has rarely been explored. Herein, the N-heterocycles containing a benzene ring and aliphatic chain [N-ethylcarbazole (NEC) and N-propylcarbazole (NPC)] were utilized to study the correlation between interheteromolecular hyperconjugation and catalytic (de)hydrogenation. Density functional theory calculations, variable-temperature 1 H nuclear magnetic resonance spectroscopy, and catalytic experiments showed that the presented hyperconjugation between NEC and NPC weakened the electron cloud density of aromatic rings and thus facilitated the reactivity with hydrogen featuring unpaired electrons. Therefore, an extremely low temperature of 80 °C was enough for the hydrogenation. Moreover, this interheteromolecular hyperconjugation was general in other N-heterocycles (e. g., N-methyindole and NPC) and was also effective to (de)deuterate as revealed by isotope experiments. This work expands the application of interheteromolecular hyperconjugation to heterogeneous thermocatalysis for reversible H2 storage.
Collapse
Affiliation(s)
- Wenjie Xue
- Key Laboratory of Material Chemistry for Energy Conversion and Storage, Ministry of Education, Hubei Key Laboratory of Material Chemistry and Service Failure, School of Chemistry and Chemical Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Hongxia Liu
- School of Chemistry and Chemical Engineering, Wuhan Textile University, Wuhan, 430200, China
| | - Binbin Zhao
- Department of Materials Science and Engineering, National University of Singapore, Singapore, 117575, Singapore
| | - Conghui Tang
- Key Laboratory of Material Chemistry for Energy Conversion and Storage, Ministry of Education, Hubei Key Laboratory of Material Chemistry and Service Failure, School of Chemistry and Chemical Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Bao Yu Xia
- Key Laboratory of Material Chemistry for Energy Conversion and Storage, Ministry of Education, Hubei Key Laboratory of Material Chemistry and Service Failure, School of Chemistry and Chemical Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Bo You
- Key Laboratory of Material Chemistry for Energy Conversion and Storage, Ministry of Education, Hubei Key Laboratory of Material Chemistry and Service Failure, School of Chemistry and Chemical Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| |
Collapse
|
20
|
Grisafi A, Lewis AM, Rossi M, Ceriotti M. Electronic-Structure Properties from Atom-Centered Predictions of the Electron Density. J Chem Theory Comput 2022. [PMID: 36453538 DOI: 10.1021/acs.jctc.2c00850] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
The electron density of a molecule or material has recently received major attention as a target quantity of machine-learning models. A natural choice to construct a model that yields transferable and linear-scaling predictions is to represent the scalar field using a multicentered atomic basis analogous to that routinely used in density fitting approximations. However, the nonorthogonality of the basis poses challenges for the learning exercise, as it requires accounting for all the atomic density components at once. We devise a gradient-based approach to directly minimize the loss function of the regression problem in an optimized and highly sparse feature space. In so doing, we overcome the limitations associated with adopting an atom-centered model to learn the electron density over arbitrarily complex data sets, obtaining very accurate predictions using a comparatively small training set. The enhanced framework is tested on 32-molecule periodic cells of liquid water, presenting enough complexity to require an optimal balance between accuracy and computational efficiency. We show that starting from the predicted density a single Kohn-Sham diagonalization step can be performed to access total energy components that carry an error of just 0.1 meV/atom with respect to the reference density functional calculations. Finally, we test our method on the highly heterogeneous QM9 benchmark data set, showing that a small fraction of the training data is enough to derive ground-state total energies within chemical accuracy.
Collapse
Affiliation(s)
- Andrea Grisafi
- PASTEUR, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| | - Alan M. Lewis
- Max Planck Institute for the Structure and Dynamics of Matter, Luruper Chaussee 149, 22761 Hamburg, Germany
| | - Mariana Rossi
- Max Planck Institute for the Structure and Dynamics of Matter, Luruper Chaussee 149, 22761 Hamburg, Germany
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
21
|
Lee AJ, Rackers JA, Bricker WP. Predicting accurate ab initio DNA electron densities with equivariant neural networks. Biophys J 2022; 121:3883-3895. [PMID: 36057785 PMCID: PMC9674991 DOI: 10.1016/j.bpj.2022.08.045] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 07/22/2022] [Accepted: 08/29/2022] [Indexed: 11/19/2022] Open
Abstract
One of the fundamental limitations of accurately modeling biomolecules like DNA is the inability to perform quantum chemistry calculations on large molecular structures. We present a machine learning model based on an equivariant Euclidean neural network framework to obtain accurate ab initio electron densities for arbitrary DNA structures that are much too large for conventional quantum methods. The model is trained on representative B-DNA basepair steps that capture both base pairing and base stacking interactions. The model produces accurate electron densities for arbitrary B-DNA structures with typical errors of less than 1%. Crucially, the error does not increase with system size, which suggests that the model can extrapolate to large DNA structures with negligible loss of accuracy. The model also generalizes reasonably to other DNA structural motifs such as the A- and Z-DNA forms, despite being trained on only B-DNA configurations. The model is used to calculate electron densities of several large-scale DNA structures, and we show that the computational scaling for this model is essentially linear. We also show that this machine learning electron density model can be used to calculate accurate electrostatic potentials for DNA. These electrostatic potentials produce more accurate results compared with classical force fields and do not show the usual deficiencies at short range.
Collapse
Affiliation(s)
- Alex J Lee
- Department of Chemical and Biological Engineering, University of New Mexico, Albuquerque, New Mexico
| | - Joshua A Rackers
- Center for Computing Research, Sandia National Laboratories, Albuquerque, New Mexico.
| | - William P Bricker
- Department of Chemical and Biological Engineering, University of New Mexico, Albuquerque, New Mexico.
| |
Collapse
|
22
|
Li M, Dai L, Hu Y. Machine Learning for Harnessing Thermal Energy: From Materials Discovery to System Optimization. ACS ENERGY LETTERS 2022; 7:3204-3226. [PMID: 37325775 PMCID: PMC10264155 DOI: 10.1021/acsenergylett.2c01836] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Recent advances in machine learning (ML) have impacted research communities based on statistical perspectives and uncovered invisibles from conventional standpoints. Though the field is still in the early stage, this progress has driven the thermal science and engineering communities to apply such cutting-edge toolsets for analyzing complex data, unraveling abstruse patterns, and discovering non-intuitive principles. In this work, we present a holistic overview of the applications and future opportunities of ML methods on crucial topics in thermal energy research, from bottom-up materials discovery to top-down system design across atomistic levels to multi-scales. In particular, we focus on a spectrum of impressive ML endeavors investigating the state-of-the-art thermal transport modeling (density functional theory, molecular dynamics, and Boltzmann transport equation), different families of materials (semiconductors, polymers, alloys, and composites), assorted aspects of thermal properties (conductivity, emissivity, stability, and thermoelectricity), and engineering prediction and optimization (devices and systems). We discuss the promises and challenges of current ML approaches and provide perspectives for future directions and new algorithms that could make further impacts on thermal energy research.
Collapse
Affiliation(s)
- Man Li
- Department of Mechanical and Aerospace Engineering, University of California, Los Angeles, Los Angeles, California 90095, United States
| | - Lingyun Dai
- Department of Mechanical and Aerospace Engineering, University of California, Los Angeles, Los Angeles, California 90095, United States
| | - Yongjie Hu
- Department of Mechanical and Aerospace Engineering, University of California, Los Angeles, Los Angeles, California 90095, United States
| |
Collapse
|
23
|
Feride Akman, Kazachenko AS, Issaoui N. DFT Calculations of Some Important Radicals Used in the Nitroxide-Mediated Polymerization and Their HOMO‒LUMO, Natural Bond Orbital, and Molecular Electrostatic Potential Comparative Analysis. POLYMER SCIENCE SERIES B 2022. [DOI: 10.1134/s156009042270035x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
24
|
Feride Akman, Kazachenko AS, Issaoui N. DFT Calculations of Some Important Radicals Used in the Nitroxide-Mediated Polymerization and Their HOMO‒LUMO, Natural Bond Orbital, and Molecular Electrostatic Potential Comparative Analysis. POLYMER SCIENCE SERIES B 2022. [DOI: doi.org/10.1134/s156009042270035x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
25
|
Fabregat R, van Gerwen P, Haeberle M, Eisenbrand F, Corminboeuf C. Metric Learning for Kernel Ridge Regression: Assessment of Molecular Similarity. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac8e4f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Abstract
Supervised and unsupervised kernel-based algorithms broadly used in physical sciences rely upon the notion of similarity. Their reliance on pre-defined distance metrics - e.g., the Euclidean or Manhattan distance - are problematic especially when used in combination with high-dimensional feature vectors for which the similarity measure does not well-reflect the differences in the target property. Metric learning is an elegant approach to surmount this shortcoming and find a property-informed transformation of the feature space. We propose a new algorithm for metric learning specifically adapted for Kernel Ridge Regression (Metric Learning for Kernel Ridge Regression or MLKRR), with applications in the physical sciences in mind. The MLKRR algorithm is built upon the Metric Learning for Kernel Regression (MLKR) framework based on the Nadaraya-Watson estimator, which we show to be inferior to the KRR estimator for typical physics-based machine learning tasks. Our MLKRR algorithm shows superior predictive performance on the benchmark regression task of atomisation energies of QM9 molecules, as well as generating more meaningful low-dimensional projections of the modified feature space.
Collapse
|
26
|
Fedik N, Zubatyuk R, Kulichenko M, Lubbers N, Smith JS, Nebgen B, Messerly R, Li YW, Boldyrev AI, Barros K, Isayev O, Tretiak S. Extending machine learning beyond interatomic potentials for predicting molecular properties. Nat Rev Chem 2022; 6:653-672. [PMID: 37117713 DOI: 10.1038/s41570-022-00416-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/15/2022] [Indexed: 11/09/2022]
Abstract
Machine learning (ML) is becoming a method of choice for modelling complex chemical processes and materials. ML provides a surrogate model trained on a reference dataset that can be used to establish a relationship between a molecular structure and its chemical properties. This Review highlights developments in the use of ML to evaluate chemical properties such as partial atomic charges, dipole moments, spin and electron densities, and chemical bonding, as well as to obtain a reduced quantum-mechanical description. We overview several modern neural network architectures, their predictive capabilities, generality and transferability, and illustrate their applicability to various chemical properties. We emphasize that learned molecular representations resemble quantum-mechanical analogues, demonstrating the ability of the models to capture the underlying physics. We also discuss how ML models can describe non-local quantum effects. Finally, we conclude by compiling a list of available ML toolboxes, summarizing the unresolved challenges and presenting an outlook for future development. The observed trends demonstrate that this field is evolving towards physics-based models augmented by ML, which is accompanied by the development of new methods and the rapid growth of user-friendly ML frameworks for chemistry.
Collapse
|
27
|
Qiao Z, Christensen AS, Welborn M, Manby FR, Anandkumar A, Miller TF. Informing geometric deep learning with electronic interactions to accelerate quantum chemistry. Proc Natl Acad Sci U S A 2022; 119:e2205221119. [PMID: 35901215 PMCID: PMC9351474 DOI: 10.1073/pnas.2205221119] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 06/06/2022] [Indexed: 01/30/2023] Open
Abstract
Predicting electronic energies, densities, and related chemical properties can facilitate the discovery of novel catalysts, medicines, and battery materials. However, existing machine learning techniques are challenged by the scarcity of training data when exploring unknown chemical spaces. We overcome this barrier by systematically incorporating knowledge of molecular electronic structure into deep learning. By developing a physics-inspired equivariant neural network, we introduce a method to learn molecular representations based on the electronic interactions among atomic orbitals. Our method, OrbNet-Equi, leverages efficient tight-binding simulations and learned mappings to recover high-fidelity physical quantities. OrbNet-Equi accurately models a wide spectrum of target properties while being several orders of magnitude faster than density functional theory. Despite only using training samples collected from readily available small-molecule libraries, OrbNet-Equi outperforms traditional semiempirical and machine learning-based methods on comprehensive downstream benchmarks that encompass diverse main-group chemical processes. Our method also describes interactions in challenging charge-transfer complexes and open-shell systems. We anticipate that the strategy presented here will help to expand opportunities for studies in chemistry and materials science, where the acquisition of experimental or reference training data is costly.
Collapse
Affiliation(s)
- Zhuoran Qiao
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125
| | | | | | | | - Anima Anandkumar
- Division of Engineering and Applied Science, California Institute of Technology, Pasadena, CA 91125
- Nvidia Corporation, Santa Clara, CA 95051
| | - Thomas F. Miller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125
- Entos, Inc., Los Angeles, CA 90027
| |
Collapse
|
28
|
DFT rationalization of metal-catalyst-controlled coupling of carbazole with diazo-naphthalen-2(1H)-one. MOLECULAR CATALYSIS 2022. [DOI: 10.1016/j.mcat.2022.112574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
29
|
Cheng L, Sun J, Miller TF. Accurate Molecular-Orbital-Based Machine Learning Energies via Unsupervised Clustering of Chemical Space. J Chem Theory Comput 2022; 18:4826-4835. [PMID: 35858242 DOI: 10.1021/acs.jctc.2c00396] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We introduce an unsupervised clustering algorithm to improve training efficiency and accuracy in predicting energies using molecular-orbital-based machine learning (MOB-ML). This work determines clusters via the Gaussian mixture model (GMM) in an entirely automatic manner and simplifies an earlier supervised clustering approach [ J. Chem. Theory Comput. 2019, 15, 6668] by eliminating both the necessity for user-specified parameters and the training of an additional classifier. Unsupervised clustering results from GMM have the advantages of accurately reproducing chemically intuitive groupings of frontier molecular orbitals and exhibiting improved performance with an increasing number of training examples. The resulting clusters from supervised or unsupervised clustering are further combined with scalable Gaussian process regression (GPR) or linear regression (LR) to learn molecular energies accurately by generating a local regression model in each cluster. Among all four combinations of regressors and clustering methods, GMM combined with scalable exact GPR (GMM/GPR) is the most efficient training protocol for MOB-ML. The numerical tests of molecular energy learning on thermalized data sets of drug-like molecules demonstrate the improved accuracy, transferability, and learning efficiency of GMM/GPR over other training protocols for MOB-ML, i.e., supervised regression clustering combined with GPR (RC/GPR) and GPR without clustering. GMM/GPR also provides the best molecular energy predictions compared with ones from the literature on the same benchmark data sets. With a lower scaling, GMM/GPR has a 10.4-fold speedup in wall-clock training time compared with scalable exact GPR with a training size of 6500 QM7b-T molecules.
Collapse
Affiliation(s)
- Lixue Cheng
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Jiace Sun
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Thomas F Miller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| |
Collapse
|
30
|
Isert C, Atz K, Jiménez-Luna J, Schneider G. QMugs, quantum mechanical properties of drug-like molecules. Sci Data 2022; 9:273. [PMID: 35672335 PMCID: PMC9174255 DOI: 10.1038/s41597-022-01390-7] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 05/17/2022] [Indexed: 12/16/2022] Open
Abstract
Machine learning approaches in drug discovery, as well as in other areas of the chemical sciences, benefit from curated datasets of physical molecular properties. However, there currently is a lack of data collections featuring large bioactive molecules alongside first-principle quantum chemical information. The open-access QMugs (Quantum-Mechanical Properties of Drug-like Molecules) dataset fills this void. The QMugs collection comprises quantum mechanical properties of more than 665 k biologically and pharmacologically relevant molecules extracted from the ChEMBL database, totaling ~2 M conformers. QMugs contains optimized molecular geometries and thermodynamic data obtained via the semi-empirical method GFN2-xTB. Atomic and molecular properties are provided on both the GFN2-xTB and on the density-functional levels of theory (DFT, ωB97X-D/def2-SVP). QMugs features molecules of significantly larger size than previously-reported collections and comprises their respective quantum mechanical wave functions, including DFT density and orbital matrices. This dataset is intended to facilitate the development of models that learn from molecular data on different levels of theory while also providing insight into the corresponding relationships between molecular structure and biological activity.
Collapse
Affiliation(s)
- Clemens Isert
- Department of Chemistry and Applied Biosciences, RETHINK, ETH Zurich, 8093, Zurich, Switzerland
| | - Kenneth Atz
- Department of Chemistry and Applied Biosciences, RETHINK, ETH Zurich, 8093, Zurich, Switzerland
| | - José Jiménez-Luna
- Department of Chemistry and Applied Biosciences, RETHINK, ETH Zurich, 8093, Zurich, Switzerland.
- Department of Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co. KG, Birkendorfer Straße 65, 88397, Biberach an der Riss, Germany.
| | - Gisbert Schneider
- Department of Chemistry and Applied Biosciences, RETHINK, ETH Zurich, 8093, Zurich, Switzerland.
- ETH Singapore SEC Ltd, 1 CREATE Way, #06-01 CREATE Tower, Singapore, 138602, Singapore.
| |
Collapse
|
31
|
Staacke CG, Wengert S, Kunkel C, Csányi G, Reuter K, Margraf JT. Kernel charge equilibration: efficient and accurate prediction of molecular dipole moments with a machine-learning enhanced electron density model. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac568d] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Abstract
State-of-the-art machine learning (ML) interatomic potentials use local representations of atomic environments to ensure linear scaling and size-extensivity. This implies a neglect of long-range interactions, most prominently related to electrostatics. To overcome this limitation, we herein present a ML framework for predicting charge distributions and their interactions termed kernel charge equilibration (kQEq). This model is based on classical charge equilibration (QEq) models expanded with an environment-dependent electronegativity. In contrast to previously reported neural network models with a similar concept, kQEq takes advantage of the linearity of both QEq and Kernel Ridge Regression to obtain a closed-form linear algebra expression for training the models. Furthermore, we avoid the ambiguity of charge partitioning schemes by using dipole moments as reference data. As a first application, we show that kQEq can be used to generate accurate and highly data-efficient models for molecular dipole moments.
Collapse
|
32
|
Computational approach in lignin structural models: Influence of non-covalent intramolecular interactions on βO4 bond properties. J Mol Struct 2022. [DOI: 10.1016/j.molstruc.2021.131938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
33
|
Alencar WLM, da Silva Arouche T, Neto AFG, de Castro Ramalho T, de Carvalho Júnior RN, de Jesus Chaves Neto AM. Interactions of Co, Cu, and non-metal phthalocyanines with external structures of SARS-CoV-2 using docking and molecular dynamics. Sci Rep 2022; 12:3316. [PMID: 35228662 PMCID: PMC8885651 DOI: 10.1038/s41598-022-07396-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 02/10/2022] [Indexed: 02/06/2023] Open
Abstract
The new coronavirus, SARS-CoV-2, caused the COVID-19 pandemic, characterized by its high rate of contamination, propagation capacity, and lethality rate. In this work, we approach the use of phthalocyanines as an inhibitor of SARS-CoV-2, as they present several interactive properties of the phthalocyanines (Pc) of Cobalt (CoPc), Copper (CuPc) and without a metal group (NoPc) can interact with SARS-CoV-2, showing potential be used as filtering by adsorption on paints on walls, masks, clothes, and air conditioning filters. Molecular modeling techniques through Molecular Docking and Molecular Dynamics were used, where the target was the external structures of the virus, but specifically the envelope protein, main protease, and Spike glycoprotein proteases. Using the g_MM-GBSA module and with it, the molecular docking studies show that the ligands have interaction characteristics capable of adsorbing the structures. Molecular dynamics provided information on the root-mean-square deviation of the atomic positions provided values between 1 and 2.5. The generalized Born implicit solvation model, Gibbs free energy, and solvent accessible surface area approach were used. Among the results obtained through molecular dynamics, it was noticed that interactions occur since Pc could bind to residues of the active site of macromolecules, demonstrating good interactions; in particular with CoPc. Molecular couplings and free energy showed that S-gly active site residues interacted strongly with phthalocyanines with values of - 182.443 kJ/mol (CoPc), 158.954 kJ/mol (CuPc), and - 129.963 kJ/mol (NoPc). The interactions of Pc's with SARS-CoV-2 may predict some promising candidates for antagonists to the virus, which if confirmed through experimental approaches, may contribute to resolving the global crisis of the COVID-19 pandemic.
Collapse
Affiliation(s)
- Wilson Luna Machado Alencar
- Laboratory of Preparation and Computation of Nanomaterials (LPCN), Federal University of Pará, C. P. 479, Belem, PA, 66075-110, Brazil
- Pos-Graduation Program in Engineering of Natural Resources of the Amazon, ITEC, Federal University of Pará, C. P. 2626, Belém, PA, 66050-540, Brazil
- Federal Institute of Pará (IFPA), C. P. BR 316, Km 61, Castanhal, PA, 68740-970, Brazil
| | - Tiago da Silva Arouche
- Laboratory of Preparation and Computation of Nanomaterials (LPCN), Federal University of Pará, C. P. 479, Belem, PA, 66075-110, Brazil
| | | | | | - Raul Nunes de Carvalho Júnior
- Pos-Graduation Program in Engineering of Natural Resources of the Amazon, ITEC, Federal University of Pará, C. P. 2626, Belém, PA, 66050-540, Brazil
- Pos-Graduation Program in Chemical Engineering, ITEC, Federal University of Pará, C. P. 479, Belém, PA, 66075-900, Brazil
| | - Antonio Maia de Jesus Chaves Neto
- Laboratory of Preparation and Computation of Nanomaterials (LPCN), Federal University of Pará, C. P. 479, Belem, PA, 66075-110, Brazil.
- Pos-Graduation Program in Engineering of Natural Resources of the Amazon, ITEC, Federal University of Pará, C. P. 2626, Belém, PA, 66050-540, Brazil.
- Pos-Graduation Program in Chemical Engineering, ITEC, Federal University of Pará, C. P. 479, Belém, PA, 66075-900, Brazil.
- National Professional Master's in Physics Teaching, Federal University of Pará, C. P. 479, Belém, PA, 66075-110, Brazil.
| |
Collapse
|
34
|
Wang FQ, Choudhary K, Liu Y, Hu J, Hu M. Large scale dataset of real space electronic charge density of cubic inorganic materials from density functional theory (DFT) calculations. Sci Data 2022; 9:59. [PMID: 35190537 PMCID: PMC8861008 DOI: 10.1038/s41597-022-01158-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 01/14/2022] [Indexed: 11/10/2022] Open
Abstract
Driven by the big data science, material informatics has attracted enormous research interests recently along with many recognized achievements. To acquire knowledge of materials by previous experience, both feature descriptors and databases are essential for training machine learning (ML) models with high accuracy. In this regard, the electronic charge density ρ(r), which in principle determines the properties of materials at their ground state, can be considered as one of the most appropriate descriptors. However, the systematic electronic charge density ρ(r) database of inorganic materials is still in its infancy due to the difficulties in collecting raw data in experiment and the expensive first-principles based computational cost in theory. Herein, a real space electronic charge density ρ(r) database of 17,418 cubic inorganic materials is constructed by performing high-throughput density functional theory calculations. The displayed ρ(r) patterns show good agreements with those reported in previous studies, which validates our computations. Further statistical analysis reveals that it possesses abundant and diverse data, which could accelerate ρ(r) related machine learning studies. Moreover, the electronic charge density database will also assists chemical bonding identifications and promotes new crystal discovery in experiments.
Collapse
Affiliation(s)
- Fancy Qian Wang
- State Key Laboratory of High-end Server & Storage Technology, Inspur Electronic Information Industry Co., Ltd, Beijing, 100085, China.
| | - Kamal Choudhary
- Materials Science and Engineering Division, National Institute of Standards and Technology, Gaithersburg, MD, 20899, USA
- Theiss Research, La Jolla, CA, 92037, USA
| | - Yu Liu
- State Key Laboratory of High-end Server & Storage Technology, Inspur Electronic Information Industry Co., Ltd, Beijing, 100085, China
| | - Jianjun Hu
- Department of Computer Science and Engineering, University of South Carolina, Columbia, 29208, South Carolina, United States
| | - Ming Hu
- Department of Mechanical Engineering, University of South Carolina, Columbia, 29208, South Carolina, United States.
| |
Collapse
|
35
|
Tran LN. Can second-order perturbation theory accurately predict electron density of open-shell molecules? The importance of self-consistency. Phys Chem Chem Phys 2022; 24:19393-19400. [DOI: 10.1039/d2cp01495e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Electron density plays an essential role in predicting molecular properties.
Collapse
Affiliation(s)
- Lan Nguyen Tran
- National Institute of Applied Mechanics and Informatics, Vietnam Academy of Science and Technology, Ho Chi Minh City 700000, Vietnam
| |
Collapse
|
36
|
Landeros-Rivera B, Gallegos M, Munarriz J, Laplaza R, Contreras García J. New venues in electron density analysis. Phys Chem Chem Phys 2022; 24:21538-21548. [DOI: 10.1039/d2cp01517j] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
We provide a comprehensive overview of the chemical information within the electron density: how to extract information, but also how to obtain and how to assess the quality of the...
Collapse
|
37
|
Lewis AM, Grisafi A, Ceriotti M, Rossi M. Learning Electron Densities in the Condensed Phase. J Chem Theory Comput 2021; 17:7203-7214. [PMID: 34669406 PMCID: PMC8582255 DOI: 10.1021/acs.jctc.1c00576] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
We introduce a local
machine-learning method for predicting the
electron densities of periodic systems. The framework is based on
a numerical, atom-centered auxiliary basis, which enables an accurate
expansion of the all-electron density in a form suitable for learning
isolated and periodic systems alike. We show that, using this formulation,
the electron densities of metals, semiconductors, and molecular crystals
can all be accurately predicted using symmetry-adapted Gaussian process
regression models, properly adjusted for the nonorthogonal nature
of the basis. These predicted densities enable the efficient calculation
of electronic properties, which present errors on the order of tens
of meV/atom when compared to ab initio density-functional
calculations. We demonstrate the key power of this approach by using
a model trained on ice unit cells containing only 4 water molecules
to predict the electron densities of cells containing up to 512 molecules
and see no increase in the magnitude of the errors of derived electronic
properties when increasing the system size. Indeed, we find that these
extrapolated derived energies are more accurate than those predicted
using a direct machine-learning model. Finally, on heterogeneous data
sets SALTED can predict electron densities with errors below 4%.
Collapse
Affiliation(s)
- Alan M Lewis
- Max Planck Institute for the Structure and Dynamics of Matter, Luruper Chaussee 149, 22761 Hamburg, Germany
| | - Andrea Grisafi
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Mariana Rossi
- Max Planck Institute for the Structure and Dynamics of Matter, Luruper Chaussee 149, 22761 Hamburg, Germany
| |
Collapse
|
38
|
Secor M, Soudackov AV, Hammes-Schiffer S. Artificial Neural Networks as Propagators in Quantum Dynamics. J Phys Chem Lett 2021; 12:10654-10662. [PMID: 34704767 DOI: 10.1021/acs.jpclett.1c03117] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The utilization of artificial neural networks (ANNs) provides strategies for accelerating molecular simulations. Herein, ANNs are implemented as propagators of the time-dependent Schrödinger equation to simulate the quantum dynamics of systems with time-dependent potentials. These ANN propagators are trained to map nonstationary wavepackets from a given time to a future time within the discrete variable representation. Each propagator is trained for a specified time step, and iterative application of the propagator enables the propagation of wavepackets over long time scales. Such ANN propagators are developed and applied to one- and two-dimensional proton transfer systems, which exhibit nuclear quantum effects such as hydrogen tunneling. These ANN propagators are trained for either a specific time-independent potential or general potentials that can be time-dependent. Hierarchical, multiple time step algorithms enable parallelization, and the extension to higher dimensions is straightforward. This strategy is applicable to quantum dynamical simulations of diverse chemical and biological processes.
Collapse
Affiliation(s)
- Maxim Secor
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520, United States
| | - Alexander V Soudackov
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520, United States
| | - Sharon Hammes-Schiffer
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520, United States
| |
Collapse
|
39
|
Deringer VL, Bartók AP, Bernstein N, Wilkins DM, Ceriotti M, Csányi G. Gaussian Process Regression for Materials and Molecules. Chem Rev 2021; 121:10073-10141. [PMID: 34398616 PMCID: PMC8391963 DOI: 10.1021/acs.chemrev.1c00022] [Citation(s) in RCA: 273] [Impact Index Per Article: 68.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Indexed: 12/18/2022]
Abstract
We provide an introduction to Gaussian process regression (GPR) machine-learning methods in computational materials science and chemistry. The focus of the present review is on the regression of atomistic properties: in particular, on the construction of interatomic potentials, or force fields, in the Gaussian Approximation Potential (GAP) framework; beyond this, we also discuss the fitting of arbitrary scalar, vectorial, and tensorial quantities. Methodological aspects of reference data generation, representation, and regression, as well as the question of how a data-driven model may be validated, are reviewed and critically discussed. A survey of applications to a variety of research questions in chemistry and materials science illustrates the rapid growth in the field. A vision is outlined for the development of the methodology in the years to come.
Collapse
Affiliation(s)
- Volker L. Deringer
- Department
of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, United Kingdom
| | - Albert P. Bartók
- Department
of Physics and Warwick Centre for Predictive Modelling, School of
Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Noam Bernstein
- Center
for Computational Materials Science, U.S.
Naval Research Laboratory, Washington D.C. 20375, United States
| | - David M. Wilkins
- Atomistic
Simulation Centre, School of Mathematics and Physics, Queen’s University Belfast, Belfast BT7 1NN, Northern Ireland, United Kingdom
| | - Michele Ceriotti
- Laboratory
of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
- National
Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale
de Lausanne, Lausanne, Switzerland
| | - Gábor Csányi
- Engineering
Laboratory, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
| |
Collapse
|
40
|
Westermayr J, Marquetand P. Machine Learning for Electronically Excited States of Molecules. Chem Rev 2021; 121:9873-9926. [PMID: 33211478 PMCID: PMC8391943 DOI: 10.1021/acs.chemrev.0c00749] [Citation(s) in RCA: 176] [Impact Index Per Article: 44.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Indexed: 12/11/2022]
Abstract
Electronically excited states of molecules are at the heart of photochemistry, photophysics, as well as photobiology and also play a role in material science. Their theoretical description requires highly accurate quantum chemical calculations, which are computationally expensive. In this review, we focus on not only how machine learning is employed to speed up such excited-state simulations but also how this branch of artificial intelligence can be used to advance this exciting research field in all its aspects. Discussed applications of machine learning for excited states include excited-state dynamics simulations, static calculations of absorption spectra, as well as many others. In order to put these studies into context, we discuss the promises and pitfalls of the involved machine learning techniques. Since the latter are mostly based on quantum chemistry calculations, we also provide a short introduction into excited-state electronic structure methods and approaches for nonadiabatic dynamics simulations and describe tricks and problems when using them in machine learning for excited states of molecules.
Collapse
Affiliation(s)
- Julia Westermayr
- Institute
of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
| | - Philipp Marquetand
- Institute
of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Vienna
Research Platform on Accelerating Photoreaction Discovery, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Data
Science @ Uni Vienna, University of Vienna, Währinger Strasse 29, 1090 Vienna, Austria
| |
Collapse
|
41
|
Abstract
Electronically excited states of molecules are at the heart of photochemistry, photophysics, as well as photobiology and also play a role in material science. Their theoretical description requires highly accurate quantum chemical calculations, which are computationally expensive. In this review, we focus on not only how machine learning is employed to speed up such excited-state simulations but also how this branch of artificial intelligence can be used to advance this exciting research field in all its aspects. Discussed applications of machine learning for excited states include excited-state dynamics simulations, static calculations of absorption spectra, as well as many others. In order to put these studies into context, we discuss the promises and pitfalls of the involved machine learning techniques. Since the latter are mostly based on quantum chemistry calculations, we also provide a short introduction into excited-state electronic structure methods and approaches for nonadiabatic dynamics simulations and describe tricks and problems when using them in machine learning for excited states of molecules.
Collapse
Affiliation(s)
- Julia Westermayr
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
| | - Philipp Marquetand
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Vienna Research Platform on Accelerating Photoreaction Discovery, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Data Science @ Uni Vienna, University of Vienna, Währinger Strasse 29, 1090 Vienna, Austria
| |
Collapse
|
42
|
King DS, Truhlar DG, Gagliardi L. Machine-Learned Energy Functionals for Multiconfigurational Wave Functions. J Phys Chem Lett 2021; 12:7761-7767. [PMID: 34374555 DOI: 10.1021/acs.jpclett.1c02042] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
We introduce multiconfiguration data-driven functional methods (MC-DDFMs), a group of methods which aim to correct the total or classical energy of a qualitatively accurate multiconfigurational wave function using a machine-learned functional of some featurization of the wave function such as its density, on-top density, or both. On a data set of carbene singlet-triplet energy splittings, we show that MC-DDFMs are able to achieve near-benchmark performance on systems not used for training with a robust degree of active-space independence. Beyond demonstrating that the density and on-top density hold the information necessary to correct the singlet-triplet energy splittings of multiconfigurational wave functions, this approach shows great promise for the development of functionals for MC-PDFT because corrections to the classical energy appear to be more transferable to types of molecules not included in the training data than corrections to total energies such as those yielded by CASSCF or NEVPT2.
Collapse
Affiliation(s)
- Daniel S King
- Department of Chemistry, University of Chicago, Chicago, Illinois, United States
| | - Donald G Truhlar
- Department of Chemistry, University of Minnesota, Minneapolis, Minnesota, United States
| | - Laura Gagliardi
- Department of Chemistry, Pritzker School of Molecular Engineering, James Franck Institute, Chicago Center for Theoretical Chemistry, University of Chicago, Chicago, Illinois, United States
| |
Collapse
|
43
|
Abstract
Various eutectic systems have been proposed and studied over the past few decades. Most of the studies have focused on three typical types of eutectics: eutectic metals, eutectic salts, and deep eutectic solvents. On the one hand, they are all eutectic systems, and their eutectic principle is the same. On the other hand, they are representative of metals, inorganic salts, and organic substances, respectively. They have applications in almost all fields related to chemistry. Their different but overlapping applications stem from their very different properties. In addition, the proposal of new eutectic systems has greatly boosted the development of cross-field research involving chemistry, materials, engineering, and energy. The goal of this review is to provide a comprehensive overview of these typical eutectics and describe task-specific strategies to address growing demands.
Collapse
Affiliation(s)
- Dongkun Yu
- Department of Chemistry, Renmin University of China, Beijing 100872, P. R. China.
| | - Zhimin Xue
- Beijing Key Laboratory of Lignocellulosic Chemistry, College of Materials Science and Technology, Beijing Forestry University, Beijing 100083, P. R. China.
| | - Tiancheng Mu
- Department of Chemistry, Renmin University of China, Beijing 100872, P. R. China.
| |
Collapse
|
44
|
Musil F, Grisafi A, Bartók AP, Ortner C, Csányi G, Ceriotti M. Physics-Inspired Structural Representations for Molecules and Materials. Chem Rev 2021; 121:9759-9815. [PMID: 34310133 DOI: 10.1021/acs.chemrev.1c00021] [Citation(s) in RCA: 168] [Impact Index Per Article: 42.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The first step in the construction of a regression model or a data-driven analysis, aiming to predict or elucidate the relationship between the atomic-scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of atomic-scale representations has played, and continues to play, a central role in the success of machine-learning methods for chemistry and materials science. This review summarizes the current understanding of the nature and characteristics of the most commonly used structural and chemical descriptions of atomistic structures, highlighting the deep underlying connections between different frameworks and the ideas that lead to computationally efficient and universally applicable models. It emphasizes the link between properties, structures, their physical chemistry, and their mathematical description, provides examples of recent applications to a diverse set of chemical and materials science problems, and outlines the open questions and the most promising research directions in the field.
Collapse
Affiliation(s)
- Felix Musil
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Andrea Grisafi
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Albert P Bartók
- Department of Physics and Warwick Centre for Predictive Modelling, School of Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Christoph Ortner
- University of British Columbia, Vancouver, British Columbia V6T 1Z2, Canada
| | - Gábor Csányi
- Engineering Laboratory, University of Cambridge, Trumpington Street, Cambridge CB2 1PZ, United Kingdom
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
45
|
Tan Z, Li Y, Shi W, Yang S. A Multitask Approach to Learn Molecular Properties. J Chem Inf Model 2021; 61:3824-3834. [PMID: 34289687 DOI: 10.1021/acs.jcim.1c00646] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The endeavors to pursue a robust multitask model to resolve intertask correlations have lasted for many years. A multitask deep neural network, as the most widely used multitask framework, however, experiences several issues such as inconsistent performance improvement over the independent model benchmark. The research aims to introduce an alternative framework by using the problem transformation methods. We build our multitask models essentially based on the stacking of a base regressor and classifier, where the multitarget predictions are realized from an additional training stage on the expanded molecular feature space. The model architecture is implemented on the QM9, Alchemy, and Tox21 datasets, by using a variety of baseline machine learning techniques. The resultant multitask performance shows 1 to 10% enhancement of forecasting precision, with the task prediction accuracy being consistently improved over the independent single-target models. The proposed method demonstrates a notable superiority in tackling the intertarget dependence and, moreover, a great potential to simulate a wide range of molecular properties under the transformation framework.
Collapse
Affiliation(s)
- Zheng Tan
- Chengdu Polytechnic, 83 Tianyi Street, Chengdu, Sichuan 610000, P. R. China
| | - Yan Li
- Xiyuan Quantitative Technology, 388 Yizhou Road, Chengdu, Sichuan 610000, P. R. China
| | - Weimei Shi
- Chengdu Polytechnic, 83 Tianyi Street, Chengdu, Sichuan 610000, P. R. China
| | - Shiqing Yang
- Chengdu Polytechnic, 83 Tianyi Street, Chengdu, Sichuan 610000, P. R. China
| |
Collapse
|
46
|
Briling KR, Fabrizio A, Corminboeuf C. Impact of quantum-chemical metrics on the machine learning prediction of electron density. J Chem Phys 2021; 155:024107. [PMID: 34266253 DOI: 10.1063/5.0055393] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Machine learning (ML) algorithms have undergone an explosive development impacting every aspect of computational chemistry. To obtain reliable predictions, one needs to maintain a proper balance between the black-box nature of ML frameworks and the physics of the target properties. One of the most appealing quantum-chemical properties for regression models is the electron density, and some of us recently proposed a transferable and scalable model based on the decomposition of the density onto an atom-centered basis set. The decomposition, as well as the training of the model, is at its core a minimization of some loss function, which can be arbitrarily chosen and may lead to results of different quality. Well-studied in the context of density fitting (DF), the impact of the metric on the performance of ML models has not been analyzed yet. In this work, we compare predictions obtained using the overlap and the Coulomb-repulsion metrics for both decomposition and training. As expected, the Coulomb metric used as both the DF and ML loss functions leads to the best results for the electrostatic potential and dipole moments. The origin of this difference lies in the fact that the model is not constrained to predict densities that integrate to the exact number of electrons N. Since an a posteriori correction for the number of electrons decreases the errors, we proposed a modification of the model, where N is included directly into the kernel function, which allowed lowering of the errors on the test and out-of-sample sets.
Collapse
Affiliation(s)
- Ksenia R Briling
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Alberto Fabrizio
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
47
|
Abstract
The ab initio determination of electronic excited state (ES) properties is the cornerstone of theoretical photochemistry. Yet, traditional ES methods become impractical when applied to fairly large molecules, or when used on thousands of systems. Machine learning (ML) techniques have demonstrated their accuracy at retrieving ES properties of large molecular databases at a reduced computational cost. For these applications, nonlinear algorithms tend to be specialized in targeting individual properties. Learning fundamental quantum objects potentially represents a more efficient, yet complex, alternative as a variety of molecular properties could be extracted through postprocessing. Herein, we report a general framework able to learn three fundamental objects: the hole and particle densities, as well as the transition density. We demonstrate the advantages of targeting those outputs and apply our predictions to obtain properties, including the state character and the exciton topological descriptors, for the two bands (nπ* and ππ*) of 3427 azoheteroarene photoswitches.
Collapse
Affiliation(s)
- Sergi Vela
- Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), Laboratory for Computational Molecular Design, Lausanne, CH-1015, Switzerland
| | - Alberto Fabrizio
- Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), Laboratory for Computational Molecular Design, Lausanne, CH-1015, Switzerland
| | - Ksenia R Briling
- Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), Laboratory for Computational Molecular Design, Lausanne, CH-1015, Switzerland
| | - Clémence Corminboeuf
- Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), Laboratory for Computational Molecular Design, Lausanne, CH-1015, Switzerland
| |
Collapse
|
48
|
Westermayr J, Gastegger M, Schütt KT, Maurer RJ. Perspective on integrating machine learning into computational chemistry and materials science. J Chem Phys 2021; 154:230903. [PMID: 34241249 DOI: 10.1063/5.0047760] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Machine learning (ML) methods are being used in almost every conceivable area of electronic structure theory and molecular simulation. In particular, ML has become firmly established in the construction of high-dimensional interatomic potentials. Not a day goes by without another proof of principle being published on how ML methods can represent and predict quantum mechanical properties-be they observable, such as molecular polarizabilities, or not, such as atomic charges. As ML is becoming pervasive in electronic structure theory and molecular simulation, we provide an overview of how atomistic computational modeling is being transformed by the incorporation of ML approaches. From the perspective of the practitioner in the field, we assess how common workflows to predict structure, dynamics, and spectroscopy are affected by ML. Finally, we discuss how a tighter and lasting integration of ML methods with computational chemistry and materials science can be achieved and what it will mean for research practice, software development, and postgraduate training.
Collapse
Affiliation(s)
- Julia Westermayr
- Department of Chemistry, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, United Kingdom
| | - Michael Gastegger
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Kristof T Schütt
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
| | - Reinhard J Maurer
- Department of Chemistry, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, United Kingdom
| |
Collapse
|
49
|
Cuevas-Zuviría B, Pacios LF. Machine Learning of Analytical Electron Density in Large Molecules Through Message-Passing. J Chem Inf Model 2021; 61:2658-2666. [PMID: 34009970 DOI: 10.1021/acs.jcim.1c00227] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Machine learning milestones in computational chemistry are overshadowed by their unaccountability and the overwhelming zoo of tools for each specific task. A promising path to tackle these problems is using machine learning to reproduce physical magnitudes as a basis to derive many other properties. By using a model of the electron density consisting of an analytical expansion on a linear set of isotropic and anisotropic functions, we implemented in this work a message-passing neural network able to reproduce electron density in molecules with just a 2.5% absolute error in complex cases. We also adapted our methodology to describe electron density in large biomolecules (proteins) and to obtain atomic charges, interaction energies, and DFT energies. We show that electron density learning is a new promising avenue with a variety of forthcoming applications.
Collapse
Affiliation(s)
- Bruno Cuevas-Zuviría
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM), Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón, Madrid, Spain
| | - Luis F Pacios
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM), Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón, Madrid, Spain.,Departamento de Biotecnología-Biología Vegetal, Escuela Técnica Superior de Ingeniería Agraria, Alimentaria y de Biosistemas (ETSIAAB), Universidad Politécnica de Madrid (UPM), 28040 Madrid, Spain
| |
Collapse
|
50
|
Böselt L, Thürlemann M, Riniker S. Machine Learning in QM/MM Molecular Dynamics Simulations of Condensed-Phase Systems. J Chem Theory Comput 2021; 17:2641-2658. [PMID: 33818085 DOI: 10.1021/acs.jctc.0c01112] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Quantum mechanics/molecular mechanics (QM/MM) molecular dynamics (MD) simulations have been developed to simulate molecular systems, where an explicit description of changes in the electronic structure is necessary. However, QM/MM MD simulations are computationally expensive compared to fully classical simulations as all valence electrons are treated explicitly and a self-consistent field (SCF) procedure is required. Recently, approaches have been proposed to replace the QM description with machine-learned (ML) models. However, condensed-phase systems pose a challenge for these approaches due to long-range interactions. Here, we establish a workflow, which incorporates the MM environment as an element type in a high-dimensional neural network potential (HDNNP). The fitted HDNNP describes the potential-energy surface of the QM particles with an electrostatic embedding scheme. Thus, the MM particles feel a force from the polarized QM particles. To achieve chemical accuracy, we find that even simple systems require models with a strong gradient regularization, a large number of data points, and a substantial number of parameters. To address this issue, we extend our approach to a Δ-learning scheme, where the ML model learns the difference between a reference method (density functional theory (DFT)) and a cheaper semiempirical method (density functional tight binding (DFTB)). We show that such a scheme reaches the accuracy of the DFT reference method while requiring significantly less parameters. Furthermore, the Δ-learning scheme is capable of correctly incorporating long-range interactions within a cutoff of 1.4 nm. It is validated by performing MD simulations of retinoic acid in water and the interaction between S-adenoslymethioniat and cytosine in water. The presented results indicate that Δ-learning is a promising approach for (QM)ML/MM MD simulations of condensed-phase systems.
Collapse
Affiliation(s)
- Lennard Böselt
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Moritz Thürlemann
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| |
Collapse
|