1
|
Ma S, Li D, Li X, Hu G. Neural network-assisted model of interfacial fluids with explicit coarse-grained molecular structures. J Chem Phys 2024; 161:174110. [PMID: 39494790 DOI: 10.1063/5.0230195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Accepted: 10/17/2024] [Indexed: 11/05/2024] Open
Abstract
Interfacial fluids are ubiquitous in systems ranging from biological membranes to chemical droplets and exhibit a complex behavior due to their nonlinear, multiphase, and multicomponent nature. The development of accurate coarse-grained (CG) models for such systems poses significant challenges, as these models must effectively capture the intricate many-body interactions, both inter- and intramolecular, arising from atomic-level phenomena, and account for the diverse density distributions and fluctuations at the interface. In this study, we use advanced machine learning techniques incorporating force matching and diffusion probabilistic models to construct a robust CG model of interfacial fluids. We evaluate our model through simulations in various settings, including the water-air interface, bulk decane, and dipalmitoylphosphatidylcholine monolayer membranes. Our results show that our CG model accurately reproduces the essential many-body and interfacial properties of interfacial fluids and proves effective across different CG mapping strategies. This work not only validates the utility of our model for multiscale simulations, but also lays the groundwork for future improvements in the simulation of complex interfacial systems.
Collapse
Affiliation(s)
- Shuhao Ma
- Department of Engineering Mechanics, Zhejiang University, Hangzhou 310027, People's Republic of China
| | - Dechang Li
- Department of Engineering Mechanics, Zhejiang University, Hangzhou 310027, People's Republic of China
| | - Xuejin Li
- Department of Engineering Mechanics, Zhejiang University, Hangzhou 310027, People's Republic of China
| | - Guoqing Hu
- Department of Engineering Mechanics, Zhejiang University, Hangzhou 310027, People's Republic of China
| |
Collapse
|
2
|
Wang T, He X, Li M, Li Y, Bi R, Wang Y, Cheng C, Shen X, Meng J, Zhang H, Liu H, Wang Z, Li S, Shao B, Liu TY. Ab initio characterization of protein molecular dynamics with AI 2BMD. Nature 2024:10.1038/s41586-024-08127-z. [PMID: 39506110 DOI: 10.1038/s41586-024-08127-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 09/26/2024] [Indexed: 11/08/2024]
Abstract
Biomolecular dynamics simulation is a fundamental technology for life sciences research, and its usefulness depends on its accuracy and efficiency1-3. Classical molecular dynamics simulation is fast but lacks chemical accuracy4,5. Quantum chemistry methods such as density functional theory can reach chemical accuracy but cannot scale to support large biomolecules6. Here we introduce an artificial intelligence-based ab initio biomolecular dynamics system (AI2BMD) that can efficiently simulate full-atom large biomolecules with ab initio accuracy. AI2BMD uses a protein fragmentation scheme and a machine learning force field7 to achieve generalizable ab initio accuracy for energy and force calculations for various proteins comprising more than 10,000 atoms. Compared to density functional theory, it reduces the computational time by several orders of magnitude. With several hundred nanoseconds of dynamics simulations, AI2BMD demonstrated its ability to efficiently explore the conformational space of peptides and proteins, deriving accurate 3J couplings that match nuclear magnetic resonance experiments, and showing protein folding and unfolding processes. Furthermore, AI2BMD enables precise free-energy calculations for protein folding, and the estimated thermodynamic properties are well aligned with experiments. AI2BMD could potentially complement wet-lab experiments, detect the dynamic processes of bioactivities and enable biomedical research that is impossible to conduct at present.
Collapse
Affiliation(s)
| | | | | | - Yatao Li
- Microsoft Research, Beijing, China
| | - Ran Bi
- Microsoft Research, Beijing, China
| | | | | | | | | | - He Zhang
- Microsoft Research, Beijing, China
| | | | - Zun Wang
- Microsoft Research, Beijing, China
| | | | - Bin Shao
- Microsoft Research, Beijing, China.
| | | |
Collapse
|
3
|
Zills F, Schäfer MR, Tovey S, Kästner J, Holm C. Machine learning-driven investigation of the structure and dynamics of the BMIM-BF 4 room temperature ionic liquid. Faraday Discuss 2024; 253:129-145. [PMID: 39056186 DOI: 10.1039/d4fd00025k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/28/2024]
Abstract
Room-temperature ionic liquids are an exciting group of materials with the potential to revolutionize energy storage. Due to their chemical structure and means of interaction, they are challenging to study computationally. Classical descriptions of their inter- and intra-molecular interactions require time intensive parametrization of force-fields which is prone to assumptions. While ab initio molecular dynamics approaches can capture all necessary interactions, they are too slow to achieve the time and length scales required. In this work, we take a step towards addressing these challenges by applying state-of-the-art machine-learned potentials to the simulation of 1-butyl-3-methylimidazolium tetrafluoroborate. We demonstrate a learning-on-the-fly procedure to train machine-learned potentials from single-point density functional theory calculations before performing production molecular dynamics simulations. Obtained structural and dynamical properties are in good agreement with computational and experimental references. Furthermore, our results show that hybrid machine-learned potentials can contribute to an improved prediction accuracy by mitigating the inherent shortsightedness of the models. Given that room-temperature ionic liquids necessitate long simulations to address their slow dynamics, achieving an optimal balance between accuracy and computational cost becomes imperative. To facilitate further investigation of these materials, we have made our IPSuite-based training and simulation workflow publicly accessible, enabling easy replication or adaptation to similar systems.
Collapse
Affiliation(s)
- Fabian Zills
- Institute for Computational Physics, University of Stuttgart, 70569 Stuttgart, Germany.
| | - Moritz René Schäfer
- Institute for Theoretical Chemistry, University of Stuttgart, 70569 Stuttgart, Germany
| | - Samuel Tovey
- Institute for Computational Physics, University of Stuttgart, 70569 Stuttgart, Germany.
| | - Johannes Kästner
- Institute for Theoretical Chemistry, University of Stuttgart, 70569 Stuttgart, Germany
| | - Christian Holm
- Institute for Computational Physics, University of Stuttgart, 70569 Stuttgart, Germany.
| |
Collapse
|
4
|
Cheng G, Gong XG, Yin WJ. An approach for full space inverse materials design by combining universal machine learning potential, universal property model, and optimization algorithm. Sci Bull (Beijing) 2024; 69:3066-3074. [PMID: 39142945 DOI: 10.1016/j.scib.2024.07.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Revised: 05/12/2024] [Accepted: 06/03/2024] [Indexed: 08/16/2024]
Abstract
We present a full space inverse materials design (FSIMD) approach that fully automates the materials design for target physical properties without the need to provide the atomic composition, chemical stoichiometry, and crystal structure in advance. Here, we used density functional theory reference data to train a universal machine learning potential (UPot) and transfer learning to train a universal bulk modulus model (UBmod). Both UPot and UBmod were able to cover materials systems composed of any element among 42 elements. Interfaced with optimization algorithm and enhanced sampling, the FSIMD approach is applied to find the materials with the largest cohesive energy and the largest bulk modulus, respectively. NaCl-type ZrC was found to be the material with the largest cohesive energy. For bulk modulus, diamond was identified to have the largest value. The FSIMD approach is also applied to design materials with other multi-objective properties with accuracy limited principally by the amount, reliability, and diversity of the training data. The FSIMD approach provides a new way for inverse materials design with other functional properties for practical applications.
Collapse
Affiliation(s)
- Guanjian Cheng
- College of Energy, Soochow Institute for Energy and Materials InnovationS (SIEMIS), and Jiangsu Provincial Key Laboratory for Advanced Carbon Materials and Wearable Energy Technologies, Soochow University, Suzhou 215006, China; Shanghai Qi Zhi Institute, Shanghai 200232, China
| | - Xin-Gao Gong
- Key Laboratory for Computational Physical Sciences (MOE), Institute of Computational Physical Sciences, Fudan University, Shanghai 200438, China; Shanghai Qi Zhi Institute, Shanghai 200232, China
| | - Wan-Jian Yin
- College of Energy, Soochow Institute for Energy and Materials InnovationS (SIEMIS), and Jiangsu Provincial Key Laboratory for Advanced Carbon Materials and Wearable Energy Technologies, Soochow University, Suzhou 215006, China; Shanghai Qi Zhi Institute, Shanghai 200232, China.
| |
Collapse
|
5
|
Tang Z, Li H, Lin P, Gong X, Jin G, He L, Jiang H, Ren X, Duan W, Xu Y. A deep equivariant neural network approach for efficient hybrid density functional calculations. Nat Commun 2024; 15:8815. [PMID: 39394190 PMCID: PMC11470148 DOI: 10.1038/s41467-024-53028-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 09/24/2024] [Indexed: 10/13/2024] Open
Abstract
Hybrid density functional calculations are essential for accurate description of electronic structure, yet their widespread use is restricted by the substantial computational cost. Here we develop DeepH-hybrid, a deep equivariant neural network method for learning the hybrid-functional Hamiltonian as a function of material structure, which circumvents the time-consuming self-consistent field iterations and enables the study of large-scale materials with hybrid-functional accuracy. Our extensive experiments demonstrate good reliability as well as effective transferability and efficiency of the method. As a notable application, DeepH-hybrid is applied to study large-supercell Moiré-twisted materials, offering the first case study on how the inclusion of exact exchange affects flat bands in magic-angle twisted bilayer graphene. The work generalizes deep-learning electronic structure methods to beyond conventional density functional theory, facilitating the development of deep-learning-based ab initio methods.
Collapse
Affiliation(s)
- Zechen Tang
- State Key Laboratory of Low Dimensional Quantum Physics and Department of Physics, Tsinghua University, 100084, Beijing, China
| | - He Li
- State Key Laboratory of Low Dimensional Quantum Physics and Department of Physics, Tsinghua University, 100084, Beijing, China
- Institute for Advanced Study, Tsinghua University, 100084, Beijing, China
| | - Peize Lin
- Beijing National Laboratory for Condensed Matter Physics, Institute of Physics, Chinese Academy of Sciences, 100190, Beijing, China
- Songshan Lake Materials Laboratory, 523808, Dongguan, Guangdong, China
- Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, 230026, Hefei, Anhui, China
| | - Xiaoxun Gong
- State Key Laboratory of Low Dimensional Quantum Physics and Department of Physics, Tsinghua University, 100084, Beijing, China
- School of Physics, Peking University, 100871, Beijing, China
| | - Gan Jin
- Key Laboratory of Quantum Information, University of Science and Technology of China, 230026, Hefei, Anhui, China
| | - Lixin He
- Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, 230026, Hefei, Anhui, China
- Key Laboratory of Quantum Information, University of Science and Technology of China, 230026, Hefei, Anhui, China
| | - Hong Jiang
- College of Chemistry and Molecular Engineering, Peking University, 100871, Beijing, China
| | - Xinguo Ren
- Beijing National Laboratory for Condensed Matter Physics, Institute of Physics, Chinese Academy of Sciences, 100190, Beijing, China.
- Songshan Lake Materials Laboratory, 523808, Dongguan, Guangdong, China.
| | - Wenhui Duan
- State Key Laboratory of Low Dimensional Quantum Physics and Department of Physics, Tsinghua University, 100084, Beijing, China.
- Institute for Advanced Study, Tsinghua University, 100084, Beijing, China.
- Frontier Science Center for Quantum Information, Beijing, China.
| | - Yong Xu
- State Key Laboratory of Low Dimensional Quantum Physics and Department of Physics, Tsinghua University, 100084, Beijing, China.
- Frontier Science Center for Quantum Information, Beijing, China.
- RIKEN Center for Emergent Matter Science (CEMS), Wako, Saitama, 351-0198, Japan.
| |
Collapse
|
6
|
Klarbring J, Walsh A. Na Vacancy-Driven Phase Transformation and Fast Ion Conduction in W-Doped Na 3SbS 4 from Machine Learning Force Fields. CHEMISTRY OF MATERIALS : A PUBLICATION OF THE AMERICAN CHEMICAL SOCIETY 2024; 36:9406-9413. [PMID: 39398370 PMCID: PMC11467836 DOI: 10.1021/acs.chemmater.4c00936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Revised: 09/04/2024] [Accepted: 09/05/2024] [Indexed: 10/15/2024]
Abstract
Solid-state sodium batteries require effective electrolytes that conduct at room temperature. The Na3PnCh4 (Pn = P, Sb; Ch = S, Se) family has been studied for their high Na ion conductivity. The population of Na vacancies, which mediate ion diffusion in these materials, can be enhanced through aliovalent doping on the pnictogen site. To probe the microscopic role of extrinsic doping and its impact on diffusion and phase stability, we trained a machine learning force field for Na3-x W x Sb1-x S4 based on an equivariant graph neural network. Analysis of large-scale molecular dynamics trajectories shows that an increased Na vacancy population stabilizes the global cubic phase at lower temperatures with enhanced Na ion diffusion and that the explicit role of the substitutional W dopants is limited. In the global cubic phase, we observe large and long-lived deviations of atoms from the averaged symmetry, echoing recent experimental suggestions. Evidence of correlated Na ion diffusion is also presented that underpins the suggested superionic nature of these materials.
Collapse
Affiliation(s)
- Johan Klarbring
- Department
of Materials, Imperial College London, Exhibition Road, London SW7 2AZ, U.K.
- Department
of Physics, Chemistry and Biology (IFM), Linköping University, SE-581 83 Linköping, Sweden
| | - Aron Walsh
- Department
of Materials, Imperial College London, Exhibition Road, London SW7 2AZ, U.K.
| |
Collapse
|
7
|
Eastman P, Pritchard BP, Chodera JD, Markland TE. Nutmeg and SPICE: Models and Data for Biomolecular Machine Learning. J Chem Theory Comput 2024; 20:8583-8593. [PMID: 39318326 DOI: 10.1021/acs.jctc.4c00794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/26/2024]
Abstract
We describe version 2 of the SPICE data set, a collection of quantum chemistry calculations for training machine learning potentials. It expands on the original data set by adding much more sampling of chemical space and more data on noncovalent interactions. We train a set of potential energy functions called Nutmeg on it. They are based on the TensorNet architecture. They use a novel mechanism to improve performance on charged and polar molecules, injecting precomputed partial charges into the model to provide a reference for the large-scale charge distribution. Evaluation of the new models shows that they do an excellent job of reproducing energy differences between conformations even on highly charged molecules or ones that are significantly larger than the molecules in the training set. They also produce stable molecular dynamics trajectories and are fast enough to be useful for routine simulation of small molecules.
Collapse
Affiliation(s)
- Peter Eastman
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| | - Benjamin P Pritchard
- Molecular Sciences Software Institute, Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24060, United States
| | - John D Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
| | - Thomas E Markland
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
8
|
Nikidis E, Kyriakopoulos N, Tohid R, Kachrimanis K, Kioseoglou J. Harnessing machine learning for efficient large-scale interatomic potential for sildenafil and pharmaceuticals containing H, C, N, O, and S. NANOSCALE 2024; 16:18014-18026. [PMID: 39252581 DOI: 10.1039/d4nr00929k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
In this study a cutting-edge approach to producing accurate and computationally efficient interatomic potentials using machine learning algorithms is presented. Specifically, the study focuses on the application of Allegro, a novel machine learning algorithm, running on high-performance GPUs for training potentials. The choice of training parameters plays a pivotal role in the quality of the potential functions. To enable this methodology, the "Solvated Protein Fragments" dataset, containing nearly 2.7 million Density Functional Theory (DFT) calculations for many-body intermolecular interactions involving protein fragments and water molecules, encompassing H, C, N, O, and S elements, is considered as the training dataset. The project optimizes computational efficiency by reducing the initial dataset size according to the intended application. To assess the efficacy of the approach, the sildenafil citrate, iso-sildenafil, aspirin, ibuprofen, mebendazole and urea, representing all five relevant elements, serve as the test bed. The results of the Allegro-trained potentials demonstrate outstanding performance, benefiting from the combination of an appropriate training dataset and parameter selection. This notably enhanced computational efficiency when compared to the computationally intensive DFT method aided by GPU acceleration. Validation of the produced interatomic potentials is achieved through Allegro's own evaluation mechanism, yielding exceptional accuracy. Further verification is carried out through LAMMPS molecular dynamics simulations. Structural optimization by energy minimization and NPT Molecular Dynamics simulations are performed for each potential, assessing relaxation processes and energy reduction. Additional structures, including urea, ammonia, uracil, oxalic acid, and acetic acid, are tested, highlighting the potential's versatility in describing systems containing the aforementioned elements. Visualization of the results confirms the scientific accuracy of each structure's relaxation. The findings of this study demonstrate strong scaling and the potential for applications in pharmaceutical research, allowing the exploration of larger molecular structures not previously amenable to computational analysis at this level of accuracy The success of the machine learning approach underscores its potential to revolutionize computational solid-state physics.
Collapse
Affiliation(s)
- E Nikidis
- Physics Department, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece.
- Center for Interdisciplinary Research & Innovation, Aristotle University of Thessaloniki, Greece
| | - N Kyriakopoulos
- Physics Department, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece.
- Center for Interdisciplinary Research & Innovation, Aristotle University of Thessaloniki, Greece
| | - R Tohid
- Center of Computation and Technology, Louisiana State University, 70803 Baton Rouge, USA
| | - K Kachrimanis
- Center for Interdisciplinary Research & Innovation, Aristotle University of Thessaloniki, Greece
- Pharmaceutical Technology Department, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
| | - J Kioseoglou
- Physics Department, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece.
- Center for Interdisciplinary Research & Innovation, Aristotle University of Thessaloniki, Greece
| |
Collapse
|
9
|
Xu Y, Jin Y, García Sánchez JS, Pérez-Lemus GR, Zubieta Rico PF, Delferro M, de Pablo JJ. A Molecular View of Methane Activation on Ni(111) through Enhanced Sampling and Machine Learning. J Phys Chem Lett 2024; 15:9852-9862. [PMID: 39298736 DOI: 10.1021/acs.jpclett.4c02237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2024]
Abstract
A combination of machine learned interatomic potentials (MLIPs) and enhanced sampling simulations is used to investigate the activation of methane on a Ni(111) surface. The work entails the development and iterative refinement of MLIPs, initially trained on a data set constructed via ab initio molecular dynamics simulations, supplemented by adaptive biasing forces, to enrich the sampling of catalytically relevant configurations. Our results reveal that upon incorporation of collective variables that capture the behavior of the reactant molecule, as well as additional frames that describe the dynamic response of the catalytic surface, it is possible to enhance considerably the accuracy of predicted energies and forces. By employing enhanced sampling schemes in the refinement of the MLIP, we systematically explore the potential energy surface, leading to a refined MLIP capable of predicting density functional theory-level energies and forces and replicating key geometric characteristics of the catalytic system. The resulting free energy landscapes at several temperatures provide a detailed view of the thermodynamics and dynamics of methane activation. Specifically, as methane approaches and dissociates on the catalytic surface, the process involves the dynamic interplay of CH4 and the Ni catalyst that includes both enthalpic and entropic contributions. The progression toward the transition state involves a CH4 moiety that is increasingly restrained in its ability to rotate or translate, while the stage following the transition state is characterized by a notable rise of the Ni atom that interacts with the cleaved C-H bond. This leads to an increase in the mobility of the adsorbed species, a feature that becomes more pronounced at higher temperatures.
Collapse
Affiliation(s)
- Yinan Xu
- Pritzker School of Molecular Engineering, The University of Chicago, 640 South Ellis Avenue, Chicago, Illinois 60637, United States
| | - Yezhi Jin
- Pritzker School of Molecular Engineering, The University of Chicago, 640 South Ellis Avenue, Chicago, Illinois 60637, United States
| | - Jireh S García Sánchez
- Pritzker School of Molecular Engineering, The University of Chicago, 640 South Ellis Avenue, Chicago, Illinois 60637, United States
| | - Gustavo R Pérez-Lemus
- Pritzker School of Molecular Engineering, The University of Chicago, 640 South Ellis Avenue, Chicago, Illinois 60637, United States
| | - Pablo F Zubieta Rico
- Pritzker School of Molecular Engineering, The University of Chicago, 640 South Ellis Avenue, Chicago, Illinois 60637, United States
| | - Massimiliano Delferro
- Chemical Sciences and Engineering Division, Argonne National Laboratory, 9700 South Cass Avenue, Lemont, Illinois 60439, United States
| | - Juan J de Pablo
- Pritzker School of Molecular Engineering, The University of Chicago, 640 South Ellis Avenue, Chicago, Illinois 60637, United States
- Materials Science Division, Argonne National Laboratory, 9700 South Cass Avenue, Lemont, Illinois 60439, United States
| |
Collapse
|
10
|
Zhu Y, Peng J, Xu C, Lan Z. Unsupervised Machine Learning in the Analysis of Nonadiabatic Molecular Dynamics Simulation. J Phys Chem Lett 2024; 15:9601-9619. [PMID: 39270134 DOI: 10.1021/acs.jpclett.4c01751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2024]
Abstract
The all-atomic full-dimensional-level simulations of nonadiabatic molecular dynamics (NAMD) in large realistic systems has received high research interest in recent years. However, such NAMD simulations normally generate an enormous amount of time-dependent high-dimensional data, leading to a significant challenge in result analyses. Based on unsupervised machine learning (ML) methods, considerable efforts were devoted to developing novel and easy-to-use analysis tools for the identification of photoinduced reaction channels and the comprehensive understanding of complicated molecular motions in NAMD simulations. Here, we tried to survey recent advances in this field, particularly to focus on how to use unsupervised ML methods to analyze the trajectory-based NAMD simulation results. Our purpose is to offer a comprehensive discussion on several essential components of this analysis protocol, including the selection of ML methods, the construction of molecular descriptors, the establishment of analytical frameworks, their advantages and limitations, and persistent challenges.
Collapse
Affiliation(s)
- Yifei Zhu
- MOE Key Laboratory of Environmental Theoretical Chemistry, SCNU Environmental Research Institute, Guangdong Provincial Key Laboratory of Chemical Pollution and Environmental Safety, School of Environment, South China Normal University, Guangzhou 510006, P. R. China
| | - Jiawei Peng
- MOE Key Laboratory of Environmental Theoretical Chemistry, SCNU Environmental Research Institute, Guangdong Provincial Key Laboratory of Chemical Pollution and Environmental Safety, School of Environment, South China Normal University, Guangzhou 510006, P. R. China
| | - Chao Xu
- MOE Key Laboratory of Environmental Theoretical Chemistry, SCNU Environmental Research Institute, Guangdong Provincial Key Laboratory of Chemical Pollution and Environmental Safety, School of Environment, South China Normal University, Guangzhou 510006, P. R. China
| | - Zhenggang Lan
- MOE Key Laboratory of Environmental Theoretical Chemistry, SCNU Environmental Research Institute, Guangdong Provincial Key Laboratory of Chemical Pollution and Environmental Safety, School of Environment, South China Normal University, Guangzhou 510006, P. R. China
| |
Collapse
|
11
|
Chong S, Bigi F, Grasselli F, Loche P, Kellner M, Ceriotti M. Prediction rigidities for data-driven chemistry. Faraday Discuss 2024. [PMID: 39319702 PMCID: PMC11423580 DOI: 10.1039/d4fd00101j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 08/22/2024] [Indexed: 09/26/2024]
Abstract
The widespread application of machine learning (ML) to the chemical sciences is making it very important to understand how the ML models learn to correlate chemical structures with their properties, and what can be done to improve the training efficiency whilst guaranteeing interpretability and transferability. In this work, we demonstrate the wide utility of prediction rigidities, a family of metrics derived from the loss function, in understanding the robustness of ML model predictions. We show that the prediction rigidities allow the assessment of the model not only at the global level, but also on the local or the component-wise level at which the intermediate (e.g. atomic, body-ordered, or range-separated) predictions are made. We leverage these metrics to understand the learning behavior of different ML models, and to guide efficient dataset construction for model training. We finally implement the formalism for a ML model targeting a coarse-grained system to demonstrate the applicability of the prediction rigidities to an even broader class of atomistic modeling problems.
Collapse
Affiliation(s)
- Sanggyu Chong
- Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.
| | - Filippo Bigi
- Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.
| | - Federico Grasselli
- Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.
| | - Philip Loche
- Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.
| | - Matthias Kellner
- Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.
| |
Collapse
|
12
|
Gupta AK, Stulajter MM, Shaidu Y, Neaton JB, de Jong WA. Equivariant Neural Networks Utilizing Molecular Clusters for Accurate Molecular Crystal Lattice Energy Predictions. ACS OMEGA 2024; 9:40269-40282. [PMID: 39346862 PMCID: PMC11425815 DOI: 10.1021/acsomega.4c07434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/13/2024] [Revised: 08/27/2024] [Accepted: 09/02/2024] [Indexed: 10/01/2024]
Abstract
Equivariant neural networks have emerged as prominent models in advancing the construction of interatomic potentials due to their remarkable data efficiency and generalization capabilities for out-of-distribution data. Here, we expand the utility of these networks to the prediction of crystal structures consisting of organic molecules. Traditional methods for computing crystal structure properties, such as plane-wave quantum chemical methods based on density functional theory (DFT), are prohibitively resource-intensive, often necessitating compromises in accuracy and the choice of exchange-correlation functional. We present an approach that leverages the efficiency, and transferability of equivariant neural networks, specifically Allegro, to predict molecular crystal structure energies at a reduced computational cost. Our neural network is trained on molecular clusters using a highly accurate Gaussian-type orbital (GTO)-based method as the target level of theory, eliminating the need for costly periodic DFT calculations, while providing access to all families of exchange-corelation functionals and post-Hartree-Fock methods. The trained model exhibits remarkable accuracy in predicting lattice energies, aligning closely with those computed by plane-wave based DFT methods, thus representing significant cost reductions. Furthermore, the Allegro network was seamlessly integrated with the USPEX framework, accelerating the discovery of low-energy crystal structures during crystal structure prediction.
Collapse
Affiliation(s)
- Ankur K Gupta
- Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Miko M Stulajter
- Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Yusuf Shaidu
- Department of Physics, University of California Berkeley, Berkeley, California 94720, United States
- Materials Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Jeffrey B Neaton
- Department of Physics, University of California Berkeley, Berkeley, California 94720, United States
- Materials Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
- Kavli Energy NanoSciences Institute at Berkeley, Berkeley, California 94720, United States
| | - Wibe A de Jong
- Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| |
Collapse
|
13
|
Yang Z, Zhao YM, Wang X, Liu X, Zhang X, Li Y, Lv Q, Chen CYC, Shen L. Scalable crystal structure relaxation using an iteration-free deep generative model with uncertainty quantification. Nat Commun 2024; 15:8148. [PMID: 39289379 PMCID: PMC11408520 DOI: 10.1038/s41467-024-52378-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2024] [Accepted: 09/02/2024] [Indexed: 09/19/2024] Open
Abstract
In computational molecular and materials science, determining equilibrium structures is the crucial first step for accurate subsequent property calculations. However, the recent discovery of millions of new crystals and super large twisted structures has challenged traditional computational methods, both ab initio and machine-learning-based, due to their computationally intensive iterative processes. To address these scalability issues, here we introduce DeepRelax, a deep generative model capable of performing geometric crystal structure relaxation rapidly and without iterations. DeepRelax learns the equilibrium structural distribution, enabling it to predict relaxed structures directly from their unrelaxed ones. The ability to perform structural relaxation at the millisecond level per structure, combined with the scalability of parallel processing, makes DeepRelax particularly useful for large-scale virtual screening. We demonstrate DeepRelax's reliability and robustness by applying it to five diverse databases, including oxides, Materials Project, two-dimensional materials, van der Waals crystals, and crystals with point defects. DeepRelax consistently shows high accuracy and efficiency, validated by density functional theory calculations. Finally, we enhance its trustworthiness by integrating uncertainty quantification. This work significantly accelerates computational workflows, offering a robust and trustworthy machine-learning method for material discovery and advancing the application of AI for science.
Collapse
Affiliation(s)
- Ziduo Yang
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Yi-Ming Zhao
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
| | - Xian Wang
- Department of Physics, National University of Singapore, Singapore, Singapore
| | - Xiaoqing Liu
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
| | - Xiuying Zhang
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
| | - Yifan Li
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
| | - Qiujie Lv
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
| | - Calvin Yu-Chian Chen
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, China.
- State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, Shenzhen, China.
- Department of Medical Research, China Medical University Hospital, Taichung, Taiwan.
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan.
- Guangdong L-Med Biotechnology Co., Ltd., Meizhou, Guangdong, China.
| | - Lei Shen
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore.
- National University of Singapore (Chongqing) Research Institute, Chongqing, China.
| |
Collapse
|
14
|
Fang YB, Shang C, Liu ZP, Gong XG. Structural transitions in liquid semiconductor alloys: A molecular dynamics study with a neural network potential. J Chem Phys 2024; 161:104504. [PMID: 39258571 DOI: 10.1063/5.0223453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Accepted: 08/26/2024] [Indexed: 09/12/2024] Open
Abstract
Liquid-liquid phase transitions hold a unique and profound significance within condensed matter physics. These transitions, while conceptually intriguing, often pose formidable computational challenges. However, recent advances in neural network (NN) potentials offer a promising avenue to effectively address these challenges. In this paper, we delve into the structural transitions of liquid CdTe, CdS, and their alloy systems using molecular dynamics simulations, harnessing the power of an NN potential named LaspNN. Our investigations encompass both pressure and temperature effects. Through our simulations, we uncover three primary liquid structures around melting points that emerge as pressure increases: tetrahedral, rock salt, and close-packed structures, which greatly resemble those of solid states. In the high-temperature regime, we observe the formation of Te chains and S dimers, providing a deeper understanding of the liquid's atomic arrangements. When examining CdSxTe1-x alloys, our findings indicate that a small substitution of S by Te atoms for S-rich alloys (x > 0.5) exhibits a structural transition much different from CdS, while a large substitution of Te by S atoms for Te-rich alloys (x < 0.5) barely exhibits a structural transition similar to CdTe. We construct a schematic diagram for liquid alloys that considers both temperature and pressure, providing a comprehensive overview of the alloy system's behavior. The local aggregation of Te atoms demonstrates a linear relationship with alloy composition x, whereas that of S atoms exhibits a nonlinear one, shedding light on the composition-dependent structural changes.
Collapse
Affiliation(s)
- Yi-Bin Fang
- Key Laboratory for Computational Physical Sciences (MOE), State Key Laboratory of Surface Physics, Department of Physics, Fudan University, Shanghai 200433, China
- Shanghai Qi Zhi Institute, Shanghai 200232, China
| | - Cheng Shang
- Shanghai Qi Zhi Institute, Shanghai 200232, China
- Collaborative Innovation Center of Chemistry for Energy Materials (iChEM), Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University, Shanghai 200433, China
| | - Zhi-Pan Liu
- Shanghai Qi Zhi Institute, Shanghai 200232, China
- Collaborative Innovation Center of Chemistry for Energy Materials (iChEM), Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University, Shanghai 200433, China
| | - Xin-Gao Gong
- Key Laboratory for Computational Physical Sciences (MOE), State Key Laboratory of Surface Physics, Department of Physics, Fudan University, Shanghai 200433, China
- Shanghai Qi Zhi Institute, Shanghai 200232, China
| |
Collapse
|
15
|
Xu L, Jiang J. Synergistic Integration of Physical Embedding and Machine Learning Enabling Precise and Reliable Force Field. J Chem Theory Comput 2024. [PMID: 39264358 DOI: 10.1021/acs.jctc.4c00618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/13/2024]
Abstract
Machine-learning force fields have achieved significant strides in accurately reproducing the potential energy surface with quantum chemical accuracy. However, this approach still faces several challenges, e.g., extrapolating to uncharted chemical spaces, interpreting long-range electrostatics, and mapping complex macroscopic properties. To address these issues, we advocate for a synergistic integration of physical principles and machine learning techniques within the framework of a physically informed neural network (PINN). This approach involves incorporating physical knowledge into the parameters of the neural network, coupled with an efficient global optimizer, the Tabu-Adam algorithm, proposed in this work to augment optimization under strict physical constraint. We choose the AMOEBA+ force field as the physics-based model for embedding and then train and test it using the diethylene glycol dimethyl ether (DEGDME) data set as a case study. The results reveal a breakthrough in constructing a precise and noise-robust machine learning force field. Utilizing two training sets with hundreds of samples, our model exhibits remarkable generalization and density functional theory (DFT) accuracy in describing molecular interactions and enables a precise prediction of the macroscopic properties such as the diffusion coefficient with minimal cost. This work provides valuable insight into establishing a fundamental framework of the PINN force field.
Collapse
Affiliation(s)
- Lifeng Xu
- Beijing National Laboratory for Molecular Sciences, State Key Laboratory of Polymer Physics and Chemistry, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, P. R. China
- University of Chinese Academy of Sciences, Beijing 100049, P. R. China
| | - Jian Jiang
- Beijing National Laboratory for Molecular Sciences, State Key Laboratory of Polymer Physics and Chemistry, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, P. R. China
- University of Chinese Academy of Sciences, Beijing 100049, P. R. China
| |
Collapse
|
16
|
Mausenberger S, Müller C, Tkatchenko A, Marquetand P, González L, Westermayr J. SpaiNN: equivariant message passing for excited-state nonadiabatic molecular dynamics. Chem Sci 2024:d4sc04164j. [PMID: 39282652 PMCID: PMC11391904 DOI: 10.1039/d4sc04164j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Accepted: 09/01/2024] [Indexed: 09/19/2024] Open
Abstract
Excited-state molecular dynamics simulations are crucial for understanding processes like photosynthesis, vision, and radiation damage. However, the computational complexity of quantum chemical calculations restricts their scope. Machine learning offers a solution by delivering high-accuracy properties at lower computational costs. We present SpaiNN, an open-source Python software for ML-driven surface hopping nonadiabatic molecular dynamics simulations. SpaiNN combines the invariant and equivariant neural network architectures of SchNetPack with SHARC for surface hopping dynamics. Its modular design allows users to implement and adapt modules easily. We compare rotationally-invariant and equivariant representations in fitting potential energy surfaces of multiple electronic states and properties arising from the interaction of two electronic states. Simulations of the methyleneimmonium cation and various alkenes demonstrate the superior performance of equivariant SpaiNN models, improving accuracy, generalization, and efficiency in both training and inference.
Collapse
Affiliation(s)
- Sascha Mausenberger
- Faculty of Chemistry, Institute of Theoretical Chemistry, University of Vienna Währinger Str. 17 1090 Vienna Austria
- Vienna Doctoral School in Chemistry (DosChem), University of Vienna Währinger Straße 42 1090 Vienna Austria
| | - Carolin Müller
- Department Chemistry and Pharmacy, Computer-Chemistry-Center, Friedrich-Alexander-Universität Erlangen-Nürnberg Nägelsbachstraße 25 91052 Erlangen Germany
- Department of Physics and Materials Science, University of Luxembourg 162 A, Avenue de la Faïencerie L-1511 Luxembourg Luxembourg
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg 162 A, Avenue de la Faïencerie L-1511 Luxembourg Luxembourg
| | - Philipp Marquetand
- Faculty of Chemistry, Institute of Theoretical Chemistry, University of Vienna Währinger Str. 17 1090 Vienna Austria
| | - Leticia González
- Faculty of Chemistry, Institute of Theoretical Chemistry, University of Vienna Währinger Str. 17 1090 Vienna Austria
| | - Julia Westermayr
- Wilhelm Ostwald Institute for Physical and Theoretical Chemistry, Leipzig University Linnéstraße 2 04103 Leipzig Germany
- Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig Germany
| |
Collapse
|
17
|
Devergne T, Huet L, Pietrucci F, Saitta AM. Efficient machine learning approach for accurate free-energy profiles and kinetic rates. Phys Rev E 2024; 110:L033301. [PMID: 39425316 DOI: 10.1103/physreve.110.l033301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 08/01/2024] [Indexed: 10/21/2024]
Abstract
The computational exploration of reactive processes is challenging due to the requirement of thorough sampling across the free energy landscape using accurate ab initio methods. To address these constraints, machine learning potentials are employed, yet their training for this kind of problem is still a laborious and tedious task. In this study, we present an efficient approach to train these potentials by cleverly using a single batch of unbiased trajectories that avoid the pitfalls of trajectories artificially biased along a suboptimal collective variable. This strategy, when integrated with current enhanced sampling techniques, allows to obtain free energy profiles and kinetic rates of ab initio quality, yet dramatically reducing the computational cost.
Collapse
Affiliation(s)
- Timothée Devergne
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, UMR 7590 CNRS, Sorbonne Université, Muséum National d'Histoire Naturelle - Paris 75005, France; Atomistic Simulations, Italian Institute of Technology, 16142 Genoa, Italy; and Computational Statistics and Machine Learning, Italian Institute of Technology, 16142 Genoa, Italy
| | | | | | | |
Collapse
|
18
|
Jin Y, Perez-Lemus GR, Zubieta Rico PF, de Pablo JJ. Improving Machine Learned Force Fields for Complex Fluids through Enhanced Sampling: A Liquid Crystal Case Study. J Phys Chem A 2024; 128:7257-7268. [PMID: 39150905 DOI: 10.1021/acs.jpca.4c01546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2024]
Abstract
Machine learned force fields offer the potential for faster execution times while retaining the accuracy of traditional DFT calculations, making them promising candidates for molecular simulations in cases where reliable classical force fields are not available. Some of the challenges associated with machine learned force fields include simulation stability over extended periods of time and ensuring that the statistical and dynamical properties of the underlying simulated systems are correctly captured. In this work, we propose a systematic training pipeline for such force fields that leads to improved model quality, compared to that achieved by traditional data generation and training approaches. That pipeline relies on the use of enhanced sampling techniques, and it is demonstrated here in the context of a liquid crystal, which exemplifies many of the challenges that are encountered in fluids and materials with complex free energy landscapes. Our results indicate that, whereas the majority of traditional machine learned force field training approaches lead to molecular dynamics simulations that are only stable over hundred-picosecond trajectories, our approach allows for stable simulations over tens of nanoseconds for organic molecular systems comprising thousands of atoms.
Collapse
Affiliation(s)
- Yezhi Jin
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637-1476, United States
| | - Gustavo R Perez-Lemus
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637-1476, United States
| | - Pablo F Zubieta Rico
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637-1476, United States
| | - Juan J de Pablo
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637-1476, United States
| |
Collapse
|
19
|
Kahle L, Minisini B, Bui T, First JT, Buda C, Goldman T, Wimmer E. A dual-cutoff machine-learned potential for condensed organic systems obtained via uncertainty-guided active learning. Phys Chem Chem Phys 2024; 26:22665-22680. [PMID: 39158948 DOI: 10.1039/d4cp01980f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/20/2024]
Abstract
Machine-learned potentials (MLPs) trained on ab initio data combine the computational efficiency of classical interatomic potentials with the accuracy and generality of the first-principles method used in the creation of the respective training set. In this work, we implement and train a MLP to obtain an accurate description of the potential energy surface and property predictions for organic compounds, as both single molecules and in the condensed phase. We devise a dual descriptor, based on the atomic cluster expansion (ACE), that couples an information-rich short-range description with a coarser long-range description that captures weak intermolecular interactions. We employ uncertainty-guided active learning for the training set generation, creating a dataset that is comparatively small for the breadth of application and consists of alcohols, alkanes, and an adipate. Utilizing that MLP, we calculate densities of those systems of varying chain lengths as a function of temperature, obtaining a discrepancy of less than 4% compared with experiment. Vibrational frequencies calculated with the MLP have a root mean square error of less than 1 THz compared to DFT. The heat capacities of condensed systems are within 11% of experimental findings, which is strong evidence that the dual descriptor provides an accurate framework for the prediction of both short-range intramolecular and long-range intermolecular interactions.
Collapse
Affiliation(s)
- Leonid Kahle
- Materials Design SARL, 42 avenue Verdier, 92120 Montrouge, France.
| | - Benoit Minisini
- Materials Design SARL, 42 avenue Verdier, 92120 Montrouge, France.
| | - Tai Bui
- bp Exploration Operating Co. Ltd, Chertsey Road, Sunbury-on-Thames TW16 7LN, UK
| | - Jeremy T First
- bp, Center for High Performance Computing, 225 Westlake Park Blvd, Houston, TX 77079, USA
| | - Corneliu Buda
- bp Exploration Operating Co. Ltd, Chertsey Road, Sunbury-on-Thames TW16 7LN, UK
| | - Thomas Goldman
- bp Exploration Operating Co. Ltd, Chertsey Road, Sunbury-on-Thames TW16 7LN, UK
| | - Erich Wimmer
- Materials Design SARL, 42 avenue Verdier, 92120 Montrouge, France.
| |
Collapse
|
20
|
Hattori S, Zhu Q. Revisiting Aspirin Polymorphic Stability Using a Machine Learning Potential. ACS OMEGA 2024; 9:36589-36599. [PMID: 39220495 PMCID: PMC11360032 DOI: 10.1021/acsomega.4c04782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 07/26/2024] [Accepted: 07/30/2024] [Indexed: 09/04/2024]
Abstract
In this study, we present a systematic computational investigation to analyze the long-debated free energy stability of two well-known aspirin polymorphs, denoted as Form I and Form II. Specifically, we developed a strategy to collect training configurations covering diverse interatomic interactions between representative functional groups in aspirin crystals. Utilizing a state-of-the-art neural network interatomic potential (NNIP) model, we trained an accurate machine learning potential to simulate aspirin crystal dynamics under finite temperature conditions with ∼0.46 kJ/mol/molecule accuracy. Employing the trained NNIP model, we performed thermodynamic integration to assess the free energy difference between aspirins I and II, accounting for the anharmonic effects in a large supercell consisting of 512 molecules. For the first time, our results convincingly demonstrated that Form I is more stable than Form II at 300 K, ranging from 0.74 to 1.83 kJ/mol/molecule, aligning with experimental observations. Unlike the majority of previous simulations based on (quasi)harmonic approximations in a small super cell, which often found degenerate energies between aspirins I and II, our findings underscore the importance of anharmonic effects in determining polymorphic stability ranking. Furthermore, we proposed the use of the rotational degrees of freedom of methyl and ester/phenyl groups in aspirin crystals as characteristic motions to highlight rotational entropic contribution that favors the stability of Form I. From the structural perspective, we also found that the subtle free energy difference can be used to explain the distinct thermal expansion responses as observed in both experimental and simulation data. Beyond the aspirin polymorphism, we anticipate that such entropy-driven stabilization can be broadly applicable to many other organic systems, suggesting that our approach holds great promise for stability studies in small-molecule drug design.
Collapse
Affiliation(s)
- Shinnosuke Hattori
- Advanced
Research Laboratory, Research Platform, Sony Group Corporation, 4−14−1 Asahi-cho, Atsugi-shi 243−0014, Japan
| | - Qiang Zhu
- Department
of Mechanical Engineering and Engineering Science, University of North Carolina at Charlotte, Charlotte, North Carolina 28223, United States
| |
Collapse
|
21
|
Heid E, Schörghuber J, Wanzenböck R, Madsen GKH. Spatially Resolved Uncertainties for Machine Learning Potentials. J Chem Inf Model 2024; 64:6377-6387. [PMID: 39110874 DOI: 10.1021/acs.jcim.4c00904] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Machine learning potentials have become an essential tool for atomistic simulations, yielding results close to ab initio simulations at a fraction of computational cost. With recent improvements on the achievable accuracies, the focus has now shifted on the data set composition itself. The reliable identification of erroneously predicted configurations to extend a given data set is therefore of high priority. Yet, uncertainty estimation techniques have achieved mixed results for machine learning potentials. Consequently, a general and versatile method to correlate energy or atomic force uncertainties with the model error has remained elusive to date. In the current work, we show that epistemic uncertainty cannot correlate with model error by definition but can be aggregated over groups of atoms to yield a strong correlation. We demonstrate that our method correctly estimates prediction errors both globally per structure and locally resolved per atom. The direct correlation of local uncertainty and local error is used to design an active learning framework based on identifying local subregions of a large simulation cell and performing ab initio calculations only for the subregion subsequently. We successfully utilized this method to perform active learning in the low-data regime for liquid water.
Collapse
Affiliation(s)
- Esther Heid
- Institute of Materials Chemistry, TU Wien, A-1060 Vienna, Austria
| | | | - Ralf Wanzenböck
- Institute of Materials Chemistry, TU Wien, A-1060 Vienna, Austria
| | - Georg K H Madsen
- Institute of Materials Chemistry, TU Wien, A-1060 Vienna, Austria
| |
Collapse
|
22
|
Rothchild D, Rosen AS, Taw E, Robinson C, Gonzalez JE, Krishnapriyan AS. Investigating the behavior of diffusion models for accelerating electronic structure calculations. Chem Sci 2024; 15:13506-13522. [PMID: 39183908 PMCID: PMC11339969 DOI: 10.1039/d3sc05877h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Accepted: 07/11/2024] [Indexed: 08/27/2024] Open
Abstract
We present an investigation of diffusion models for molecular generation, with the aim of better understanding how their predictions compare to the results of physics-based calculations. The investigation into these models is driven by their potential to significantly accelerate electronic structure calculations using machine learning, without requiring expensive first-principles datasets for training interatomic potentials. We find that the inference process of a popular diffusion model for de novo molecular generation is divided into an exploration phase, where the model chooses the atomic species, and a relaxation phase, where it adjusts the atomic coordinates to find a low-energy geometry. As training proceeds, we show that the model initially learns about the first-order structure of the potential energy surface, and then later learns about higher-order structure. We also find that the relaxation phase of the diffusion model can be re-purposed to sample the Boltzmann distribution over conformations and to carry out structure relaxations. For structure relaxations, the model finds geometries with ∼10× lower energy than those produced by a classical force field for small organic molecules. Initializing a density functional theory (DFT) relaxation at the diffusion-produced structures yields a >2× speedup to the DFT relaxation when compared to initializing at structures relaxed with a classical force field.
Collapse
Affiliation(s)
- Daniel Rothchild
- Department of Electrical Engineering and Computer Science, University of California Berkeley USA
| | - Andrew S Rosen
- Department of Materials Science and Engineering, University of California Berkeley USA
- Materials Science Division, Lawrence Berkeley National Laboratory USA
| | - Eric Taw
- Department of Chemical and Biomolecular Engineering, University of California Berkeley USA
- Materials Science Division, Lawrence Berkeley National Laboratory USA
| | - Connie Robinson
- Department of Chemistry, University of California Berkeley USA
| | - Joseph E Gonzalez
- Department of Electrical Engineering and Computer Science, University of California Berkeley USA
| | - Aditi S Krishnapriyan
- Department of Electrical Engineering and Computer Science, University of California Berkeley USA
- Department of Chemical and Biomolecular Engineering, University of California Berkeley USA
| |
Collapse
|
23
|
van Gerwen P, Briling KR, Bunne C, Somnath VR, Laplaza R, Krause A, Corminboeuf C. 3DReact: Geometric Deep Learning for Chemical Reactions. J Chem Inf Model 2024; 64:5771-5785. [PMID: 39007724 PMCID: PMC11323278 DOI: 10.1021/acs.jcim.4c00104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 07/03/2024] [Accepted: 07/08/2024] [Indexed: 07/16/2024]
Abstract
Geometric deep learning models, which incorporate the relevant molecular symmetries within the neural network architecture, have considerably improved the accuracy and data efficiency of predictions of molecular properties. Building on this success, we introduce 3DReact, a geometric deep learning model to predict reaction properties from three-dimensional structures of reactants and products. We demonstrate that the invariant version of the model is sufficient for existing reaction data sets. We illustrate its competitive performance on the prediction of activation barriers on the GDB7-22-TS, Cyclo-23-TS, and Proparg-21-TS data sets in different atom-mapping regimes. We show that, compared to existing models for reaction property prediction, 3DReact offers a flexible framework that exploits atom-mapping information, if available, as well as geometries of reactants and products (in an invariant or equivariant fashion). Accordingly, it performs systematically well across different data sets, atom-mapping regimes, as well as both interpolation and extrapolation tasks.
Collapse
Affiliation(s)
- Puck van Gerwen
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Ksenia R. Briling
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Charlotte Bunne
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
- Learning
& Adaptive Systems Group, Department of Computer Science, ETH Zurich, 8092 Zurich, Switzerland
| | - Vignesh Ram Somnath
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
- Learning
& Adaptive Systems Group, Department of Computer Science, ETH Zurich, 8092 Zurich, Switzerland
| | - Ruben Laplaza
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Andreas Krause
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
- Learning
& Adaptive Systems Group, Department of Computer Science, ETH Zurich, 8092 Zurich, Switzerland
| | - Clemence Corminboeuf
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne, 1015 Lausanne, Switzerland
- National
Center for Competence in Research − Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
24
|
Frank JT, Unke OT, Müller KR, Chmiela S. A Euclidean transformer for fast and stable machine learned force fields. Nat Commun 2024; 15:6539. [PMID: 39107296 PMCID: PMC11303804 DOI: 10.1038/s41467-024-50620-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 07/10/2024] [Indexed: 08/10/2024] Open
Abstract
Recent years have seen vast progress in the development of machine learned force fields (MLFFs) based on ab-initio reference calculations. Despite achieving low test errors, the reliability of MLFFs in molecular dynamics (MD) simulations is facing growing scrutiny due to concerns about instability over extended simulation timescales. Our findings suggest a potential connection between robustness to cumulative inaccuracies and the use of equivariant representations in MLFFs, but the computational cost associated with these representations can limit this advantage in practice. To address this, we propose a transformer architecture called SO3KRATES that combines sparse equivariant representations (Euclidean variables) with a self-attention mechanism that separates invariant and equivariant information, eliminating the need for expensive tensor products. SO3KRATES achieves a unique combination of accuracy, stability, and speed that enables insightful analysis of quantum properties of matter on extended time and system size scales. To showcase this capability, we generate stable MD trajectories for flexible peptides and supra-molecular structures with hundreds of atoms. Furthermore, we investigate the PES topology for medium-sized chainlike molecules (e.g., small peptides) by exploring thousands of minima. Remarkably, SO3KRATES demonstrates the ability to strike a balance between the conflicting demands of stability and the emergence of new minimum-energy conformations beyond the training data, which is crucial for realistic exploration tasks in the field of biochemistry.
Collapse
Affiliation(s)
- J Thorben Frank
- Machine Learning Group, TU Berlin, Berlin, Germany
- BIFOLD, Berlin Institute for the Foundations of Learning and Data, Berlin, Germany
| | | | - Klaus-Robert Müller
- Machine Learning Group, TU Berlin, Berlin, Germany.
- BIFOLD, Berlin Institute for the Foundations of Learning and Data, Berlin, Germany.
- Google DeepMind, Berlin, Germany.
- Department of Artificial Intelligence, Korea University, Seoul, Korea.
- Max Planck Institut für Informatik, Saarbrücken, Germany.
| | - Stefan Chmiela
- Machine Learning Group, TU Berlin, Berlin, Germany.
- BIFOLD, Berlin Institute for the Foundations of Learning and Data, Berlin, Germany.
| |
Collapse
|
25
|
Pyzer-Knapp EO, Curioni A. Advancing biomolecular simulation through exascale HPC, AI and quantum computing. Curr Opin Struct Biol 2024; 87:102826. [PMID: 38733863 DOI: 10.1016/j.sbi.2024.102826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 04/11/2024] [Accepted: 04/11/2024] [Indexed: 05/13/2024]
Abstract
Biomolecular simulation can act as both a digital microscope and a crystal ball; offering the potential for a deeper understanding of experimental observations whilst also presenting a forward-looking avenue for the in silico design and evaluation of hitherto unsynthesized compounds. Indeed, as the intricacy of our scientific inquiries has grown, so too has the computational prowess we seek to deploy in our pursuit of answers. As we enter the Exascale era, this mini-review surveys the computational landscape from both the point of view of the development of new and ever more powerful systems, and the simulations that are run on them. Moreover, as we stand on the cusp of a transformative phase in computational biology, this article offers a contemplative glance into the future, speculating on the profound implications of artificial intelligence and quantum computing for large-scale biomolecular simulations.
Collapse
|
26
|
Goodwin ZAH, Wenny MB, Yang JH, Cepellotti A, Ding J, Bystrom K, Duschatko BR, Johansson A, Sun L, Batzner S, Musaelian A, Mason JA, Kozinsky B, Molinari N. Transferability and Accuracy of Ionic Liquid Simulations with Equivariant Machine Learning Interatomic Potentials. J Phys Chem Lett 2024; 15:7539-7547. [PMID: 39023916 DOI: 10.1021/acs.jpclett.4c01942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Ionic liquids (ILs) are an exciting class of electrolytes finding applications in many areas from energy storage to solvents, where they have been touted as "designer solvents" as they can be mixed to precisely tailor the physiochemical properties. As using machine learning interatomic potentials (MLIPs) to simulate ILs is still relatively unexplored, several questions need to be answered to see if MLIPs can be transformative for ILs. Since ILs are often not pure, but are either mixed together or contain additives, we first demonstrate that a MLIP can be trained to be compositionally transferable; i.e., the MLIP can be applied to mixtures of ions not directly trained on, while only being trained on a few mixtures of the same ions. We also investigated the accuracy of MLIPs for a novel IL, which we experimentally synthesize and characterize. Our MLIP trained on ∼200 DFT frames is in reasonable agreement with our experiments and DFT.
Collapse
Affiliation(s)
- Zachary A H Goodwin
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Malia B Wenny
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Julia H Yang
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
- Harvard University Center for the Environment, 26 Oxford St., Cambridge, Massachusetts 02138, United States
| | - Andrea Cepellotti
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Jingxuan Ding
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Kyle Bystrom
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Blake R Duschatko
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Anders Johansson
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Lixin Sun
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Simon Batzner
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Albert Musaelian
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Jarad A Mason
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Boris Kozinsky
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
- Research and Technology Center, Robert Bosch LLC, Cambridge, Massachusetts 02142, United States
| | - Nicola Molinari
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138, United States
- Research and Technology Center, Robert Bosch LLC, Cambridge, Massachusetts 02142, United States
| |
Collapse
|
27
|
Bigi F, Pozdnyakov SN, Ceriotti M. Wigner kernels: Body-ordered equivariant machine learning without a basis. J Chem Phys 2024; 161:044116. [PMID: 39056390 DOI: 10.1063/5.0208746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Accepted: 06/10/2024] [Indexed: 07/28/2024] Open
Abstract
Machine-learning models based on a point-cloud representation of a physical object are ubiquitous in scientific applications and particularly well-suited to the atomic-scale description of molecules and materials. Among the many different approaches that have been pursued, the description of local atomic environments in terms of their discretized neighbor densities has been used widely and very successfully. We propose a novel density-based method, which involves computing "Wigner kernels." These are fully equivariant and body-ordered kernels that can be computed iteratively at a cost that is independent of the basis used to discretize the density and grows only linearly with the maximum body-order considered. Wigner kernels represent the infinite-width limit of feature-space models, whose dimensionality and computational cost instead scale exponentially with the increasing order of correlations. We present several examples of the accuracy of models based on Wigner kernels in chemical applications, for both scalar and tensorial targets, reaching an accuracy that is competitive with state-of-the-art deep-learning architectures. We discuss the broader relevance of these findings to equivariant geometric machine-learning.
Collapse
Affiliation(s)
- Filippo Bigi
- Laboratory of Computational Science and Modeling, Institut des Matériaux, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Sergey N Pozdnyakov
- Laboratory of Computational Science and Modeling, Institut des Matériaux, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling, Institut des Matériaux, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
28
|
Plé T, Adjoua O, Lagardère L, Piquemal JP. FeNNol: An efficient and flexible library for building force-field-enhanced neural network potentials. J Chem Phys 2024; 161:042502. [PMID: 39051830 DOI: 10.1063/5.0217688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 06/28/2024] [Indexed: 07/27/2024] Open
Abstract
Neural network interatomic potentials (NNPs) have recently proven to be powerful tools to accurately model complex molecular systems while bypassing the high numerical cost of ab initio molecular dynamics simulations. In recent years, numerous advances in model architectures as well as the development of hybrid models combining machine-learning (ML) with more traditional, physically motivated, force-field interactions have considerably increased the design space of ML potentials. In this paper, we present FeNNol, a new library for building, training, and running force-field-enhanced neural network potentials. It provides a flexible and modular system for building hybrid models, allowing us to easily combine state-of-the-art embeddings with ML-parameterized physical interaction terms without the need for explicit programming. Furthermore, FeNNol leverages the automatic differentiation and just-in-time compilation features of the Jax Python library to enable fast evaluation of NNPs, shrinking the performance gap between ML potentials and standard force-fields. This is demonstrated with the popular ANI-2x model reaching simulation speeds nearly on par with the AMOEBA polarizable force-field on commodity GPUs (graphics processing units). We hope that FeNNol will facilitate the development and application of new hybrid NNP architectures for a wide range of molecular simulation problems.
Collapse
Affiliation(s)
- Thomas Plé
- Sorbonne Université, LCT, UMR 7616 CNRS, 75005 Paris, France
| | - Olivier Adjoua
- Sorbonne Université, LCT, UMR 7616 CNRS, 75005 Paris, France
| | - Louis Lagardère
- Sorbonne Université, LCT, UMR 7616 CNRS, 75005 Paris, France
| | | |
Collapse
|
29
|
Yang ZX, Xie XT, Kang PL, Wang ZX, Shang C, Liu ZP. Many-Body Function Corrected Neural Network with Atomic Attention (MBNN-att) for Molecular Property Prediction. J Chem Theory Comput 2024. [PMID: 39034686 DOI: 10.1021/acs.jctc.4c00660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/23/2024]
Abstract
Recent years have seen a surge of machine learning (ML) in chemistry for predicting chemical properties, but a low-cost, general-purpose, and high-performance model, desirable to be accessible on central processing unit (CPU) devices, remains not available. For this purpose, here we introduce an atomic attention mechanism into many-body function corrected neural network (MBNN), namely, MBNN-att ML model, to predict both the extensive and intensive properties of molecules and materials. The MBNN-att uses explicit function descriptors as the inputs for the atom-based feed-forward neural network (NN). The output of the NN is designed to be a vector to implement the multihead self-attention mechanism. This vector is split into two parts: the atomic attention weight part and the many-body-function part. The final property is obtained by summing the products of each atomic attention weight and the corresponding many-body function. We show that MBNN-att performs well on all QM9 properties, i.e., errors on all properties, below chemical accuracy, and, in particular, achieves the top performance for the energy-related extensive properties. By systematically comparing with other explicit-function-type descriptor ML models and the graph representation ML models, we demonstrate that the many-body-function framework and atomic attention mechanism are key ingredients for the high performance and the good transferability of MBNN-att in molecular property prediction.
Collapse
Affiliation(s)
- Zheng-Xin Yang
- Collaborative Innovation Center of Chemistry for Energy Material, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University, Shanghai 200433, China
| | - Xin-Tian Xie
- Collaborative Innovation Center of Chemistry for Energy Material, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University, Shanghai 200433, China
| | - Pei-Lin Kang
- Collaborative Innovation Center of Chemistry for Energy Material, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University, Shanghai 200433, China
| | - Zhen-Xiong Wang
- Collaborative Innovation Center of Chemistry for Energy Material, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University, Shanghai 200433, China
| | - Cheng Shang
- Collaborative Innovation Center of Chemistry for Energy Material, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University, Shanghai 200433, China
- Shanghai Qi Zhi Institution, Shanghai 200030, China
| | - Zhi-Pan Liu
- Collaborative Innovation Center of Chemistry for Energy Material, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University, Shanghai 200433, China
- Key Laboratory of Synthetic and Self-Assembly Chemistry for Organic Functional Molecules, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai 200032, China
- Shanghai Qi Zhi Institution, Shanghai 200030, China
| |
Collapse
|
30
|
Shi P, Xu Z. Exploring fracture of H-BN and graphene by neural network force fields. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2024; 36:415401. [PMID: 38925133 DOI: 10.1088/1361-648x/ad5c31] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Accepted: 06/26/2024] [Indexed: 06/28/2024]
Abstract
Extreme mechanical processes such as strong lattice distortion and bond breakage during fracture often lead to catastrophic failure of materials and structures. Understanding the nucleation and growth of cracks is challenged by their multiscale characteristics spanning from atomic-level structures at the crack tip to the structural features where the load is applied. Atomistic simulations offer 'first-principles' tools to resolve the progressive microstructural changes at crack fronts and are widely used to explore the underlying processes of mechanical energy dissipation, crack path selection, and dynamic instabilities (e.g. kinking, branching). Empirical force fields developed based on atomic-level structural descriptors based on atomic positions and the bond orders do not yield satisfying predictions of fracture, especially for the nonlinear, anisotropic stress-strain relations and the energy densities of edges. High-fidelity force fields thus should include the tensorial nature of strain and the energetics of bond-breaking and (re)formation events during fracture, which, unfortunately, have not been taken into account in either the state-of-the-art empirical or machine-learning force fields. Based on data generated by density functional theory calculations, we report a neural network-based force field for fracture (NN-F3) constructed by using the end-to-end symmetry preserving framework of deep potential-smooth edition (DeepPot-SE). The workflow combines pre-sampling of the space of strain states and active-learning techniques to explore the transition states at critical bonding distances. The capability of NN-F3is demonstrated by studying the rupture of hexagonal boron nitride (h-BN) and twisted bilayer graphene as model problems. The simulation results elucidate the roughening physics of fracture defined by the lattice asymmetry in h-BN, explaining recent experimental findings, and predict the interaction between cross-layer cracks in twisted graphene bilayers, which leads to a toughening effect.
Collapse
Affiliation(s)
- Pengjie Shi
- Applied Mechanics Laboratory and Department of Engineering Mechanics, Tsinghua University, Beijing 100084, People's Republic of China
| | - Zhiping Xu
- Applied Mechanics Laboratory and Department of Engineering Mechanics, Tsinghua University, Beijing 100084, People's Republic of China
| |
Collapse
|
31
|
Cheng Z, Bi H, Liu S, Chen J, Misquitta AJ, Yu K. Developing a Differentiable Long-Range Force Field for Proteins with E(3) Neural Network-Predicted Asymptotic Parameters. J Chem Theory Comput 2024; 20:5598-5608. [PMID: 38888427 DOI: 10.1021/acs.jctc.4c00337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/20/2024]
Abstract
Accurately describing long-range interactions is a significant challenge in molecular dynamics (MD) simulations of proteins. High-quality long-range potential is also an important component of the range-separated machine learning force field. This study introduces a comprehensive asymptotic parameter database encompassing atomic multipole moments, polarizabilities, and dispersion coefficients. Leveraging active learning, our database comprehensively represents protein fragments with up to 8 heavy atoms, capturing their conformational diversity with merely 78,000 data points. Additionally, the E(3) neural network (E3NN) is employed to predict the asymptotic parameters directly from the local geometry. The E3NN models demonstrate exceptional accuracy and transferability across all asymptotic parameters, achieving an R2 of 0.999 for both protein fragments and 20 amino acid dipeptide test sets. The long-range electrostatic and dispersion energies can be obtained using the E3NN-predicted parameters, with an error of 0.07 and 0.02 kcal/mol, respectively, when compared to symmetry-adapted perturbation theory (SAPT). Therefore, our force fields demonstrate the capability to accurately describe long-range interactions in proteins, paving the way for next-generation protein force fields.
Collapse
Affiliation(s)
- Zheng Cheng
- School of Mathematical Sciences, Peking University, Beijing 100871, China
- AI for Science Institute, Beijing 100084, P. R. China
| | - Hangrui Bi
- School of Mathematical Sciences, Peking University, Beijing 100871, China
- DP Technology, Beijing 100080, P. R. China
| | - Siyuan Liu
- DP Technology, Beijing 100080, P. R. China
| | - Junmin Chen
- Tsinghua-Berkeley Shenzhen Institute, Shenzhen 518055, Guangdong, P. R. China
- Tsinghua Shenzhen International Graduate School, Shenzhen 518055, Guangdong, P. R. China
| | - Alston J Misquitta
- School of Physics and Astronomy, Queen Mary, University of London, London E1 4NS, U.K
| | - Kuang Yu
- Tsinghua-Berkeley Shenzhen Institute, Shenzhen 518055, Guangdong, P. R. China
- Tsinghua Shenzhen International Graduate School, Shenzhen 518055, Guangdong, P. R. China
| |
Collapse
|
32
|
Medrano Sandonas L, Van Rompaey D, Fallani A, Hilfiker M, Hahn D, Perez-Benito L, Verhoeven J, Tresadern G, Kurt Wegner J, Ceulemans H, Tkatchenko A. Dataset for quantum-mechanical exploration of conformers and solvent effects in large drug-like molecules. Sci Data 2024; 11:742. [PMID: 38972891 PMCID: PMC11228031 DOI: 10.1038/s41597-024-03521-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 06/13/2024] [Indexed: 07/09/2024] Open
Abstract
We here introduce the Aquamarine (AQM) dataset, an extensive quantum-mechanical (QM) dataset that contains the structural and electronic information of 59,783 low-and high-energy conformers of 1,653 molecules with a total number of atoms ranging from 2 to 92 (mean: 50.9), and containing up to 54 (mean: 28.2) non-hydrogen atoms. To gain insights into the solvent effects as well as collective dispersion interactions for drug-like molecules, we have performed QM calculations supplemented with a treatment of many-body dispersion (MBD) interactions of structures and properties in the gas phase and implicit water. Thus, AQM contains over 40 global and local physicochemical properties (including ground-state and response properties) per conformer computed at the tightly converged PBE0+MBD level of theory for gas-phase molecules, whereas PBE0+MBD with the modified Poisson-Boltzmann (MPB) model of water was used for solvated molecules. By addressing both molecule-solvent and dispersion interactions, AQM dataset can serve as a challenging benchmark for state-of-the-art machine learning methods for property modeling and de novo generation of large (solvated) molecules with pharmaceutical and biological relevance.
Collapse
Affiliation(s)
- Leonardo Medrano Sandonas
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg.
- Institute for Materials Science and Max Bergmann Center of Biomaterials, TU Dresden, 01062, Dresden, Germany.
| | - Dries Van Rompaey
- Drug Discovery Data Sciences (D3S), Janssen Pharmaceutica NV, Turnhoutseweg 30, 2340, Beerse, Belgium.
| | - Alessio Fallani
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
- Drug Discovery Data Sciences (D3S), Janssen Pharmaceutica NV, Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Mathias Hilfiker
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - David Hahn
- Computational Chemistry, Janssen Pharmaceutica NV, Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Laura Perez-Benito
- Computational Chemistry, Janssen Pharmaceutica NV, Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Jonas Verhoeven
- Drug Discovery Data Sciences (D3S), Janssen Pharmaceutica NV, Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Gary Tresadern
- Computational Chemistry, Janssen Pharmaceutica NV, Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Joerg Kurt Wegner
- Drug Discovery Data Sciences (D3S), Janssen Pharmaceutica NV, Turnhoutseweg 30, 2340, Beerse, Belgium
- Drug Discovery Data Sciences (D3S), Johnson & Johnson Innovative Medicine, 301 Binney Street, MA 02142, Cambridge, USA
| | - Hugo Ceulemans
- Drug Discovery Data Sciences (D3S), Janssen Pharmaceutica NV, Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg.
| |
Collapse
|
33
|
Zhang X, Li C, Ye HZ, Berkelbach TC, Chan GKL. Performant automatic differentiation of local coupled cluster theories: Response properties and ab initio molecular dynamics. J Chem Phys 2024; 161:014109. [PMID: 38949583 DOI: 10.1063/5.0212274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 06/12/2024] [Indexed: 07/02/2024] Open
Abstract
In this work, we introduce a differentiable implementation of the local natural orbital coupled cluster (LNO-CC) method within the automatic differentiation framework of the PySCFAD package. The implementation is comprehensively tuned for enhanced performance, which enables the calculation of first-order static response properties on medium-sized molecular systems using coupled cluster theory with single, double, and perturbative triple excitations [CCSD(T)]. We evaluate the accuracy of our method by benchmarking it against the canonical CCSD(T) reference for nuclear gradients, dipole moments, and geometry optimizations. In addition, we demonstrate the possibility of property calculations for chemically interesting systems through the computation of bond orders and Mössbauer spectroscopy parameters for a [NiFe]-hydrogenase active site model, along with the simulation of infrared spectra via ab initio LNO-CC molecular dynamics for a protonated water hexamer.
Collapse
Affiliation(s)
- Xing Zhang
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Chenghan Li
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Hong-Zhou Ye
- Department of Chemistry, Columbia University, New York, New York 10027, USA
| | | | - Garnet Kin-Lic Chan
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
| |
Collapse
|
34
|
Aldossary A, Campos-Gonzalez-Angulo JA, Pablo-García S, Leong SX, Rajaonson EM, Thiede L, Tom G, Wang A, Avagliano D, Aspuru-Guzik A. In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
Affiliation(s)
- Abdulrahman Aldossary
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | | | - Sergio Pablo-García
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
| | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Ella Miray Rajaonson
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Luca Thiede
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Andrew Wang
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Davide Avagliano
- Chimie ParisTech, PSL University, CNRS, Institute of Chemistry for Life and Health Sciences (iCLeHS UMR 8060), Paris, F-75005, France
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
- Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON, M5S 3E4, Canada
- Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON, M5S 3E5, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 66118 University Ave., Toronto, M5G 1M1, Canada
- Acceleration Consortium, 80 St George St, Toronto, M5S 3H6, Canada
| |
Collapse
|
35
|
Sivaraman G, Benmore CJ. Deciphering diffuse scattering with machine learning and the equivariant foundation model: the case of molten FeO. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2024; 36:381501. [PMID: 38866028 DOI: 10.1088/1361-648x/ad577b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 06/12/2024] [Indexed: 06/14/2024]
Abstract
Bridging the gap between diffuse x-ray or neutron scattering measurements and predicted structures derived from atom-atom pair potentials in disordered materials, has been a longstanding challenge in condensed matter physics. This perspective gives a brief overview of the traditional approaches employed over the past several decades. Namely, the use of approximate interatomic pair potentials that relate three-dimensional structural models to the measured structure factor and its' associated pair distribution function. The use of machine learned interatomic potentials has grown in the past few years, and has been particularly successful in the cases of ionic and oxide systems. Recent advances in large scale sampling, along with a direct integration of scattering measurements into the model development, has provided improved agreement between experiments and large-scale models calculated with quantum mechanical accuracy. However, details of local polyhedral bonding and connectivity in meta-stable disordered systems still require improvement. Here we leverage MACE-MP-0; a newly introduced equivariant foundation model and validate the results against high-quality experimental scattering data for the case of molten iron(II) oxide (FeO). These preliminary results suggest that the emerging foundation model has the potential to surpass the traditional limitations of classical interatomic potentials.
Collapse
Affiliation(s)
- Ganesh Sivaraman
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, United States of America
- C-STEEL Center for Steel Electrification by Electrosynthesis, Argonne National Laboratory, Argonne, IL 60438, United States of America
| | - Chris J Benmore
- C-STEEL Center for Steel Electrification by Electrosynthesis, Argonne National Laboratory, Argonne, IL 60438, United States of America
- X-Ray Science Division, Advanced Photon Source, Argonne National Laboratory, Argonne, IL 60438, United States of America
| |
Collapse
|
36
|
Tao Y, Giese TJ, Ekesan Ş, Zeng J, Aradi B, Hourahine B, Aktulga HM, Götz AW, Merz KM, York DM. Amber free energy tools: Interoperable software for free energy simulations using generalized quantum mechanical/molecular mechanical and machine learning potentials. J Chem Phys 2024; 160:224104. [PMID: 38856060 DOI: 10.1063/5.0211276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 05/15/2024] [Indexed: 06/11/2024] Open
Abstract
We report the development and testing of new integrated cyberinfrastructure for performing free energy simulations with generalized hybrid quantum mechanical/molecular mechanical (QM/MM) and machine learning potentials (MLPs) in Amber. The Sander molecular dynamics program has been extended to leverage fast, density-functional tight-binding models implemented in the DFTB+ and xTB packages, and an interface to the DeePMD-kit software enables the use of MLPs. The software is integrated through application program interfaces that circumvent the need to perform "system calls" and enable the incorporation of long-range Ewald electrostatics into the external software's self-consistent field procedure. The infrastructure provides access to QM/MM models that may serve as the foundation for QM/MM-ΔMLP potentials, which supplement the semiempirical QM/MM model with a MLP correction trained to reproduce ab initio QM/MM energies and forces. Efficient optimization of minimum free energy pathways is enabled through a new surface-accelerated finite-temperature string method implemented in the FE-ToolKit package. Furthermore, we interfaced Sander with the i-PI software by implementing the socket communication protocol used in the i-PI client-server model. The new interface with i-PI allows for the treatment of nuclear quantum effects with semiempirical QM/MM-ΔMLP models. The modular interoperable software is demonstrated on proton transfer reactions in guanine-thymine mispairs in a B-form deoxyribonucleic acid helix. The current work represents a considerable advance in the development of modular software for performing free energy simulations of chemical reactions that are important in a wide range of applications.
Collapse
Affiliation(s)
- Yujun Tao
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Timothy J Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Şölen Ekesan
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Jinzhe Zeng
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Bálint Aradi
- Bremen Center for Computational Materials Science, University of Bremen, D-28334 Bremen, Germany
| | - Ben Hourahine
- SUPA, Department of Physics, University of Strathclyde, Glasgow G4 0NG, United Kingdom
| | - Hasan Metin Aktulga
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA
| | - Andreas W Götz
- San Diego Supercomputer Center, University of California San Diego, La Jolla, California 92093, USA
| | - Kenneth M Merz
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA
| | - Darrin M York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| |
Collapse
|
37
|
Park Y, Kim J, Hwang S, Han S. Scalable Parallel Algorithm for Graph Neural Network Interatomic Potentials in Molecular Dynamics Simulations. J Chem Theory Comput 2024; 20:4857-4868. [PMID: 38813770 DOI: 10.1021/acs.jctc.4c00190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
Message-passing graph neural network interatomic potentials (GNN-IPs), particularly those with equivariant representations such as NequIP, are attracting significant attention due to their data efficiency and high accuracy. However, parallelizing GNN-IPs poses challenges because multiple message-passing layers complicate data communication within the spatial decomposition method, which is preferred by many molecular dynamics (MD) packages. In this article, we propose an efficient parallelization scheme compatible with GNN-IPs and develop a package, SevenNet (Scalable EquiVariance-Enabled Neural NETwork), based on the NequIP architecture. For MD simulations, SevenNet interfaces with the LAMMPS package. Through benchmark tests on a 32-GPU cluster with examples of SiO2, SevenNet achieves over 80% parallel efficiency in weak-scaling scenarios and exhibits nearly ideal strong-scaling performance as long as GPUs are fully utilized. However, the strong-scaling performance significantly declines with suboptimal GPU utilization, particularly affecting parallel efficiency in cases involving lightweight models or simulations with small numbers of atoms. We also pretrain SevenNet with a vast data set from the Materials Project (dubbed "SevenNet-0") and assess its performance on generating amorphous Si3N4 containing more than 100,000 atoms. By developing scalable GNN-IPs, this work aims to bridge the gap between advanced machine-learning models and large-scale MD simulations, offering researchers a powerful tool to explore complex material systems with high accuracy and efficiency.
Collapse
Affiliation(s)
- Yutack Park
- Department of Materials Science and Engineering and Research Institute of Advanced Materials, Seoul National University, Seoul 08826, Korea
| | - Jaesun Kim
- Department of Materials Science and Engineering and Research Institute of Advanced Materials, Seoul National University, Seoul 08826, Korea
| | - Seungwoo Hwang
- Department of Materials Science and Engineering and Research Institute of Advanced Materials, Seoul National University, Seoul 08826, Korea
| | - Seungwu Han
- Department of Materials Science and Engineering and Research Institute of Advanced Materials, Seoul National University, Seoul 08826, Korea
- Korea Institute for Advanced Study, Seoul 02455, Korea
| |
Collapse
|
38
|
Shu Y, Truhlar DG. Generalized Semiclassical Ehrenfest Method: A Route to Wave Function-Free Photochemistry and Nonadiabatic Dynamics with Only Potential Energies and Gradients. J Chem Theory Comput 2024; 20:4396-4426. [PMID: 38819014 DOI: 10.1021/acs.jctc.4c00424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
We reconsider recent methods by which direct dynamics calculations of electronically nonadiabatic processes can be carried out while requiring only adiabatic potential energies and their gradients. We show that these methods can be understood in terms of a new generalization of the well-known semiclassical Ehrenfest method. This is convenient because it eliminates the need to evaluate electronic wave functions and their matrix elements along the mixed quantum-classical trajectories. The new approximations and procedures enabling this advance are the curvature-driven approximation to the time-derivative coupling, the generalized semiclassical Ehrenfest method, and a new gradient correction scheme called the time-derivative matrix (TDM) scheme. When spin-orbit coupling is present, one can carry out dynamics calculations in the fully adiabatic basis using potential energies and gradients calculated without spin-orbit coupling plus the spin-orbit coupling matrix elements. Even when spin-orbit coupling is neglected, the method is useful because it allows calculations by electronic structure methods for which nonadiabatic coupling vectors are unavailable. In order to place the new considerations in context, the article starts out with a review of background material on trajectory surface hopping, the semiclassical Ehrenfest scheme, and methods for incorporating decoherence. We consider both internal conversion and intersystem crossing. We also review several examples from our group of successful applications of the curvature-driven approximation.
Collapse
Affiliation(s)
- Yinan Shu
- Department of Chemistry, Chemical Theory Center, and Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455-0431, United States
| | - Donald G Truhlar
- Department of Chemistry, Chemical Theory Center, and Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455-0431, United States
| |
Collapse
|
39
|
Tiwary P. Modeling prebiotic chemistries with quantum accuracy at classical costs. Proc Natl Acad Sci U S A 2024; 121:e2408742121. [PMID: 38809708 PMCID: PMC11161769 DOI: 10.1073/pnas.2408742121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2024] Open
Affiliation(s)
- Pratyush Tiwary
- Institute for Physical Science and Technology, University of Maryland, College Park, MD20742
- Department of Chemistry and Biochemistry, University of Maryland, College Park, MD20742
- University of Maryland Institute for Health Computing, Bethesda, MD20852
| |
Collapse
|
40
|
Pelaez RP, Simeon G, Galvelis R, Mirarchi A, Eastman P, Doerr S, Thölke P, Markland TE, De Fabritiis G. TorchMD-Net 2.0: Fast Neural Network Potentials for Molecular Simulations. J Chem Theory Comput 2024; 20:4076-4087. [PMID: 38743033 DOI: 10.1021/acs.jctc.4c00253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Achieving a balance between computational speed, prediction accuracy, and universal applicability in molecular simulations has been a persistent challenge. This paper presents substantial advancements in TorchMD-Net software, a pivotal step forward in the shift from conventional force fields to neural network-based potentials. The evolution of TorchMD-Net into a more comprehensive and versatile framework is highlighted, incorporating cutting-edge architectures such as TensorNet. This transformation is achieved through a modular design approach, encouraging customized applications within the scientific community. The most notable enhancement is a significant improvement in computational efficiency, achieving a very remarkable acceleration in the computation of energy and forces for TensorNet models, with performance gains ranging from 2× to 10× over previous, nonoptimized, iterations. Other enhancements include highly optimized neighbor search algorithms that support periodic boundary conditions and smooth integration with existing molecular dynamics frameworks. Additionally, the updated version introduces the capability to integrate physical priors, further enriching its application spectrum and utility in research. The software is available at https://github.com/torchmd/torchmd-net.
Collapse
Affiliation(s)
- Raul P Pelaez
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Guillem Simeon
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Raimondas Galvelis
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
- Acellera Labs, C Dr Trueta 183, 08005 Barcelona, Spain
| | - Antonio Mirarchi
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Peter Eastman
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| | - Stefan Doerr
- Acellera Labs, C Dr Trueta 183, 08005 Barcelona, Spain
| | | | - Thomas E Markland
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| | - Gianni De Fabritiis
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
- Acellera Labs, C Dr Trueta 183, 08005 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
41
|
Žugec I, Tetenoire A, Muzas AS, Zhang Y, Jiang B, Alducin M, Juaristi JI. Understanding the Photoinduced Desorption and Oxidation of CO on Ru(0001) Using a Neural Network Potential Energy Surface. JACS AU 2024; 4:1997-2004. [PMID: 38818055 PMCID: PMC11134377 DOI: 10.1021/jacsau.4c00197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 04/25/2024] [Accepted: 04/29/2024] [Indexed: 06/01/2024]
Abstract
The study of ultrafast photoinduced dynamics of adsorbates on metal surfaces requires thorough investigation of laser-excited electrons and, in many cases, the highly excited surface lattice. While ab initio molecular dynamics with electronic friction and thermostats (Te, Tl)-AIMDEF addresses such complex modeling, it imposes severe computational costs, hindering quantitative comparison with experimental desorption probabilities. In order to bypass this limitation, we utilize the embedded atom neural network method to construct a potential energy surface (PES) for the coadsorption of CO and O on Ru(0001). Our results demonstrate that this PES not only reproduces the short-time ab initio dynamics but is also able to yield statistically significant data for long lasting trajectories that correlate well with experimental findings. Furthermore, the analysis of the laser-induced dynamics reveals the existence of a dynamic trapping state that acts as a precursor for CO desorption, and it is not observed under thermal conditions. Altogether, our results validate the underlying theoretical framework, providing robust support for the description of not only the photoinduced desorption but also the oxidation of CO in terms of nonequilibrated but thermal hot electrons and phonons.
Collapse
Affiliation(s)
- Ivan Žugec
- Centro
de Física de Materiales CFM/MPC (CSIC-UPV/EHU), Paseo Manuel de Lardizabal 5, 20018 Donostia-San Sebastián, Spain
| | - Auguste Tetenoire
- Donostia
International Physics Center (DIPC), Paseo Manuel de Lardizabal 4, 20018 Donostia-San Sebastián, Spain
| | - Alberto S. Muzas
- Centro
de Física de Materiales CFM/MPC (CSIC-UPV/EHU), Paseo Manuel de Lardizabal 5, 20018 Donostia-San Sebastián, Spain
- Departamento
de Polímeros y Materiales Avanzados: Física, Química
y Tecnología, Facultad de Química, UPV/EHU, Apartado 1072, 20018 Donostia-San Sebastián, Spain
| | - Yaolong Zhang
- Key
Laboratory of Precision and Intelligent Chemistry Department of Chemical
Physics, University of Science and Technology
of China, Hefei, Anhui 230026, China
| | - Bin Jiang
- Key
Laboratory of Precision and Intelligent Chemistry Department of Chemical
Physics, University of Science and Technology
of China, Hefei, Anhui 230026, China
| | - Maite Alducin
- Centro
de Física de Materiales CFM/MPC (CSIC-UPV/EHU), Paseo Manuel de Lardizabal 5, 20018 Donostia-San Sebastián, Spain
- Donostia
International Physics Center (DIPC), Paseo Manuel de Lardizabal 4, 20018 Donostia-San Sebastián, Spain
| | - J. Iñaki Juaristi
- Centro
de Física de Materiales CFM/MPC (CSIC-UPV/EHU), Paseo Manuel de Lardizabal 5, 20018 Donostia-San Sebastián, Spain
- Donostia
International Physics Center (DIPC), Paseo Manuel de Lardizabal 4, 20018 Donostia-San Sebastián, Spain
- Departamento
de Polímeros y Materiales Avanzados: Física, Química
y Tecnología, Facultad de Química, UPV/EHU, Apartado 1072, 20018 Donostia-San Sebastián, Spain
| |
Collapse
|
42
|
Morehead A, Cheng J. Geometry-Complete Diffusion for 3D Molecule Generation and Optimization. ARXIV 2024:arXiv:2302.04313v6. [PMID: 36798459 PMCID: PMC9934735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
Motivation Generative deep learning methods have recently been proposed for generating 3D molecules using equivariant graph neural networks (GNNs) within a denoising diffusion framework. However, such methods are unable to learn important geometric properties of 3D molecules, as they adopt molecule-agnostic and non-geometric GNNs as their 3D graph denoising networks, which notably hinders their ability to generate valid large 3D molecules. Results In this work, we address these gaps by introducing the Geometry-Complete Diffusion Model (GCDM) for 3D molecule generation, which outperforms existing 3D molecular diffusion models by significant margins across conditional and unconditional settings for the QM9 dataset and the larger GEOM-Drugs dataset, respectively. Importantly, we demonstrate that GCDM's generative denoising process enables the model to generate a significant proportion of valid and energetically-stable large molecules at the scale of GEOM-Drugs, whereas previous methods fail to do so with the features they learn. Additionally, we show that extensions of GCDM can not only effectively design 3D molecules for specific protein pockets but can be repurposed to consistently optimize the geometry and chemical composition of existing 3D molecules for molecular stability and property specificity, demonstrating new versatility of molecular diffusion models. Availability Code and data are freely available on GitHub.
Collapse
Affiliation(s)
- Alex Morehead
- Electrical Engineering & Computer Science, NextGen Precision Health, University of Missouri, Columbia, 65211, Missouri, USA
| | - Jianlin Cheng
- Electrical Engineering & Computer Science, NextGen Precision Health, University of Missouri, Columbia, 65211, Missouri, USA
| |
Collapse
|
43
|
Pelaez RP, Simeon G, Galvelis R, Mirarchi A, Eastman P, Doerr S, Thölke P, Markland TE, De Fabritiis G. TorchMD-Net 2.0: Fast Neural Network Potentials for Molecular Simulations. ARXIV 2024:arXiv:2402.17660v3. [PMID: 38463504 PMCID: PMC10925388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Achieving a balance between computational speed, prediction accuracy, and universal applicability in molecular simulations has been a persistent challenge. This paper presents substantial advancements in the TorchMD-Net software, a pivotal step forward in the shift from conventional force fields to neural network-based potentials. The evolution of TorchMD-Net into a more comprehensive and versatile framework is highlighted, incorporating cutting-edge architectures such as TensorNet. This transformation is achieved through a modular design approach, encouraging customized applications within the scientific community. The most notable enhancement is a significant improvement in computational efficiency, achieving a very remarkable acceleration in the computation of energy and forces for Tensor-Net models, with performance gains ranging from 2x to 10x over previous, non-optimized, iterations. Other enhancements include highly optimized neighbor search algorithms that support periodic boundary conditions and smooth integration with existing molecular dynamics frameworks. Additionally, the updated version introduces the capability to integrate physical priors, further enriching its application spectrum and utility in research. The software is available at https://github.com/torchmd/torchmd-net.
Collapse
Affiliation(s)
- Raul P Pelaez
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Guillem Simeon
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Raimondas Galvelis
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
- Acellera Labs, C Dr Trueta 183, 08005, Barcelona, Spain
| | - Antonio Mirarchi
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Peter Eastman
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Stefan Doerr
- Acellera Labs, C Dr Trueta 183, 08005, Barcelona, Spain
| | | | - Thomas E Markland
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Gianni De Fabritiis
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
- Acellera Labs, C Dr Trueta 183, 08005, Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
44
|
Duignan TT. The Potential of Neural Network Potentials. ACS PHYSICAL CHEMISTRY AU 2024; 4:232-241. [PMID: 38800721 PMCID: PMC11117678 DOI: 10.1021/acsphyschemau.4c00004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 03/04/2024] [Accepted: 03/05/2024] [Indexed: 05/29/2024]
Abstract
In the next half-century, physical chemistry will likely undergo a profound transformation, driven predominantly by the combination of recent advances in quantum chemistry and machine learning (ML). Specifically, equivariant neural network potentials (NNPs) are a breakthrough new tool that are already enabling us to simulate systems at the molecular scale with unprecedented accuracy and speed, relying on nothing but fundamental physical laws. The continued development of this approach will realize Paul Dirac's 80-year-old vision of using quantum mechanics to unify physics with chemistry and providing invaluable tools for understanding materials science, biology, earth sciences, and beyond. The era of highly accurate and efficient first-principles molecular simulations will provide a wealth of training data that can be used to build automated computational methodologies, using tools such as diffusion models, for the design and optimization of systems at the molecular scale. Large language models (LLMs) will also evolve into increasingly indispensable tools for literature review, coding, idea generation, and scientific writing.
Collapse
|
45
|
Wang G, Wang C, Zhang X, Li Z, Zhou J, Sun Z. Machine learning interatomic potential: Bridge the gap between small-scale models and realistic device-scale simulations. iScience 2024; 27:109673. [PMID: 38646181 PMCID: PMC11033164 DOI: 10.1016/j.isci.2024.109673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/23/2024] Open
Abstract
Machine learning interatomic potential (MLIP) overcomes the challenges of high computational costs in density-functional theory and the relatively low accuracy in classical large-scale molecular dynamics, facilitating more efficient and precise simulations in materials research and design. In this review, the current state of the four essential stages of MLIP is discussed, including data generation methods, material structure descriptors, six unique machine learning algorithms, and available software. Furthermore, the applications of MLIP in various fields are investigated, notably in phase-change memory materials, structure searching, material properties predicting, and the pre-trained universal models. Eventually, the future perspectives, consisting of standard datasets, transferability, generalization, and trade-off between accuracy and complexity in MLIPs, are reported.
Collapse
Affiliation(s)
- Guanjie Wang
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
- School of Integrated Circuit Science and Engineering, Beihang University, Beijing 100191, China
| | - Changrui Wang
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| | - Xuanguang Zhang
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| | - Zefeng Li
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| | - Jian Zhou
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| | - Zhimei Sun
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| |
Collapse
|
46
|
Linker TM, Krishnamoorthy A, Daemen LL, Ramirez-Cuesta AJ, Nomura K, Nakano A, Cheng YQ, Hicks WR, Kolesnikov AI, Vashishta PD. Neutron scattering and neural-network quantum molecular dynamics investigation of the vibrations of ammonia along the solid-to-liquid transition. Nat Commun 2024; 15:3911. [PMID: 38724541 PMCID: PMC11082248 DOI: 10.1038/s41467-024-48246-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Accepted: 04/25/2024] [Indexed: 05/12/2024] Open
Abstract
Vibrational spectroscopy allows us to understand complex physical and chemical interactions of molecular crystals and liquids such as ammonia, which has recently emerged as a strong hydrogen fuel candidate to support a sustainable society. We report inelastic neutron scattering measurement of vibrational properties of ammonia along the solid-to-liquid phase transition with high enough resolution for direct comparisons to ab-initio simulations. Theoretical analysis reveals the essential role of nuclear quantum effects (NQEs) for correctly describing the intermolecular spectrum as well as high energy intramolecular N-H stretching modes. This is achieved by training neural network models using ab-initio path-integral molecular dynamics (PIMD) simulations, thereby encompassing large spatiotemporal trajectories required to resolve low energy dynamics while retaining NQEs. Our results not only establish the role of NQEs in ammonia but also provide general computational frameworks to study complex molecular systems with NQEs.
Collapse
Affiliation(s)
- T M Linker
- Collaboratory for Advanced Computing and Simulations, University of Southern California, Los Angeles, CA, 90089-0242, USA
- Stanford PULSE Institute, SLAC National Accelerator Laboratory, Menlo Park, California, 94025, USA
| | - A Krishnamoorthy
- Department of Mechanical Engineering Texas A&M, 400 Bizzell St, College Station, TX, 77843, USA
| | - L L Daemen
- Neutron Scattering Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - A J Ramirez-Cuesta
- Neutron Scattering Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - K Nomura
- Collaboratory for Advanced Computing and Simulations, University of Southern California, Los Angeles, CA, 90089-0242, USA
| | - A Nakano
- Collaboratory for Advanced Computing and Simulations, University of Southern California, Los Angeles, CA, 90089-0242, USA
| | - Y Q Cheng
- Neutron Scattering Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA.
| | - W R Hicks
- Neutron Scattering Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - A I Kolesnikov
- Neutron Scattering Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA.
| | - P D Vashishta
- Collaboratory for Advanced Computing and Simulations, University of Southern California, Los Angeles, CA, 90089-0242, USA.
| |
Collapse
|
47
|
Owen CJ, Xie Y, Johansson A, Sun L, Kozinsky B. Low-index mesoscopic surface reconstructions of Au surfaces using Bayesian force fields. Nat Commun 2024; 15:3790. [PMID: 38710679 DOI: 10.1038/s41467-024-48192-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 04/23/2024] [Indexed: 05/08/2024] Open
Abstract
Metal surfaces have long been known to reconstruct, significantly influencing their structural and catalytic properties. Many key mechanistic aspects of these subtle transformations remain poorly understood due to limitations of previous simulation approaches. Using active learning of Bayesian machine-learned force fields trained from ab initio calculations, we enable large-scale molecular dynamics simulations to describe the thermodynamics and time evolution of the low-index mesoscopic surface reconstructions of Au (e.g., the Au(111)-'Herringbone,' Au(110)-(1 × 2)-'Missing-Row,' and Au(100)-'Quasi-Hexagonal' reconstructions). This capability yields direct atomistic understanding of the dynamic emergence of these surface states from their initial facets, providing previously inaccessible information such as nucleation kinetics and a complete mechanistic interpretation of reconstruction under the effects of strain and local deviations from the original stoichiometry. We successfully reproduce previous experimental observations of reconstructions on pristine surfaces and provide quantitative predictions of the emergence of spinodal decomposition and localized reconstruction in response to strain at non-ideal stoichiometries. A unified mechanistic explanation is presented of the kinetic and thermodynamic factors driving surface reconstruction. Furthermore, we study surface reconstructions on Au nanoparticles, where characteristic (111) and (100) reconstructions spontaneously appear on a variety of high-symmetry particle morphologies.
Collapse
Affiliation(s)
- Cameron J Owen
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA.
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA.
| | - Yu Xie
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
| | - Anders Johansson
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
| | - Lixin Sun
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
- Microsoft Research, Cambridge, UK
| | - Boris Kozinsky
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA.
- Robert Bosch LLC Research and Technology Center, Watertown, MA, USA.
| |
Collapse
|
48
|
Zhang S, Makoś MZ, Jadrich RB, Kraka E, Barros K, Nebgen BT, Tretiak S, Isayev O, Lubbers N, Messerly RA, Smith JS. Exploring the frontiers of condensed-phase chemistry with a general reactive machine learning potential. Nat Chem 2024; 16:727-734. [PMID: 38454071 PMCID: PMC11087274 DOI: 10.1038/s41557-023-01427-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 12/12/2023] [Indexed: 03/09/2024]
Abstract
Atomistic simulation has a broad range of applications from drug design to materials discovery. Machine learning interatomic potentials (MLIPs) have become an efficient alternative to computationally expensive ab initio simulations. For this reason, chemistry and materials science would greatly benefit from a general reactive MLIP, that is, an MLIP that is applicable to a broad range of reactive chemistry without the need for refitting. Here we develop a general reactive MLIP (ANI-1xnr) through automated sampling of condensed-phase reactions. ANI-1xnr is then applied to study five distinct systems: carbon solid-phase nucleation, graphene ring formation from acetylene, biofuel additives, combustion of methane and the spontaneous formation of glycine from early earth small molecules. In all studies, ANI-1xnr closely matches experiment (when available) and/or previous studies using traditional model chemistry methods. As such, ANI-1xnr proves to be a highly general reactive MLIP for C, H, N and O elements in the condensed phase, enabling high-throughput in silico reactive chemistry experimentation.
Collapse
Affiliation(s)
- Shuhao Zhang
- Department of Chemistry, Mellon College of Science, Carnegie Mellon University, Pittsburgh, PA, USA
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Małgorzata Z Makoś
- Computational and Theoretical Chemistry Group, Department of Chemistry, Southern Methodist University, Dallas, TX, USA
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Ryan B Jadrich
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Elfi Kraka
- Computational and Theoretical Chemistry Group, Department of Chemistry, Southern Methodist University, Dallas, TX, USA
| | - Kipton Barros
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Benjamin T Nebgen
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Sergei Tretiak
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Olexandr Isayev
- Department of Chemistry, Mellon College of Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Nicholas Lubbers
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, USA.
| | - Richard A Messerly
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA.
| | - Justin S Smith
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA.
- NVIDIA Corp., Santa Clara, CA, USA.
| |
Collapse
|
49
|
Xie J, Zhou Y, Faizan M, Li Z, Li T, Fu Y, Wang X, Zhang L. Designing semiconductor materials and devices in the post-Moore era by tackling computational challenges with data-driven strategies. NATURE COMPUTATIONAL SCIENCE 2024; 4:322-333. [PMID: 38783137 DOI: 10.1038/s43588-024-00632-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 04/18/2024] [Indexed: 05/25/2024]
Abstract
In the post-Moore's law era, the progress of electronics relies on discovering superior semiconductor materials and optimizing device fabrication. Computational methods, augmented by emerging data-driven strategies, offer a promising alternative to the traditional trial-and-error approach. In this Perspective, we highlight data-driven computational frameworks for enhancing semiconductor discovery and device development by elaborating on their advances in exploring the materials design space, predicting semiconductor properties and optimizing device fabrication, with a concluding discussion on the challenges and opportunities in these areas.
Collapse
Affiliation(s)
- Jiahao Xie
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE, Key Laboratory of Material Simulation Methods & Software of MOE, and School of Materials Science and Engineering, Jilin University, Changchun, China
| | - Yansong Zhou
- State Key Laboratory of Superhard Materials, International Center of Computational Method and Software, School of Physics, Jilin University, Changchun, China
| | - Muhammad Faizan
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE, Key Laboratory of Material Simulation Methods & Software of MOE, and School of Materials Science and Engineering, Jilin University, Changchun, China
| | - Zewei Li
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE, Key Laboratory of Material Simulation Methods & Software of MOE, and School of Materials Science and Engineering, Jilin University, Changchun, China
| | - Tianshu Li
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE, Key Laboratory of Material Simulation Methods & Software of MOE, and School of Materials Science and Engineering, Jilin University, Changchun, China
| | - Yuhao Fu
- State Key Laboratory of Superhard Materials, International Center of Computational Method and Software, School of Physics, Jilin University, Changchun, China
| | - Xinjiang Wang
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE, Key Laboratory of Material Simulation Methods & Software of MOE, and School of Materials Science and Engineering, Jilin University, Changchun, China.
| | - Lijun Zhang
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE, Key Laboratory of Material Simulation Methods & Software of MOE, and School of Materials Science and Engineering, Jilin University, Changchun, China.
| |
Collapse
|
50
|
Wan K, He J, Shi X. Construction of High Accuracy Machine Learning Interatomic Potential for Surface/Interface of Nanomaterials-A Review. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2305758. [PMID: 37640376 DOI: 10.1002/adma.202305758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 08/24/2023] [Indexed: 08/31/2023]
Abstract
The inherent discontinuity and unique dimensional attributes of nanomaterial surfaces and interfaces bestow them with various exceptional properties. These properties, however, also introduce difficulties for both experimental and computational studies. The advent of machine learning interatomic potential (MLIP) addresses some of the limitations associated with empirical force fields, presenting a valuable avenue for accurate simulations of these surfaces/interfaces of nanomaterials. Central to this approach is the idea of capturing the relationship between system configuration and potential energy, leveraging the proficiency of machine learning (ML) to precisely approximate high-dimensional functions. This review offers an in-depth examination of MLIP principles and their execution and elaborates on their applications in the realm of nanomaterial surface and interface systems. The prevailing challenges faced by this potent methodology are also discussed.
Collapse
Affiliation(s)
- Kaiwei Wan
- Laboratory of Theoretical and Computational Nanoscience, National Center for Nanoscience and Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Jianxin He
- Laboratory of Theoretical and Computational Nanoscience, National Center for Nanoscience and Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Xinghua Shi
- Laboratory of Theoretical and Computational Nanoscience, National Center for Nanoscience and Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| |
Collapse
|