1
|
Gallmetzer J, Gamper J, Kröll S, Hofer TS. Comparative Study of UMCM-9 Polymorphs: Structural, Dynamic, and Hydrogen Storage Properties via Atomistic Simulations. THE JOURNAL OF PHYSICAL CHEMISTRY. C, NANOMATERIALS AND INTERFACES 2025; 129:5645-5655. [PMID: 40134511 PMCID: PMC11931535 DOI: 10.1021/acs.jpcc.4c07872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Revised: 01/23/2025] [Accepted: 02/25/2025] [Indexed: 03/27/2025]
Abstract
The structural and dynamic properties of two polymorphs of the metal-organic framework UMCM-9 (UMCM-9-α and -β) have been studied via molecular dynamics (MD) simulations in conjunction with density functional tight binding (DFTB) as well as the newly developed MACE-MP neural network potential (NNP). Based on these calculations, a novel UMCM-9-β polymorph is proposed that exhibits reduced linker strain and increased flexibility compared to UMCM-9-α, which is shown to be energetically less stable. UMCM-9-β exhibits enhanced diffusion of molecular hydrogen due to weaker host-guest interactions, whereas UMCM-9-α exhibits stronger interactions, leading to improved hydrogen adsorption. The results suggest that synthesis conditions may control the formation of both polymorphs: UMCM-9-β is likely to be the thermodynamic product, forming under stable conditions, while UMCM-9-α may be the kinetic product, forming under accelerated synthesis conditions. This study highlights the potential for optimizing MOFs for specific gas storage applications to achieve the desired structural and associated gas storage properties.
Collapse
Affiliation(s)
- Josef
M. Gallmetzer
- Institute of General, Inorganic and
Theoretical Chemistry, University of Innsbruck, Innrain 80-82, 6020 Innsbruck, Austria
| | - Jakob Gamper
- Institute of General, Inorganic and
Theoretical Chemistry, University of Innsbruck, Innrain 80-82, 6020 Innsbruck, Austria
| | - Stefanie Kröll
- Institute of General, Inorganic and
Theoretical Chemistry, University of Innsbruck, Innrain 80-82, 6020 Innsbruck, Austria
| | - Thomas S. Hofer
- Institute of General, Inorganic and
Theoretical Chemistry, University of Innsbruck, Innrain 80-82, 6020 Innsbruck, Austria
| |
Collapse
|
2
|
Lee M, Ucak UV, Jeong J, Ashyrmamatov I, Lee J, Sim E. Automated and Efficient Sampling of Chemical Reaction Space. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2409009. [PMID: 39804946 PMCID: PMC11884589 DOI: 10.1002/advs.202409009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2024] [Revised: 11/26/2024] [Indexed: 01/16/2025]
Abstract
Machine learning interatomic potentials (MLIPs) promise quantum-level accuracy at classical force field speeds, but their performance hinges on the quality and diversity of training data. An efficient and fully automated approach to sample chemical reaction space without relying on human intuition, addressing a critical gap in MLIP development is presented. The method combines the speed of tight-binding calculations with selective high-level refinement, generating diverse datasets that capture both equilibrium and reactive regions of potential energy surfaces. By employing single-ended growing string and nudged elastic band methods, reaction pathways previously underrepresented in MLIP training sets, particularly near transition states are systematically explored. This approach yields datasets with rich structural and chemical diversity, essential for robust MLIP development. Open-source code is provided for the entire workflow, facilitating the integration of the approach into existing MLIP development pipelines.
Collapse
Affiliation(s)
- Minhyeok Lee
- Department of ChemistryYonsei University50 Yonsei‐ro, Seodaemun‐guSeoul03722Republic of Korea
| | - Umit V. Ucak
- Research Institute of Pharmaceutical Science, College of PharmacySeoul National University1 Gwanak‐ro, Gwanak‐guSeoul08826Republic of Korea
| | - Jinyoung Jeong
- Department of ChemistryYonsei University50 Yonsei‐ro, Seodaemun‐guSeoul03722Republic of Korea
| | - Islambek Ashyrmamatov
- College of PharmacySeoul National University1 Gwanak‐ro, Gwanak‐guSeoul08826Republic of Korea
| | - Juyong Lee
- Research Institute of Pharmaceutical Science, College of PharmacySeoul National University1 Gwanak‐ro, Gwanak‐guSeoul08826Republic of Korea
- College of PharmacySeoul National University1 Gwanak‐ro, Gwanak‐guSeoul08826Republic of Korea
- Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and TechnologySeoul National University1 Gwanak‐ro, Gwanak‐guSeoul08826Republic of Korea
- Arontier Co.241, Gangnam‐daero, Seocho‐guSeoul06735Republic of Korea
| | - Eunji Sim
- Department of ChemistryYonsei University50 Yonsei‐ro, Seodaemun‐guSeoul03722Republic of Korea
| |
Collapse
|
3
|
Zlobin A, Maslova V, Beliaeva J, Meiler J, Golovin A. Long-Range Electrostatics in Serine Proteases: Machine Learning-Driven Reaction Sampling Yields Insights for Enzyme Design. J Chem Inf Model 2025; 65:2003-2013. [PMID: 39928564 PMCID: PMC11863386 DOI: 10.1021/acs.jcim.4c01827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2024] [Revised: 01/30/2025] [Accepted: 01/31/2025] [Indexed: 02/12/2025]
Abstract
Computational enzyme design is a promising technique for producing novel enzymes for industrial and clinical needs. A key challenge that this technique faces is to consistently achieve the desired activity. Fundamental studies of natural enzymes revealed critical contributions from second-shell - and even more distant - residues to their remarkable efficiency. In particular, such residues organize the internal electrostatic field to promote the reaction. Engineering such fields computationally proved to be a promising strategy, which, however, has some limitations. Charged residues necessarily form specific patterns of local interactions that may be exploited for structural integrity. As a result, it is impossible to probe the electrostatic field alone by substituting amino acids. We hypothesize that an approach that isolates the influences of residues' charges from other influences could yield deeper insights. We use molecular modeling with AI-enhanced QM/MM reaction sampling to implement such an approach and apply it to a model serine protease subtilisin. We find that the negative charge 8 Å away from the catalytic site is crucial to achieving the enzyme's catalytic efficiency, contributing more than 2 kcal/mol to lowering the barrier. In contrast, a positive charge from the second-closest charged residue opposes the efficiency of the reaction by raising the barrier by 0.8 kcal/mol. This result invites discussion into the role of this residue and trade-offs that might have taken place in the evolution of such enzymes. Our approach is transferable and can help investigate the evolution of electrostatic preorganization in other enzymes. We believe that the study and engineering of electrostatic fields in enzymes is a promising direction to advance both fundamental and applied enzymology and lead to the design of new powerful biocatalysts.
Collapse
Affiliation(s)
- Alexander Zlobin
- Institute for Drug
Discovery, Leipzig University Medical School, Brüderstraße 34, Leipzig 04103, Germany
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory 1, building 73, Moscow 119234, Russia
| | - Valentina Maslova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory 1, building 73, Moscow 119234, Russia
| | - Julia Beliaeva
- Institute for Drug
Discovery, Leipzig University Medical School, Brüderstraße 34, Leipzig 04103, Germany
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory 1, building 73, Moscow 119234, Russia
- Institute for Medical Physics and Biophysics, Leipzig University Medical School, Härtelstr. 16-18, Leipzig 04107, Germany
| | - Jens Meiler
- Institute for Drug
Discovery, Leipzig University Medical School, Brüderstraße 34, Leipzig 04103, Germany
- Department of Chemistry, Vanderbilt University, 1234 Stevenson Center Lane, Nashville, Tennessee 37240, United States
- Center
for Structural Biology, Vanderbilt University, PMB 407917, Nashville, Tennessee 37240-7917, United States
- Center for Scalable Data Analytics and
Artificial Intelligence (ScaDS.AI), Leipzig 04081, Germany
| | - Andrey Golovin
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory 1, building 73, Moscow 119234, Russia
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Miklukho-Maklaya 16/10, Moscow 117997, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskie Gory 1, building 40, Moscow 119992, Russia
| |
Collapse
|
4
|
Poltavsky I, Charkin-Gorbulin A, Puleva M, Fonseca G, Batatia I, Browning NJ, Chmiela S, Cui M, Frank JT, Heinen S, Huang B, Käser S, Kabylda A, Khan D, Müller C, Price AJA, Riedmiller K, Töpfer K, Ko TW, Meuwly M, Rupp M, Csányi G, von Lilienfeld OA, Margraf JT, Müller KR, Tkatchenko A. Crash testing machine learning force fields for molecules, materials, and interfaces: model analysis in the TEA Challenge 2023. Chem Sci 2025; 16:3720-3737. [PMID: 39935506 PMCID: PMC11809572 DOI: 10.1039/d4sc06529h] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Accepted: 12/25/2024] [Indexed: 02/13/2025] Open
Abstract
Atomistic simulations are routinely employed in academia and industry to study the behavior of molecules, materials, and their interfaces. Central to these simulations are force fields (FFs), whose development is challenged by intricate interatomic interactions at different spatio-temporal scales and the vast expanse of chemical space. Machine learning (ML) FFs, trained on quantum-mechanical energies and forces, have shown the capacity to achieve sub-kcal (mol-1 Å-1) accuracy while maintaining computational efficiency. The TEA Challenge 2023 rigorously evaluated commonly used MLFFs across diverse applications, highlighting their strengths and weaknesses. Participants trained their models using provided datasets, and the results were systematically analyzed to assess the ability of MLFFs to reproduce potential energy surfaces, handle incomplete reference data, manage multi-component systems, and model complex periodic structures. This publication describes the datasets, outlines the proposed challenges, and presents a detailed analysis of the accuracy, stability, and efficiency of the MACE, SO3krates, sGDML, SOAP/GAP, and FCHL19* architectures in molecular dynamics simulations. The models represent the MLFF developers who participated in the TEA Challenge 2023. All results presented correspond to the state of the ML architectures as of October 2023. A comprehensive analysis of the molecular dynamics results obtained with different MLFFs will be presented in the second part of this manuscript.
Collapse
Affiliation(s)
- Igor Poltavsky
- Department of Physics and Materials Science, University of Luxembourg L-1511 Luxembourg Luxembourg
| | - Anton Charkin-Gorbulin
- Department of Physics and Materials Science, University of Luxembourg L-1511 Luxembourg Luxembourg
- Laboratory for Chemistry of Novel Materials, University of Mons B-7000 Mons Belgium
| | - Mirela Puleva
- Department of Physics and Materials Science, University of Luxembourg L-1511 Luxembourg Luxembourg
- Institute for Advanced Studies, University of Luxembourg Campus Belval L-4365 Esch-sur-Alzette Luxembourg
| | - Grégory Fonseca
- Department of Physics and Materials Science, University of Luxembourg L-1511 Luxembourg Luxembourg
| | - Ilyes Batatia
- Department of Engineering, University of Cambridge Trumpington Street Cambridge CB2 1PZ UK
| | | | - Stefan Chmiela
- Machine Learning Group, Technical University Berlin Berlin Germany
- BIFOLD, Berlin Institute for the Foundations of Learning and Data Berlin Germany
| | - Mengnan Cui
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Berlin Germany
| | - J Thorben Frank
- Machine Learning Group, Technical University Berlin Berlin Germany
- BIFOLD, Berlin Institute for the Foundations of Learning and Data Berlin Germany
| | - Stefan Heinen
- Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada
| | - Bing Huang
- Wuhan University, Department of Chemistry and Molecular Sciences 430072 Wuhan China
| | - Silvan Käser
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | - Adil Kabylda
- Department of Physics and Materials Science, University of Luxembourg L-1511 Luxembourg Luxembourg
| | - Danish Khan
- Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto St George Campus Toronto ON Canada
| | - Carolin Müller
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Computer-Chemistry-Center Nägelsbachstraße 25 91052 Erlangen Germany
| | - Alastair J A Price
- Department of Chemistry, University of Toronto St George Campus Toronto ON Canada
- Acceleration Consortium, University of Toronto 80 St George St Toronto ON M5S 3H6 Canada
| | - Kai Riedmiller
- Heidelberg Institute for Theoretical Studies Heidelberg Germany
| | - Kai Töpfer
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | - Tsz Wai Ko
- Department of NanoEngineering, University of California San Diego 9500 Gilman Dr, Mail Code 0448 La Jolla CA 92093-0448 USA
| | - Markus Meuwly
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | - Matthias Rupp
- Luxembourg Institute of Science and Technology (LIST) L-4362 Esch-sur-Alzette Luxembourg
| | - Gábor Csányi
- Department of Engineering, University of Cambridge Trumpington Street Cambridge CB2 1PZ UK
| | - O Anatole von Lilienfeld
- Machine Learning Group, Technical University Berlin Berlin Germany
- BIFOLD, Berlin Institute for the Foundations of Learning and Data Berlin Germany
- Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada
- Department of Chemistry, University of Toronto St George Campus Toronto ON Canada
- Acceleration Consortium, University of Toronto 80 St George St Toronto ON M5S 3H6 Canada
- Department of Materials Science and Engineering, University of Toronto St George Campus Toronto ON Canada
- Department of Physics, University of Toronto St George Campus Toronto ON Canada
| | - Johannes T Margraf
- University of Bayreuth, Bavarian Center for Battery Technology (BayBatt) Bayreuth Germany
| | - Klaus-Robert Müller
- Machine Learning Group, Technical University Berlin Berlin Germany
- BIFOLD, Berlin Institute for the Foundations of Learning and Data Berlin Germany
- Department of Artificial Intelligence, Korea University Seoul South Korea
- Max Planck Institut für Informatik Saarbrücken Germany
- Google DeepMind Berlin Germany
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg L-1511 Luxembourg Luxembourg
- Institute for Advanced Studies, University of Luxembourg Campus Belval L-4365 Esch-sur-Alzette Luxembourg
| |
Collapse
|
5
|
Vazquez-Salazar LI, Käser S, Meuwly M. Outlier-detection for reactive machine learned potential energy surfaces. NPJ COMPUTATIONAL MATERIALS 2025; 11:33. [PMID: 39963264 PMCID: PMC11829830 DOI: 10.1038/s41524-024-01473-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 11/17/2024] [Indexed: 02/20/2025]
Abstract
Uncertainty quantification (UQ) to detect samples with large expected errors (outliers) is applied to reactive molecular potential energy surfaces (PESs). Three methods-Ensembles, deep evidential regression (DER), and Gaussian Mixture Models (GMM)-were applied to the H-transfer reaction between syn-Criegee and vinyl hydroxyperoxide. The results indicate that ensemble models provide the best results for detecting outliers, followed by GMM. For example, from a pool of 1000 structures with the largest uncertainty, the detection quality for outliers is ~90% and ~50%, respectively, if 25 or 1000 structures with large errors are sought. On the contrary, the limitations of the statistical assumptions of DER greatly impact its prediction capabilities. Finally, a structure-based indicator was found to be correlated with large average error, which may help to rapidly classify new structures into those that provide an advantage for refining the neural network.
Collapse
Affiliation(s)
| | - Silvan Käser
- Department of Chemistry, University of Basel, Basel, Switzerland
| | - Markus Meuwly
- Department of Chemistry, University of Basel, Basel, Switzerland
| |
Collapse
|
6
|
Zhang K, Wu T, Shen L, Wu Q, Chen W, Ye C, He X. Carbon Dioxide Sensing Based on Off-Axis Integrated Cavity Absorption Spectroscopy Combined with the Informer and Multilayer Perceptron Models. Anal Chem 2025; 97:3019-3025. [PMID: 39882837 DOI: 10.1021/acs.analchem.4c06057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2025]
Abstract
Off-axis integrated cavity output spectroscopy (OA-ICOS) allows the laser to be reflected multiple times inside the cavity, increasing the effective absorption path length and thus improving sensitivity. However, OA-ICOS systems are affected by various types of noise, and traditional filtering methods offer low processing efficiency and perform limited feature extraction. Deep learning models enable us to extract important features from large-scale, complex spectral data and analyze them efficiently and accurately. We propose a carbon dioxide (CO2) sensor operating in the near-infrared spectral region (1.602 μm) based on OA-ICOS and deep learning models. A radiofrequency (RF) noise source is employed to reduce the cavity-mode noise in OA-ICOS and thus improve the signal-to-noise ratio (SNR). A time-series-based neural network, known as the informer, is employed for filtering CO2 spectral time series. After filtering, spectral features are directly extracted from the filtered spectral data and CO2 concentrations are predicted using a multilayer perceptron (MLP) model. Our results showed that the SNR attained using informer filtering approximately double those obtained using traditional filtering methods (Savitzky-Golay filtering, Kalman filtering, and wavelet threshold). The linear correlation coefficient (R2) between measured concentrations and standard concentrations was increased from 79.74% (obtained by using the absorption-peak-fitting method) to 98.52% (obtained by using the proposed MLP model). Moreover, the detection limit of the CO2 sensor using the MLP model reached 1.38 ppm at 224.4 s, a 3.79-fold improvement compared to that obtained by using the absorption-peak-fitting method. Our results demonstrate the feasibility of integrating deep learning methods in the field of spectroscopy-based sensing and provide a promising approach for spectral data processing.
Collapse
Affiliation(s)
- Kehao Zhang
- Key Laboratory of Nondestructive Test (Ministry of Education), Nanchang Hangkong University, Nanchang 330063, China
| | - Tao Wu
- Key Laboratory of Nondestructive Test (Ministry of Education), Nanchang Hangkong University, Nanchang 330063, China
| | - Linlin Shen
- School of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
| | - Qiang Wu
- Department of Mathematics, Physics and Electrical Engineering, Northumbria University, Newcastle upon Tyne NE1 8ST, U.K
| | - Weidong Chen
- Laboratoire de Physicochimie de l'Atmosphère, Université du Littoral Côte d'Opale 189A, Av. Maurice Schumann, 59140 Dunkerque, France
| | - Chenwen Ye
- Key Laboratory of Nondestructive Test (Ministry of Education), Nanchang Hangkong University, Nanchang 330063, China
| | - Xingdao He
- Key Laboratory of Nondestructive Test (Ministry of Education), Nanchang Hangkong University, Nanchang 330063, China
| |
Collapse
|
7
|
Hao Y, Lu X, Fu B, Zhang DH. New Algorithms to Generate Permutationally Invariant Polynomials and Fundamental Invariants for Potential Energy Surface Fitting. J Chem Theory Comput 2025; 21:1046-1053. [PMID: 39841118 DOI: 10.1021/acs.jctc.4c01447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2025]
Abstract
Symmetric functions, such as Permutationally Invariant Polynomials (PIPs) and Fundamental Invariants (FIs), are effective and concise descriptors for incorporating permutation symmetry into neural network (NN) potential energy surface (PES) fitting. The traditional algorithm for generating such symmetric polynomials has a factorial time complexity of N!, where N is the number of identical atoms, posing a significant challenge to applying symmetric polynomials as descriptors of NN PESs for larger systems, particularly with more than 10 atoms. Herein, we report a new algorithm which has only linear time complexity for identical atoms. It can tremendously accelerate generation process of symmetric polynomials for molecular systems. The proposed algorithm is based on graph connectivity analysis following the action of the generation set of molecular permutational group. For instance, in the case of calculating the invariant polynomials for a 15-atom molecule, such as tropolone, our algorithm is approximately 2 million times faster than the previous method. The efficiency of the new algorithm can be further enhanced with increasing molecular size and number of identical atoms, making the FI-NN approach feasible for systems with over 10 atoms and high symmetry demands.
Collapse
Affiliation(s)
- Yiping Hao
- State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, People's Republic of China
- School of Chemical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xiaoxiao Lu
- State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, People's Republic of China
| | - Bina Fu
- State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, People's Republic of China
- School of Chemical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- Hefei National Laboratory, Hefei 230088, China
| | - Dong H Zhang
- State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, People's Republic of China
- School of Chemical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- Hefei National Laboratory, Hefei 230088, China
| |
Collapse
|
8
|
Wang Y, Samarasinghe DSND, Deng H, Liu B, Aikens CM. Gaussian Process Approach to Constructing Transferable Force Fields for Thiolate-Protected Gold Nanoclusters. J Chem Inf Model 2025. [PMID: 39876563 DOI: 10.1021/acs.jcim.4c01495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2025]
Abstract
Gold nanoparticles can exhibit unique physical and chemical properties, such as plasmon resonances or photoluminescence. These nanoparticles have many atoms, which leads to high computational costs for density functional theory (DFT) calculations. In this work, we used the FLARE++ (fast learning of atomistic rare events) code and incorporated an active learning algorithm to construct force fields for gold thiolate-protected nanoclusters. We started training the force field using Au20(SCH3)16 as the initial structure and then applied the trained force field to perform molecular dynamics (MD) simulations. We then validated the machine learning force field using different types of gold nanoclusters as testing models. The test results were integrated into the existing database and retrained again. The final force fields show success in predicting energies for nanoclusters not only in the training database but also outside the database. These tests revealed that the force field has achieved quantum mechanical level accuracy in some key performance metrics.
Collapse
Affiliation(s)
- Yuchen Wang
- Department of Chemistry, Kansas State University, Manhattan, Kansas 66506, United States
| | | | - Hao Deng
- Tim Taylor Department of Chemical Engineering, Kansas State University, Manhattan, Kansas 66506, United States
| | - Bin Liu
- Tim Taylor Department of Chemical Engineering, Kansas State University, Manhattan, Kansas 66506, United States
| | - Christine M Aikens
- Department of Chemistry, Kansas State University, Manhattan, Kansas 66506, United States
| |
Collapse
|
9
|
Alavi SF, Chen Y, Hou YF, Ge F, Zheng P, Dral PO. ANI-1ccx-gelu Universal Interatomic Potential and Its Fine-Tuning: Toward Accurate and Efficient Anharmonic Vibrational Frequencies. J Phys Chem Lett 2025; 16:483-493. [PMID: 39748511 DOI: 10.1021/acs.jpclett.4c03031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2025]
Abstract
Calculating anharmonic vibrational modes of molecules for interpreting experimental spectra is one of the most interesting challenges of contemporary computational chemistry. However, the traditional QM methods are costly for this application. Machine learning techniques have emerged as a powerful tool for substituting the traditional QM methods. Universal interatomic potentials (UIPs) hold a particular promise to deliver accurate results at a fraction of the cost of the traditional QM methods, but the performance of UIPs for calculating anharmonic vibrational frequencies remains hitherto unknown. Here we show that despite a known excellent performance of the representative UIP ANI-1ccx for thermochemical properties, it fails for the anharmonic frequencies due to the original unfortunate choice of the activation function. Hence, we recommend evaluating new UIPs on anharmonic frequencies as an additional important quality test. To remedy the shortcomings of ANI-1ccx, we introduce its reformulation ANI-1ccx-gelu with the GELU activation function, which is capable of calculating IR anharmonic frequencies with reasonable accuracy (close to B3LYP/6-31G*). We also show that our new UIP can be fine-tuned to obtain very accurate anharmonic frequencies for some specific molecules but more effort is needed to improve the overall quality of UIP and its capability for fine-tuning. The new UIP will be included as part of our universal and updatable AI-enhanced QM methods (UAIQM) platform and is available together with usage and fine-tuning tutorials in open-source MLatom at https://github.com/dralgroup/mlatom. The calculations can also be performed via a web browser at https://XACScloud.com.
Collapse
Affiliation(s)
- Seyedeh Fatemeh Alavi
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Yuxinxin Chen
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Yi-Fan Hou
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Fuchun Ge
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Peikun Zheng
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Pavlo O Dral
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
- Institute of Physics, Faculty of Physics, Astronomy, and Informatics, Nicolaus Copernicus University in Torun, ul. Grudziądzka 5, 87-100 Torun, Poland
| |
Collapse
|
10
|
David R, de la Puente M, Gomez A, Anton O, Stirnemann G, Laage D. ArcaNN: automated enhanced sampling generation of training sets for chemically reactive machine learning interatomic potentials. DIGITAL DISCOVERY 2025; 4:54-72. [PMID: 39553851 PMCID: PMC11563209 DOI: 10.1039/d4dd00209a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Accepted: 10/21/2024] [Indexed: 11/19/2024]
Abstract
The emergence of artificial intelligence is profoundly impacting computational chemistry, particularly through machine-learning interatomic potentials (MLIPs). Unlike traditional potential energy surface representations, MLIPs overcome the conventional computational scaling limitations by offering an effective combination of accuracy and efficiency for calculating atomic energies and forces to be used in molecular simulations. These MLIPs have significantly enhanced molecular simulations across various applications, including large-scale simulations of materials, interfaces, chemical reactions, and beyond. Despite these advances, the construction of training datasets-a critical component for the accuracy of MLIPs-has not received proportional attention, especially in the context of chemical reactivity, which depends on rare barrier-crossing events that are not easily included in the datasets. Here we address this gap by introducing ArcaNN, a comprehensive framework designed for generating training datasets for reactive MLIPs. ArcaNN employs a concurrent learning approach combined with advanced sampling techniques to ensure an accurate representation of high-energy geometries. The framework integrates automated processes for iterative training, exploration, new configuration selection, and energy and force labeling, all while ensuring reproducibility and documentation. We demonstrate ArcaNN's capabilities through two paradigm reactions: a nucleophilic substitution and a Diels-Alder reaction. These examples showcase its effectiveness, the uniformly low error of the resulting MLIP everywhere along the chemical reaction coordinate, and its potential for broad applications in reactive molecular dynamics. Finally, we provide guidelines for assessing the quality of MLIPs in reactive systems.
Collapse
Affiliation(s)
- Rolf David
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Miguel de la Puente
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Axel Gomez
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Olaia Anton
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Guillaume Stirnemann
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Damien Laage
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| |
Collapse
|
11
|
Hassan MM, Xu Y, Sayada J, Zareef M, Shoaib M, Chen X, Li H, Chen Q. Progress of machine learning-based biosensors for the monitoring of food safety: A review. Biosens Bioelectron 2025; 267:116782. [PMID: 39288707 DOI: 10.1016/j.bios.2024.116782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Revised: 08/20/2024] [Accepted: 09/12/2024] [Indexed: 09/19/2024]
Abstract
Rapid urbanization and growing food demand caused people to be concerned about food safety. Biosensors have gained considerable attention for assessing food safety due to selectivity, and sensitivity but poor stability inherently limits their application. The emergence of machine learning (ML) has enhanced the efficiency of different sensors for food safety assessment. The ML combined with various noninvasive biosensors has been implemented efficiently to monitor food safety by considering the stability of bio-recognition molecules. This review comprehensively summarizes the application of ML-powered biosensors to investigate food safety. Initially, different detector-based biosensors using biological molecules with their advantages and disadvantages and biosensor-related various ML algorithms for food safety monitoring have been discussed. Next, the application of ML-powered biosensors to detect antibiotics, foodborne microorganisms, mycotoxins, pesticides, heavy metals, anions, and persistent organic pollutants has been highlighted for the last five years. The challenges and prospects have also been deliberated. This review provides a new prospect in developing various biosensors for multi-food contaminants powered by suitable ML algorithms to monitor in-situ food safety.
Collapse
Affiliation(s)
- Md Mehedi Hassan
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China
| | - Yi Xu
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China
| | - Jannatul Sayada
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China
| | - Muhammad Zareef
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, 212013, PR China
| | - Muhammad Shoaib
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, 212013, PR China
| | - Xiaomei Chen
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China
| | - Huanhuan Li
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, 212013, PR China
| | - Quansheng Chen
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China; School of Food and Biological Engineering, Jiangsu University, Zhenjiang, 212013, PR China.
| |
Collapse
|
12
|
Watanabe N, Hori Y, Sugisawa H, Ida T, Shoji M, Shigeta Y. A machine learning potential construction based on radial distribution function sampling. J Comput Chem 2024; 45:2949-2958. [PMID: 39225311 DOI: 10.1002/jcc.27497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 08/09/2024] [Accepted: 08/15/2024] [Indexed: 09/04/2024]
Abstract
Sampling reference data is crucial in machine learning potential (MLP) construction. Inadequate coverage of local configurations in reference data may lead to unphysical behaviors in MLP-based molecular dynamics (MLP-MD) simulations. To address this problem, this study proposes a new on-the-fly reference data sampling method called radial distribution function (RDF)-based data sampling for MLP construction. This method detects and extracts anomalous structures from the trajectories of MLP-MD simulations by focusing on the shapes of RDFs. The detected structures are added to the reference data to improve the accuracy of the MLP. This method allows us to realize a reasonable MLP construction for liquid water with minimal additional data. We prepare data from an H2O molecular cluster system and verify whether the constructed MLPs are practical for bulk water systems. MLP-MD simulations without RDF-based data sampling show unphysical behaviors, such as atomic collisions. In contrast, after applying this method, we obtain MLP-MD trajectories with features, such as RDF shapes and angle distributions, that are comparable to those of ab initio MD simulations. Our simulation results demonstrate that the RDF-based data sampling approach is useful for constructing MLPs that are robust to extrapolations from molecular cluster systems to bulk systems without any specialized know-how.
Collapse
Affiliation(s)
- Natsuki Watanabe
- Center for Computational Sciences, University of Tsukuba, Tsukuba, Japan
- Graduate School of Pure and Applied Sciences, University of Tsukuba, Tsukuba, Japan
| | - Yuta Hori
- Center for Computational Sciences, University of Tsukuba, Tsukuba, Japan
| | - Hiroki Sugisawa
- Science & Innovation Center, Mitsubishi Chemical Corporation, Yokohama, Japan
| | - Tomonori Ida
- Division of Material Chemistry, Graduate School of Natural Science and Technology, Kanazawa University, Kanazawa, Japan
| | - Mitsuo Shoji
- Center for Computational Sciences, University of Tsukuba, Tsukuba, Japan
| | - Yasuteru Shigeta
- Center for Computational Sciences, University of Tsukuba, Tsukuba, Japan
| |
Collapse
|
13
|
Hsu PJ, Mizuide A, Kuo JL, Fujii A. Hydrogen bond network structures of protonated 2,2,2-trifluoroethanol/ethanol mixed clusters probed by infrared spectroscopy combined with a deep-learning structure sampling approach: the origin of the linear type network preference in protonated fluoroalcohol clusters. Phys Chem Chem Phys 2024; 26:27751-27762. [PMID: 39470069 DOI: 10.1039/d4cp03534h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/30/2024]
Abstract
While preferential hydrogen bond network structures of cold protonated alcohol clusters H+(ROH)n are generally switched from a linear type to a cyclic one at n = 4-5, those of protonated 2,2,2-trifluoroethanol (TFE) clusters maintain linear type structures at least in the size range of n = 3-7. To explore the origin of the strong linear type network preference of H+(TFE)n, infrared spectra of protonated mixed clusters H+(TFE)m(ethanol)n (m + n = 5) were measured. An efficient structure sampling technique using parallelized basin-hopping algorithms and deep-learning neural network potentials is developed to search for essential isomers of the mixed clusters. Vibrational simulations based on the harmonic superposition approximation were compared with the observed spectra to identify the major isomer component at each mixing ratio. It was found that the formation of the cyclic structure occurs only in n ≥ 3 of the mixed clusters, in which the proton solvating sites and the double acceptor site are occupied by ethanol. The crucial role of the stability of the double acceptor site in the cyclic structure formation is discussed.
Collapse
Affiliation(s)
- Po-Jen Hsu
- Institute of Atomic and Molecular Sciences, Academia Sinica, Taipei 10617, Taiwan.
| | - Atsuya Mizuide
- Department of Chemistry, Graduate School of Science, Tohoku University, Sendai 980-8578, Japan.
| | - Jer-Lai Kuo
- Institute of Atomic and Molecular Sciences, Academia Sinica, Taipei 10617, Taiwan.
| | - Asuka Fujii
- Department of Chemistry, Graduate School of Science, Tohoku University, Sendai 980-8578, Japan.
| |
Collapse
|
14
|
Singh K, Lee KH, Peláez D, Bande A. Accelerating wavepacket propagation with machine learning. J Comput Chem 2024; 45:2360-2373. [PMID: 39031712 DOI: 10.1002/jcc.27443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 05/13/2024] [Accepted: 05/16/2024] [Indexed: 07/22/2024]
Abstract
In this work, we discuss the use of a recently introduced machine learning (ML) technique known as Fourier neural operators (FNO) as an efficient alternative to the traditional solution of the time-dependent Schrödinger equation (TDSE). FNOs are ML models which are employed in the approximated solution of partial differential equations. For a wavepacket propagating in an anharmonic potential and for a tunneling system, we show that the FNO approach can accurately and faithfully model wavepacket propagation via the density. Additionally, we demonstrate that FNOs can be a suitable replacement for traditional TDSE solvers in cases where the results of the quantum dynamical simulation are required repeatedly such as in the case of parameter optimization problems (e.g., control). The speed-up from the FNO method allows for its combination with the Markov-chain Monte Carlo approach in applications that involve solving inverse problems such as optimal and coherent laser control of the outcome of dynamical processes.
Collapse
Affiliation(s)
- Kanishka Singh
- Theory of Electron Dynamics and Spectroscopy, Helmholtz-Zentrum Berlin für Materialien und Energie GmbH, Berlin, Germany
- Institute of Chemistry and Biochemistry, Freie Universität Berlin, Berlin, Germany
| | - Ka Hei Lee
- Theory of Electron Dynamics and Spectroscopy, Helmholtz-Zentrum Berlin für Materialien und Energie GmbH, Berlin, Germany
- Fachbereich Physik, Freie Universität Berlin, Berlin, Germany
| | - Daniel Peláez
- CNRS, Institut des Sciences Moléculaires d'Orsay, Université Paris-Saclay, Orsay, France
| | - Annika Bande
- Theory of Electron Dynamics and Spectroscopy, Helmholtz-Zentrum Berlin für Materialien und Energie GmbH, Berlin, Germany
- Institute of Inorganic Chemistry, Leibniz University Hannover, Hannover, Germany
- Cluster of Excellence PhoenixD, Leibniz University Hannover, Hannover, Germany
| |
Collapse
|
15
|
Aldossary A, Campos-Gonzalez-Angulo JA, Pablo-García S, Leong SX, Rajaonson EM, Thiede L, Tom G, Wang A, Avagliano D, Aspuru-Guzik A. In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
Affiliation(s)
- Abdulrahman Aldossary
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | | | - Sergio Pablo-García
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
| | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Ella Miray Rajaonson
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Luca Thiede
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Andrew Wang
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Davide Avagliano
- Chimie ParisTech, PSL University, CNRS, Institute of Chemistry for Life and Health Sciences (iCLeHS UMR 8060), Paris, F-75005, France
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
- Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON, M5S 3E4, Canada
- Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON, M5S 3E5, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 66118 University Ave., Toronto, M5G 1M1, Canada
- Acceleration Consortium, 80 St George St, Toronto, M5S 3H6, Canada
| |
Collapse
|
16
|
Fu W, Mo Y, Xiao Y, Liu C, Zhou F, Wang Y, Zhou J, Zhang YJ. Enhancing Molecular Energy Predictions with Physically Constrained Modifications to the Neural Network Potential. J Chem Theory Comput 2024; 20:4533-4544. [PMID: 38828925 DOI: 10.1021/acs.jctc.3c01181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]
Abstract
Exclusively prioritizing the precision of energy prediction frequently proves inadequate in satisfying multifaceted requirements. A heightened focus is warranted on assessing the rationality of potential energy curves predicted by machine learning-based force fields (MLFFs), alongside evaluating the pragmatic utility of these MLFFs. This study introduces SWANI, an optimized neural network potential stemming from the ANI framework. Through the incorporation of supplementary physical constraints, SWANI aligns more cohesively with chemical expectations, yielding rational potential energy profiles. It also exhibits superior predictive precision compared with that of the ANI model. Additionally, a comprehensive comparison is conducted between SWANI and a prominent graph neural network-based model. The findings indicate that SWANI outperforms the latter, particularly for molecules exceeding the dimensions of the training set. This outcome underscores SWANI's exceptional capacity for generalization and its proficiency in handling larger molecular systems.
Collapse
Affiliation(s)
- Weiqiang Fu
- Beijing StoneWise Technology Co., Ltd., Haidian Street 15, Haidian District, Beijing 100080, China
| | - Yujie Mo
- Beijing StoneWise Technology Co., Ltd., Haidian Street 15, Haidian District, Beijing 100080, China
| | - Yi Xiao
- Beijing StoneWise Technology Co., Ltd., Haidian Street 15, Haidian District, Beijing 100080, China
| | - Chang Liu
- Beijing StoneWise Technology Co., Ltd., Haidian Street 15, Haidian District, Beijing 100080, China
| | - Feng Zhou
- Beijing StoneWise Technology Co., Ltd., Haidian Street 15, Haidian District, Beijing 100080, China
| | - Yang Wang
- Beijing StoneWise Technology Co., Ltd., Haidian Street 15, Haidian District, Beijing 100080, China
| | - Jielong Zhou
- Beijing StoneWise Technology Co., Ltd., Haidian Street 15, Haidian District, Beijing 100080, China
| | - Yingsheng J Zhang
- Beijing StoneWise Technology Co., Ltd., Haidian Street 15, Haidian District, Beijing 100080, China
| |
Collapse
|
17
|
Yang Y, Zhang S, Ranasinghe KD, Isayev O, Roitberg AE. Machine Learning of Reactive Potentials. Annu Rev Phys Chem 2024; 75:371-395. [PMID: 38941524 DOI: 10.1146/annurev-physchem-062123-024417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024]
Abstract
In the past two decades, machine learning potentials (MLPs) have driven significant developments in chemical, biological, and material sciences. The construction and training of MLPs enable fast and accurate simulations and analysis of thermodynamic and kinetic properties. This review focuses on the application of MLPs to reaction systems with consideration of bond breaking and formation. We review the development of MLP models, primarily with neural network and kernel-based algorithms, and recent applications of reactive MLPs (RMLPs) to systems at different scales. We show how RMLPs are constructed, how they speed up the calculation of reactive dynamics, and how they facilitate the study of reaction trajectories, reaction rates, free energy calculations, and many other calculations. Different data sampling strategies applied in building RMLPs are also discussed with a focus on how to collect structures for rare events and how to further improve their performance with active learning.
Collapse
Affiliation(s)
- Yinuo Yang
- Department of Chemistry, University of Florida, Gainesville, Florida;
| | - Shuhao Zhang
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania;
| | | | - Olexandr Isayev
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania;
| | - Adrian E Roitberg
- Department of Chemistry, University of Florida, Gainesville, Florida;
| |
Collapse
|
18
|
Laskar MR, Bhattacharya A, Dasgputa K. Efficient simulation of potential energy operators on quantum hardware: a study on sodium iodide (NaI). Sci Rep 2024; 14:10831. [PMID: 38734700 PMCID: PMC11582323 DOI: 10.1038/s41598-024-60605-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 04/25/2024] [Indexed: 05/13/2024] Open
Abstract
This study introduces a conceptually novel polynomial encoding algorithm for simulating potential energy operators encoded in diagonal unitary forms in a quantum computing machine. The current trend in quantum computational chemistry is effective experimentation to achieve high-precision quantum computational advantage. However, high computational gate complexity and fidelity loss are some of the impediments to the realization of this advantage in a real quantum hardware. In this study, we address the challenges of building a diagonal Hamiltonian operator having exponential functional form, and its implementation in the context of the time evolution problem (Hamiltonian simulation and encoding). Potential energy operators when represented in the first quantization form is an example of such types of operators. Through systematic decomposition and construction, we demonstrate the efficacy of the proposed polynomial encoding method in reducing gate complexity from O ( 2 n ) to O ∑ i = 1 r n C r (for some r ≪ n ). This offers a solution with lower complexity in comparison to the conventional Hadamard basis encoding approach. The effectiveness of the proposed algorithm was validated with its implementation in the IBM quantum simulator and IBM quantum hardware. This study demonstrates the proposed approach by taking the example of the potential energy operator of the sodium iodide molecule (NaI) in the first quantization form. The numerical results demonstrate the potential applicability of the proposed method in quantum chemistry problems, while the analytical bound for error analysis and computational gate complexity discussed, throw light on issues regarding its implementation.
Collapse
Affiliation(s)
- Mostafizur Rahaman Laskar
- IBM Research, Bangalore, India.
- G. S. Sanyal School of Telecommunications, Indian Institute of Technology Kharagpur, Kharagpur, India.
| | | | | |
Collapse
|
19
|
Wan K, He J, Shi X. Construction of High Accuracy Machine Learning Interatomic Potential for Surface/Interface of Nanomaterials-A Review. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2305758. [PMID: 37640376 DOI: 10.1002/adma.202305758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 08/24/2023] [Indexed: 08/31/2023]
Abstract
The inherent discontinuity and unique dimensional attributes of nanomaterial surfaces and interfaces bestow them with various exceptional properties. These properties, however, also introduce difficulties for both experimental and computational studies. The advent of machine learning interatomic potential (MLIP) addresses some of the limitations associated with empirical force fields, presenting a valuable avenue for accurate simulations of these surfaces/interfaces of nanomaterials. Central to this approach is the idea of capturing the relationship between system configuration and potential energy, leveraging the proficiency of machine learning (ML) to precisely approximate high-dimensional functions. This review offers an in-depth examination of MLIP principles and their execution and elaborates on their applications in the realm of nanomaterial surface and interface systems. The prevailing challenges faced by this potent methodology are also discussed.
Collapse
Affiliation(s)
- Kaiwei Wan
- Laboratory of Theoretical and Computational Nanoscience, National Center for Nanoscience and Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Jianxin He
- Laboratory of Theoretical and Computational Nanoscience, National Center for Nanoscience and Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Xinghua Shi
- Laboratory of Theoretical and Computational Nanoscience, National Center for Nanoscience and Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| |
Collapse
|
20
|
Song K, Upadhyay M, Meuwly M. OH-Formation following vibrationally induced reaction dynamics of H 2COO. Phys Chem Chem Phys 2024; 26:12698-12708. [PMID: 38602285 DOI: 10.1039/d4cp00739e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
The reaction dynamics of H2COO to form HCOOH and dioxirane as first steps for OH-elimination is quantitatively investigated. Using a machine learned potential energy surface (PES) at the CASPT2/aug-cc-pVTZ level of theory vibrational excitation along the CH-normal mode νCH with energies up to 40.0 kcal mol-1 (∼5νCH) leads almost exclusively to HCOOH which further decomposes into OH + HCO. Although the barrier to form dioxirane is only 21.4 kcal mol-1 the reaction probability to form dioxirane is two orders of magnitude lower if the CH-stretch mode is excited. Following the dioxirane-formation pathway is facile, however, if the COO-bend vibration is excited together with energies equivalent to ∼2νCH or ∼3νCOO. For OH-formation in the atmosphere the pathway through HCOOH is probably most relevant because the alternative pathways (through dioxirane or formic acid) involve several intermediates that can de-excite through collisions, relax via internal vibrational relaxation (IVR), or pass through loose and vulnerable transition states (formic acid). This work demonstrates how, by selectively exciting particular vibrational modes, it is possible to dial into desired reaction channels with a high degree of specificity.
Collapse
Affiliation(s)
- Kaisheng Song
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland.
- School of Chemistry and Chemical Engineering & Chongqing Key Laboratory of Theoretical and Computational Chemistry, Chongqing University, Chongqing 401331, China
| | - Meenu Upadhyay
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland.
| | - Markus Meuwly
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland.
| |
Collapse
|
21
|
Tkaczyk S, Karwounopoulos J, Schöller A, Woodcock HL, Langer T, Boresch S, Wieder M. Reweighting from Molecular Mechanics Force Fields to the ANI-2x Neural Network Potential. J Chem Theory Comput 2024; 20:2719-2728. [PMID: 38527958 DOI: 10.1021/acs.jctc.3c01274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
To achieve chemical accuracy in free energy calculations, it is necessary to accurately describe the system's potential energy surface and efficiently sample configurations from its Boltzmann distribution. While neural network potentials (NNPs) have shown significantly higher accuracy than classical molecular mechanics (MM) force fields, they have a limited range of applicability and are considerably slower than MM potentials, often by orders of magnitude. To address this challenge, Rufa et al. [Rufa et al. bioRxiv 2020, 10.1101/2020.07.29.227959.] suggested a two-stage approach that uses a fast and established MM alchemical energy protocol, followed by reweighting the results using NNPs, known as endstate correction or indirect free energy calculation. This study systematically investigates the accuracy and robustness of reweighting from an MM reference to a neural network target potential (ANI-2x) for an established data set in vacuum, using single-step free-energy perturbation (FEP) and nonequilibrium (NEQ) switching simulation. We assess the influence of longer switching lengths and the impact of slow degrees of freedom on outliers in the work distribution and compare the results to those of multistate equilibrium free energy simulations. Our results demonstrate that free energy calculations between NNPs and MM potentials should be preferably performed using NEQ switching simulations to obtain accurate free energy estimates. NEQ switching simulations between the MM potentials and NNPs are efficient, robust, and trivial to implement.
Collapse
Affiliation(s)
- Sara Tkaczyk
- Department of Pharmaceutical Sciences, Pharmaceutical Chemistry Division, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
- Vienna Doctoral School of Pharmaceutical, Nutritional and Sport Sciences (PhaNuSpo), University of Vienna, 1090 Vienna, Austria
| | - Johannes Karwounopoulos
- Faculty of Chemistry, Institute of Computational Biological Chemistry, University of Vienna, Währingerstrasse 17, 1090 Vienna, Austria
- Vienna Doctoral School of Chemistry (DoSChem), University of Vienna, Währingerstrasse 42, 1090 Vienna, Austria
| | - Andreas Schöller
- Faculty of Chemistry, Institute of Computational Biological Chemistry, University of Vienna, Währingerstrasse 17, 1090 Vienna, Austria
- Vienna Doctoral School of Chemistry (DoSChem), University of Vienna, Währingerstrasse 42, 1090 Vienna, Austria
| | - H Lee Woodcock
- Department of Chemistry, University of South Florida, 4202 E. Fowler Ave., CHE205, Tampa, Florida 33620-5250, United States
| | - Thierry Langer
- Department of Pharmaceutical Sciences, Pharmaceutical Chemistry Division, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| | - Stefan Boresch
- Faculty of Chemistry, Institute of Computational Biological Chemistry, University of Vienna, Währingerstrasse 17, 1090 Vienna, Austria
| | - Marcus Wieder
- Faculty of Chemistry, Institute of Computational Biological Chemistry, University of Vienna, Währingerstrasse 17, 1090 Vienna, Austria
| |
Collapse
|
22
|
Käser S, Meuwly M. Numerical Accuracy Matters: Applications of Machine Learned Potential Energy Surfaces. J Phys Chem Lett 2024:3419-3424. [PMID: 38506827 DOI: 10.1021/acs.jpclett.3c03405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/21/2024]
Abstract
The role of numerical accuracy in training and evaluating neural network-based potential energy surfaces is examined for different experimental observables. For observables that require third- and fourth-order derivatives of the potential energy with respect to Cartesian coordinates single-precision arithmetics as is typically used in ML-based approaches is insufficient and leads to roughness of the underlying PES as is explicitly demonstrated. Increasing the numerical accuracy to double-precision gives a smooth PES with higher-order derivatives that are numerically stable and yield meaningful anharmonic frequencies and tunneling splitting as is demonstrated for H2CO and malonaldehyde. For molecular dynamics simulations, which only require first-order derivatives, single-precision arithmetics appears to be sufficient, though.
Collapse
Affiliation(s)
- Silvan Käser
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | - Markus Meuwly
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| |
Collapse
|
23
|
Horn KP, Vazquez-Salazar LI, Koch CP, Meuwly M. Improving potential energy surfaces using measured Feshbach resonance states. SCIENCE ADVANCES 2024; 10:eadi6462. [PMID: 38427733 PMCID: PMC10906917 DOI: 10.1126/sciadv.adi6462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/29/2024] [Indexed: 03/03/2024]
Abstract
The structure and dynamics of a molecular system is governed by its potential energy surface (PES), representing the total energy as a function of the nuclear coordinates. Obtaining accurate potential energy surfaces is limited by the exponential scaling of Hilbert space, restricting quantitative predictions of experimental observables from first principles to small molecules with just a few electrons. Here, we present an explicitly physics-informed approach for improving and assessing the quality of families of PESs by modifying them through linear coordinate transformations based on experimental data. We demonstrate this "morphing" of the PES for the He - H2+ complex using recent comprehensive Feshbach resonance (FR) measurements for reference PESs at three different levels of quantum chemistry. In all cases, the positions and intensities of peaks in the energy distributions are improved. We find these observables to be mainly sensitive to the long-range part of the PES.
Collapse
Affiliation(s)
- Karl P. Horn
- Dahlem Center for Complex Quantum Systems and Fachbereich Physik, Freie Universität Berlin, Arnimallee 14, D-14195 Berlin, Germany
| | | | - Christiane P. Koch
- Dahlem Center for Complex Quantum Systems and Fachbereich Physik, Freie Universität Berlin, Arnimallee 14, D-14195 Berlin, Germany
| | - Markus Meuwly
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| |
Collapse
|
24
|
Upadhyay M, Töpfer K, Meuwly M. Molecular Simulation for Atmospheric Reactions: Non-Equilibrium Dynamics, Roaming, and Glycolaldehyde Formation following Photoinduced Decomposition of syn-Acetaldehyde Oxide. J Phys Chem Lett 2024; 15:90-96. [PMID: 38147042 DOI: 10.1021/acs.jpclett.3c03131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2023]
Abstract
The decomposition dynamics of vibrationally excited syn-CH3CHOO to form vinoxy + hydroxyl (CH2CHO + OH) radicals or to recombine to form glycolaldehyde (CH2OHCHO) are characterized using statistically significant numbers of molecular dynamics simulations using a full-dimensional neural-network-based potential energy surface at the CASPT2 level of theory. The computed final OH-translational and rotational state distributions agree well with experiments and probe the still unknown O-O bond strength DeOO for which best values from 22 to 25 kcal/mol are found. OH-elimination rates are consistent with experiments and do not vary appreciably with DeOO due to the non-equilibrium nature of the process. In addition to the OH-elimination pathway, OH roaming is observed following O-O scission, which leads to glycolaldehyde formation on the picosecond time scale. Together with recent work involving the methyl-ethyl-substituted Criegee intermediate, we conclude that OH roaming is a general pathway to be included in molecular-level modeling of atmospheric processes. This work demonstrates that atomistic simulations with machine-learned energy functions provide a viable route for exploring the chemistry and reaction dynamics of atmospheric reactions.
Collapse
Affiliation(s)
- Meenu Upadhyay
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | - Kai Töpfer
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | - Markus Meuwly
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| |
Collapse
|
25
|
Abedi M, Behler J, Goldsmith CF. High-Dimensional Neural Network Potentials for Accurate Prediction of Equation of State: A Case Study of Methane. J Chem Theory Comput 2023; 19:7825-7832. [PMID: 37902963 DOI: 10.1021/acs.jctc.3c00469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2023]
Abstract
Machine learning-based interatomic potentials, such as those provided by neural networks, are increasingly important in molecular dynamics simulations. In the present work, we consider the applicability and robustness of machine learning molecular dynamics to predict the equation of state properties of methane by using high-dimensional neural network potentials (HDNNPs). We investigate two different strategies for generating training data: one strategy based upon bulk representations using periodic cells and another strategy based upon clusters of molecules. We assess the accuracy of the trained potentials by predicting the equilibrium mass density for a wide range of thermodynamic conditions to characterize the liquid phase, supercritical fluid, and gas phase, as well as the liquid-vapor coexistence curve. Our results show an excellent agreement with reference phase diagrams, with an average error below ∼2% for all studied phases. Moreover, we confirm the applicability of models trained on cluster data sets for producing accurate and reliable results.
Collapse
Affiliation(s)
- Mostafa Abedi
- School of Engineering, Brown University, Providence, Rhode Island 02906, United States
| | - Jörg Behler
- Lehrstuhl für Theoretische Chemie II, Ruhr-Universität Bochum, 44780 Bochum, Germany
- Research Center Chemical Sciences and Sustainability, Research Alliance Ruhr, 44780 Bochum, Germany
| | - C Franklin Goldsmith
- School of Engineering, Brown University, Providence, Rhode Island 02906, United States
| |
Collapse
|
26
|
Casetti N, Alfonso-Ramos JE, Coley CW, Stuyver T. Combining Molecular Quantum Mechanical Modeling and Machine Learning for Accelerated Reaction Screening and Discovery. Chemistry 2023; 29:e202301957. [PMID: 37526059 DOI: 10.1002/chem.202301957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 07/30/2023] [Accepted: 07/31/2023] [Indexed: 08/02/2023]
Abstract
Molecular quantum mechanical modeling, accelerated by machine learning, has opened the door to high-throughput screening campaigns of complex properties, such as the activation energies of chemical reactions and absorption/emission spectra of materials and molecules; in silico. Here, we present an overview of the main principles, concepts, and design considerations involved in such hybrid computational quantum chemistry/machine learning screening workflows, with a special emphasis on some recent examples of their successful application. We end with a brief outlook of further advances that will benefit the field.
Collapse
Affiliation(s)
- Nicholas Casetti
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts, 02139, United States
| | - Javier E Alfonso-Ramos
- Ecole Nationale Supérieure de Chimie de Paris, Université PSL, CNRS, Institute of Chemistry for Life and Health Sciences, 75005, Paris, France
| | - Connor W Coley
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts, 02139, United States
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts, 02139, United States
| | - Thijs Stuyver
- Ecole Nationale Supérieure de Chimie de Paris, Université PSL, CNRS, Institute of Chemistry for Life and Health Sciences, 75005, Paris, France
| |
Collapse
|
27
|
Eckhoff M, Reiher M. Lifelong Machine Learning Potentials. J Chem Theory Comput 2023; 19:3509-3525. [PMID: 37288932 PMCID: PMC10308836 DOI: 10.1021/acs.jctc.3c00279] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Indexed: 06/09/2023]
Abstract
Machine learning potentials (MLPs) trained on accurate quantum chemical data can retain the high accuracy, while inflicting little computational demands. On the downside, they need to be trained for each individual system. In recent years, a vast number of MLPs have been trained from scratch because learning additional data typically requires retraining on all data to not forget previously acquired knowledge. Additionally, most common structural descriptors of MLPs cannot represent efficiently a large number of different chemical elements. In this work, we tackle these problems by introducing element-embracing atom-centered symmetry functions (eeACSFs), which combine structural properties and element information from the periodic table. These eeACSFs are key for our development of a lifelong machine learning potential (lMLP). Uncertainty quantification can be exploited to transgress a fixed, pretrained MLP to arrive at a continuously adapting lMLP, because a predefined level of accuracy can be ensured. To extend the applicability of an lMLP to new systems, we apply continual learning strategies to enable autonomous and on-the-fly training on a continuous stream of new data. For the training of deep neural networks, we propose the continual resilient (CoRe) optimizer and incremental learning strategies relying on rehearsal of data, regularization of parameters, and the architecture of the model.
Collapse
Affiliation(s)
- Marco Eckhoff
- ETH Zürich, Departement Chemie und Angewandte Biowissenschaften, 8093 Zürich, Switzerland
| | - Markus Reiher
- ETH Zürich, Departement Chemie und Angewandte Biowissenschaften, 8093 Zürich, Switzerland
| |
Collapse
|
28
|
Staub R, Gantzer P, Harabuchi Y, Maeda S, Varnek A. Challenges for Kinetics Predictions via Neural Network Potentials: A Wilkinson's Catalyst Case. Molecules 2023; 28:molecules28114477. [PMID: 37298952 DOI: 10.3390/molecules28114477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 05/23/2023] [Accepted: 05/26/2023] [Indexed: 06/12/2023] Open
Abstract
Ab initio kinetic studies are important to understand and design novel chemical reactions. While the Artificial Force Induced Reaction (AFIR) method provides a convenient and efficient framework for kinetic studies, accurate explorations of reaction path networks incur high computational costs. In this article, we are investigating the applicability of Neural Network Potentials (NNP) to accelerate such studies. For this purpose, we are reporting a novel theoretical study of ethylene hydrogenation with a transition metal complex inspired by Wilkinson's catalyst, using the AFIR method. The resulting reaction path network was analyzed by the Generative Topographic Mapping method. The network's geometries were then used to train a state-of-the-art NNP model, to replace expensive ab initio calculations with fast NNP predictions during the search. This procedure was applied to run the first NNP-powered reaction path network exploration using the AFIR method. We discovered that such explorations are particularly challenging for general purpose NNP models, and we identified the underlying limitations. In addition, we are proposing to overcome these challenges by complementing NNP models with fast semiempirical predictions. The proposed solution offers a generally applicable framework, laying the foundations to further accelerate ab initio kinetic studies with Machine Learning Force Fields, and ultimately explore larger systems that are currently inaccessible.
Collapse
Affiliation(s)
- Ruben Staub
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
| | - Philippe Gantzer
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
| | - Yu Harabuchi
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
- Japan Science and Technology Agency (JST), ERATO Maeda Artificial Intelligence in Chemical Reaction Design and Discovery Project, Kita 10, Nishi 8, Kita-ku, Sapporo 060-0810, Japan
- Department of Chemistry, Faculty of Science, Hokkaido University, Kita 10, Nishi 8, Kita-ku, Sapporo 060-0810, Japan
| | - Satoshi Maeda
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
- Japan Science and Technology Agency (JST), ERATO Maeda Artificial Intelligence in Chemical Reaction Design and Discovery Project, Kita 10, Nishi 8, Kita-ku, Sapporo 060-0810, Japan
- Department of Chemistry, Faculty of Science, Hokkaido University, Kita 10, Nishi 8, Kita-ku, Sapporo 060-0810, Japan
- Research and Services Division of Materials Data and Integrated System (MaDIS), National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba 305-0044, Japan
| | - Alexandre Varnek
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21, Nishi 10, Kita-ku, Sapporo 001-0021, Japan
- Laboratory of Chemoinformatics, UMR 7140, CNRS, University of Strasbourg, 67081 Strasbourg, France
| |
Collapse
|