1
|
Pang HW, Dong X, Johnson MS, Green WH. Subgraph Isomorphic Decision Tree to Predict Radical Thermochemistry with Bounded Uncertainty Estimation. J Phys Chem A 2024; 128:2891-2907. [PMID: 38536892 DOI: 10.1021/acs.jpca.4c00569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
Detailed chemical kinetic models offer valuable mechanistic insights into industrial applications. Automatic generation of reliable kinetic models requires fast and accurate radical thermochemistry estimation. Kineticists often prefer hydrogen bond increment (HBI) corrections from a closed-shell molecule to the corresponding radical for their interpretability, physical meaning, and facilitation of error cancellation as a relative quantity. Tree estimators, used due to limited data, currently rely on expert knowledge and manual construction, posing challenges in maintenance and improvement. In this work, we extend the subgraph isomorphic decision tree (SIDT) algorithm originally developed for rate estimation to estimate HBI corrections. We introduce a physics-aware splitting criterion, explore a bounded weighted uncertainty estimation method, and evaluate aleatoric uncertainty-based and model variance reduction-based prepruning methods. Moreover, we compile a data set of thermochemical parameters for 2210 radicals involving C, O, N, and H based on quantum chemical calculations from recently published works. We leverage the collected data set to train the SIDT model. Compared to existing empirical tree estimators, the SIDT model (1) offers an automatic approach to generating and extending the tree estimator for thermochemistry, (2) has better accuracy and R2, (3) provides significantly more realistic uncertainty estimates, and (4) has a tree structure much more advantageous in descent speed. Overall, the SIDT estimator marks a great leap in kinetic modeling, offering more precise, reliable, and scalable predictions for radical thermochemistry.
Collapse
Affiliation(s)
- Hao-Wei Pang
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Xiaorui Dong
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Matthew S Johnson
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - William H Green
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
2
|
Johnson MS, Dong X, Grinberg Dana A, Chung Y, Farina D, Gillis RJ, Liu M, Yee NW, Blondal K, Mazeau E, Grambow CA, Payne AM, Spiekermann KA, Pang HW, Goldsmith CF, West RH, Green WH. RMG Database for Chemical Property Prediction. J Chem Inf Model 2022; 62:4906-4915. [PMID: 36222558 DOI: 10.1021/acs.jcim.2c00965] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The Reaction Mechanism Generator (RMG) database for chemical property prediction is presented. The RMG database consists of curated datasets and estimators for accurately predicting the parameters necessary for constructing a wide variety of chemical kinetic mechanisms. These datasets and estimators are mostly published and enable prediction of thermodynamics, kinetics, solvation effects, and transport properties. For thermochemistry prediction, the RMG database contains 45 libraries of thermochemical parameters with a combination of 4564 entries and a group additivity scheme with 9 types of corrections including radical, polycyclic, and surface absorption corrections with 1580 total curated groups and parameters for a graph convolutional neural network trained using transfer learning from a set of >130 000 DFT calculations to 10 000 high-quality values. Correction schemes for solvent-solute effects, important for thermochemistry in the liquid phase, are available. They include tabulated values for 195 pure solvents and 152 common solutes and a group additivity scheme for predicting the properties of arbitrary solutes. For kinetics estimation, the database contains 92 libraries of kinetic parameters containing a combined 21 000 reactions and contains rate rule schemes for 87 reaction classes trained on 8655 curated training reactions. Additional libraries and estimators are available for transport properties. All of this information is easily accessible through the graphical user interface at https://rmg.mit.edu. Bulk or on-the-fly use can be facilitated by interfacing directly with the RMG Python package which can be installed from Anaconda. The RMG database provides kineticists with easy access to estimates of the many parameters they need to model and analyze kinetic systems. This helps to speed up and facilitate kinetic analysis by enabling easy hypothesis testing on pathways, by providing parameters for model construction, and by providing checks on kinetic parameters from other sources.
Collapse
Affiliation(s)
- Matthew S Johnson
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - Xiaorui Dong
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - Alon Grinberg Dana
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States.,The Wolfson Department of Chemical Engineering, Grand Technion Energy Program (GTEP), Technion─Israel Institute of Technology, Haifa3200003, Israel
| | - Yunsie Chung
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - David Farina
- Department of Chemical Engineering, Northeastern University, Boston, Massachusetts02115, United States
| | - Ryan J Gillis
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - Mengjie Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - Nathan W Yee
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - Katrin Blondal
- School of Engineering, Brown University, Providence, Rhode Island02912, United States
| | - Emily Mazeau
- Department of Chemical Engineering, Northeastern University, Boston, Massachusetts02115, United States
| | - Colin A Grambow
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - A Mark Payne
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - Kevin A Spiekermann
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - Hao-Wei Pang
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| | - C Franklin Goldsmith
- School of Engineering, Brown University, Providence, Rhode Island02912, United States
| | - Richard H West
- Department of Chemical Engineering, Northeastern University, Boston, Massachusetts02115, United States
| | - William H Green
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts02139, United States
| |
Collapse
|
3
|
Ismail I, Chantreau Majerus R, Habershon S. Graph-Driven Reaction Discovery: Progress, Challenges, and Future Opportunities. J Phys Chem A 2022; 126:7051-7069. [PMID: 36190262 PMCID: PMC9574932 DOI: 10.1021/acs.jpca.2c06408] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Graph-based descriptors, such as bond-order matrices and adjacency matrices, offer a simple and compact way of categorizing molecular structures; furthermore, such descriptors can be readily used to catalog chemical reactions (i.e., bond-making and -breaking). As such, a number of graph-based methodologies have been developed with the goal of automating the process of generating chemical reaction network models describing the possible mechanistic chemistry in a given set of reactant species. Here, we outline the evolution of these graph-based reaction discovery schemes, with particular emphasis on more recent methods incorporating graph-based methods with semiempirical and ab initio electronic structure calculations, minimum-energy path refinements, and transition state searches. Using representative examples from homogeneous catalysis and interstellar chemistry, we highlight how these schemes increasingly act as "virtual reaction vessels" for interrogating mechanistic questions. Finally, we highlight where challenges remain, including issues of chemical accuracy and calculation speeds, as well as the inherent challenge of dealing with the vast size of accessible chemical reaction space.
Collapse
Affiliation(s)
- Idil Ismail
- Department of Chemistry, University of Warwick, CoventryCV4 7AL, United Kingdom
| | | | - Scott Habershon
- Department of Chemistry, University of Warwick, CoventryCV4 7AL, United Kingdom
| |
Collapse
|
4
|
Spiekermann K, Pattanaik L, Green WH. High accuracy barrier heights, enthalpies, and rate coefficients for chemical reactions. Sci Data 2022; 9:417. [PMID: 35851390 PMCID: PMC9293986 DOI: 10.1038/s41597-022-01529-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Accepted: 06/30/2022] [Indexed: 12/13/2022] Open
Abstract
Quantitative chemical reaction data, including activation energies and reaction rates, are crucial for developing detailed kinetic mechanisms and accurately predicting reaction outcomes. However, such data are often difficult to find, and high-quality datasets are especially rare. Here, we use CCSD(T)-F12a/cc-pVDZ-F12//ωB97X-D3/def2-TZVP to obtain high-quality single point calculations for nearly 22,000 unique stable species and transition states. We report the results from these quantum chemistry calculations and extract the barrier heights and reaction enthalpies to create a kinetics dataset of nearly 12,000 gas-phase reactions. These reactions involve H, C, N, and O, contain up to seven heavy atoms, and have cleaned atom-mapped SMILES. Our higher-accuracy coupled-cluster barrier heights differ significantly (RMSE of ∼5 kcal mol−1) relative to those calculated at ωB97X-D3/def2-TZVP. We also report accurate transition state theory rate coefficients \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${k}_{\infty }(T)$$\end{document}k∞(T) between 300 K and 2000 K and the corresponding Arrhenius parameters for a subset of rigid reactions. We believe this data will accelerate development of automated and reliable methods for quantitative reaction prediction. Measurement(s) | Barrier Heights • Enthalpies • Rate Coefficients | Technology Type(s) | ab initio quantum chemistry computational method |
Collapse
Affiliation(s)
- Kevin Spiekermann
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA, 02139, USA
| | - Lagnajit Pattanaik
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA, 02139, USA
| | - William H Green
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA, 02139, USA.
| |
Collapse
|
5
|
Ismail I, Robertson C, Habershon S. Successes and challenges in using machine-learned activation energies in kinetic simulations. J Chem Phys 2022; 157:014109. [DOI: 10.1063/5.0096027] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The prediction of the thermodynamic and kinetic properties of chemical reactions is increasingly being addressed by machine-learning (ML) methods such as artificial neural networks (ANNs). While a number of recent studies have reported success in predicting chemical reaction activation energies, less attention has focused on how the accuracy of ML predictions filter through to predictions of macroscopic observables. Here, we consider the impact of the uncertainty associated with ML prediction of activation energies on observable properties of chemical reaction networks, as given by microkinetics simulations based on ML-predicted reaction rates. After training an ANN to predict activation energies given standard molecular descriptors for reactants and products alone, we performed microkinetics simulations of three different prototypical reaction networks: formamide decomposition, aldol reactions and decomposition of 3-hydroperoxypropanal. We find that the kinetic modelling predictions can be in excellent agreement with corresponding simulations performed with ab initio calculations, but this is dependent on the inherent energetic landscape of the networks. We use these simulations to suggest some guidelines for when ML-based activation energies can be reliable, and when one should take more care in applications to kinetics modelling.
Collapse
Affiliation(s)
| | | | - Scott Habershon
- Department of Chemistry, University of Warwick, United Kingdom
| |
Collapse
|
6
|
Pio G, Mocellin P, Vianello C, Salzano E. A detailed kinetic model for the thermal decomposition of hydroxylamine. JOURNAL OF HAZARDOUS MATERIALS 2021; 416:125641. [PMID: 33823478 DOI: 10.1016/j.jhazmat.2021.125641] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 03/08/2021] [Accepted: 03/09/2021] [Indexed: 06/12/2023]
Abstract
Hydroxylamine may decompose explosively if processed and stored in certain conditions, posing critical safety issues that need to be carefully addressed. A key aspect is related to the characterization of chemical aspects involved in the explosive decomposition of hydroxylamine (HA), requiring accurate and detailed kinetic mechanisms. This work was devoted to the experimental and numerical characterization of the thermal decomposition of aqueous solutions of HA included in the range of 10%w to 50%w. The onset temperatures of thermal decomposition were determined in the range of 143-198 °C under heating rates of 2 and 5 °C min-1, respectively. A reduced mechanism listing 13 species and 11 reactions involving nitrogen-containing species was produced and validated against experimental measurements. Reaction pathways ruling the decomposition of HA were identified. The hydrogen abstraction toward HNOH and H2NO dominates the primary steps of NH2OH decomposition. The generated mechanism was adopted for the definition of a dimensionless stability diagram for the safe use of HA. Finally, results show a self-accelerating behaviour for any temperature larger than 186 °C, defining a monitoring criterion for safe storage of hydroxylamine-solutions.
Collapse
Affiliation(s)
- Gianmaria Pio
- DICAM - Università di Bologna, Via Terracini 28, 40131 Bologna, Italy
| | - Paolo Mocellin
- Dipartimento Ingegneria Civile Edile e Ambientale - Università di Padova, Via Marzolo 9, 35131 Padova, Italy
| | - Chiara Vianello
- Dipartimento Ingegneria Civile Edile e Ambientale - Università di Padova, Via Marzolo 9, 35131 Padova, Italy
| | - Ernesto Salzano
- DICAM - Università di Bologna, Via Terracini 28, 40131 Bologna, Italy.
| |
Collapse
|
7
|
Liu M, Grinberg Dana A, Johnson MS, Goldman MJ, Jocher A, Payne AM, Grambow CA, Han K, Yee NW, Mazeau EJ, Blondal K, West RH, Goldsmith CF, Green WH. Reaction Mechanism Generator v3.0: Advances in Automatic Mechanism Generation. J Chem Inf Model 2021; 61:2686-2696. [PMID: 34048230 DOI: 10.1021/acs.jcim.0c01480] [Citation(s) in RCA: 64] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
In chemical kinetics research, kinetic models containing hundreds of species and tens of thousands of elementary reactions are commonly used to understand and predict the behavior of reactive chemical systems. Reaction Mechanism Generator (RMG) is a software suite developed to automatically generate such models by incorporating and extrapolating from a database of known thermochemical and kinetic parameters. Here, we present the recent version 3 release of RMG and highlight improvements since the previously published description of RMG v1.0. Most notably, RMG can now generate heterogeneous catalysis models in addition to the previously available gas- and liquid-phase capabilities. For model analysis, new methods for local and global uncertainty analysis have been implemented to supplement first-order sensitivity analysis. The RMG database of thermochemical and kinetic parameters has been significantly expanded to cover more types of chemistry. The present release includes parallelization for faster model generation and a new molecule isomorphism approach to improve computational performance. RMG has also been updated to use Python 3, ensuring compatibility with the latest cheminformatics and machine learning packages. Overall, RMG v3.0 includes many changes which improve the accuracy of the generated chemical mechanisms and allow for exploration of a wider range of chemical systems.
Collapse
Affiliation(s)
- Mengjie Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Alon Grinberg Dana
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Wolfson Department of Chemical Engineering, Technion-Israel Institute of Technology, Haifa 3200003, Israel
| | - Matthew S Johnson
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Mark J Goldman
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Agnes Jocher
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - A Mark Payne
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Colin A Grambow
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Kehang Han
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Nathan W Yee
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Emily J Mazeau
- Department of Chemical Engineering, Northeastern University, Boston, Massachusetts 02115, United States
| | - Katrin Blondal
- School of Engineering, Brown University, Providence, Rhode Island 02912, United States
| | - Richard H West
- Department of Chemical Engineering, Northeastern University, Boston, Massachusetts 02115, United States
| | - C Franklin Goldsmith
- School of Engineering, Brown University, Providence, Rhode Island 02912, United States
| | - William H Green
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
8
|
Affiliation(s)
- William H. Green
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge Massachusetts USA
| |
Collapse
|