1
|
Türtscher PL, Reiher M. Pathfinder─Navigating and Analyzing Chemical Reaction Networks with an Efficient Graph-Based Approach. J Chem Inf Model 2023; 63:147-160. [PMID: 36515968 PMCID: PMC9832502 DOI: 10.1021/acs.jcim.2c01136] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
While the field of first-principles explorations into chemical reaction space has been continuously growing, the development of strategies for analyzing resulting chemical reaction networks (CRNs) is lagging behind. A CRN consists of compounds linked by reactions. Analyzing how these compounds are transformed into one another based on kinetic modeling is a nontrivial task. Here, we present the graph-optimization-driven algorithm and program Pathfinder to allow for such an analysis of a CRN. The CRN for this work has been obtained with our open-source Chemoton reaction network exploration software. Chemoton probes reactive combinations of compounds for elementary steps and sorts them into reactions. By encoding these reactions of the CRN as a graph consisting of compound and reaction vertices and adding information about activation barriers as well as required reagents to the edges of the graph yields a complete graph-theoretical representation of the CRN. Since the probabilities of the formation of compounds depend on the starting conditions, the consumption of any compound during a reaction must be accounted for to reflect the availability of reagents. To account for this, we introduce compound costs to reflect compound availability. Simultaneously, the determined compound costs rank the compounds in the CRN in terms of their probability to be formed. This ranking then allows us to probe easily accessible compounds in the CRN first for further explorations into yet unexplored terrain. We first illustrate the working principle on an abstract small CRN. Afterward, Pathfinder is demonstrated in the example of the disproportionation of iodine with water and the comproportionation of iodic acid and hydrogen iodide. Both processes are analyzed within the same CRN, which we construct with our autonomous first-principles CRN exploration software Chemoton [Unsleber, J. P.; J. Chem. Theory Comput. 2022, 18, 5393-5409] guided by Pathfinder.
Collapse
|
2
|
Wen M, Spotte-Smith EWC, Blau SM, McDermott MJ, Krishnapriyan AS, Persson KA. Chemical reaction networks and opportunities for machine learning. NATURE COMPUTATIONAL SCIENCE 2023; 3:12-24. [PMID: 38177958 DOI: 10.1038/s43588-022-00369-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 11/08/2022] [Indexed: 01/06/2024]
Abstract
Chemical reaction networks (CRNs), defined by sets of species and possible reactions between them, are widely used to interrogate chemical systems. To capture increasingly complex phenomena, CRNs can be leveraged alongside data-driven methods and machine learning (ML). In this Perspective, we assess the diverse strategies available for CRN construction and analysis in pursuit of a wide range of scientific goals, discuss ML techniques currently being applied to CRNs and outline future CRN-ML approaches, presenting scientific and technical challenges to overcome.
Collapse
Affiliation(s)
- Mingjian Wen
- Chemical and Biomolecular Engineering, University of Houston, Houston, TX, USA
- Energy Technologies Area, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Evan Walter Clark Spotte-Smith
- Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Materials Science and Engineering, University of California, Berkeley, Berkeley, CA, USA
| | - Samuel M Blau
- Energy Technologies Area, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Matthew J McDermott
- Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Materials Science and Engineering, University of California, Berkeley, Berkeley, CA, USA
| | - Aditi S Krishnapriyan
- Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Chemical and Biomolecular Engineering, University of California, Berkeley, Berkeley, CA, USA
- Electrical Engineering and Computer Science, University of California, Berkeley, Berkeley, CA, USA
| | - Kristin A Persson
- Materials Science and Engineering, University of California, Berkeley, Berkeley, CA, USA.
- Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
3
|
Ismail I, Chantreau Majerus R, Habershon S. Graph-Driven Reaction Discovery: Progress, Challenges, and Future Opportunities. J Phys Chem A 2022; 126:7051-7069. [PMID: 36190262 PMCID: PMC9574932 DOI: 10.1021/acs.jpca.2c06408] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Graph-based descriptors, such as bond-order matrices and adjacency matrices, offer a simple and compact way of categorizing molecular structures; furthermore, such descriptors can be readily used to catalog chemical reactions (i.e., bond-making and -breaking). As such, a number of graph-based methodologies have been developed with the goal of automating the process of generating chemical reaction network models describing the possible mechanistic chemistry in a given set of reactant species. Here, we outline the evolution of these graph-based reaction discovery schemes, with particular emphasis on more recent methods incorporating graph-based methods with semiempirical and ab initio electronic structure calculations, minimum-energy path refinements, and transition state searches. Using representative examples from homogeneous catalysis and interstellar chemistry, we highlight how these schemes increasingly act as "virtual reaction vessels" for interrogating mechanistic questions. Finally, we highlight where challenges remain, including issues of chemical accuracy and calculation speeds, as well as the inherent challenge of dealing with the vast size of accessible chemical reaction space.
Collapse
Affiliation(s)
- Idil Ismail
- Department of Chemistry, University of Warwick, CoventryCV4 7AL, United Kingdom
| | | | - Scott Habershon
- Department of Chemistry, University of Warwick, CoventryCV4 7AL, United Kingdom
| |
Collapse
|
4
|
Ismail I, Robertson C, Habershon S. Successes and challenges in using machine-learned activation energies in kinetic simulations. J Chem Phys 2022; 157:014109. [DOI: 10.1063/5.0096027] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The prediction of the thermodynamic and kinetic properties of chemical reactions is increasingly being addressed by machine-learning (ML) methods such as artificial neural networks (ANNs). While a number of recent studies have reported success in predicting chemical reaction activation energies, less attention has focused on how the accuracy of ML predictions filter through to predictions of macroscopic observables. Here, we consider the impact of the uncertainty associated with ML prediction of activation energies on observable properties of chemical reaction networks, as given by microkinetics simulations based on ML-predicted reaction rates. After training an ANN to predict activation energies given standard molecular descriptors for reactants and products alone, we performed microkinetics simulations of three different prototypical reaction networks: formamide decomposition, aldol reactions and decomposition of 3-hydroperoxypropanal. We find that the kinetic modelling predictions can be in excellent agreement with corresponding simulations performed with ab initio calculations, but this is dependent on the inherent energetic landscape of the networks. We use these simulations to suggest some guidelines for when ML-based activation energies can be reliable, and when one should take more care in applications to kinetics modelling.
Collapse
Affiliation(s)
| | | | - Scott Habershon
- Department of Chemistry, University of Warwick, United Kingdom
| |
Collapse
|
5
|
Garay-Ruiz D, Álvarez-Moreno M, Bo C, Martínez-Núñez E. New Tools for Taming Complex Reaction Networks: The Unimolecular Decomposition of Indole Revisited. ACS PHYSICAL CHEMISTRY AU 2022; 2:225-236. [PMID: 36855573 PMCID: PMC9718323 DOI: 10.1021/acsphyschemau.1c00051] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The level of detail attained in the computational description of reaction mechanisms can be vastly improved through tools for automated chemical space exploration, particularly for systems of small to medium size. Under this approach, the unimolecular decomposition landscape for indole was explored through the automated reaction mechanism discovery program AutoMeKin. Nevertheless, the sheer complexity of the obtained mechanisms might be a hindrance regarding their chemical interpretation. In this spirit, the new Python library amk-tools has been designed to read and manipulate complex reaction networks, greatly simplifying their overall analysis. The package provides interactive dashboards featuring visualizations of the network, the three-dimensional (3D) molecular structures and vibrational normal modes of all chemical species, and the corresponding energy profiles for selected pathways. The combination of the joined mechanism generation and postprocessing workflow with the rich chemistry of indole decomposition enabled us to find new details of the reaction (obtained at the CCSD(T)/aug-cc-pVTZ//M06-2X/MG3S level of theory) that were not reported before: (i) 16 pathways leading to the formation of HCN and NH3 (via amino radical); (ii) a barrierless reaction between methylene radical and phenyl isocyanide, which might be an operative mechanism under the conditions of the interstellar medium; and (iii) reaction channels leading to both hydrogen cyanide and hydrogen isocyanide, of potential astrochemical interest as the computed HNC/HCN ratios greatly exceed the calculated equilibrium value at very low temperatures. The reported reaction networks can be very valuable to supplement databases of kinetic data, which is of remarkable interest for pyrolysis and astrochemical studies.
Collapse
Affiliation(s)
- Diego Garay-Ruiz
- Institute
of Chemical Research of Catalonia (ICIQ), Barcelona Institute of Science & Technology (BIST), Avinguda Països Catalans,
16, 43007 Tarragona, Spain,Departament
de Química Física i Inorgànica, Universitat Rovira i Virgili (URV), Marcel·lí Domingo s/n, 43007 Tarragona, Spain
| | - Moises Álvarez-Moreno
- Institute
of Chemical Research of Catalonia (ICIQ), Barcelona Institute of Science & Technology (BIST), Avinguda Països Catalans,
16, 43007 Tarragona, Spain
| | - Carles Bo
- Institute
of Chemical Research of Catalonia (ICIQ), Barcelona Institute of Science & Technology (BIST), Avinguda Països Catalans,
16, 43007 Tarragona, Spain,Departament
de Química Física i Inorgànica, Universitat Rovira i Virgili (URV), Marcel·lí Domingo s/n, 43007 Tarragona, Spain,
| | - Emilio Martínez-Núñez
- Departmento
de Química Física, Facultade de Química, Universidade de Santiago de Compostela, 15782 Santiago
de Compostela, Spain,
| |
Collapse
|
6
|
Martínez-Núñez E, Barnes GL, Glowacki DR, Kopec S, Peláez D, Rodríguez A, Rodríguez-Fernández R, Shannon RJ, Stewart JJP, Tahoces PG, Vazquez SA. AutoMeKin2021: An open-source program for automated reaction discovery. J Comput Chem 2021; 42:2036-2048. [PMID: 34387374 DOI: 10.1002/jcc.26734] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 07/16/2021] [Accepted: 07/27/2021] [Indexed: 01/10/2023]
Abstract
AutoMeKin2021 is an updated version of tsscds2018, a program for the automated discovery of reaction mechanisms (J. Comput. Chem. 2018, 39, 1922). This release features a number of new capabilities: rare-event molecular dynamics simulations to enhance reaction discovery, extension of the original search algorithm to study van der Waals complexes, use of chemical knowledge, a new search algorithm based on bond-order time series analysis, statistics of the chemical reaction networks, a web application to submit jobs, and other features. The source code, manual, installation instructions and the website link are available at: https://rxnkin.usc.es/index.php/AutoMeKin.
Collapse
Affiliation(s)
- Emilio Martínez-Núñez
- Department of Physical Chemistry, University of Santiago de Compostela, Santiago de Compostela, Spain
| | - George L Barnes
- Department of Chemistry and Biochemistry, Siena College, Loudonville, New York, USA
| | - David R Glowacki
- Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, UK
| | - Sabine Kopec
- Institut de Sciences Moléculaires d'Orsay, UMR 8214, Université Paris-Sud - Université Paris-Saclay, Orsay, France
| | - Daniel Peláez
- Institut de Sciences Moléculaires d'Orsay, UMR 8214, Université Paris-Sud - Université Paris-Saclay, Orsay, France
| | - Aurelio Rodríguez
- Galicia Supercomputing Center (CESGA), Santiago de Compostela, Spain
| | | | - Robin J Shannon
- Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, UK
| | | | - Pablo G Tahoces
- Department of Electronics and Computer Science, University of Santiago de Compostela, Santiago de Compostela, Spain
| | - Saulo A Vazquez
- Department of Physical Chemistry, University of Santiago de Compostela, Santiago de Compostela, Spain
| |
Collapse
|
7
|
Xie X, Clark Spotte-Smith EW, Wen M, Patel HD, Blau SM, Persson KA. Data-Driven Prediction of Formation Mechanisms of Lithium Ethylene Monocarbonate with an Automated Reaction Network. J Am Chem Soc 2021; 143:13245-13258. [PMID: 34379977 DOI: 10.1021/jacs.1c05807] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Interfacial reactions are notoriously difficult to characterize, and robust prediction of the chemical evolution and associated functionality of the resulting surface film is one of the grand challenges of materials chemistry. The solid-electrolyte interphase (SEI), critical to Li-ion batteries (LIBs), exemplifies such a surface film, and despite decades of work, considerable controversy remains regarding the major components of the SEI as well as their formation mechanisms. Here we use a reaction network to investigate whether lithium ethylene monocarbonate (LEMC) or lithium ethylene dicarbonate (LEDC) is the major organic component of the LIB SEI. Our data-driven, automated methodology is based on a systematic generation of relevant species using a general fragmentation/recombination procedure which provides the basis for a vast thermodynamic reaction landscape, calculated with density functional theory. The shortest pathfinding algorithms are employed to explore the reaction landscape and obtain previously proposed formation mechanisms of LEMC as well as several new reaction pathways and intermediates. For example, we identify two novel LEMC formation mechanisms: one which involves LiH generation and another that involves breaking the (CH2)O-C(═O)OLi bond in LEDC. Most importantly, we find that all identified paths, which are also kinetically favorable under the explored conditions, require water as a reactant. This condition severely limits the amount of LEMC that can form, as compared with LEDC, a conclusion that has direct impact on the SEI formation in Li-ion energy storage systems. Finally, the data-driven framework presented here is generally applicable to any electrochemical system and expected to improve our understanding of surface passivation.
Collapse
Affiliation(s)
- Xiaowei Xie
- Department of Chemistry, University of California, Berkeley, California 94720, United States.,Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Evan Walter Clark Spotte-Smith
- Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States.,Department of Materials Science and Engineering, University of California, Berkeley, California 94720, United States
| | - Mingjian Wen
- Energy Technologies Area, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Hetal D Patel
- Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States.,Department of Materials Science and Engineering, University of California, Berkeley, California 94720, United States
| | - Samuel M Blau
- Energy Technologies Area, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Kristin A Persson
- Department of Materials Science and Engineering, University of California, Berkeley, California 94720, United States.,Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| |
Collapse
|
8
|
Robertson C, Hyland R, Lacey AJD, Havens S, Habershon S. Identifying Barrierless Mechanisms for Benzene Formation in the Interstellar Medium Using Permutationally Invariant Reaction Discovery. J Chem Theory Comput 2021; 17:2307-2322. [DOI: 10.1021/acs.jctc.1c00046] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
| | - Ross Hyland
- Department of Chemistry, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Andrew J. D. Lacey
- Department of Chemistry, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Sebastian Havens
- Department of Chemistry, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Scott Habershon
- Department of Chemistry, University of Warwick, Coventry CV4 7AL, United Kingdom
| |
Collapse
|
9
|
Rasmussen MH, Jensen JH. Fast and automatic estimation of transition state structures using tight binding quantum chemical calculations. PEERJ PHYSICAL CHEMISTRY 2020. [DOI: 10.7717/peerj-pchem.15] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We present a method for the automatic determination of transition states (TSs) that is based on Grimme’s RMSD-PP semiempirical tight binding reaction path method (J. Chem. Theory Comput. 2019, 15, 2847–2862), where the maximum energy structure along the path serves as an initial guess for DFT TS searches. The method is tested on 100 elementary reactions and located a total of 89 TSs correctly. Of the 11 remaining reactions, nine are shown not to be elementary reactions after all and for one of the two true failures the problem is shown to be the semiempirical tight binding model itself. Furthermore, we show that the GFN2-xTB RMSD-PP barrier is a good approximation for the corresponding DFT barrier for reactions with DFT barrier heights up to about 30 kcal/mol. Thus, GFN2-xTB RMSD-PP barrier heights, which can be estimated at the cost of a single energy minimisation, can be used to quickly identify reactions with low barriers, although it will also produce some false positives.
Collapse
Affiliation(s)
| | - Jan H. Jensen
- Department of Chemistry, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|