1
|
Aldossary A, Campos-Gonzalez-Angulo JA, Pablo-García S, Leong SX, Rajaonson EM, Thiede L, Tom G, Wang A, Avagliano D, Aspuru-Guzik A. In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
Affiliation(s)
- Abdulrahman Aldossary
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | | | - Sergio Pablo-García
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
| | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Ella Miray Rajaonson
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Luca Thiede
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Andrew Wang
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Davide Avagliano
- Chimie ParisTech, PSL University, CNRS, Institute of Chemistry for Life and Health Sciences (iCLeHS UMR 8060), Paris, F-75005, France
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
- Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON, M5S 3E4, Canada
- Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON, M5S 3E5, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 66118 University Ave., Toronto, M5G 1M1, Canada
- Acceleration Consortium, 80 St George St, Toronto, M5S 3H6, Canada
| |
Collapse
|
2
|
Okada H, Maeda S. On Accelerating Substrate Optimization Using Computational Gibbs Energy Barriers: A Numerical Consideration Utilizing a Computational Data Set. ACS OMEGA 2024; 9:7123-7131. [PMID: 38371820 PMCID: PMC10870292 DOI: 10.1021/acsomega.3c09066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/05/2024] [Accepted: 01/16/2024] [Indexed: 02/20/2024]
Abstract
Substrate optimization is a time- and resource-consuming step in organic synthesis. Recent advances in chemo- and materials-informatics provide systematic and efficient procedures utilizing tools such as Bayesian optimization (BO). This study explores the possibility of reducing the required experiments further by utilizing computational Gibbs energy barriers. To thoroughly validate the impact of using computational Gibbs energy barriers in BO-assisted substrate optimization, this study employs a computational Gibbs energy barrier data set in the literature and performs an extensive numerical investigation virtually regarding the Gibbs energy barriers as virtual experimental results and those with systematic and random noises as virtual computational results. The present numerical investigation shows that even the computational reactivity affected by noises of as much as 20 kJ/mol helps reduce the number of required experiments.
Collapse
Affiliation(s)
- Hiroaki Okada
- Graduate
School of Chemical Sciences and Engineering, Hokkaido University, Sapporo, Hokkaido 060-8628, Japan
| | - Satoshi Maeda
- Department
of Chemistry, Graduate School of Science, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- Institute
for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Sapporo, Hokkaido 001-0021, Japan
- ERATO
Maeda Artificial Intelligence for Chemical Reaction Design and Discovery
Project, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- Research
and Services Division of Materials Data and Integrated System (MaDIS), National Institute for Materials Science (NIMS), Tsukuba, Ibaraki 305-0044, Japan
| |
Collapse
|
3
|
Li SW, Xu LC, Zhang C, Zhang SQ, Hong X. Reaction performance prediction with an extrapolative and interpretable graph model based on chemical knowledge. Nat Commun 2023; 14:3569. [PMID: 37322041 DOI: 10.1038/s41467-023-39283-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 05/31/2023] [Indexed: 06/17/2023] Open
Abstract
Accurate prediction of reactivity and selectivity provides the desired guideline for synthetic development. Due to the high-dimensional relationship between molecular structure and synthetic function, it is challenging to achieve the predictive modelling of synthetic transformation with the required extrapolative ability and chemical interpretability. To meet the gap between the rich domain knowledge of chemistry and the advanced molecular graph model, herein we report a knowledge-based graph model that embeds the digitalized steric and electronic information. In addition, a molecular interaction module is developed to enable the learning of the synergistic influence of reaction components. In this study, we demonstrate that this knowledge-based graph model achieves excellent predictions of reaction yield and stereoselectivity, whose extrapolative ability is corroborated by additional scaffold-based data splittings and experimental verifications with new catalysts. Because of the embedding of local environment, the model allows the atomic level of interpretation of the steric and electronic influence on the overall synthetic performance, which serves as a useful guide for the molecular engineering towards the target synthetic function. This model offers an extrapolative and interpretable approach for reaction performance prediction, pointing out the importance of chemical knowledge-constrained reaction modelling for synthetic purpose.
Collapse
Affiliation(s)
- Shu-Wen Li
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University, Hangzhou, 310027, China
| | - Li-Cheng Xu
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University, Hangzhou, 310027, China
| | - Cheng Zhang
- Department of Chemistry, University of Science and Technology of China, Hefei, China
| | - Shuo-Qing Zhang
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University, Hangzhou, 310027, China.
| | - Xin Hong
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University, Hangzhou, 310027, China.
- Beijing National Laboratory for Molecular Sciences, Zhongguancun North First Street No. 2, Beijing, 100190, PR China.
- Key Laboratory of Precise Synthesis of Functional Molecules of Zhejiang Province, School of Science, Westlake University, 18 Shilongshan Road, Hangzhou, 310024, Zhejiang Province, China.
| |
Collapse
|
4
|
Li H, Zou L, Kowah JAH, He D, Liu Z, Ding X, Wen H, Wang L, Yuan M, Liu X. A compact review of progress and prospects of deep learning in drug discovery. J Mol Model 2023; 29:117. [PMID: 36976427 DOI: 10.1007/s00894-023-05492-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 02/27/2023] [Indexed: 03/29/2023]
Abstract
BACKGROUND Drug discovery processes, such as new drug development, drug synergy, and drug repurposing, consume significant yearly resources. Computer-aided drug discovery can effectively improve the efficiency of drug discovery. Traditional computer methods such as virtual screening and molecular docking have achieved many gratifying results in drug development. However, with the rapid growth of computer science, data structures have changed considerably; with more extensive and dimensional data and more significant amounts of data, traditional computer methods can no longer be applied well. Deep learning methods are based on deep neural network structures that can handle high-dimensional data very well, so they are used in current drug development. RESULTS This review summarized the applications of deep learning methods in drug discovery, such as drug target discovery, drug de novo design, drug recommendation, drug synergy, and drug response prediction. While applying deep learning methods to drug discovery suffers from a lack of data, transfer learning is an excellent solution to this problem. Furthermore, deep learning methods can extract deeper features and have higher predictive power than other machine learning methods. Deep learning methods have great potential in drug discovery and are expected to facilitate drug discovery development.
Collapse
Affiliation(s)
- Huijun Li
- College of Medicine, Guangxi University, Nanning, 530004, China
| | - Lin Zou
- College of Medicine, Guangxi University, Nanning, 530004, China
| | | | - Dongqiong He
- College of Chemistry and Chemical Engineering, Guangxi University, Nanning, 530004, China
| | - Zifan Liu
- College of Medicine, Guangxi University, Nanning, 530004, China
| | - Xuejie Ding
- College of Medicine, Guangxi University, Nanning, 530004, China
| | - Hao Wen
- College of Chemistry and Chemical Engineering, Guangxi University, Nanning, 530004, China
| | - Lisheng Wang
- College of Medicine, Guangxi University, Nanning, 530004, China
| | - Mingqing Yuan
- College of Medicine, Guangxi University, Nanning, 530004, China
| | - Xu Liu
- College of Medicine, Guangxi University, Nanning, 530004, China.
| |
Collapse
|
5
|
Noto N, Yada A, Yanai T, Saito S. Machine-Learning Classification for the Prediction of Catalytic Activity of Organic Photosensitizers in the Nickel(II)-Salt-Induced Synthesis of Phenols. Angew Chem Int Ed Engl 2023; 62:e202219107. [PMID: 36645619 DOI: 10.1002/anie.202219107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Revised: 01/15/2023] [Accepted: 01/16/2023] [Indexed: 01/17/2023]
Abstract
Catalytic systems using a small amount of organic photosensitizer for the activation of an inorganic (on-demand ligand-free) nickel(II) salt represent a cost-effective method for cross-coupling reactions, while C(sp2 )-O bond formation remains less developed. Herein, we report a strategy for the synthesis of phenols with a nickel(II) salt and an organic photosensitizer, which was identified via an investigation into the catalytic activity of 60 organic photosensitizers consisting of various electron donor and acceptor moieties. To examine the effect of multiple intractable parameters on the catalytic activity of photosensitizers, machine-learning (ML) models were developed, wherein we embedded descriptors representing their physical and structural properties, which were obtained from DFT calculations and RDKit, respectively. The study clarified that integrating both DFT- and RDKit-derived descriptors in ML models balances higher "precision" and "recall" across a wide range of search space relative to using only one of the two descriptor sets.
Collapse
Affiliation(s)
- Naoki Noto
- Integrated Research Consortium on Chemical Sciences (IRCCS), Nagoya University, Nagoya, Aichi, 464-8602, Japan
| | - Akira Yada
- Interdisciplinary Research Center for Catalytic Chemistry, National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba, Ibaraki, 305-8565, Japan
| | - Takeshi Yanai
- Institute of Transformative Bio-Molecules (WPI-ITbM) and Graduate School of Science, Nagoya University, Nagoya, Aichi, 464-8602, Japan
| | - Susumu Saito
- Integrated Research Consortium on Chemical Sciences (IRCCS) and Graduate School of Science, Nagoya University, Nagoya, Aichi, 464-8602, Japan
| |
Collapse
|
6
|
Chen Y, Ou Y, Zheng P, Huang Y, Ge F, Dral PO. Benchmark of general-purpose machine learning-based quantum mechanical method AIQM1 on reaction barrier heights. J Chem Phys 2023; 158:074103. [PMID: 36813722 DOI: 10.1063/5.0137101] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Artificial intelligence-enhanced quantum mechanical method 1 (AIQM1) is a general-purpose method that was shown to achieve high accuracy for many applications with a speed close to its baseline semiempirical quantum mechanical (SQM) method ODM2*. Here, we evaluate the hitherto unknown performance of out-of-the-box AIQM1 without any refitting for reaction barrier heights on eight datasets, including a total of ∼24 thousand reactions. This evaluation shows that AIQM1's accuracy strongly depends on the type of transition state and ranges from excellent for rotation barriers to poor for, e.g., pericyclic reactions. AIQM1 clearly outperforms its baseline ODM2* method and, even more so, a popular universal potential, ANI-1ccx. Overall, however, AIQM1 accuracy largely remains similar to SQM methods (and B3LYP/6-31G* for most reaction types) suggesting that it is desirable to focus on improving AIQM1 performance for barrier heights in the future. We also show that the built-in uncertainty quantification helps in identifying confident predictions. The accuracy of confident AIQM1 predictions is approaching the level of popular density functional theory methods for most reaction types. Encouragingly, AIQM1 is rather robust for transition state optimizations, even for the type of reactions it struggles with the most. Single-point calculations with high-level methods on AIQM1-optimized geometries can be used to significantly improve barrier heights, which cannot be said for its baseline ODM2* method.
Collapse
Affiliation(s)
- Yuxinxin Chen
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Yanchi Ou
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Peikun Zheng
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Yaohuang Huang
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Fuchun Ge
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Pavlo O Dral
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| |
Collapse
|
7
|
Zhang P. Mechanical Transmission Model and Numerical Simulation Based on Machine Learning. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH 2023. [DOI: 10.4018/ijitsa.318457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
Mechanical transmission is one of the earliest transmission modes in human society. With the continuous progress of science and technology, effective simulation and calculation research on mechanical transmission has gradually become an important link in the study of mechanical transmission. In the actual engineering practice, reliable and accurate data are difficult to obtain due to the complexity and low accuracy of the traditional mechanical transmission process. Machine learning (ML), a model trained by data, was used to analyze the response of the system through different parameters and drew scientific and reasonable conclusions. ML is more intuitive, easier to operate, and faster in calculation than the traditional methods. In many mechanical structures, due to the large number of processing parts and data, numerical simulation of this important equipment requires a considerable time to adjust and optimize accordingly.
Collapse
Affiliation(s)
- Pan Zhang
- College of Mechanical Engineering, Jinjiang College, Sichuan University, China
| |
Collapse
|
8
|
Maley SM, Lief GR, Buck RM, Sydora OL, Yang Q, Bischof SM, Ess DH. Density functional theory and CCSD(T) evaluation of ionization potentials, redox potentials, and bond energies related to zirconocene polymerization catalysts. J Comput Chem 2023; 44:506-515. [PMID: 35662063 DOI: 10.1002/jcc.26890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 03/28/2022] [Accepted: 04/22/2022] [Indexed: 01/07/2023]
Abstract
Quantum-mechanical-based computational design of molecular catalysts requires accurate and fast electronic structure calculations to determine and predict properties of transition-metal complexes. For Zr-based molecular complexes related to polyethylene catalysis, previous evaluation of density functional theory (DFT) and wavefunction methods only examined oxides and halides or select reaction barrier heights. In this work, we evaluate the performance of DFT against experimental redox potentials and bond dissociation enthalpies (BDEs) for zirconocene complexes directly relevant to ethylene polymerization catalysis. We also examined the ability of DFT to compute the fourth atomic ionization potential of zirconium and the effect the basis set selection has on the ionization potential computed with CCSD(T). Generally, the atomic ionization potential and redox potentials are very well reproduced by DFT, but we discovered relatively large deviations of DFT-calculated BDEs compared to experiment. However, evaluation of BDEs with CCSD(T) suggests that experimental values should be revisited, and our CCSD(T) values should be taken as most accurate.
Collapse
Affiliation(s)
- Steven M Maley
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah, USA
| | - Graham R Lief
- Research and Technology, Chevron Phillips Chemical Company, Bartlesville, Oklahoma, USA
| | - Richard M Buck
- Research and Technology, Chevron Phillips Chemical Company, Bartlesville, Oklahoma, USA
| | - Orson L Sydora
- Research and Technology, Chevron Phillips Chemical Company, Kingwood, Texas, USA
| | - Qing Yang
- Research and Technology, Chevron Phillips Chemical Company, Bartlesville, Oklahoma, USA
| | - Steven M Bischof
- Research and Technology, Chevron Phillips Chemical Company, Kingwood, Texas, USA
| | - Daniel H Ess
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah, USA
| |
Collapse
|
9
|
Computer-assisted design of asymmetric PNP ligands for ethylene tri-/tetramerization: A combined DFT and artificial neural network approach. J Catal 2023. [DOI: 10.1016/j.jcat.2023.01.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
10
|
Miller E, Mai BK, Read JA, Bell WC, Derrick JS, Liu P, Toste FD. A Combined DFT, Energy Decomposition, and Data Analysis Approach to Investigate the Relationship Between Noncovalent Interactions and Selectivity in a Flexible DABCOnium/Chiral Anion Catalyst System. ACS Catal 2022; 12:12369-12385. [PMID: 37215160 PMCID: PMC10195112 DOI: 10.1021/acscatal.2c03077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Developing strategies to study reactivity and selectivity in flexible catalyst systems has become an important topic of research. Herein, we report a combined experimental and computational study aimed at understanding the mechanistic role of an achiral DABCOnium cofactor in a regio- and enantiodivergent bromocyclization reaction. It was found that electron-deficient aryl substituents enable rigidified transition states via an anion-π interaction with the catalyst, which drives the selectivity of the reaction. In contrast, electron-rich aryl groups on the DABCOnium result in significantly more flexible transition states, where interactions between the catalyst and substrate are more important. An analysis of not only the lowest-energy transition state structures but also an ensemble of low-energy transition state conformers via energy decomposition analysis and machine learning was crucial to revealing the dominant noncovalent interactions responsible for observed changes in selectivity in this flexible system.
Collapse
Affiliation(s)
- Edward Miller
- Department of Chemistry, University of California, Berkeley, California 94720, United States
| | - Binh Khanh Mai
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Jacquelyne A Read
- Department of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| | - William C Bell
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Jeffrey S Derrick
- Department of Chemistry, University of California, Berkeley, California 94720, United States
| | - Peng Liu
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - F Dean Toste
- Department of Chemistry, University of California, Berkeley, California 94720, United States
| |
Collapse
|
11
|
Ismail I, Chantreau Majerus R, Habershon S. Graph-Driven Reaction Discovery: Progress, Challenges, and Future Opportunities. J Phys Chem A 2022; 126:7051-7069. [PMID: 36190262 PMCID: PMC9574932 DOI: 10.1021/acs.jpca.2c06408] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Graph-based descriptors, such as bond-order matrices and adjacency matrices, offer a simple and compact way of categorizing molecular structures; furthermore, such descriptors can be readily used to catalog chemical reactions (i.e., bond-making and -breaking). As such, a number of graph-based methodologies have been developed with the goal of automating the process of generating chemical reaction network models describing the possible mechanistic chemistry in a given set of reactant species. Here, we outline the evolution of these graph-based reaction discovery schemes, with particular emphasis on more recent methods incorporating graph-based methods with semiempirical and ab initio electronic structure calculations, minimum-energy path refinements, and transition state searches. Using representative examples from homogeneous catalysis and interstellar chemistry, we highlight how these schemes increasingly act as "virtual reaction vessels" for interrogating mechanistic questions. Finally, we highlight where challenges remain, including issues of chemical accuracy and calculation speeds, as well as the inherent challenge of dealing with the vast size of accessible chemical reaction space.
Collapse
Affiliation(s)
- Idil Ismail
- Department of Chemistry, University of Warwick, CoventryCV4 7AL, United Kingdom
| | | | - Scott Habershon
- Department of Chemistry, University of Warwick, CoventryCV4 7AL, United Kingdom
| |
Collapse
|
12
|
Small BL, Milner MF. Insights on the Mechanism for Ethylene Tetramerization. Organometallics 2022. [DOI: 10.1021/acs.organomet.2c00285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Brooke L. Small
- Research & Technology, Chevron Phillips Chemical, 1862 Kingwood Drive, Kingwood, Texas 77339, United States
| | - Matthew F. Milner
- Research & Technology, Chevron Phillips Chemical, 1862 Kingwood Drive, Kingwood, Texas 77339, United States
| |
Collapse
|
13
|
Baek JW, Ko JH, Park JH, Park JY, Lee HJ, Seo YH, Lee J, Lee BY. α-Olefin Trimerization for Lubricant Base Oils with Modified Chevron–Phillips Ethylene Trimerization Catalysts. Organometallics 2022. [DOI: 10.1021/acs.organomet.2c00249] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Jun Won Baek
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, South Korea
| | - Ji Hyeong Ko
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, South Korea
| | - Jun Hyeong Park
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, South Korea
| | - Ju Yong Park
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, South Korea
| | - Hyun Ju Lee
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, South Korea
| | - Yeong Hyun Seo
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, South Korea
| | - Junseong Lee
- Department of Chemistry, Chonnam National University, Gwangju 61186, South Korea
| | - Bun Yeoul Lee
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, South Korea
| |
Collapse
|
14
|
Fey N, Lynam JM. Computational mechanistic study in organometallic catalysis: Why prediction is still a challenge. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1590] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Natalie Fey
- School of Chemistry University of Bristol, Cantock's Close Bristol UK
| | | |
Collapse
|
15
|
Lewis‐Atwell T, Townsend PA, Grayson MN. Machine learning activation energies of chemical reactions. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1593] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Toby Lewis‐Atwell
- Department of Computer Science, Faculty of Science University of Bath Bath UK
| | - Piers A. Townsend
- Department of Chemistry, Faculty of Science University of Bath Bath UK
| | | |
Collapse
|
16
|
Nandy A, Duan C, Kulik HJ. Audacity of huge: overcoming challenges of data scarcity and data quality for machine learning in computational materials discovery. Curr Opin Chem Eng 2022. [DOI: 10.1016/j.coche.2021.100778] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
17
|
Esterhuizen JA, Goldsmith BR, Linic S. Interpretable machine learning for knowledge generation in heterogeneous catalysis. Nat Catal 2022. [DOI: 10.1038/s41929-022-00744-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
18
|
Matsuoka W, Harabuchi Y, Maeda S. Virtual Ligand-Assisted Screening Strategy to Discover Enabling Ligands for Transition Metal Catalysis. ACS Catal 2022. [DOI: 10.1021/acscatal.2c00267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Wataru Matsuoka
- Department of Chemistry, Faculty of Science, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- ERATO Maeda Artificial Intelligence for Chemical Reaction Design and Discovery Project, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
| | - Yu Harabuchi
- Department of Chemistry, Faculty of Science, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Sapporo, Hokkaido 001-0021, Japan
- ERATO Maeda Artificial Intelligence for Chemical Reaction Design and Discovery Project, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
| | - Satoshi Maeda
- Department of Chemistry, Faculty of Science, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Sapporo, Hokkaido 001-0021, Japan
- ERATO Maeda Artificial Intelligence for Chemical Reaction Design and Discovery Project, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- Research and Services Division of Materials Data and Integrated System (MaDIS), National Institute for Materials Science (NIMS), Tsukuba, Ibaraki 305-0044, Japan
| |
Collapse
|
19
|
Maley SM, Steagall R, Lief GR, Buck RM, Yang Q, Sydora OL, Bischof SM, Ess DH. Computational Evaluation and Design of Polyethylene Zirconocene Catalysts with Noncovalent Dispersion Interactions. Organometallics 2022. [DOI: 10.1021/acs.organomet.1c00670] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Steven M. Maley
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah 84602, United States
| | - Robert Steagall
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah 84602, United States
| | - Graham R. Lief
- Research and Technology, Chevron Phillips Chemical Company LP, Highways 60 & 123, Bartlesville, Oklahoma 74003, United States
| | - Richard M. Buck
- Research and Technology, Chevron Phillips Chemical Company LP, Highways 60 & 123, Bartlesville, Oklahoma 74003, United States
| | - Qing Yang
- Research and Technology, Chevron Phillips Chemical Company LP, Highways 60 & 123, Bartlesville, Oklahoma 74003, United States
| | - Orson L. Sydora
- Research and Technology, Chevron Phillips Chemical Company LP, 1862, Kingwood Drive, Kingwood, Texas 77339, United States
| | - Steven M. Bischof
- Research and Technology, Chevron Phillips Chemical Company LP, 1862, Kingwood Drive, Kingwood, Texas 77339, United States
| | - Daniel H. Ess
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah 84602, United States
| |
Collapse
|
20
|
Duan C, Nandy A, Kulik HJ. Machine Learning for the Discovery, Design, and Engineering of Materials. Annu Rev Chem Biomol Eng 2022; 13:405-429. [PMID: 35320698 DOI: 10.1146/annurev-chembioeng-092320-120230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Machine learning (ML) has become a part of the fabric of high-throughput screening and computational discovery of materials. Despite its increasingly central role, challenges remain in fully realizing the promise of ML. This is especially true for the practical acceleration of the engineering of robust materials and the development of design strategies that surpass trial and error or high-throughput screening alone. Depending on the quantity being predicted and the experimental data available, ML can either outperform physics-based modes, be used to accelerate such models, or be integrated with them to improve their performance. We cover recent advances in algorithms and in their application that are starting to make inroads toward (a) the discovery of new materials through large-scale enumerative screening, (b) the design of materials through identification of rules and principles that govern materials properties, and (c) the engineering of practical materials by satisfying multiple objectives. We conclude with opportunities for further advancement to realize ML as a widespread tool for practical computational materials design. Expected final online publication date for the Annual Review of Chemical and Biomolecular Engineering, Volume 13 is October 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA; , , .,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA; , , .,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA; , ,
| |
Collapse
|
21
|
|
22
|
Harper DR, Nandy A, Arunachalam N, Duan C, Janet JP, Kulik HJ. Representations and strategies for transferable machine learning Improve model performance in chemical discovery. J Chem Phys 2022; 156:074101. [DOI: 10.1063/5.0082964] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Daniel R Harper
- Massachusetts Institute of Technology, United States of America
| | - Aditya Nandy
- Massachusetts Institute of Technology, United States of America
| | | | - Chenru Duan
- Massachusetts Institute of Technology, United States of America
| | | | - Heather J. Kulik
- Dept of Chemical Engineering, Massachusetts Institute of Technology, United States of America
| |
Collapse
|
23
|
Mukai M, Nagao K, Yamaguchi S, Ohmiya H. Molecular Field Analysis Using Computational-Screening Data in Asymmetric N-Heterocyclic Carbene-Copper Catalysis toward Data-driven in silico Catalyst Optimization. BULLETIN OF THE CHEMICAL SOCIETY OF JAPAN 2022. [DOI: 10.1246/bcsj.20210349] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Masakiyo Mukai
- Division of Pharmaceutical Sciences, Graduate School of Medical Sciences, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan
| | - Kazunori Nagao
- Division of Pharmaceutical Sciences, Graduate School of Medical Sciences, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan
| | - Shigeru Yamaguchi
- RIKEN Center for Sustainable Resource Science, RIKEN, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
| | - Hirohisa Ohmiya
- Division of Pharmaceutical Sciences, Graduate School of Medical Sciences, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan
- JST, PRESTO, 4-1-8 Honcho, Kawaguchi, Saitama 332-0012, Japan
| |
Collapse
|
24
|
Petit J, Magna L, Mézailles N. Alkene oligomerization via metallacycles: Recent advances and mechanistic insights. Coord Chem Rev 2022. [DOI: 10.1016/j.ccr.2021.214227] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
25
|
Farrar EHE, Grayson MN. Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction. Chem Sci 2022; 13:7594-7603. [PMID: 35872815 PMCID: PMC9242013 DOI: 10.1039/d2sc02925a] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 06/08/2022] [Indexed: 11/21/2022] Open
Abstract
A synergistic approach that combines machine learning with semi-empirical methods enables the fast and accurate prediction of DFT-quality reaction barriers, with mechanistic insights available from semi-empirical transition state geometries.
Collapse
Affiliation(s)
- Elliot H. E. Farrar
- Department of Chemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Matthew N. Grayson
- Department of Chemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| |
Collapse
|
26
|
Morán‐González L, Pedregal JR, Besora M, Maseras F. Understanding the Binding Properties of N‐heterocyclic Carbenes through BDE Matrix App. Eur J Inorg Chem 2021. [DOI: 10.1002/ejic.202100932] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Lucía Morán‐González
- Institute of Chemical Research of Catalonia (ICIQ) The Barcelona Institute of Science and Technology Avgda. Països Catalans, 16 Tarragona 43007 Catalonia Spain
| | - Jaime Rodríguez‐Guerra Pedregal
- Institute of Chemical Research of Catalonia (ICIQ) The Barcelona Institute of Science and Technology Avgda. Països Catalans, 16 Tarragona 43007 Catalonia Spain
| | - Maria Besora
- Departament de Química Física i Inorgànica Universitat Rovira i Virgili c/Marcel⋅lí Domingo s/n Tarragona 43007 Catalonia Spain
| | - Feliu Maseras
- Institute of Chemical Research of Catalonia (ICIQ) The Barcelona Institute of Science and Technology Avgda. Països Catalans, 16 Tarragona 43007 Catalonia Spain
| |
Collapse
|
27
|
Alam F, Fan H, Dong C, Zhang J, Ma J, Chen Y, Jiang T. Chromium catalysts stabilized by alkylphosphanyl PNP ligands for selective ethylene tri-/tetramerization. J Catal 2021. [DOI: 10.1016/j.jcat.2021.09.025] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
28
|
Towards Data‐Driven Design of Asymmetric Hydrogenation of Olefins: Database and Hierarchical Learning. Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202106880] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
29
|
Preparation of Extremely Active Ethylene Tetramerization Catalyst [iPrN(PAr2)2−CrCl2]+[B(C6F5)4]– (Ar = −C6H4-p-SiR3). Catalysts 2021. [DOI: 10.3390/catal11091122] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Numerous efforts have been made to achieve “on-purpose” 1-octene production since Sasol discovered a Cr-based selective ethylene tetramerization catalyst in the early 2000s. By preparing a series of bis(phosphine) ligands iPrN(PAr2)2 where the Ar contains a bulky –SiR3 substituent (Ar = −C6H4-p-Si(nBu)3 (1), −C6H4-p-Si(1-hexyl)3 (2), −C6H4-p-Si(1-octyl)3 (3), −C6H4-p-Si(2-ethylhexyl)3 (4), −C6H4-p-Si(3,7-dimethyloctyl)3 (5)), we obtained an extremely active catalyst that meets the criteria for commercial utilization. The Cr complexes [iPrN(PAr2)2−CrCl2]+[B(C6F5)4]–, obtained by reacting ligands 1–5 with [(CH3CN)4CrCl2]+[B(C6F5)4]–, showed high activity exceeding 6000 kg/g-Cr/h, when combined with the inexpensive iBu3Al, thus avoiding the use of expensive modified methylaluminoxane (MMAO). The bulky –SiR3 substituents played a key role in the success of catalysis by blocking the formation of inactive species (Cr complexes coordinated by two iPrN(PAr2)2 ligands, that is, [(iPrN(PAr2)2)2−CrCl2]+[B(C6F5)4]–). Among the complexes prepared, [3−CrCl2]+[B(C6F5)4]– exhibited the highest activity (11,100 kg/g-Cr/h, 100 kg/g-catalyst) with high 1-octene selectivity (75 wt%) and, moreover, mitigated the generation of undesired > C10 fractions (10.7 wt%). A 10-g-scale synthesis of 3 was developed, as well as a facile and low-cost synthetic method for [(CH3CN)4CrCl2]+[B(C6F5)4]–.
Collapse
|
30
|
Automated Construction and Optimization Combined with Machine Learning to Generate Pt(II) Methane C–H Activation Transition States. Top Catal 2021. [DOI: 10.1007/s11244-021-01506-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
31
|
Nandy A, Duan C, Taylor MG, Liu F, Steeves AH, Kulik HJ. Computational Discovery of Transition-metal Complexes: From High-throughput Screening to Machine Learning. Chem Rev 2021; 121:9927-10000. [PMID: 34260198 DOI: 10.1021/acs.chemrev.1c00347] [Citation(s) in RCA: 70] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Transition-metal complexes are attractive targets for the design of catalysts and functional materials. The behavior of the metal-organic bond, while very tunable for achieving target properties, is challenging to predict and necessitates searching a wide and complex space to identify needles in haystacks for target applications. This review will focus on the techniques that make high-throughput search of transition-metal chemical space feasible for the discovery of complexes with desirable properties. The review will cover the development, promise, and limitations of "traditional" computational chemistry (i.e., force field, semiempirical, and density functional theory methods) as it pertains to data generation for inorganic molecular discovery. The review will also discuss the opportunities and limitations in leveraging experimental data sources. We will focus on how advances in statistical modeling, artificial intelligence, multiobjective optimization, and automation accelerate discovery of lead compounds and design rules. The overall objective of this review is to showcase how bringing together advances from diverse areas of computational chemistry and computer science have enabled the rapid uncovering of structure-property relationships in transition-metal chemistry. We aim to highlight how unique considerations in motifs of metal-organic bonding (e.g., variable spin and oxidation state, and bonding strength/nature) set them and their discovery apart from more commonly considered organic molecules. We will also highlight how uncertainty and relative data scarcity in transition-metal chemistry motivate specific developments in machine learning representations, model training, and in computational chemistry. Finally, we will conclude with an outlook of areas of opportunity for the accelerated discovery of transition-metal complexes.
Collapse
Affiliation(s)
- Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Michael G Taylor
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Fang Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Adam H Steeves
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
32
|
Xu LC, Zhang SQ, Li X, Tang MJ, Xie PP, Hong X. Towards Data-driven Design of Asymmetric Hydrogenation of Olefins: Database and Hierarchical Learning. Angew Chem Int Ed Engl 2021; 60:22804-22811. [PMID: 34370892 DOI: 10.1002/anie.202106880] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Revised: 07/14/2021] [Indexed: 11/09/2022]
Abstract
Asymmetric hydrogenation of olefins is one of the most powerful asymmetric transformations in molecular synthesis. Although several privileged catalyst scaffolds are available, the catalyst development for asymmetric hydrogenation is still a time- and resource-consuming process due to the lack of predictive catalyst design strategy. Targeting the data-driven design of asymmetric catalysis, we herein report the development of a standardized database that contains the detailed information of over 12000 literature asymmetric hydrogenations of olefins. This database provides a valuable platform for the machine learning applications in asymmetric catalysis. Based on this database, we developed a hierarchical learning approach to achieve predictive machine leaning model using only dozens of enantioselectivity data with the target olefin, which offers a useful solution for the few-shot learning problem and will facilitate the reaction optimization with new olefin substrate in catalysis screening.
Collapse
Affiliation(s)
- Li-Cheng Xu
- Zhejiang University, Department of Chemistry, CHINA
| | | | - Xin Li
- Zhejiang University, Department of Chemistry, CHINA
| | | | - Pei-Pei Xie
- Zhejiang University, Department of Chemistry, CHINA
| | - Xin Hong
- Zhejiang University, Department of Chemistry, 38 Zheda Road, 310028, Hangzhou, CHINA
| |
Collapse
|
33
|
Abstract
Computational methods have emerged as a powerful tool to augment traditional experimental molecular catalyst design by providing useful predictions of catalyst performance and decreasing the time needed for catalyst screening. In this perspective, we discuss three approaches for computational molecular catalyst design: (i) the reaction mechanism-based approach that calculates all relevant elementary steps, finds the rate and selectivity determining steps, and ultimately makes predictions on catalyst performance based on kinetic analysis, (ii) the descriptor-based approach where physical/chemical considerations are used to find molecular properties as predictors of catalyst performance, and (iii) the data-driven approach where statistical analysis as well as machine learning (ML) methods are used to obtain relationships between available data/features and catalyst performance. Following an introduction to these approaches, we cover their strengths and weaknesses and highlight some recent key applications. Furthermore, we present an outlook on how the currently applied approaches may evolve in the near future by addressing how recent developments in building automated computational workflows and implementing advanced ML models hold promise for reducing human workload, eliminating human bias, and speeding up computational catalyst design at the same time. Finally, we provide our viewpoint on how some of the challenges associated with the up-and-coming approaches driven by automation and ML may be resolved.
Collapse
Affiliation(s)
- Ademola Soyemi
- Department of Chemical and Biological Engineering, The University of Alabama, Tuscaloosa, AL 35487, USA.
| | - Tibor Szilvási
- Department of Chemical and Biological Engineering, The University of Alabama, Tuscaloosa, AL 35487, USA.
| |
Collapse
|
34
|
Pablo‐García S, García‐Muelas R, Sabadell‐Rendón A, López N. Dimensionality reduction of complex reaction networks in heterogeneous catalysis: From l
inear‐scaling
relationships to statistical learning techniques. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1540] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Sergio Pablo‐García
- Institute of Chemical Research of Catalonia The Barcelona Institute of Science and Technology Tarragona Spain
| | - Rodrigo García‐Muelas
- Institute of Chemical Research of Catalonia The Barcelona Institute of Science and Technology Tarragona Spain
| | - Albert Sabadell‐Rendón
- Institute of Chemical Research of Catalonia The Barcelona Institute of Science and Technology Tarragona Spain
| | - Núria López
- Institute of Chemical Research of Catalonia The Barcelona Institute of Science and Technology Tarragona Spain
| |
Collapse
|
35
|
Gallarati S, Fabregat R, Laplaza R, Bhattacharjee S, Wodrich MD, Corminboeuf C. Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts. Chem Sci 2021; 12:6879-6889. [PMID: 34123316 PMCID: PMC8153079 DOI: 10.1039/d1sc00482d] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 04/01/2021] [Indexed: 12/12/2022] Open
Abstract
Hundreds of catalytic methods are developed each year to meet the demand for high-purity chiral compounds. The computational design of enantioselective organocatalysts remains a significant challenge, as catalysts are typically discovered through experimental screening. Recent advances in combining quantum chemical computations and machine learning (ML) hold great potential to propel the next leap forward in asymmetric catalysis. Within the context of quantum chemical machine learning (QML, or atomistic ML), the ML representations used to encode the three-dimensional structure of molecules and evaluate their similarity cannot easily capture the subtle energy differences that govern enantioselectivity. Here, we present a general strategy for improving molecular representations within an atomistic machine learning model to predict the DFT-computed enantiomeric excess of asymmetric propargylation organocatalysts solely from the structure of catalytic cycle intermediates. Mean absolute errors as low as 0.25 kcal mol-1 were achieved in predictions of the activation energy with respect to DFT computations. By virtue of its design, this strategy is generalisable to other ML models, to experimental data and to any catalytic asymmetric reaction, enabling the rapid screening of structurally diverse organocatalysts from available structural information.
Collapse
Affiliation(s)
- Simone Gallarati
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| | - Raimon Fabregat
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| | - Rubén Laplaza
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| | - Sinjini Bhattacharjee
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- Indian Institute of Science Education and Research Dr Homi Bhabha Rd, Ward No. 8, NCL Colony, Pashan Pune Maharashtra 411008 India
| | - Matthew D Wodrich
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Computational Design and Discovery of Novel Materials (MARVEL), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| |
Collapse
|
36
|
Blann K, Bollmann A, Brown GM, Dixon JT, Elsegood MRJ, Raw CR, Smith MB, Tenza K, Willemse JA, Zweni P. Ethylene oligomerisation chromium catalysts with unsymmetrical PCNP ligands. Dalton Trans 2021; 50:4345-4354. [PMID: 33690749 DOI: 10.1039/d1dt00287b] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Chromium(iii) complexes of chelating diphosphines, with PNP or PCNCP backbones, are excellent catalysts for ethylene tetra- and/or trimerisations. A missing link within this ligand series are unsymmetric chelating diphosphines based on a PCNP scaffold. New bidentate PCNP ligands of the type Ph2PCH2N(R)PPh2 (R = 1-naphthyl or 5-quinoline groups, 2a-d) have been synthesised and shown to be extremely effective ligands for ethylene tri-/tetramerisations. Three representative tetracarbonyl Cr0 complexes bearing a single PN(R)P (5), PCN(R)P (6), or PCN(R)CP (7) diphosphine (R = 1-naphthyl) have been prepared from Cr(CO)4(η4-nbd) (nbd = norbornadiene). Furthermore we report a single crystal X-ray diffraction study of these compounds and discuss their structural parameters.
Collapse
Affiliation(s)
- Kevin Blann
- R & D Division, Sasol Technology (Pty) Ltd., 1 Klasie Havenga Road, Sasolburg, South Africa
| | - Annette Bollmann
- R & D Division, Sasol Technology (Pty) Ltd., 1 Klasie Havenga Road, Sasolburg, South Africa
| | - Gavin M Brown
- Department of Chemistry, Loughborough University, Loughborough, Leics LE11 3TU, UK.
| | - John T Dixon
- R & D Division, Sasol Technology (Pty) Ltd., 1 Klasie Havenga Road, Sasolburg, South Africa
| | - Mark R J Elsegood
- Department of Chemistry, Loughborough University, Loughborough, Leics LE11 3TU, UK.
| | - Christopher R Raw
- Department of Chemistry, Loughborough University, Loughborough, Leics LE11 3TU, UK.
| | - Martin B Smith
- Department of Chemistry, Loughborough University, Loughborough, Leics LE11 3TU, UK.
| | - Kenny Tenza
- R & D Division, Sasol Technology (Pty) Ltd., 1 Klasie Havenga Road, Sasolburg, South Africa
| | - J Alexander Willemse
- R & D Division, Sasol Technology (Pty) Ltd., 1 Klasie Havenga Road, Sasolburg, South Africa
| | - Pumza Zweni
- R & D Division, Sasol Technology (Pty) Ltd., 1 Klasie Havenga Road, Sasolburg, South Africa
| |
Collapse
|
37
|
Gallegos LC, Luchini G, St. John PC, Kim S, Paton RS. Importance of Engineered and Learned Molecular Representations in Predicting Organic Reactivity, Selectivity, and Chemical Properties. Acc Chem Res 2021; 54:827-836. [PMID: 33534534 DOI: 10.1021/acs.accounts.0c00745] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Machine-readable chemical structure representations are foundational in all attempts to harness machine learning for the prediction of reactivities, selectivities, and chemical properties directly from molecular structure. The featurization of discrete chemical structures into a continuous vector space is a critical phase undertaken before model selection, and the development of new ways to quantitatively encode molecules is an active area of research. In this Account, we highlight the application and suitability of different representations, from expert-guided "engineered" descriptors to automatically "learned" features, in different prediction tasks relevant to organic and organometallic chemistry, where differing amounts of training data are available. These tasks include statistical models of stereo- and enantioselectivity, thermochemistry, and kinetics developed using experimental and quantum chemical data.The use of expert-guided molecular descriptors provides an opportunity to incorporate chemical knowledge, domain expertise, and physical constraints into statistical modeling. In applications to stereoselective organic and organometallic catalysis, where data sets may be relatively small and 3D-geometries and conformations play an important role, mechanistically informed features can be used successfully to obtain predictive statistical models that are also chemically interpretable. We provide an overview of several recent applications of this approach to obtain quantitative models for reactivity and selectivity, where topological descriptors, quantum mechanical calculations of electronic and steric properties, along with conformational ensembles, all feature as essential ingredients of the molecular representations used.Alternatively, more flexible, general-purpose molecular representations such as attributed molecular graphs can be used with machine learning approaches to learn the complex relationship between a structure and prediction target. This approach has the potential to out-perform more traditional representation methods such as "hand-crafted" molecular descriptors, particularly as data set sizes grow. One area where this is particularly relevant is in the use of large sets of quantum mechanical data to train quantitative structure-property relationships. A general approach toward curating useful data sets and training highly accurate graph neural network models is discussed in the context of organic bond dissociation enthalpies, where this strategy outperforms regression using precomputed descriptors.Finally, we describe how graph neural network predictions can be incorporated into mechanistically informed statistical models of chemical reactivity and selectivity. Once trained, this approach avoids the expensive computational overhead associated with quantum mechanical calculations, while maintaining chemical interpretability. We illustrate examples for which fast predictions of bond dissociation enthalpy and of the identities of radicals formed through cleavage of a molecule's weakest bond are used in simple physical models of site-selectivity and reactivity.
Collapse
Affiliation(s)
- Liliana C. Gallegos
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523, United States
| | - Guilian Luchini
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523, United States
| | - Peter C. St. John
- Biosciences Center, National Renewable Energy Laboratory, 15103 Denver West Parkway, Golden, Colorado 80401, United States
| | - Seonah Kim
- Biosciences Center, National Renewable Energy Laboratory, 15103 Denver West Parkway, Golden, Colorado 80401, United States
| | - Robert S. Paton
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523, United States
| |
Collapse
|
38
|
Wang Z, Liu L, Ma X, Liu Y, Mi P, Liu Z, Zhang J. Effect of an additional donor on decene formation in ethylene oligomerization catalyzed by a Cr/PCCP system: a combined experimental and DFT study. Catal Sci Technol 2021. [DOI: 10.1039/d1cy00423a] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Cr catalyst based on a PCCP ligand shows high activity in ethylene oligomerization, giving 1-hexene and considerable C10 fraction. DFT calculation results are consistent with the experimental observations on the distribution of C10 isomers.
Collapse
Affiliation(s)
- Zhichao Wang
- Key Laboratory for Advanced Materials and Joint International Research Laboratory of Precision Chemistry and Molecular Engineering
- Feringa Nobel Prize Scientist Joint Research Center
- School of Chemistry and Molecular Engineering
- East China University of Science and Technology
- Shanghai 200237
| | - Lin Liu
- School of Chemical Engineering
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Xufeng Ma
- Key Laboratory for Advanced Materials and Joint International Research Laboratory of Precision Chemistry and Molecular Engineering
- Feringa Nobel Prize Scientist Joint Research Center
- School of Chemistry and Molecular Engineering
- East China University of Science and Technology
- Shanghai 200237
| | - Yao Liu
- Key Laboratory for Advanced Materials and Joint International Research Laboratory of Precision Chemistry and Molecular Engineering
- Feringa Nobel Prize Scientist Joint Research Center
- School of Chemistry and Molecular Engineering
- East China University of Science and Technology
- Shanghai 200237
| | - Puke Mi
- Key Laboratory for Ultrafine Materials of Ministry of Education
- School of Materials Science and Engineering
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Zhen Liu
- School of Chemical Engineering
- East China University of Science and Technology
- Shanghai 200237
- China
| | - Jun Zhang
- Key Laboratory for Advanced Materials and Joint International Research Laboratory of Precision Chemistry and Molecular Engineering
- Feringa Nobel Prize Scientist Joint Research Center
- School of Chemistry and Molecular Engineering
- East China University of Science and Technology
- Shanghai 200237
| |
Collapse
|
39
|
Yang LC, Li X, Zhang SQ, Hong X. Machine learning prediction of hydrogen atom transfer reactivity in photoredox-mediated C–H functionalization. Org Chem Front 2021. [DOI: 10.1039/d1qo01325d] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
DFT-computed structure–activity relationship data and physical organic descriptors create accurate machine learning model for HAT barrier prediction in photoredox-mediated HAT catalysis.
Collapse
Affiliation(s)
- Li-Cheng Yang
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University, Hangzhou 310027, China
| | - Xin Li
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University, Hangzhou 310027, China
| | - Shuo-Qing Zhang
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University, Hangzhou 310027, China
| | - Xin Hong
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University, Hangzhou 310027, China
| |
Collapse
|