1
|
Schoepfer A, Laplaza R, Wodrich MD, Waser J, Corminboeuf C. Reaction-Agnostic Featurization of Bidentate Ligands for Bayesian Ridge Regression of Enantioselectivity. ACS Catal 2024; 14:9302-9312. [PMID: 38933467 PMCID: PMC11197013 DOI: 10.1021/acscatal.4c02452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 05/22/2024] [Accepted: 05/22/2024] [Indexed: 06/28/2024]
Abstract
Chiral ligands are important components in asymmetric homogeneous catalysis, but their synthesis and screening can be both time-consuming and resource-intensive. Data-driven approaches, in contrast to screening procedures based on intuition, have the potential to reduce the time and resources needed for reaction optimization by more rapidly identifying an ideal catalyst. These approaches, however, are often nontransferable and cannot be applied across different reactions. To overcome this drawback, we introduce a general featurization strategy for bidentate ligands that is coupled with an automated feature selection pipeline and Bayesian ridge regression to perform multivariate linear regression modeling. This approach, which is applicable to any reaction, incorporates electronic, steric, and topological features (rigidity/flexibility, branching, geometry, and constitution) and is well-suited for early stage ligand optimization. Using only small data sets, our workflow capably predicts the enantioselectivity of four metal-catalyzed asymmetric reactions. Uncertainty estimates provided by Bayesian ridge regression permit the use of Bayesian optimization to efficiently explore pools of prospective ligands. Finally, we constructed the BDL-Cu-2023 data set, composed of 312 bidentate ligands extracted from the Cambridge Structural Database, and screened it with this procedure to identify ligand candidates for a challenging asymmetric oxy-alkynylation reaction.
Collapse
Affiliation(s)
- Alexandre
A. Schoepfer
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- Laboratory
of Catalysis and Organic Synthesis, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Ruben Laplaza
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Matthew D. Wodrich
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Jerome Waser
- Laboratory
of Catalysis and Organic Synthesis, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
2
|
Tsutsui Y, Yanaka I, Takeda K, Kondo M, Takizawa S, Kojima R, Konishi A, Yasuda M. Selective recognition between aromatics and aliphatics by cage-shaped borates supported by a machine learning approach. Org Biomol Chem 2024; 22:4283-4291. [PMID: 38602393 DOI: 10.1039/d4ob00408f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
Selective recognition between hydrocarbon moieties is a longstanding issue. Although we developed a π-pocket Lewis acid catalyst with high selectivity for aromatic aldehydes over aliphatic ones, a general strategy for catalyst design remains elusive. As an approach that transfers the molecular recognition based on multiple cooperative non-covalent interactions within the π-pocket to a rational catalyst design, herein, we demonstrate Lewis acid catalysts showing improved selectivity through the support of an ensemble algorithm with random forest, Ada Boost, and XG Boost as a machine learning (ML) approach. Using 7963 explanatory variables extracted from model hetero-Diels-Alder reactions, the ensemble algorithm predicted the chemoselectivity of unlearned catalysts. Experiments confirmed the prediction. The proposed catalyst shows the highest selective recognition, reminiscing enzymatic catalytic activity. Additionally, a SHapley Additive exPlanations (SHAP) method suggested that the selectivity originates from the polarizability and three-dimensional size of the catalyst. This insight leads to rational design guidelines for Lewis acid catalysts with dispersion forces.
Collapse
Affiliation(s)
- Yuya Tsutsui
- Department of Applied Chemistry, Graduate School of Engineering, Osaka University, Suita, 565-0871, Japan.
| | - Issei Yanaka
- Department of Engineering, Graduate School of Integrated Science and Technology, Shizuoka University, Hamamatsu, 432-8561, Japan.
| | - Kazuhiro Takeda
- Department of Engineering, Graduate School of Integrated Science and Technology, Shizuoka University, Hamamatsu, 432-8561, Japan.
| | - Masaru Kondo
- School of Pharmaceutical Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| | | | - Ryosuke Kojima
- Department of Biomedical Data Intelligence, Graduate School of Medicine, Kyoto University, Sakyo-ku, 606-8507, Japan
| | - Akihito Konishi
- Department of Applied Chemistry, Graduate School of Engineering, Osaka University, Suita, 565-0871, Japan.
- Innovative Catalysis Science Division, Institute for Open and Transdisciplinary Research Initiatives (ICS-OTRI), Osaka University, Suita, 565-0871, Japan
| | - Makoto Yasuda
- Department of Applied Chemistry, Graduate School of Engineering, Osaka University, Suita, 565-0871, Japan.
- Innovative Catalysis Science Division, Institute for Open and Transdisciplinary Research Initiatives (ICS-OTRI), Osaka University, Suita, 565-0871, Japan
| |
Collapse
|
3
|
Li B, Su S, Zhu C, Lin J, Hu X, Su L, Yu Z, Liao K, Chen H. A deep learning framework for accurate reaction prediction and its application on high-throughput experimentation data. J Cheminform 2023; 15:72. [PMID: 37568183 PMCID: PMC10422736 DOI: 10.1186/s13321-023-00732-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 06/30/2023] [Indexed: 08/13/2023] Open
Abstract
In recent years, it has been seen that artificial intelligence (AI) starts to bring revolutionary changes to chemical synthesis. However, the lack of suitable ways of representing chemical reactions and the scarceness of reaction data has limited the wider application of AI to reaction prediction. Here, we introduce a novel reaction representation, GraphRXN, for reaction prediction. It utilizes a universal graph-based neural network framework to encode chemical reactions by directly taking two-dimension reaction structures as inputs. The GraphRXN model was evaluated by three publically available chemical reaction datasets and gave on-par or superior results compared with other baseline models. To further evaluate the effectiveness of GraphRXN, wet-lab experiments were carried out for the purpose of generating reaction data. GraphRXN model was then built on high-throughput experimentation data and a decent accuracy (R2 of 0.712) was obtained on our in-house data. This highlights that the GraphRXN model can be deployed in an integrated workflow which combines robotics and AI technologies for forward reaction prediction.
Collapse
Affiliation(s)
- Baiqing Li
- Guangzhou Laboratory, Guangzhou, 510005, Guangdong, China
| | - Shimin Su
- Guangzhou Laboratory, Guangzhou, 510005, Guangdong, China
| | - Chan Zhu
- Guangzhou Laboratory, Guangzhou, 510005, Guangdong, China
| | - Jie Lin
- Guangzhou Laboratory, Guangzhou, 510005, Guangdong, China
| | - Xinyue Hu
- Guangzhou Laboratory, Guangzhou, 510005, Guangdong, China
| | - Lebin Su
- Guangzhou Laboratory, Guangzhou, 510005, Guangdong, China
| | - Zhunzhun Yu
- Guangzhou Laboratory, Guangzhou, 510005, Guangdong, China
| | - Kuangbiao Liao
- Guangzhou Laboratory, Guangzhou, 510005, Guangdong, China.
| | - Hongming Chen
- Guangzhou Laboratory, Guangzhou, 510005, Guangdong, China.
| |
Collapse
|
4
|
Sanders MA, Chittari SS, Sherman N, Foley JR, Knight AS. Versatile Triphenylphosphine-Containing Polymeric Catalysts and Elucidation of Structure-Function Relationships. J Am Chem Soc 2023; 145:9686-9692. [PMID: 37079910 DOI: 10.1021/jacs.3c01092] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/22/2023]
Abstract
Synthetic polymers are a modular solution to bridging the two most common classes of catalysts: proteins and small molecules. Polymers offer the synthetic versatility of small-molecule catalysts while simultaneously having the ability to construct microenvironments mimicking those of natural proteins. We synthesized a panel of polymeric catalysts containing a novel triphenylphosphine acrylamide monomer and investigated how their properties impact the rate of a model Suzuki-Miyaura cross-coupling reaction. Systematic variation of polymer properties, such as the molecular weight, functional density, and comonomer identity, led to tunable reaction rates and solvent compatibility, including full conversion in an aqueous medium. Studies with bulkier substrates revealed connections between polymer parameters and reaction conditions that were further elucidated with a regression analysis. Some connections were substrate-specific, highlighting the value of the rapidly tunable polymer catalyst. Collectively, these results aid in building structure-function relationships to guide the development of polymer catalysts with tunable substrates and environmental compatibility.
Collapse
Affiliation(s)
- Matthew A Sanders
- Department of Chemistry, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Supraja S Chittari
- Department of Chemistry, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Nicole Sherman
- Department of Chemistry, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Jack R Foley
- Department of Chemistry, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Abigail S Knight
- Department of Chemistry, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| |
Collapse
|
5
|
Wang JW, Li Z, Liu D, Zhang JY, Lu X, Fu Y. Nickel-Catalyzed Remote Asymmetric Hydroalkylation of Alkenyl Ethers to Access Ethers of Chiral Dialkyl Carbinols. J Am Chem Soc 2023; 145:10411-10421. [PMID: 37127544 DOI: 10.1021/jacs.3c02950] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Site- and enantio-selective alkyl-alkyl bond formation is privileged in the retrosynthetic analysis due to the universality of sp3-hybridized carbon atoms in organic molecules. Herein, we report a nickel-catalyzed remote asymmetric hydroalkylation of alkenyl ethers via synchronous implementation of alkene isomerization and enantioselective C(sp3)-C(sp3) bond formation. Regression analysis of catalyst structure-activity relationships accelerates the rational ligand modification through modular regulation. This reaction has several advantages for synthesizing chiral dialkyl carbinols and their ether derivatives, including the broad substrate scope, good functional group tolerance, excellent regioselectivity (>20:1 regioisomeric ratio), and high enantioselectivity (up to 95% enantiomeric excess).
Collapse
Affiliation(s)
- Jia-Wang Wang
- Hefei National Research Center for Physical Sciences at the Microscale, iChEM, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, University of Science and Technology of China, 230026 Hefei, China
- School of Plant Protection, Anhui Agricultural University, Hefei, Anhui 230036, China
| | - Zhen Li
- Hefei National Research Center for Physical Sciences at the Microscale, iChEM, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, University of Science and Technology of China, 230026 Hefei, China
| | - Deguang Liu
- Hefei National Research Center for Physical Sciences at the Microscale, iChEM, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, University of Science and Technology of China, 230026 Hefei, China
| | - Jun-Yang Zhang
- Hefei National Research Center for Physical Sciences at the Microscale, iChEM, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, University of Science and Technology of China, 230026 Hefei, China
| | - Xi Lu
- Key Laboratory of Precision and Intelligent Chemistry, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Yao Fu
- Hefei National Research Center for Physical Sciences at the Microscale, iChEM, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, University of Science and Technology of China, 230026 Hefei, China
| |
Collapse
|
6
|
Liu D, Xu Z, Lu X, Yu H, Fu Y. Linear Regression Model for Predicting Allyl Alcohol C–O Bond Activity under Palladium Catalysis. ACS Catal 2022. [DOI: 10.1021/acscatal.2c03847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- DeGuang Liu
- Hefei National Laboratory for Physical Sciences at the Microscale, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, iChEM, Institute of Energy, Hefei Comprehensive National Science Center, University of Science and Technology of China, Hefei230026, China
| | - ZheYuan Xu
- Hefei National Laboratory for Physical Sciences at the Microscale, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, iChEM, Institute of Energy, Hefei Comprehensive National Science Center, University of Science and Technology of China, Hefei230026, China
| | - Xi Lu
- Hefei National Laboratory for Physical Sciences at the Microscale, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, iChEM, Institute of Energy, Hefei Comprehensive National Science Center, University of Science and Technology of China, Hefei230026, China
| | - HaiZhu Yu
- Department of Chemistry, Center for Atomic Engineering of Advanced Materials, Anhui Provence Key Laboratory of Chemistry for Inorganic/Organic Hybrid Functionalized Materials, Anhui University, Hefei230601, China
| | - Yao Fu
- Hefei National Laboratory for Physical Sciences at the Microscale, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, iChEM, Institute of Energy, Hefei Comprehensive National Science Center, University of Science and Technology of China, Hefei230026, China
| |
Collapse
|
7
|
Apolinar O, Kang T, Alturaifi TM, Bedekar PG, Rubel CZ, Derosa J, Sanchez BB, Wong QN, Sturgell EJ, Chen JS, Wisniewski SR, Liu P, Engle KM. Three-Component Asymmetric Ni-Catalyzed 1,2-Dicarbofunctionalization of Unactivated Alkenes via Stereoselective Migratory Insertion. J Am Chem Soc 2022; 144:19337-19343. [PMID: 36222701 DOI: 10.1021/jacs.2c06636] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
An asymmetric 1,2-dicarbofunctionalization of unactivated alkenes with aryl iodides and aryl/alkenylboronic esters under nickel/bioxazoline catalysis is disclosed. A wide array of aryl and alkenyl nucleophiles are tolerated, furnishing the products in good yield and with high enantioselectivity. In addition to terminal alkenes, 1,2-disubstituted internal alkenes participate in the reaction, establishing two contiguous stereocenters with high diastereoselectivity and moderate enantioselectivity. A combination of experimental and computational techniques shed light on the mechanism of the catalytic transformation, pointing to a closed-shell pathway with an enantiodetermining migratory insertion step, where stereoinduction arises from synergistic interactions between the sterically bulky achiral sulfonamide directing group and the hemilabile bidentate ligand.
Collapse
Affiliation(s)
- Omar Apolinar
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Taeho Kang
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Turki M Alturaifi
- Department of Chemistry, University of Pittsburgh, 219 Parkman Avenue, Pittsburgh, Pennsylvania 15260, United States
| | - Pranali G Bedekar
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Camille Z Rubel
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Joseph Derosa
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Brittany B Sanchez
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Quynh Nguyen Wong
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Emily J Sturgell
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Jason S Chen
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Steven R Wisniewski
- Chemical Process Development Bristol Myers Squibb, One Squibb Drive, New Brunswick, New Jersey 08903, United States
| | - Peng Liu
- Department of Chemistry, University of Pittsburgh, 219 Parkman Avenue, Pittsburgh, Pennsylvania 15260, United States
| | - Keary M Engle
- Department of Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| |
Collapse
|
8
|
Prediction and Analysis of Financial Default Loan Behavior Based on Machine Learning Model. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:7907210. [PMID: 36238663 PMCID: PMC9552691 DOI: 10.1155/2022/7907210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 08/21/2022] [Accepted: 08/25/2022] [Indexed: 11/18/2022]
Abstract
In recent years, the increase of customer loan risk and the aggravation of the epidemic have led to the increase of customer default risk. Therefore, identifying high-risk customers has become an important research hotspot for banks. The customer's credit is the standard to evaluate the loan amount and interest rate, and the ability to quickly identify customer information has become a research hotspot. Based on the bank credit application scenario, this paper realizes function extraction and data processing for customer basic attribute data and download transaction data. Then, a linear regression model with penalty and a neural network prediction model are proposed to improve the accuracy of bankruptcy assessment and achieve local optimization. In this way, the implicit risk prediction and control of customer credit are improved, and the default risk of bank loans is significantly reduced. According to the characteristics of the collected sample data, the most appropriate penalty linear regression prediction algorithm is selected and the experimental analysis is carried out to improve the risk management level of banks. The experimental results show that the improved logistic regression and neural network model has obvious advantages in the prediction effect for four models.
Collapse
|
9
|
Mondal P, Pal R, Pal AK, Das S, Misra A, Datta A. Understanding the Regioselectivity of Ion-Pair-Assisted Meta-Selective C(sp 2)-H Activation in Conformationally Flexible Arylammonium Salts. J Org Chem 2022; 87:9222-9231. [PMID: 35771188 DOI: 10.1021/acs.joc.2c00957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The lack of directionality and the long-range nature of Coulomb interactions have been a bottleneck to achieve chemically precise C-H activation using ion-pairs. Recent report by Phipps and co-workers of the ion-pair-directed regioselective Iridium-catalyzed borylation opens a new direction toward harnessing noncovalent interactions for C-H activation. In this article, the mechanism and specific role of ion-pairing are investigated using density functional theory (DFT). Computational studies reveal that meta C-H activation is kinetically more favorable than the para analogue due to stronger electrostatic interactions between the ion-pairs in closer proximity [d(NMe3+···SO3-)TSP1m = 3.93 Å versus d(NMe3+···SO3-)TSP1p = 4.30 Å]. The electrostatic interactions overwhelm the Pauli repulsion and distortion interactions incurred in bringing the oppositely charged ions in close contact for the rate-limiting meta transition state (TSP1m). Multiple linear regression shows that the free energies of activation correlate well with descriptors like the charge densities on the meta carbon and Ir atom along with that on the cation and anion with R2 = 0.74. Tuned range-separated DFT calculations demonstrate accurately the localization of charge separation in the reactant complex and transition state for the meta selectivity.
Collapse
Affiliation(s)
- Partha Mondal
- School of Chemical Sciences, Indian Association for the Cultivation of Science, Jadavpur, Kolkata 700032, West Bengal, India
| | - Rapti Pal
- Department of Chemistry, Indian Institute of Science Education and Research (IISER), Pune 411008, India
| | - Arun K Pal
- School of Chemical Sciences, Indian Association for the Cultivation of Science, Jadavpur, Kolkata 700032, West Bengal, India
| | - Soumik Das
- Department of Chemistry, University of North Bengal, Dist-Darjeeling 734013, India
| | - Anirban Misra
- Department of Chemistry, University of North Bengal, Dist-Darjeeling 734013, India
| | - Ayan Datta
- School of Chemical Sciences, Indian Association for the Cultivation of Science, Jadavpur, Kolkata 700032, West Bengal, India
| |
Collapse
|
10
|
Lustosa DM, Milo A. Mechanistic Inference from Statistical Models at Different Data-Size Regimes. ACS Catal 2022. [DOI: 10.1021/acscatal.2c01741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Danilo M. Lustosa
- Department of Chemistry, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Anat Milo
- Department of Chemistry, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| |
Collapse
|
11
|
Yang L, Zhu L, Zhang S, Hong X. Machine Learning Prediction of
Structure‐Performance
Relationship in Organic Synthesis. CHINESE J CHEM 2022. [DOI: 10.1002/cjoc.202200039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Li‐Cheng Yang
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University Hangzhou Zhejiang 310027 China
| | - Lu‐Jing Zhu
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University Hangzhou Zhejiang 310027 China
| | - Shuo‐Qing Zhang
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University Hangzhou Zhejiang 310027 China
| | - Xin Hong
- Center of Chemistry for Frontier Technologies, Department of Chemistry, State Key Laboratory of Clean Energy Utilization, Zhejiang University Hangzhou Zhejiang 310027 China
- Beijing National Laboratory for Molecular Sciences, Zhongguancun North First Street NO. 2 Beijing 100190 China
- Key Laboratory of Precise Synthesis of Functional Molecules of Zhejiang Province, School of Science, Westlake University, 18 Shilongshan Road Hangzhou Zhejiang 310024 China
| |
Collapse
|
12
|
Betinol IO, Reid JP. A predictive and mechanistic statistical modelling workflow for improving decision making in organic synthesis and catalysis. Org Biomol Chem 2022; 20:6012-6018. [PMID: 35389396 DOI: 10.1039/d2ob00272h] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The application of multivariate linear regression models has been widely utilized as a strategy to streamline the reaction optimization process. While these tools likely provide relatively safe predictions, embedding a method for forecasting the probability of achieving the desired reaction outcome would be valuable for streamlining the identification of promising structures with the best chance of success. Herein, we present a workflow that predicts the probability that a reaction will be successful and is easy and quick to apply. We show that this probabilistic framework can effectively differentiate between predictions often indistinguishable by multivariate linear regression analysis. Moreover, these techniques can enhance the development of mechanistically informative correlations by producing more direct pathways for molecular development and design. Overall, we anticipate this protocol will be generally applicable and useful for accelerating successful chemical discovery.
Collapse
Affiliation(s)
- Isaiah O Betinol
- Department of Chemistry, University of British Columbia, Vancouver, British Columbia V6T 1Z1, Canada.
| | - Jolene P Reid
- Department of Chemistry, University of British Columbia, Vancouver, British Columbia V6T 1Z1, Canada.
| |
Collapse
|
13
|
Lee W, Kim J, Kim H, Back S. Catalytic Activity Trends of Pyrite Transition Metal Dichalcogenides for Oxygen Reduction and Evolution. Phys Chem Chem Phys 2022; 24:19911-19918. [DOI: 10.1039/d2cp01518h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Transition metal dichalcogenides (TMDs) have been considered as promising materials for oxygen reduction reaction (ORR) and oxygen evolution reaction (OER) electrocatalysis. While there have been numerous studies focusing on layered...
Collapse
|
14
|
Morán-González L, Besora M, Maseras F. Seeking the Optimal Descriptor for S N2 Reactions through Statistical Analysis of Density Functional Theory Results. J Org Chem 2021; 87:363-372. [PMID: 34935370 DOI: 10.1021/acs.joc.1c02387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Bimolecular nucleophilic substitution is one of the fundamental reactions in organic chemistry, yet there is still knowledge to be gained on the role of the nucleophile and the substrate. A statistical treatment of over 600 density functional theory (DFT)-computed barriers for bimolecular nucleophilic substitution at methyl derivatives (SN2@C) leads to the identification of numerical descriptors that best represent the entering and leaving ability of 26 different nucleophiles. The treatment is based on singular value decomposition (SVD) of a matrix of computed energy barriers. The current work represents the extension to a problem of reactivity of the hidden descriptor methodology that we had previously developed for the thermodynamic problem of bond dissociation energies in transition-metal complexes. The analysis of the results shows that a single descriptor is sufficient. This hidden descriptor has different values for nucleophilic and leaving abilities and, contrary to expectation, does not correlate especially well with either frontier molecular orbital descriptors or solvation descriptors. In contrast, it correlates with other thermodynamic and geometric parameters. This statistical procedure can be in principle extended to additional chemical fragments and other reactions.
Collapse
Affiliation(s)
- Lucía Morán-González
- Institute of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology, Avgda. Països Catalans, 16, 43007 Tarragona, Catalonia, Spain
| | - Maria Besora
- Departament de Química Física i Inorgànica, Universitat Rovira i Virgili, c/Marcel·lí Domingo s/n, 43007 Tarragona, Catalonia, Spain
| | - Feliu Maseras
- Institute of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology, Avgda. Països Catalans, 16, 43007 Tarragona, Catalonia, Spain
| |
Collapse
|
15
|
A Smart Helmet-Based PLS-BPNN Error Compensation Model for Infrared Body Temperature Measurement of Construction Workers during COVID-19. MATHEMATICS 2021. [DOI: 10.3390/math9212808] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
In the context of the long-term coexistence between COVID-19 and human society, the implementation of personnel health monitoring in construction sites has become one of the urgent needs of current construction management. The installation of infrared temperature sensors on the helmets required to be worn by construction personnel to track and monitor their body temperature has become a relatively inexpensive and reliable means of epidemic prevention and control, but the accuracy of measuring body temperature has always been a problem. This study developed a smart helmet equipped with an infrared temperature sensor and conducted a simulated construction experiment to collect data of temperature and its influencing factors in indoor and outdoor construction operation environments. Then, a Partial Least Square–Back Propagation Neural Network (PLS-BPNN) temperature error compensation model was established to correct the temperature measurement results of the smart helmet. The temperature compensation effects of different models were also compared, including PLS-BPNN with Least Square Regression (LSR), Partial Least Square Regression (PLSR), and single Back Propagation Neural Network (BPNN) models. The results showed that the PLS-BPNN model had higher accuracy and reliability, and the determination coefficient of the model was 0.99377. After using PLS-BPNN model for compensation, the relative average error of infrared body temperature was reduced by 2.745 °C and RMSE was reduced by 0.9849. The relative error range of infrared body temperature detection was only 0.005~0.143 °C.
Collapse
|
16
|
Towards Data‐Driven Design of Asymmetric Hydrogenation of Olefins: Database and Hierarchical Learning. Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202106880] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
17
|
Abstract
Given the importance of catalysts in the chemical industry, they have been extensively investigated by experimental and numerical methods. With the development of computational algorithms and computer hardware, large-scale simulations have enabled influential studies with more atomic details reflecting microscopic mechanisms. This review provides a comprehensive summary of recent developments in molecular dynamics, including ab initio molecular dynamics and reaction force-field molecular dynamics. Recent research on both approaches to catalyst calculations is reviewed, including growth, dehydrogenation, hydrogenation, oxidation reactions, bias, and recombination of carbon materials that can guide catalyst calculations. Machine learning has attracted increasing interest in recent years, and its combination with the field of catalysts has inspired promising development approaches. Its applications in machine learning potential, catalyst design, performance prediction, structure optimization, and classification have been summarized in detail. This review hopes to shed light and perspective on ML approaches in catalysts.
Collapse
|
18
|
Shoja A, Zhai J, Reid JP. Comprehensive Stereochemical Models for Selectivity Prediction in Diverse Chiral Phosphate-Catalyzed Reaction Space. ACS Catal 2021. [DOI: 10.1021/acscatal.1c03520] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Ali Shoja
- Department of Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, British Columbia V6T 1Z1, Canada
| | - Jianyu Zhai
- Department of Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, British Columbia V6T 1Z1, Canada
| | - Jolene P. Reid
- Department of Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, British Columbia V6T 1Z1, Canada
| |
Collapse
|
19
|
Xu LC, Zhang SQ, Li X, Tang MJ, Xie PP, Hong X. Towards Data-driven Design of Asymmetric Hydrogenation of Olefins: Database and Hierarchical Learning. Angew Chem Int Ed Engl 2021; 60:22804-22811. [PMID: 34370892 DOI: 10.1002/anie.202106880] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Revised: 07/14/2021] [Indexed: 11/09/2022]
Abstract
Asymmetric hydrogenation of olefins is one of the most powerful asymmetric transformations in molecular synthesis. Although several privileged catalyst scaffolds are available, the catalyst development for asymmetric hydrogenation is still a time- and resource-consuming process due to the lack of predictive catalyst design strategy. Targeting the data-driven design of asymmetric catalysis, we herein report the development of a standardized database that contains the detailed information of over 12000 literature asymmetric hydrogenations of olefins. This database provides a valuable platform for the machine learning applications in asymmetric catalysis. Based on this database, we developed a hierarchical learning approach to achieve predictive machine leaning model using only dozens of enantioselectivity data with the target olefin, which offers a useful solution for the few-shot learning problem and will facilitate the reaction optimization with new olefin substrate in catalysis screening.
Collapse
Affiliation(s)
- Li-Cheng Xu
- Zhejiang University, Department of Chemistry, CHINA
| | | | - Xin Li
- Zhejiang University, Department of Chemistry, CHINA
| | | | - Pei-Pei Xie
- Zhejiang University, Department of Chemistry, CHINA
| | - Xin Hong
- Zhejiang University, Department of Chemistry, 38 Zheda Road, 310028, Hangzhou, CHINA
| |
Collapse
|
20
|
Crawford JM, Kingston C, Toste FD, Sigman MS. Data Science Meets Physical Organic Chemistry. Acc Chem Res 2021; 54:10.1021/acs.accounts.1c00285. [PMID: 34351757 PMCID: PMC9078128 DOI: 10.1021/acs.accounts.1c00285] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
ConspectusAt the heart of synthetic chemistry is the holy grail of predictable catalyst design. In particular, researchers involved in reaction development in asymmetric catalysis have pursued a variety of strategies toward this goal. This is driven by both the pragmatic need to achieve high selectivities and the inability to readily identify why a certain catalyst is effective for a given reaction. While empiricism and intuition have dominated the field of asymmetric catalysis since its inception, enantioselectivity offers a mechanistically rich platform to interrogate catalyst-structure response patterns that explain the performance of a particular catalyst or substrate.In the early stages of an asymmetric reaction development campaign, the overarching mechanism of the reaction, catalyst speciation, the turnover limiting step, and many other details are unknown or posited based on related reactions. Considering the unclear details leading to a successful reaction, initial enantioselectivity data are often used to intuitively guide the ultimate direction of optimization. However, if the conditions of the Curtin-Hammett principle are satisfied, then measured enantioselectivity can be directly connected to the ensemble of diastereomeric transition states (TSs) that lead to the enantiomeric products, and the associated free energy difference between competing TSs (ΔΔG⧧ = -RT ln[(S)/(R)], where (S) and (R) represent the concentrations of the enantiomeric products). We, and others, speculated that this important piece of information can be leveraged to guide reaction optimization in a quantitative way.Although traditional linear free energy relationships (LFERs), such as Hammett plots, have been used to illuminate important mechanistic features, we sought to develop data science derived tools to expand the power of LFERs in order to describe complex reactions frequently encountered in modern asymmetric catalysis. Specifically, we investigated whether enantioselectivity data from a reaction can be quantitatively connected to the attributes of reaction components, such as catalyst and substrate structural features, to harness data for asymmetric catalyst design.In this context, we developed a workflow to relate computationally derived features of reaction components to enantioselectivity using data science tools. The mathematical representation of molecules can incorporate many aspects of a transformation, such as molecular features from substrate, product, catalyst, and proposed transition states. Statistical models relating these features to reaction outputs can be used for various tasks, such as performance prediction of untested molecules. Perhaps most importantly, statistical models can guide the generation of mechanistic hypotheses that are embedded within complex patterns of reaction responses. Overall, merging traditional physical organic experiments with statistical modeling techniques creates a feedback loop that enables both evaluation of multiple mechanistic hypotheses and future catalyst design. In this Account, we highlight the evolution and application of this approach in the context of a collaborative program based on chiral phosphoric acid catalysts (CPAs) in asymmetric catalysis.
Collapse
Affiliation(s)
- Jennifer M Crawford
- Department of Chemistry, University of Utah, 315 S. 1400 E., Salt Lake City, Utah 84112, United States
| | - Cian Kingston
- Department of Chemistry, University of Utah, 315 S. 1400 E., Salt Lake City, Utah 84112, United States
| | - F Dean Toste
- Department of Chemistry, University of California, Berkeley, California 94720, United States
| | - Matthew S Sigman
- Department of Chemistry, University of Utah, 315 S. 1400 E., Salt Lake City, Utah 84112, United States
| |
Collapse
|
21
|
Abstract
Computational methods have emerged as a powerful tool to augment traditional experimental molecular catalyst design by providing useful predictions of catalyst performance and decreasing the time needed for catalyst screening. In this perspective, we discuss three approaches for computational molecular catalyst design: (i) the reaction mechanism-based approach that calculates all relevant elementary steps, finds the rate and selectivity determining steps, and ultimately makes predictions on catalyst performance based on kinetic analysis, (ii) the descriptor-based approach where physical/chemical considerations are used to find molecular properties as predictors of catalyst performance, and (iii) the data-driven approach where statistical analysis as well as machine learning (ML) methods are used to obtain relationships between available data/features and catalyst performance. Following an introduction to these approaches, we cover their strengths and weaknesses and highlight some recent key applications. Furthermore, we present an outlook on how the currently applied approaches may evolve in the near future by addressing how recent developments in building automated computational workflows and implementing advanced ML models hold promise for reducing human workload, eliminating human bias, and speeding up computational catalyst design at the same time. Finally, we provide our viewpoint on how some of the challenges associated with the up-and-coming approaches driven by automation and ML may be resolved.
Collapse
Affiliation(s)
- Ademola Soyemi
- Department of Chemical and Biological Engineering, The University of Alabama, Tuscaloosa, AL 35487, USA.
| | - Tibor Szilvási
- Department of Chemical and Biological Engineering, The University of Alabama, Tuscaloosa, AL 35487, USA.
| |
Collapse
|