1
|
Schoepfer A, Laplaza R, Wodrich MD, Waser J, Corminboeuf C. Reaction-Agnostic Featurization of Bidentate Ligands for Bayesian Ridge Regression of Enantioselectivity. ACS Catal 2024; 14:9302-9312. [PMID: 38933467 PMCID: PMC11197013 DOI: 10.1021/acscatal.4c02452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 05/22/2024] [Accepted: 05/22/2024] [Indexed: 06/28/2024]
Abstract
Chiral ligands are important components in asymmetric homogeneous catalysis, but their synthesis and screening can be both time-consuming and resource-intensive. Data-driven approaches, in contrast to screening procedures based on intuition, have the potential to reduce the time and resources needed for reaction optimization by more rapidly identifying an ideal catalyst. These approaches, however, are often nontransferable and cannot be applied across different reactions. To overcome this drawback, we introduce a general featurization strategy for bidentate ligands that is coupled with an automated feature selection pipeline and Bayesian ridge regression to perform multivariate linear regression modeling. This approach, which is applicable to any reaction, incorporates electronic, steric, and topological features (rigidity/flexibility, branching, geometry, and constitution) and is well-suited for early stage ligand optimization. Using only small data sets, our workflow capably predicts the enantioselectivity of four metal-catalyzed asymmetric reactions. Uncertainty estimates provided by Bayesian ridge regression permit the use of Bayesian optimization to efficiently explore pools of prospective ligands. Finally, we constructed the BDL-Cu-2023 data set, composed of 312 bidentate ligands extracted from the Cambridge Structural Database, and screened it with this procedure to identify ligand candidates for a challenging asymmetric oxy-alkynylation reaction.
Collapse
Affiliation(s)
- Alexandre
A. Schoepfer
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- Laboratory
of Catalysis and Organic Synthesis, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Ruben Laplaza
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Matthew D. Wodrich
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Jerome Waser
- Laboratory
of Catalysis and Organic Synthesis, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory
for Computational Molecular Design, Institute of Chemical Sciences
and Engineering, École Polytechnique
Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National
Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale
de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
2
|
Cao X, Huang J, Du K, Tian Y, Hu Z, Luo Z, Wang J, Guo Y. Machine-Learning-Assisted Descriptors Identification for Indoor Formaldehyde Oxidation Catalysts. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58:8372-8379. [PMID: 38691628 DOI: 10.1021/acs.est.4c01691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2024]
Abstract
The development of highly efficient catalysts for formaldehyde (HCHO) oxidation is of significant interest for the improvement of indoor air quality. Up to 400 works relating to the catalytic oxidation of HCHO have been published to date; however, their analysis for collective inference through conventional literature search is still a challenging task. A machine learning (ML) framework was presented to predict catalyst performance from experimental descriptors based on an HCHO oxidation catalysts database. MnOx, CeO2, Co3O4, TiO2, FeOx, ZrO2, Al2O3, SiO2, and carbon-based catalysts with different promoters were compiled from the literature. Notably, 20 descriptors including reaction catalyst composition, reaction conditions, and catalyst physical properties were collected for data mining (2263 data points). Furthermore, the eXtreme Gradient Boosting algorithm was employed, which successfully predicted the conversion efficiency of HCHO with an R-square value of 0.81. Shapley additive analysis suggested Pt/MnO2 and Ag/Ce-Co3O4 exhibited excellent catalytic performance of HCHO oxidation based on the analysis of the entire database. Validated by experimental tests and theoretical simulations, the key descriptor identified by ML, i.e., the first promoter, was further described as metal-support interactions. This study highlights ML as a useful tool for database establishment and the catalyst rational design strategy based on the importance of analysis between experimental descriptors and the performance of complex catalytic systems.
Collapse
Affiliation(s)
- Xinyuan Cao
- College of Chemistry, Central China Normal University, Wuhan, Hubei 430079, P. R. China
| | - Jisi Huang
- College of Chemistry, Central China Normal University, Wuhan, Hubei 430079, P. R. China
| | - Kexin Du
- College of Chemistry, Central China Normal University, Wuhan, Hubei 430079, P. R. China
| | - Yawen Tian
- College of Chemistry, Central China Normal University, Wuhan, Hubei 430079, P. R. China
| | - Zhixin Hu
- College of Chemistry, Central China Normal University, Wuhan, Hubei 430079, P. R. China
| | - Zhu Luo
- College of Chemistry, Central China Normal University, Wuhan, Hubei 430079, P. R. China
| | - Jinlong Wang
- College of Chemistry, Central China Normal University, Wuhan, Hubei 430079, P. R. China
- Wuhan Institute of Photochemistry and Technology, Wuhan, Hubei 430083, P. R. China
- Engineering Research Center of Photoenergy Utilization for Pollution Control and Carbon Reduction, Ministry of Education, Wuhan 430079, P. R. China
| | - Yanbing Guo
- College of Chemistry, Central China Normal University, Wuhan, Hubei 430079, P. R. China
- Wuhan Institute of Photochemistry and Technology, Wuhan, Hubei 430083, P. R. China
- Engineering Research Center of Photoenergy Utilization for Pollution Control and Carbon Reduction, Ministry of Education, Wuhan 430079, P. R. China
| |
Collapse
|
3
|
Rana D, Pflüger PM, Hölter NP, Tan G, Glorius F. Standardizing Substrate Selection: A Strategy toward Unbiased Evaluation of Reaction Generality. ACS CENTRAL SCIENCE 2024; 10:899-906. [PMID: 38680564 PMCID: PMC11046462 DOI: 10.1021/acscentsci.3c01638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 03/14/2024] [Accepted: 03/18/2024] [Indexed: 05/01/2024]
Abstract
With over 10,000 new reaction protocols arising every year, only a handful of these procedures transition from academia to application. A major reason for this gap stems from the lack of comprehensive knowledge about a reaction's scope, i.e., to which substrates the protocol can or cannot be applied. Even though chemists invest substantial effort to assess the scope of new protocols, the resulting scope tables involve significant biases, reducing their expressiveness. Herein we report a standardized substrate selection strategy designed to mitigate these biases and evaluate the applicability, as well as the limits, of any chemical reaction. Unsupervised learning is utilized to map the chemical space of industrially relevant molecules. Subsequently, potential substrate candidates are projected onto this universal map, enabling the selection of a structurally diverse set of substrates with optimal relevance and coverage. By testing our methodology on different chemical reactions, we were able to demonstrate its effectiveness in finding general reactivity trends by using a few highly representative examples. The developed methodology empowers chemists to showcase the unbiased applicability of novel methodologies, facilitating their practical applications. We hope that this work will trigger interdisciplinary discussions about biases in synthetic chemistry, leading to improved data quality.
Collapse
Affiliation(s)
- Debanjan Rana
- Universität Münster,
Organisch-Chemisches Institut, Corrensstraße 36, 48149 Münster, Germany
| | - Philipp M. Pflüger
- Universität Münster,
Organisch-Chemisches Institut, Corrensstraße 36, 48149 Münster, Germany
| | - Niklas P. Hölter
- Universität Münster,
Organisch-Chemisches Institut, Corrensstraße 36, 48149 Münster, Germany
| | - Guangying Tan
- Universität Münster,
Organisch-Chemisches Institut, Corrensstraße 36, 48149 Münster, Germany
| | - Frank Glorius
- Universität Münster,
Organisch-Chemisches Institut, Corrensstraße 36, 48149 Münster, Germany
| |
Collapse
|
4
|
Wagner F, Sagmeister P, Jusner CE, Tampone TG, Manee V, Buono FG, Williams JD, Kappe CO. A Slug Flow Platform with Multiple Process Analytics Facilitates Flexible Reaction Optimization. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2308034. [PMID: 38273711 PMCID: PMC10987115 DOI: 10.1002/advs.202308034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 12/21/2023] [Indexed: 01/27/2024]
Abstract
Flow processing offers many opportunities to optimize reactions in a rapid and automated manner, yet often requires relatively large quantities of input materials. To combat this, the use of a flexible slug flow reactor, equipped with two analytical instruments, for low-volume optimization experiments are reported. A Buchwald-Hartwig amination toward the drug olanzapine, with 6 independent optimizable variables, is optimized using three different automated approaches: self-optimization, design of experiments, and kinetic modeling. These approaches are complementary and provide differing information on the reaction: pareto optimal operating points, response surface models, and mechanistic models, respectively. The results are achieved using <10% of the material that would be required for standard flow operation. Finally, a chemometric model is built utilizing automated data handling and three subsequent validation experiments demonstrate good agreement between the slug flow reactor and a standard (larger scale) flow reactor.
Collapse
Affiliation(s)
- Florian Wagner
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Peter Sagmeister
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Clemens E. Jusner
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Thomas G. Tampone
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Vidhyadhar Manee
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Frederic G. Buono
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Jason D. Williams
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - C. Oliver Kappe
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| |
Collapse
|
5
|
Gallarati S, van Gerwen P, Laplaza R, Brey L, Makaveev A, Corminboeuf C. A genetic optimization strategy with generality in asymmetric organocatalysis as a primary target. Chem Sci 2024; 15:3640-3660. [PMID: 38455002 PMCID: PMC10915838 DOI: 10.1039/d3sc06208b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Accepted: 01/30/2024] [Indexed: 03/09/2024] Open
Abstract
A catalyst possessing a broad substrate scope, in terms of both turnover and enantioselectivity, is sometimes called "general". Despite their great utility in asymmetric synthesis, truly general catalysts are difficult or expensive to discover via traditional high-throughput screening and are, therefore, rare. Existing computational tools accelerate the evaluation of reaction conditions from a pre-defined set of experiments to identify the most general ones, but cannot generate entirely new catalysts with enhanced substrate breadth. For these reasons, we report an inverse design strategy based on the open-source genetic algorithm NaviCatGA and on the OSCAR database of organocatalysts to simultaneously probe the catalyst and substrate scope and optimize generality as a primary target. We apply this strategy to the Pictet-Spengler condensation, for which we curate a database of 820 reactions, used to train statistical models of selectivity and activity. Starting from OSCAR, we define a combinatorial space of millions of catalyst possibilities, and perform evolutionary experiments on a diverse substrate scope that is representative of the whole chemical space of tetrahydro-β-carboline products. While privileged catalysts emerge, we show how genetic optimization can address the broader question of generality in asymmetric synthesis, extracting structure-performance relationships from the challenging areas of chemical space.
Collapse
Affiliation(s)
- Simone Gallarati
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| | - Puck van Gerwen
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research - Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| | - Ruben Laplaza
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research - Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| | - Lucien Brey
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| | - Alexander Makaveev
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research - Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Computational Design and Discovery of Novel Materials (MARVEL), Ecole Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
| |
Collapse
|
6
|
Eckhoff M, Diedrich JV, Mücke M, Proppe J. Quantitative Structure-Reactivity Relationships for Synthesis Planning: The Benzhydrylium Case. J Phys Chem A 2024; 128:343-354. [PMID: 38113457 PMCID: PMC10788916 DOI: 10.1021/acs.jpca.3c07289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 11/28/2023] [Accepted: 12/01/2023] [Indexed: 12/21/2023]
Abstract
Selective and feasible reactions are among the top targets in synthesis planning. Mayr's approach to quantifying chemical reactivity has greatly facilitated the planning process, but reactivity parameters for new compounds require time-consuming experiments. In the past decade, data-driven modeling has been gaining momentum in the field, as it shows promise in terms of efficient reactivity prediction. However, state-of-the-art models use quantum chemical data as input, which prevent access to real-time planning in organic synthesis. Here, we present a novel data-driven workflow for predicting reactivity parameters of molecules that takes only structural information as input, enabling de facto real-time reactivity predictions. We use the well-understood chemical space of benzhydrylium ions as an example to demonstrate the functionality of our approach and the performance of the resulting quantitative structure-reactivity relationships (QSRRs). Our results suggest that it is straightforward to build low-cost QSRR models that are accurate, interpretable, and transferable to unexplored systems within a given scope of application. Moreover, our QSRR approach suggests that Hammett σ parameters are only approximately additive.
Collapse
Affiliation(s)
- Maike Eckhoff
- Institute
of Physical and Theoretical Chemistry, TU
Braunschweig, Braunschweig 38106, Germany
| | - Johannes V. Diedrich
- Institute
of Physical and Theoretical Chemistry, TU
Braunschweig, Braunschweig 38106, Germany
- Institute
of Physical Chemistry, University of Göttingen, Göttingen 37077, Germany
| | - Maike Mücke
- Institute
of Physical and Theoretical Chemistry, TU
Braunschweig, Braunschweig 38106, Germany
- Institute
of Physical Chemistry, University of Göttingen, Göttingen 37077, Germany
| | - Jonny Proppe
- Institute
of Physical and Theoretical Chemistry, TU
Braunschweig, Braunschweig 38106, Germany
| |
Collapse
|
7
|
Raghavan P, Haas BC, Ruos ME, Schleinitz J, Doyle AG, Reisman SE, Sigman MS, Coley CW. Dataset Design for Building Models of Chemical Reactivity. ACS CENTRAL SCIENCE 2023; 9:2196-2204. [PMID: 38161380 PMCID: PMC10755851 DOI: 10.1021/acscentsci.3c01163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 11/06/2023] [Accepted: 11/15/2023] [Indexed: 01/03/2024]
Abstract
Models can codify our understanding of chemical reactivity and serve a useful purpose in the development of new synthetic processes via, for example, evaluating hypothetical reaction conditions or in silico substrate tolerance. Perhaps the most determining factor is the composition of the training data and whether it is sufficient to train a model that can make accurate predictions over the full domain of interest. Here, we discuss the design of reaction datasets in ways that are conducive to data-driven modeling, emphasizing the idea that training set diversity and model generalizability rely on the choice of molecular or reaction representation. We additionally discuss the experimental constraints associated with generating common types of chemistry datasets and how these considerations should influence dataset design and model building.
Collapse
Affiliation(s)
- Priyanka Raghavan
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
| | - Brittany C. Haas
- Department
of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| | - Madeline E. Ruos
- Department
of Chemistry & Biochemistry, University
of California, Los Angeles, Los Angeles, California 90095, United States
| | - Jules Schleinitz
- Division
of Chemistry and Chemical Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| | - Abigail G. Doyle
- Department
of Chemistry & Biochemistry, University
of California, Los Angeles, Los Angeles, California 90095, United States
| | - Sarah E. Reisman
- Division
of Chemistry and Chemical Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| | - Matthew S. Sigman
- Department
of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| | - Connor W. Coley
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
- Department
of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
8
|
van Dijk L, Haas BC, Lim NK, Clagg K, Dotson JJ, Treacy SM, Piechowicz KA, Roytman VA, Zhang H, Toste FD, Miller SJ, Gosselin F, Sigman MS. Data Science-Enabled Palladium-Catalyzed Enantioselective Aryl-Carbonylation of Sulfonimidamides. J Am Chem Soc 2023; 145:20959-20967. [PMID: 37656964 DOI: 10.1021/jacs.3c06674] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/03/2023]
Abstract
New methods for the general asymmetric synthesis of sulfonimidamides are of great interest due to their applications in medicinal chemistry, agrochemical discovery, and academic research. We report a palladium-catalyzed cross-coupling method for the enantioselective aryl-carbonylation of sulfonimidamides. Using data science techniques, a virtual library of calculated bisphosphine ligand descriptors was used to guide reaction optimization by effectively sampling the catalyst chemical space. The optimized conditions identified using this approach provided the desired product in excellent yield and enantioselectivity. As the next step, a data science-driven strategy was also used to explore a diverse set of aryl and heteroaryl iodides, providing key information about the scope and limitations of the method. Furthermore, we tested a range of racemic sulfonimidamides for compatibility of this coupling partner. The developed method offers a general and efficient strategy for accessing enantioenriched sulfonimidamides, which should facilitate their application in industrial and academic settings.
Collapse
Affiliation(s)
- Lucy van Dijk
- Department of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| | - Brittany C Haas
- Department of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| | - Ngiap-Kie Lim
- Department of Small Molecule Process Chemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - Kyle Clagg
- Department of Small Molecule Process Chemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - Jordan J Dotson
- Department of Small Molecule Process Chemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - Sean M Treacy
- Department of Chemistry, University of California, Berkeley, California 94720, United States
| | - Katarzyna A Piechowicz
- Department of Small Molecule Process Chemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - Vladislav A Roytman
- Department of Chemistry, University of California, Berkeley, California 94720, United States
| | - Haiming Zhang
- Department of Small Molecule Process Chemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - F Dean Toste
- Department of Chemistry, University of California, Berkeley, California 94720, United States
| | - Scott J Miller
- Department of Chemistry, Yale University, New Haven, Connecticut 06511, United States
| | - Francis Gosselin
- Department of Small Molecule Process Chemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - Matthew S Sigman
- Department of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| |
Collapse
|
9
|
Synthesis, α-glucosidase inhibitory activity, and molecular docking of cinnamamides. Med Chem Res 2023. [DOI: 10.1007/s00044-023-03032-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
|
10
|
Abstract
Reactivity scales are useful research tools for chemists, both experimental and computational. However, to determine the reactivity of a single molecule, multiple measurements need to be carried out, which is a time-consuming and resource-intensive task. In this Tutorial Review, we present alternative approaches for the efficient generation of quantitative structure-reactivity relationships that are based on quantum chemistry, supervised learning, and uncertainty quantification. First published in 2002, we observe a tendency for these relationships to become not only more predictive but also more interpretable over time.
Collapse
Affiliation(s)
- Maike Vahl
- Institute of Physical and Theoretical Chemistry, Technische Universität Braunschweig, Gaußstraße 17, 38106 Braunschweig, Germany.
| | - Jonny Proppe
- Institute of Physical and Theoretical Chemistry, Technische Universität Braunschweig, Gaußstraße 17, 38106 Braunschweig, Germany.
| |
Collapse
|
11
|
Yamaguchi S. Molecular field analysis for data-driven molecular design in asymmetric catalysis. Org Biomol Chem 2022; 20:6057-6071. [PMID: 35791843 DOI: 10.1039/d2ob00228k] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
This review highlights the recent advances (2019-present) in the use of MFA (molecular field analysis) for data-driven catalyst design, enabling to improve selectivities/reaction outcomes in asymmetric catalysis. Successful examples of MFA-based molecular design and how to design molecules by MFA are described, including how to generate and evaluate MFA-based regression models, and future challenges in MFA-based molecular design in molecular catalysis.
Collapse
Affiliation(s)
- Shigeru Yamaguchi
- RIKEN Center for Sustainable Resource Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan.
| |
Collapse
|