1
|
Tâmega GS, Costa MO, de Araujo Pereira A, Barbosa Ferreira MA. Data Science Guiding Analysis of Organic Reaction Mechanism and Prediction. CHEM REC 2024; 24:e202400148. [PMID: 39499081 DOI: 10.1002/tcr.202400148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2024] [Revised: 09/09/2024] [Indexed: 11/07/2024]
Abstract
Advancements in synthetic organic chemistry are closely related to understanding substrate and catalyst reactivities through detailed mechanistic studies. Traditional mechanistic investigations are labor-intensive and rely on experimental kinetic, thermodynamic, and spectroscopic data. Linear free energy relationships (LFERs), exemplified by Hammett relationships, have long facilitated reactivity prediction despite their inherent limitations when using experimental constants or incorporating comprehensive experimental data. Data-driven modeling, which integrates cheminformatics with machine learning, offers powerful tools for predicting and interpreting mechanisms and effectively handling complex reactivities through multiparameter strategies. This review explores selected examples of data-driven strategies for investigating organic reaction mechanisms. It highlights the evolution and application of computational descriptors for mechanistic inference.
Collapse
Affiliation(s)
- Giovanna Scalli Tâmega
- Department of Chemistry, Federal University of São Carlos, 13565-905, São Carlos, SP, Brazil
| | - Mateus Oliveira Costa
- Department of Chemistry, Federal University of São Carlos, 13565-905, São Carlos, SP, Brazil
| | - Ariel de Araujo Pereira
- Department of Chemistry, Federal University of São Carlos, 13565-905, São Carlos, SP, Brazil
| | | |
Collapse
|
2
|
Guo Q, Fu B, Tian Y, Xu S, Meng X. Recent progress in artificial intelligence and machine learning for novel diabetes mellitus medications development. Curr Med Res Opin 2024; 40:1483-1493. [PMID: 39083361 DOI: 10.1080/03007995.2024.2387187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/04/2024] [Accepted: 07/29/2024] [Indexed: 08/02/2024]
Abstract
Diabetes mellitus, stemming from either insulin resistance or inadequate insulin secretion, represents a complex ailment that results in prolonged hyperglycemia and severe complications. Patients endure severe ramifications such as kidney disease, vision impairment, cardiovascular disorders, and susceptibility to infections, leading to significant physical suffering and imposing substantial socio-economic burdens. This condition has evolved into an increasingly severe health crisis. There is an urgent need to develop new treatments with improved efficacy and fewer adverse effects to meet clinical demands. However, novel drug development is costly, time-consuming, and often associated with side effects and suboptimal efficacy, making it a major challenge. Artificial Intelligence (AI) and Machine Learning (ML) have revolutionized drug development across its comprehensive lifecycle, spanning drug discovery, preclinical studies, clinical trials, and post-market surveillance. These technologies have significantly accelerated the identification of promising therapeutic candidates, optimized trial designs, and enhanced post-approval safety monitoring. Recent advances in AI, including data augmentation, interpretable AI, and integration of AI with traditional experimental methods, offer promising strategies for overcoming the challenges inherent in AI-based drug discovery. Despite these advancements, there exists a notable gap in comprehensive reviews detailing AI and ML applications throughout the entirety of developing medications for diabetes mellitus. This review aims to fill this gap by evaluating the impact and potential of AI and ML technologies at various stages of diabetes mellitus drug development. It does that by synthesizing current research findings and technological advances so as to effectively control diabetes mellitus and mitigate its far-reaching social and economic impacts. The integration of AI and ML promises to revolutionize diabetes mellitus treatment strategies, offering hope for improved patient outcomes and reduced healthcare burdens worldwide.
Collapse
Affiliation(s)
- Qi Guo
- School of Pharmacy, Heilongjiang University of Chinese Medicine, Harbin, P. R. China
| | - Bo Fu
- School of Pharmacy, Heilongjiang University of Chinese Medicine, Harbin, P. R. China
| | - Yuan Tian
- School of Pharmacy, Heilongjiang University of Chinese Medicine, Harbin, P. R. China
| | - Shujun Xu
- School of Pharmacy, Heilongjiang University of Chinese Medicine, Harbin, P. R. China
| | - Xin Meng
- School of Pharmacy, Heilongjiang University of Chinese Medicine, Harbin, P. R. China
| |
Collapse
|
3
|
Riaz IB, Khan MA, Haddad TC. Potential application of artificial intelligence in cancer therapy. Curr Opin Oncol 2024; 36:437-448. [PMID: 39007164 DOI: 10.1097/cco.0000000000001068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
PURPOSE OF REVIEW This review underscores the critical role and challenges associated with the widespread adoption of artificial intelligence in cancer care to enhance disease management, streamline clinical processes, optimize data retrieval of health information, and generate and synthesize evidence. RECENT FINDINGS Advancements in artificial intelligence models and the development of digital biomarkers and diagnostics are applicable across the cancer continuum from early detection to survivorship care. Additionally, generative artificial intelligence has promised to streamline clinical documentation and patient communications, generate structured data for clinical trial matching, automate cancer registries, and facilitate advanced clinical decision support. Widespread adoption of artificial intelligence has been slow because of concerns about data diversity and data shift, model reliability and algorithm bias, legal oversight, and high information technology and infrastructure costs. SUMMARY Artificial intelligence models have significant potential to transform cancer care. Efforts are underway to deploy artificial intelligence models in the cancer practice, evaluate their clinical impact, and enhance their fairness and explainability. Standardized guidelines for the ethical integration of artificial intelligence models in cancer care pathways and clinical operations are needed. Clear governance and oversight will be necessary to gain trust in artificial intelligence-assisted cancer care by clinicians, scientists, and patients.
Collapse
Affiliation(s)
- Irbaz Bin Riaz
- Department of AI and Informatics, Mayo Clinic, Minnesota
- Division of Hematology and Oncology, Mayo Clinic, Phoenix, Arizona
| | | | - Tufia C Haddad
- Department of Oncology, Mayo Clinic, Rochester, Minnesota, USA
| |
Collapse
|
4
|
McDonald MA, Koscher BA, Canty RB, Jensen KF. Calibration-free reaction yield quantification by HPLC with a machine-learning model of extinction coefficients. Chem Sci 2024; 15:10092-10100. [PMID: 38966367 PMCID: PMC11220585 DOI: 10.1039/d4sc01881h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 05/19/2024] [Indexed: 07/06/2024] Open
Abstract
Reaction optimization and characterization depend on reliable measures of reaction yield, often measured by high-performance liquid chromatography (HPLC). Peak areas in HPLC chromatograms are correlated to analyte concentrations by way of calibration standards, typically pure samples of known concentration. Preparing the pure material required for calibration runs can be tedious for low-yielding reactions and technically challenging at small reaction scales. Herein, we present a method to quantify the yield of reactions by HPLC without needing to isolate the product(s) by combining a machine learning model for molar extinction coefficient estimation, and both UV-vis absorption and mass spectra. We demonstrate the method for a variety of reactions important in medicinal and process chemistry, including amide couplings, palladium catalyzed cross-couplings, nucleophilic aromatic substitutions, aminations, and heterocycle syntheses. The reactions were all performed using an automated synthesis and isolation platform. Calibration-free methods such as the presented approach are necessary for such automated platforms to be able to discover, characterize, and optimize reactions automatically.
Collapse
Affiliation(s)
- Matthew A McDonald
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge Massachusetts 02139 USA
| | - Brent A Koscher
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge Massachusetts 02139 USA
| | - Richard B Canty
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge Massachusetts 02139 USA
| | - Klavs F Jensen
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge Massachusetts 02139 USA
| |
Collapse
|
5
|
Cardoso Rial R. AI in analytical chemistry: Advancements, challenges, and future directions. Talanta 2024; 274:125949. [PMID: 38569367 DOI: 10.1016/j.talanta.2024.125949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Revised: 03/09/2024] [Accepted: 03/17/2024] [Indexed: 04/05/2024]
Abstract
This article explores the influence and applications of Artificial Intelligence (AI) in analytical chemistry, highlighting its potential to revolutionize the analysis of complex data sets and the development of innovative analytical methods. Additionally, it discusses the role of AI in interpreting large-scale data and optimizing experimental processes. AI has been fundamental in managing heterogeneous data and in advanced analysis of complex spectra in areas such as spectroscopy and chromatography. The article also examines the historical development of AI in chemistry, its current challenges, including the interpretation of AI models and the integration of large volumes of data. Finally, it forecasts future trends and the potential impact of AI on analytical chemistry, emphasizing the need for ethical and secure approaches in the use of AI.
Collapse
Affiliation(s)
- Rafael Cardoso Rial
- Federal Institute of Mato Grosso do Sul, 79750-000, Nova Andradina, MS, Brazil.
| |
Collapse
|
6
|
Xerxa E, Bajorath J. Data-oriented protein kinase drug discovery. Eur J Med Chem 2024; 271:116413. [PMID: 38636127 DOI: 10.1016/j.ejmech.2024.116413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/06/2024] [Accepted: 04/11/2024] [Indexed: 04/20/2024]
Abstract
The continued growth of data from biological screening and medicinal chemistry provides opportunities for data-driven experimental design and decision making in early-phase drug discovery. Approaches adopted from data science help to integrate internal and public domain data and extract knowledge from historical in-house data. Protein kinase (PK) drug discovery is an exemplary area where large amounts of data are accumulating, providing a valuable knowledge base for discovery projects. Herein, the evolution of PK drug discovery and development of small molecular PK inhibitors (PKIs) is reviewed, highlighting milestone developments in the field and discussing exemplary studies providing a basis for increasing data orientation of PK discovery efforts.
Collapse
Affiliation(s)
- Elena Xerxa
- Department of Life Science Informatics and Data Science, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Lamarr Institute for Machine Learning and Artificial Intelligence, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, D-53115, Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics and Data Science, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Lamarr Institute for Machine Learning and Artificial Intelligence, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, D-53115, Bonn, Germany.
| |
Collapse
|
7
|
Wagner F, Sagmeister P, Jusner CE, Tampone TG, Manee V, Buono FG, Williams JD, Kappe CO. A Slug Flow Platform with Multiple Process Analytics Facilitates Flexible Reaction Optimization. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2308034. [PMID: 38273711 PMCID: PMC10987115 DOI: 10.1002/advs.202308034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 12/21/2023] [Indexed: 01/27/2024]
Abstract
Flow processing offers many opportunities to optimize reactions in a rapid and automated manner, yet often requires relatively large quantities of input materials. To combat this, the use of a flexible slug flow reactor, equipped with two analytical instruments, for low-volume optimization experiments are reported. A Buchwald-Hartwig amination toward the drug olanzapine, with 6 independent optimizable variables, is optimized using three different automated approaches: self-optimization, design of experiments, and kinetic modeling. These approaches are complementary and provide differing information on the reaction: pareto optimal operating points, response surface models, and mechanistic models, respectively. The results are achieved using <10% of the material that would be required for standard flow operation. Finally, a chemometric model is built utilizing automated data handling and three subsequent validation experiments demonstrate good agreement between the slug flow reactor and a standard (larger scale) flow reactor.
Collapse
Affiliation(s)
- Florian Wagner
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Peter Sagmeister
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Clemens E. Jusner
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Thomas G. Tampone
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Vidhyadhar Manee
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Frederic G. Buono
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Jason D. Williams
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - C. Oliver Kappe
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| |
Collapse
|
8
|
Jiang S, McClure J, Mao H, Chen J, Liu Y, Zhang Y. Integrating Machine Learning and Color Chemistry: Developing a High-School Curriculum toward Real-World Problem-Solving. JOURNAL OF CHEMICAL EDUCATION 2024; 101:675-681. [PMID: 38939529 PMCID: PMC11210371 DOI: 10.1021/acs.jchemed.3c00589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/29/2024]
Abstract
Artificial intelligence (AI) is rapidly transforming our world, making it imperative to educate the next generation about both the potential benefits and challenges associated with AI. This study presents a cross-disciplinary curriculum that connects AI and chemistry disciplines in the high school classroom. Particularly, we leverage machine learning (ML), an important and simple application of AI to instruct students to build an ML-based virtual pH meter for high-precision pH read-outs. We used a "codeless" and free ML neural network building software - Orange, along with a simple chemical topic of pH to show the connection between AI and chemistry for high-schoolers who might have rudimentary backgrounds in both disciplines. The goal of this curriculum is to promote student interest and drive in the analytical chemistry domain and offer insights into how the interconnection between chemistry and ML can benefit high-school students in science learning. The activity involves students using pH strips to measure the pH of various solutions with local relevancy and then building an ML neural network model to predict the pH value based on color changes of pH strips. The integrated curriculum increased student interest in chemistry and ML and demonstrated the relevance of science to their daily lives and global issues. This approach is transformative in developing a broad spectrum of integration topics between chemistry and ML and understanding their global impacts.
Collapse
Affiliation(s)
- Shiyan Jiang
- Department of Teacher Education and Learning Sciences, North Carolina State University, 2310 Stinson Drive, Raleigh, NC, 27695, USA
| | - Jeanne McClure
- Department of Teacher Education and Learning Sciences, North Carolina State University, 2310 Stinson Drive, Raleigh, NC, 27695, USA
| | - Hongjing Mao
- Molecular Analytics and Photonics (MAP) Lab, Program of Polymer and Color Chemistry, Department of Textile Engineering, Chemistry and Science, North Carolina State University, 1020 Main Campus Drive, Raleigh, NC, 27606, USA
| | - Jiahui Chen
- Molecular Analytics and Photonics (MAP) Lab, Program of Polymer and Color Chemistry, Department of Textile Engineering, Chemistry and Science, North Carolina State University, 1020 Main Campus Drive, Raleigh, NC, 27606, USA
| | - Yunshu Liu
- Molecular Analytics and Photonics (MAP) Lab, Program of Polymer and Color Chemistry, Department of Textile Engineering, Chemistry and Science, North Carolina State University, 1020 Main Campus Drive, Raleigh, NC, 27606, USA
| | - Yang Zhang
- Molecular Analytics and Photonics (MAP) Lab, Program of Polymer and Color Chemistry, Department of Textile Engineering, Chemistry and Science, North Carolina State University, 1020 Main Campus Drive, Raleigh, NC, 27606, USA
| |
Collapse
|
9
|
Leonov AI, Hammer AJS, Lach S, Mehr SHM, Caramelli D, Angelone D, Khan A, O'Sullivan S, Craven M, Wilbraham L, Cronin L. An integrated self-optimizing programmable chemical synthesis and reaction engine. Nat Commun 2024; 15:1240. [PMID: 38336880 PMCID: PMC10858227 DOI: 10.1038/s41467-024-45444-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 01/22/2024] [Indexed: 02/12/2024] Open
Abstract
Robotic platforms for chemistry are developing rapidly but most systems are not currently able to adapt to changing circumstances in real-time. We present a dynamically programmable system capable of making, optimizing, and discovering new molecules which utilizes seven sensors that continuously monitor the reaction. By developing a dynamic programming language, we demonstrate the 10-fold scale-up of a highly exothermic oxidation reaction, end point detection, as well as detecting critical hardware failures. We also show how the use of in-line spectroscopy such as HPLC, Raman, and NMR can be used for closed-loop optimization of reactions, exemplified using Van Leusen oxazole synthesis, a four-component Ugi condensation and manganese-catalysed epoxidation reactions, as well as two previously unreported reactions, discovered from a selected chemical space, providing up to 50% yield improvement over 25-50 iterations. Finally, we demonstrate an experimental pipeline to explore a trifluoromethylations reaction space, that discovers new molecules.
Collapse
Affiliation(s)
- Artem I Leonov
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Alexander J S Hammer
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Slawomir Lach
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - S Hessam M Mehr
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Dario Caramelli
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Davide Angelone
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Aamir Khan
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Steven O'Sullivan
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Matthew Craven
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Liam Wilbraham
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Leroy Cronin
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK.
| |
Collapse
|
10
|
Lennon G, Dingwall P. Enabling High Throughput Kinetic Experimentation by Using Flow as a Differential Kinetic Technique. Angew Chem Int Ed Engl 2024; 63:e202318146. [PMID: 38078481 PMCID: PMC10952970 DOI: 10.1002/anie.202318146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Indexed: 12/23/2023]
Abstract
Kinetic data is most commonly collected through the generation of time-series data under either batch or flow conditions. Existing methods to generate kinetic data in flow collect integral data (concentration over time) only. Here, we report a method for the rapid and direct collection of differential kinetic data (direct measurement of rate) in flow by performing a series of instantaneous rate measurements on sequential small-scale reactions. This technique decouples the time required to generate a full kinetic profile from the time required for a reaction to reach completion, enabling high throughput kinetic experimentation. In addition, comparison of kinetic profiles constructed at different residence times allows the robustness, or stability, of homogeneously catalysed reactions to be interrogated. This approach makes use of a segmented flow platform which was shown to quantitatively reproduce batch kinetic data. The proline mediated aldol reaction was chosen as a model reaction to perform a high throughput kinetic screen of 216 kinetic profiles in 90 hours, one every 25 minutes, which would have taken an estimated continuous 3500 hours in batch, an almost 40-fold increase in experimental throughput matched by a corresponding reduction in material consumption.
Collapse
Affiliation(s)
- Gavin Lennon
- School of Chemistry and Chemical EngineeringQueen's University BelfastDavid Keir Building, Stranmillis RoadBelfastBT9 5AGUK
| | - Paul Dingwall
- School of Chemistry and Chemical EngineeringQueen's University BelfastDavid Keir Building, Stranmillis RoadBelfastBT9 5AGUK
| |
Collapse
|
11
|
Moor SR, Howard JR, Herrera BT, McVeigh MS, Marini F, Keatinge-Clay AT, Anslyn EV. Determination of Enantiomeric Excess and Diastereomeric Excess via Optical Methods. Application to α-methyl-β-hydroxy-carboxylic acids. Org Chem Front 2023; 10:1386-1392. [PMID: 37636898 PMCID: PMC10456989 DOI: 10.1039/d2qo01444k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/29/2023]
Abstract
Characterization of chiral molecules in solution is paramount for measuring reaction success. However, techniques to distinguish between chiral molecules containing more than one stereocenter through the use of optical techniques remains a challenge. Herein, we report a techique using a series of circular dichroism spectra to train multivariate regression models that are capable of predicting the complete speciation of 3-hydroxy-2-methylbutanoic acid stereoisomers. From this, it is possible to rapidly and accurately determine the enantiomeric excess and diastereomeric excess of the solution without the need for chiral chromatography.
Collapse
Affiliation(s)
- Sarah R Moor
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, USA
| | - James R Howard
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, USA
| | - Brenden T Herrera
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, USA
| | - Matthew S McVeigh
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, USA
| | - Federico Marini
- Department of Chemistry, University of Rome "La Sapienza", P.le Aldo Moro 5, Rome I-00185, Italy
| | - Adrian T Keatinge-Clay
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA
| | - Eric V Anslyn
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, USA
| |
Collapse
|
12
|
Haas C, Lübbesmeyer M, Jin EH, McDonald MA, Koscher BA, Guimond N, Di Rocco L, Kayser H, Leweke S, Niedenführ S, Nicholls R, Greeves E, Barber DM, Hillenbrand J, Volpin G, Jensen KF. Open-Source Chromatographic Data Analysis for Reaction Optimization and Screening. ACS CENTRAL SCIENCE 2023; 9:307-317. [PMID: 36844498 PMCID: PMC9951288 DOI: 10.1021/acscentsci.2c01042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Indexed: 06/18/2023]
Abstract
Automation and digitalization solutions in the field of small molecule synthesis face new challenges for chemical reaction analysis, especially in the field of high-performance liquid chromatography (HPLC). Chromatographic data remains locked in vendors' hardware and software components, limiting their potential in automated workflows and data science applications. In this work, we present an open-source Python project called MOCCA for the analysis of HPLC-DAD (photodiode array detector) raw data. MOCCA provides a comprehensive set of data analysis features, including an automated peak deconvolution routine of known signals, even if overlapped with signals of unexpected impurities or side products. We highlight the broad applicability of MOCCA in four studies: (i) a simulation study to validate MOCCA's data analysis features; (ii) a reaction kinetics study on a Knoevenagel condensation reaction demonstrating MOCCA's peak deconvolution feature; (iii) a closed-loop optimization study for the alkylation of 2-pyridone without human control during data analysis; (iv) a well plate screening of categorical reaction parameters for a novel palladium-catalyzed cyanation of aryl halides employing O-protected cyanohydrins. By publishing MOCCA as a Python package with this work, we envision an open-source community project for chromatographic data analysis with the potential of further advancing its scope and capabilities.
Collapse
Affiliation(s)
- Christian
P. Haas
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - Maximilian Lübbesmeyer
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - Edward H. Jin
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Matthew A. McDonald
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Brent A. Koscher
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Nicolas Guimond
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Alfred-Nobel-Straße 50, 40789 Monheim am Rhein, Germany
| | - Laura Di Rocco
- Chemical
& Pharmaceutical Development, Bayer
AG, Pharmaceuticals Division, Müllerstraße 178, 13353 Berlin, Germany
| | - Henning Kayser
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Alfred-Nobel-Straße 50, 40789 Monheim am Rhein, Germany
| | - Samuel Leweke
- Applied
Mathematics, Bayer AG, Enabling Functions
Division, Kaiser-Wilhelm-Allee
1, 51368 Leverkusen, Germany
| | - Sebastian Niedenführ
- Research
and Development, Computational Life Science, Bayer AG, Crop Science Division, Alfred-Nobel-Straße 50, 40789 Monheim am Rhein, Germany
| | - Rachel Nicholls
- Research
and Development, Computational Life Science, Bayer AG, Crop Science Division, Alfred-Nobel-Straße 50, 40789 Monheim am Rhein, Germany
| | - Emily Greeves
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - David M. Barber
- Research
and Development, Weed Control Chemistry, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - Julius Hillenbrand
- Chemical
& Pharmaceutical Development, Bayer
AG, Pharmaceuticals Division, Friedrich-Ebert-Straße 475, 42117 Wuppertal, Germany
| | - Giulio Volpin
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - Klavs F. Jensen
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
13
|
Lin MH, Wolf JB, Sletten ET, Cambié D, Danglad-Flores J, Seeberger PH. Enabling Technologies in Carbohydrate Chemistry: Automated Glycan Assembly, Flow Chemistry and Data Science. Chembiochem 2023; 24:e202200607. [PMID: 36382494 DOI: 10.1002/cbic.202200607] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 11/15/2022] [Indexed: 11/17/2022]
Abstract
The synthesis of defined oligosaccharides is a complex task. Several enabling technologies have been introduced in the last two decades to facilitate synthetic access to these valuable biomolecules. In this concept, we describe the technological solutions that have advanced glycochemistry using automated glycan assembly, flow chemistry and data science as examples. We highlight how the synergies between these different technologies can further advance the field, with progress toward the realization of a self-driving lab for glycan synthesis.
Collapse
Affiliation(s)
- Mei-Huei Lin
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany.,Department of Chemistry and Biochemistry, Freie Universität Berlin, Arnimallee 22, 14195, Berlin, Germany
| | - Jakob B Wolf
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany.,Department of Chemistry and Biochemistry, Freie Universität Berlin, Arnimallee 22, 14195, Berlin, Germany
| | - Eric T Sletten
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany
| | - Dario Cambié
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany
| | - José Danglad-Flores
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany
| | - Peter H Seeberger
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany.,Department of Chemistry and Biochemistry, Freie Universität Berlin, Arnimallee 22, 14195, Berlin, Germany
| |
Collapse
|
14
|
Singh S, Sunoj RB. Molecular Machine Learning for Chemical Catalysis: Prospects and Challenges. Acc Chem Res 2023; 56:402-412. [PMID: 36715248 DOI: 10.1021/acs.accounts.2c00801] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
ConspectusIn the domain of reaction development, one aims to obtain higher efficacies as measured in terms of yield and/or selectivities. During the empirical cycles, an admixture of outcomes from low to high yields/selectivities is expected. While it is not easy to identify all of the factors that might impact the reaction efficiency, complex and nonlinear dependence on the nature of reactants, catalysts, solvents, etc. is quite likely. Developmental stages of newer reactions would typically offer a few hundreds of samples with variations in participating molecules and/or reaction conditions. These "observations" and their "output" can be harnessed as valuable labeled data for developing molecular machine learning (ML) models. Once a robust ML model is built for a specific reaction under development, it can predict the reaction outcome for any new choice of substrates/catalyst in a few seconds/minutes and thus can expedite the identification of promising candidates for experimental validation. Recent years have witnessed impressive applications of ML in the molecular world, most of them aimed at predicting important chemical or biological properties. We believe that an integration of effective ML workflows can be made richly beneficial to reaction discovery.As with any new technology, direct adaptation of ML as used in well-developed domains, such as natural language processing (NLP) and image recognition, is unlikely to succeed in reaction discovery. Some of the challenges stem from ineffective featurization of the molecular space, unavailability of quality data and its distribution, in making the right choice of ML model and its technically robust deployment. It shall be noted that there is no universal ML model suitable for an inherently high-dimensional problem such as chemical reactions. Given these backgrounds, rendering ML tools conducive for reactions is an exciting as well as challenging endeavor at the same time. With the increased availability of efficient ML algorithms, we focused on tapping their potential for small-data reaction discovery (a few hundreds to thousands of samples).In this Account, we describe both feature engineering and feature learning approaches for molecular ML as applied to diverse reactions of high contemporary interest. Among these, catalytic asymmetric hydrogenation of imines/alkenes, β-C(sp3)-H bond functionalization, and relay Heck reaction employed a feature engineering approach using the quantum-chemically derived physical organic descriptors as the molecular features─all designed to predict the enantioselectivity. The selection of molecular features to customize it for a reaction of interest is described, along with emphasizing the chemical insights that could be gathered through the use of such features. Feature learning methods for predicting the yield of Buchwald-Hartwig cross-coupling, deoxyfluorination of alcohols, and enantioselectivity of N,S-acetal formation are found to offer excellent predictions. We propose a transfer learning protocol, wherein an ML model such as a language model is trained on a large number of molecules (105-106) and fine-tuned on a focused library of target task reactions, as an effective alternative for small-data reaction discovery (102-103 reactions). The exploitation of deep neural network latent space as a method for generative tasks to identify useful substrates for a reaction is demonstrated as a promising strategy.
Collapse
Affiliation(s)
- Sukriti Singh
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai 400076, India
| | - Raghavan B Sunoj
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai 400076, India.,Centre for Machine Intelligence and Data Science, Indian Institute of Technology Bombay, Mumbai 400076, India
| |
Collapse
|
15
|
Organic reaction mechanism classification using machine learning. Nature 2023; 613:689-695. [PMID: 36697863 DOI: 10.1038/s41586-022-05639-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 12/08/2022] [Indexed: 01/26/2023]
Abstract
A mechanistic understanding of catalytic organic reactions is crucial for the design of new catalysts, modes of reactivity and the development of greener and more sustainable chemical processes1-13. Kinetic analysis lies at the core of mechanistic elucidation by facilitating direct testing of mechanistic hypotheses from experimental data. Traditionally, kinetic analysis has relied on the use of initial rates14, logarithmic plots and, more recently, visual kinetic methods15-18, in combination with mathematical rate law derivations. However, the derivation of rate laws and their interpretation require numerous mathematical approximations and, as a result, they are prone to human error and are limited to reaction networks with only a few steps operating under steady state. Here we show that a deep neural network model can be trained to analyse ordinary kinetic data and automatically elucidate the corresponding mechanism class, without any additional user input. The model identifies a wide variety of classes of mechanism with outstanding accuracy, including mechanisms out of steady state such as those involving catalyst activation and deactivation steps, and performs excellently even when the kinetic data contain substantial error or only a few time points. Our results demonstrate that artificial-intelligence-guided mechanism classification is a powerful new tool that can streamline and automate mechanistic elucidation. We are making this model freely available to the community and we anticipate that this work will lead to further advances in the development of fully automated organic reaction discovery and development.
Collapse
|
16
|
Kuntz D, Wilson AK. Machine learning, artificial intelligence, and chemistry: how smart algorithms are reshaping simulation and the laboratory. PURE APPL CHEM 2022. [DOI: 10.1515/pac-2022-0202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
Machine learning and artificial intelligence are increasingly gaining in prominence through image analysis, language processing, and automation, to name a few applications. Machine learning is also making profound changes in chemistry. From revisiting decades-old analytical techniques for the purpose of creating better calibration curves, to assisting and accelerating traditional in silico simulations, to automating entire scientific workflows, to being used as an approach to deduce underlying physics of unexplained chemical phenomena, machine learning and artificial intelligence are reshaping chemistry, accelerating scientific discovery, and yielding new insights. This review provides an overview of machine learning and artificial intelligence from a chemist’s perspective and focuses on a number of examples of the use of these approaches in computational chemistry and in the laboratory.
Collapse
Affiliation(s)
- David Kuntz
- Department of Chemistry , University of North Texas , Denton , TX 76201 , USA
| | - Angela K. Wilson
- Department of Chemistry , Michigan State University , East Lansing , MI 48824 , USA
| |
Collapse
|
17
|
Seifrid M, Pollice R, Aguilar-Granda A, Morgan Chan Z, Hotta K, Ser CT, Vestfrid J, Wu TC, Aspuru-Guzik A. Autonomous Chemical Experiments: Challenges and Perspectives on Establishing a Self-Driving Lab. Acc Chem Res 2022; 55:2454-2466. [PMID: 35948428 PMCID: PMC9454899 DOI: 10.1021/acs.accounts.2c00220] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Indexed: 01/19/2023]
Abstract
We must accelerate the pace at which we make technological advancements to address climate change and disease risks worldwide. This swifter pace of discovery requires faster research and development cycles enabled by better integration between hypothesis generation, design, experimentation, and data analysis. Typical research cycles take months to years. However, data-driven automated laboratories, or self-driving laboratories, can significantly accelerate molecular and materials discovery. Recently, substantial advancements have been made in the areas of machine learning and optimization algorithms that have allowed researchers to extract valuable knowledge from multidimensional data sets. Machine learning models can be trained on large data sets from the literature or databases, but their performance can often be hampered by a lack of negative results or metadata. In contrast, data generated by self-driving laboratories can be information-rich, containing precise details of the experimental conditions and metadata. Consequently, much larger amounts of high-quality data are gathered in self-driving laboratories. When placed in open repositories, this data can be used by the research community to reproduce experiments, for more in-depth analysis, or as the basis for further investigation. Accordingly, high-quality open data sets will increase the accessibility and reproducibility of science, which is sorely needed.In this Account, we describe our efforts to build a self-driving lab for the development of a new class of materials: organic semiconductor lasers (OSLs). Since they have only recently been demonstrated, little is known about the molecular and material design rules for thin-film, electrically-pumped OSL devices as compared to other technologies such as organic light-emitting diodes or organic photovoltaics. To realize high-performing OSL materials, we are developing a flexible system for automated synthesis via iterative Suzuki-Miyaura cross-coupling reactions. This automated synthesis platform is directly coupled to the analysis and purification capabilities. Subsequently, the molecules of interest can be transferred to an optical characterization setup. We are currently limited to optical measurements of the OSL molecules in solution. However, material properties are ultimately most important in the solid state (e.g., as a thin-film device). To that end and for a different scientific goal, we are developing a self-driving lab for inorganic thin-film materials focused on the oxygen evolution reaction.While the future of self-driving laboratories is very promising, numerous challenges still need to be overcome. These challenges can be split into cognition and motor function. Generally, the cognitive challenges are related to optimization with constraints or unexpected outcomes for which general algorithmic solutions have yet to be developed. A more practical challenge that could be resolved in the near future is that of software control and integration because few instrument manufacturers design their products with self-driving laboratories in mind. Challenges in motor function are largely related to handling heterogeneous systems, such as dispensing solids or performing extractions. As a result, it is critical to understand that adapting experimental procedures that were designed for human experimenters is not as simple as transferring those same actions to an automated system, and there may be more efficient ways to achieve the same goal in an automated fashion. Accordingly, for self-driving laboratories, we need to carefully rethink the translation of manual experimental protocols.
Collapse
Affiliation(s)
- Martin Seifrid
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Robert Pollice
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | | | - Zamyla Morgan Chan
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Acceleration
Consortium, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Kazuhiro Hotta
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Science
& Innovation Center, Mitsubishi Chemical
Corporation, 1000 Kamoshidacho, Aoba, Yokohama, Kanagawa 227-8502, Japan
| | - Cher Tian Ser
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Jenya Vestfrid
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Tony C. Wu
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Alán Aspuru-Guzik
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department
of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada
- Department
of Materials Science, University of Toronto, Toronto, Ontario M5S 3E4, Canada
- Vector
Institute for Artificial Intelligence, Toronto, Ontario M5S 1M1, Canada
- Lebovic
Fellow, Canadian Institute for Advanced
Research, Toronto, Ontario M5S 1M1, Canada
| |
Collapse
|
18
|
Lustosa DM, Milo A. Mechanistic Inference from Statistical Models at Different Data-Size Regimes. ACS Catal 2022. [DOI: 10.1021/acscatal.2c01741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Danilo M. Lustosa
- Department of Chemistry, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Anat Milo
- Department of Chemistry, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| |
Collapse
|
19
|
Yano J, Gaffney KJ, Gregoire J, Hung L, Ourmazd A, Schrier J, Sethian JA, Toma FM. The case for data science in experimental chemistry: examples and recommendations. Nat Rev Chem 2022; 6:357-370. [PMID: 37117931 DOI: 10.1038/s41570-022-00382-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/17/2022] [Indexed: 12/31/2022]
Abstract
The physical sciences community is increasingly taking advantage of the possibilities offered by modern data science to solve problems in experimental chemistry and potentially to change the way we design, conduct and understand results from experiments. Successfully exploiting these opportunities involves considerable challenges. In this Expert Recommendation, we focus on experimental co-design and its importance to experimental chemistry. We provide examples of how data science is changing the way we conduct experiments, and we outline opportunities for further integration of data science and experimental chemistry to advance these fields. Our recommendations include establishing stronger links between chemists and data scientists; developing chemistry-specific data science methods; integrating algorithms, software and hardware to 'co-design' chemistry experiments from inception; and combining diverse and disparate data sources into a data network for chemistry research.
Collapse
|
20
|
Ullah A, Dral PO. Predicting the future of excitation energy transfer in light-harvesting complex with artificial intelligence-based quantum dynamics. Nat Commun 2022; 13:1930. [PMID: 35411054 PMCID: PMC9001686 DOI: 10.1038/s41467-022-29621-w] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 03/16/2022] [Indexed: 01/20/2023] Open
Abstract
Exploring excitation energy transfer (EET) in light-harvesting complexes (LHCs) is essential for understanding the natural processes and design of highly-efficient photovoltaic devices. LHCs are open systems, where quantum effects may play a crucial role for almost perfect utilization of solar energy. Simulation of energy transfer with inclusion of quantum effects can be done within the framework of dissipative quantum dynamics (QD), which are computationally expensive. Thus, artificial intelligence (AI) offers itself as a tool for reducing the computational cost. Here we suggest AI-QD approach using AI to directly predict QD as a function of time and other parameters such as temperature, reorganization energy, etc., completely circumventing the need of recursive step-wise dynamics propagation in contrast to the traditional QD and alternative, recursive AI-based QD approaches. Our trajectory-learning AI-QD approach is able to predict the correct asymptotic behavior of QD at infinite time. We demonstrate AI-QD on seven-sites Fenna-Matthews-Olson (FMO) complex.
Collapse
Affiliation(s)
- Arif Ullah
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen, 361005, Fujian, China.
| | - Pavlo O Dral
- State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen, 361005, Fujian, China.
| |
Collapse
|
21
|
Sagmeister P, Ort FF, Jusner CE, Hebrault D, Tampone T, Buono FG, Williams JD, Kappe CO. Autonomous Multi-Step and Multi-Objective Optimization Facilitated by Real-Time Process Analytics. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2022; 9:e2105547. [PMID: 35106974 PMCID: PMC8981902 DOI: 10.1002/advs.202105547] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 01/12/2022] [Indexed: 05/04/2023]
Abstract
Autonomous flow reactors are becoming increasingly utilized in the synthesis of organic compounds, yet the complexity of the chemical reactions and analytical methods remains limited. The development of a modular platform which uses rapid flow NMR and FTIR measurements, combined with chemometric modeling, is presented for efficient and timely analysis of reaction outcomes. This platform is tested with a four variable single-step reaction (nucleophilic aromatic substitution), to determine the most effective optimization methodology. The self-optimization approach with minimal background knowledge proves to provide the optimal reaction parameters within the shortest operational time. The chosen approach is then applied to a seven variable two-step optimization problem (imine formation and cyclization), for the synthesis of the active pharmaceutical ingredient edaravone. Despite the exponentially increased complexity of this optimization problem, the platform achieves excellent results in a relatively small number of iterations, leading to >95% solution yield of the intermediate and up to 5.42 kg L-1 h-1 space-time yield for this pharmaceutically relevant product.
Collapse
Affiliation(s)
- Peter Sagmeister
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
- Center for Continuous Flow Synthesis and Processing (CCFLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
| | - Florian F. Ort
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Clemens E. Jusner
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
- Center for Continuous Flow Synthesis and Processing (CCFLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
| | - Dominique Hebrault
- Chemical Development USBoehringer Ingelheim Pharmaceuticals, Inc.900 Ridgebury RoadRidgefieldConnecticut06877USA
| | - Thomas Tampone
- Chemical Development USBoehringer Ingelheim Pharmaceuticals, Inc.900 Ridgebury RoadRidgefieldConnecticut06877USA
| | - Frederic G. Buono
- Chemical Development USBoehringer Ingelheim Pharmaceuticals, Inc.900 Ridgebury RoadRidgefieldConnecticut06877USA
| | - Jason D. Williams
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
- Center for Continuous Flow Synthesis and Processing (CCFLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
| | - C. Oliver Kappe
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
- Center for Continuous Flow Synthesis and Processing (CCFLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
| |
Collapse
|
22
|
Ball-Jones NR, Cobo AA, Armstrong BM, Wigman B, Fettinger JC, Hein JE, Franz AK. Ligand-Accelerated Catalysis in Scandium(III)-Catalyzed Asymmetric Spiroannulation Reactions. ACS Catal 2022. [DOI: 10.1021/acscatal.1c05768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Nicolas R. Ball-Jones
- Department of Chemistry, One Shields Ave, University of California, Davis, California 95616, United States
| | - Angel A. Cobo
- Department of Chemistry, One Shields Ave, University of California, Davis, California 95616, United States
| | - Brittany M. Armstrong
- Department of Chemistry, One Shields Ave, University of California, Davis, California 95616, United States
| | - Benjamin Wigman
- Department of Chemistry, One Shields Ave, University of California, Davis, California 95616, United States
| | - James C. Fettinger
- Department of Chemistry, One Shields Ave, University of California, Davis, California 95616, United States
| | - Jason E. Hein
- Department of Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, British Columbia V6T 1Z1, Canada
| | - Annaliese K. Franz
- Department of Chemistry, One Shields Ave, University of California, Davis, California 95616, United States
| |
Collapse
|
23
|
Chaikittisilp W, Yamauchi Y, Ariga K. Material Evolution with Nanotechnology, Nanoarchitectonics, and Materials Informatics: What will be the Next Paradigm Shift in Nanoporous Materials? ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2022; 34:e2107212. [PMID: 34637159 DOI: 10.1002/adma.202107212] [Citation(s) in RCA: 54] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Revised: 10/05/2021] [Indexed: 05/27/2023]
Abstract
Materials science and chemistry have played a central and significant role in advancing society. With the shift toward sustainable living, it is anticipated that the development of functional materials will continue to be vital for sustaining life on our planet. In the recent decades, rapid progress has been made in materials science and chemistry owing to the advances in experimental, analytical, and computational methods, thereby producing several novel and useful materials. However, most problems in material development are highly complex. Here, the best strategy for the development of functional materials via the implementation of three key concepts is discussed: nanotechnology as a game changer, nanoarchitectonics as an integrator, and materials informatics as a super-accelerator. Discussions from conceptual viewpoints and example recent developments, chiefly focused on nanoporous materials, are presented. It is anticipated that coupling these three strategies together will open advanced routes for the swift design and exploratory search of functional materials truly useful for solving real-world problems. These novel strategies will result in the evolution of nanoporous functional materials.
Collapse
Affiliation(s)
- Watcharop Chaikittisilp
- JST-ERATO Yamauchi Materials Space-Tectonics Project, National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan
- Research and Services Division of Materials Data and Integrated System (MaDIS), National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan
| | - Yusuke Yamauchi
- JST-ERATO Yamauchi Materials Space-Tectonics Project, National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan
- International Center for Materials Nanoarchitectonics (WPI-MANA), National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan
- Australian Institute for Bioengineering and Nanotechnology (AIBN) and School of Chemical Engineering, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Katsuhiko Ariga
- JST-ERATO Yamauchi Materials Space-Tectonics Project, National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan
- International Center for Materials Nanoarchitectonics (WPI-MANA), National Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki, 305-0044, Japan
- Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba, 277-8561, Japan
| |
Collapse
|
24
|
Chen K, Tian H, Li B, Rangarajan S. A chemistry‐inspired neural network kinetic model for oxidative coupling of methane from high‐throughput data. AIChE J 2022. [DOI: 10.1002/aic.17584] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Affiliation(s)
- Kexin Chen
- Department of Chemical and Biomolecular Engineering Lehigh University Bethlehem Pennsylvania USA
| | - Huijie Tian
- Department of Chemical and Biomolecular Engineering Lehigh University Bethlehem Pennsylvania USA
| | - Bowen Li
- Department of Chemical and Biomolecular Engineering Lehigh University Bethlehem Pennsylvania USA
| | - Srinivas Rangarajan
- Department of Chemical and Biomolecular Engineering Lehigh University Bethlehem Pennsylvania USA
| |
Collapse
|
25
|
Hickey BL, Chen J, Zou Y, Gill AD, Zhong W, Millar JG, Hooley RJ. Enantioselective sensing of insect pheromones in water. Chem Commun (Camb) 2021; 57:13341-13344. [PMID: 34817473 DOI: 10.1039/d1cc05540b] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
An arrayed combination of water-soluble deep cavitands and cationic dyes has been shown to optically sense insect pheromones at micromolar concentration in water. Machine learning approaches were used to optimize the most effective array components, which allows differentiation between small structural differences in targets, including between different diastereomers, even though the pheromones have no innate chromophore. When combined with chiral additives, enantiodiscrimination is possible, dependent on the size and shape of the pheromone.
Collapse
Affiliation(s)
- Briana L Hickey
- Department of Chemistry, University of California-Riverside, Riverside, CA 92521, USA.
| | - Junyi Chen
- Environmental Toxicology Graduate Program, University of California-Riverside, Riverside, CA 92521, USA
| | - Yunfan Zou
- Department of Entomology, University of California-Riverside, Riverside, CA 92521, USA
| | - Adam D Gill
- Department of Biochemistry, University of California-Riverside, Riverside, CA 92521, USA
| | - Wenwan Zhong
- Department of Chemistry, University of California-Riverside, Riverside, CA 92521, USA. .,Environmental Toxicology Graduate Program, University of California-Riverside, Riverside, CA 92521, USA
| | - Jocelyn G Millar
- Department of Chemistry, University of California-Riverside, Riverside, CA 92521, USA. .,Department of Entomology, University of California-Riverside, Riverside, CA 92521, USA
| | - Richard J Hooley
- Department of Chemistry, University of California-Riverside, Riverside, CA 92521, USA. .,Department of Biochemistry, University of California-Riverside, Riverside, CA 92521, USA
| |
Collapse
|
26
|
Williams W, Zeng L, Gensch T, Sigman MS, Doyle AG, Anslyn EV. The Evolution of Data-Driven Modeling in Organic Chemistry. ACS CENTRAL SCIENCE 2021; 7:1622-1637. [PMID: 34729406 PMCID: PMC8554870 DOI: 10.1021/acscentsci.1c00535] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Indexed: 05/14/2023]
Abstract
Organic chemistry is replete with complex relationships: for example, how a reactant's structure relates to the resulting product formed; how reaction conditions relate to yield; how a catalyst's structure relates to enantioselectivity. Questions like these are at the foundation of understanding reactivity and developing novel and improved reactions. An approach to probing these questions that is both longstanding and contemporary is data-driven modeling. Here, we provide a synopsis of the history of data-driven modeling in organic chemistry and the terms used to describe these endeavors. We include a timeline of the steps that led to its current state. The case studies included highlight how, as a community, we have advanced physical organic chemistry tools with the aid of computers and data to augment the intuition of expert chemists and to facilitate the prediction of structure-activity and structure-property relationships.
Collapse
Affiliation(s)
- Wendy
L. Williams
- Department
of Chemistry and Biochemistry, University
of California, Los Angeles, California 90095, United States
- Department
of Chemistry, Princeton University, Princeton, New Jersey 08544, United States
| | - Lingyu Zeng
- Department
of Chemistry, The University of Texas at
Austin, Austin, Texas 78712, United States
| | - Tobias Gensch
- Department
of Chemistry, TU Berlin, Straße des 17. Juni 135, Sekr. C2, 10623 Berlin, Germany
| | - Matthew S. Sigman
- Department
of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| | - Abigail G. Doyle
- Department
of Chemistry and Biochemistry, University
of California, Los Angeles, California 90095, United States
- Department
of Chemistry, Princeton University, Princeton, New Jersey 08544, United States
| | - Eric V. Anslyn
- Department
of Chemistry, The University of Texas at
Austin, Austin, Texas 78712, United States
| |
Collapse
|
27
|
Puthongkham P, Wirojsaengthong S, Suea-Ngam A. Machine learning and chemometrics for electrochemical sensors: moving forward to the future of analytical chemistry. Analyst 2021; 146:6351-6364. [PMID: 34585185 DOI: 10.1039/d1an01148k] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Electrochemical sensors and biosensors have been successfully used in a wide range of applications, but systematic optimization and nonlinear relationships have been compromised for electrode fabrication and data analysis. Machine learning and experimental designs are chemometric tools that have been proved to be useful in method development and data analysis. This minireview summarizes recent applications of machine learning and experimental designs in electroanalytical chemistry. First, experimental designs, e.g., full factorial, central composite, and Box-Behnken are discussed as systematic approaches to optimize electrode fabrication to consider the effects from individual variables and their interactions. Then, the principles of machine learning algorithms, including linear and logistic regressions, neural network, and support vector machine, are introduced. These machine learning models have been implemented to extract complex relationships between chemical structures and their electrochemical properties and to analyze complicated electrochemical data to improve calibration and analyte classification, such as in electronic tongues. Lastly, the future of machine learning and experimental designs in electrochemical sensors is outlined. These chemometric strategies will accelerate the development and enhance the performance of electrochemical devices for point-of-care diagnostics and commercialization.
Collapse
Affiliation(s)
- Pumidech Puthongkham
- Department of Chemistry, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand. .,Electrochemistry and Optical Spectroscopy Center of Excellence (EOSCE), Chulalongkorn University, Bangkok 10330, Thailand.,Center of Excellence in Responsive Wearable Materials, Chulalongkorn University, Bangkok 10330, Thailand
| | - Supacha Wirojsaengthong
- Department of Chemistry, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand.
| | - Akkapol Suea-Ngam
- Department of Materials, Department of Bioengineering, and Institute of Biomedical Engineering, Imperial College London, London, SW7 2AZ, UK
| |
Collapse
|
28
|
Bornemann‐Pfeiffer M, Wolf J, Meyer K, Kern S, Angelone D, Leonov A, Cronin L, Emmerling F. Standardisierung und Kontrolle von Grignard‐Reaktionen mittels Online‐NMR in einer universellen chemischen Syntheseplattform. Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202106323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Martin Bornemann‐Pfeiffer
- Bundesanstalt für Materialforschung und -prüfung Richard-Willstätter-Straße 11 12489 Berlin Deutschland
- Chair of Chemical and Process Engineering Technische Universität Berlin Marchstr. 23 10587 Berlin Germany
| | - Jakob Wolf
- Bundesanstalt für Materialforschung und -prüfung Richard-Willstätter-Straße 11 12489 Berlin Deutschland
| | - Klas Meyer
- Bundesanstalt für Materialforschung und -prüfung Richard-Willstätter-Straße 11 12489 Berlin Deutschland
| | - Simon Kern
- S-PACT GmbH Burtscheiderstr. 1 52064 Aachen Deutschland
| | - Davide Angelone
- School of Chemistry University of Glasgow Glasgow G12 8QQ UK
| | - Artem Leonov
- School of Chemistry University of Glasgow Glasgow G12 8QQ UK
| | - Leroy Cronin
- School of Chemistry University of Glasgow Glasgow G12 8QQ UK
| | - Franziska Emmerling
- Bundesanstalt für Materialforschung und -prüfung Richard-Willstätter-Straße 11 12489 Berlin Deutschland
| |
Collapse
|
29
|
Bornemann‐Pfeiffer M, Wolf J, Meyer K, Kern S, Angelone D, Leonov A, Cronin L, Emmerling F. Standardization and Control of Grignard Reactions in a Universal Chemical Synthesis Machine using online NMR. Angew Chem Int Ed Engl 2021; 60:23202-23206. [PMID: 34278673 PMCID: PMC8597166 DOI: 10.1002/anie.202106323] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Indexed: 11/17/2022]
Abstract
A big problem with the chemistry literature is that it is not standardized with respect to precise operational parameters, and real time corrections are hard to make without expert knowledge. This lack of context means difficult reproducibility because many steps are ambiguous, and hence depend on tacit knowledge. Here we present the integration of online NMR into an automated chemical synthesis machine (CSM aka. "Chemputer" which is capable of small-molecule synthesis using a universal programming language) to allow automated analysis and adjustment of reactions on the fly. The system was validated and benchmarked by using Grignard reactions which were chosen due to their importance in synthesis. The system was monitored in real time using online-NMR, and spectra were measured continuously during the reactions. This shows that the synthesis being done in the Chemputer can be dynamically controlled in response to feedback optimizing the reaction conditions according to the user requirements.
Collapse
Affiliation(s)
- Martin Bornemann‐Pfeiffer
- Department 1: Analytical Chemistry, Reference MaterialsBundesanstalt für Materialforschung und -prüfungRichard-Willstätter-Straße 1112489BerlinGermany
- Chair of Chemical and Process EngineeringTechnische Universität BerlinMarchstr. 2310587BerlinGermany
| | - Jakob Wolf
- Department 1: Analytical Chemistry, Reference MaterialsBundesanstalt für Materialforschung und -prüfungRichard-Willstätter-Straße 1112489BerlinGermany
| | - Klas Meyer
- Department 1: Analytical Chemistry, Reference MaterialsBundesanstalt für Materialforschung und -prüfungRichard-Willstätter-Straße 1112489BerlinGermany
| | - Simon Kern
- S-PACT GmbHBurtscheiderstr. 152064AachenGermany
| | | | - Artem Leonov
- School of ChemistryUniversity of GlasgowGlasgowG12 8QQUK
| | - Leroy Cronin
- School of ChemistryUniversity of GlasgowGlasgowG12 8QQUK
| | - Franziska Emmerling
- Department 1: Analytical Chemistry, Reference MaterialsBundesanstalt für Materialforschung und -prüfungRichard-Willstätter-Straße 1112489BerlinGermany
| |
Collapse
|
30
|
Martínez RF, Cravotto G, Cintas P. Organic Sonochemistry: A Chemist's Timely Perspective on Mechanisms and Reactivity. J Org Chem 2021; 86:13833-13856. [PMID: 34156841 PMCID: PMC8562878 DOI: 10.1021/acs.joc.1c00805] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Indexed: 01/17/2023]
Abstract
Sonochemistry, the use of sound waves, usually within the ultrasonic range (>20 kHz), to boost or alter chemical properties and reactivity constitutes a long-standing and sustainable technique that has, however, received less attention than other activation protocols despite affordable setups. Even if unnecessary to underline the impact of ultrasound-based strategies in a broad range of chemical and biological applications, there is considerable misunderstanding and pitfalls regarding the interpretation of cavitational effects and the actual role played by the acoustic field. In this Perspective, with an eye on mechanisms in particular, we discuss the potentiality of sonochemistry in synthetic organic chemistry through selected examples of past and recent developments. Such examples illustrate specific controlling effects and working rules. Looking back at the past while looking forward to advancing the field, some essentials of sonochemical activation will be distilled.
Collapse
Affiliation(s)
- R. Fernando Martínez
- Department
of Organic and Inorganic Chemistry, Faculty of Sciences, and IACYS-Green
Chemistry and Sustainable Development Unit, University of Extremadura, 06006 Badajoz, Spain
| | - Giancarlo Cravotto
- Dipartimento
di Scienza e Tecnologia del Farmaco, Universita
degli Studi di Torino, via P. Giuria 9, Torino 10125, Italy
| | - Pedro Cintas
- Department
of Organic and Inorganic Chemistry, Faculty of Sciences, and IACYS-Green
Chemistry and Sustainable Development Unit, University of Extremadura, 06006 Badajoz, Spain
| |
Collapse
|
31
|
Kuremoto T, Sadatsune R, Yasukawa T, Yamashita Y, Kobayashi S. Machine‐Assisted Preparation of a Chiral Diamine Ligand Library and In Silico Screening Using Ab Initio Structural Parameters for Heterogeneous Chiral Catalysts. Adv Synth Catal 2021. [DOI: 10.1002/adsc.202100798] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Tatsuya Kuremoto
- Department of Chemistry, School of Science The University of Tokyo, Hongo, Bunkyo-ku Tokyo 113-0033 Japan
| | - Ren Sadatsune
- Department of Chemistry, School of Science The University of Tokyo, Hongo, Bunkyo-ku Tokyo 113-0033 Japan
| | - Tomohiro Yasukawa
- Department of Chemistry, School of Science The University of Tokyo, Hongo, Bunkyo-ku Tokyo 113-0033 Japan
| | - Yasuhiro Yamashita
- Department of Chemistry, School of Science The University of Tokyo, Hongo, Bunkyo-ku Tokyo 113-0033 Japan
| | - Shū Kobayashi
- Department of Chemistry, School of Science The University of Tokyo, Hongo, Bunkyo-ku Tokyo 113-0033 Japan
| |
Collapse
|
32
|
Baudis S, Behl M. High-Throughput and Combinatorial Approaches for the Development of Multifunctional Polymers. Macromol Rapid Commun 2021; 43:e2100400. [PMID: 34460146 DOI: 10.1002/marc.202100400] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 08/18/2021] [Indexed: 01/22/2023]
Abstract
High-throughput (HT) development of new multifunctional polymers is accomplished by the combination of different HT tools established in polymer sciences in the last decade. Important advances are robotic/HT synthesis of polymer libraries, the HT characterization of polymers, and the application of spatially resolved polymer library formats, explicitly microarray and gradient libraries. HT polymer synthesis enables the generation of material libraries with combinatorial design motifs. Polymer composition, molecular weight, macromolecular architecture, etc. may be varied in a systematic, fine-graded manner to obtain libraries with high chemical diversity and sufficient compositional resolution as model systems for the screening of these materials for the functions aimed. HT characterization allows a fast assessment of complementary properties, which are employed to decipher quantitative structure-properties relationships. Moreover, these methods facilitate the HT determination of important surface parameters by spatially resolved characterization methods, including time-of-flight secondary ion mass spectrometry and X-ray photoelectron spectroscopy. Here current methods for the high-throughput robotic synthesis of multifunctional polymers as well as their characterization are presented and advantages as well as present limitations are discussed.
Collapse
Affiliation(s)
- Stefan Baudis
- Institute of Active Polymers, Helmholtz-Zentrum Hereon, 14513, Teltow, Germany
| | - Marc Behl
- Institute of Active Polymers, Helmholtz-Zentrum Hereon, 14513, Teltow, Germany
| |
Collapse
|
33
|
Chen J, Gill AD, Hickey BL, Gao Z, Cui X, Hooley RJ, Zhong W. Machine Learning Aids Classification and Discrimination of Noncanonical DNA Folding Motifs by an Arrayed Host:Guest Sensing System. J Am Chem Soc 2021; 143:12791-12799. [PMID: 34346209 DOI: 10.1021/jacs.1c06031] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
An arrayed host:guest fluorescence sensor system can discriminate among and classify multiple different noncanonical DNA structures by exploiting selective molecular recognition. The sensor is highly selective and can discriminate between folds as similar as native G-quadruplexes and those with bulges or vacancies. The host and guest can form heteroternary complexes with DNA strands, with the host acting as mediator between the DNA and dye, modulating the emission. By applying machine learning algorithms to the sensing data, prediction of the folding state of unknown DNA strands is possible with high fidelity.
Collapse
|
34
|
Crawford JM, Kingston C, Toste FD, Sigman MS. Data Science Meets Physical Organic Chemistry. Acc Chem Res 2021; 54:10.1021/acs.accounts.1c00285. [PMID: 34351757 PMCID: PMC9078128 DOI: 10.1021/acs.accounts.1c00285] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
ConspectusAt the heart of synthetic chemistry is the holy grail of predictable catalyst design. In particular, researchers involved in reaction development in asymmetric catalysis have pursued a variety of strategies toward this goal. This is driven by both the pragmatic need to achieve high selectivities and the inability to readily identify why a certain catalyst is effective for a given reaction. While empiricism and intuition have dominated the field of asymmetric catalysis since its inception, enantioselectivity offers a mechanistically rich platform to interrogate catalyst-structure response patterns that explain the performance of a particular catalyst or substrate.In the early stages of an asymmetric reaction development campaign, the overarching mechanism of the reaction, catalyst speciation, the turnover limiting step, and many other details are unknown or posited based on related reactions. Considering the unclear details leading to a successful reaction, initial enantioselectivity data are often used to intuitively guide the ultimate direction of optimization. However, if the conditions of the Curtin-Hammett principle are satisfied, then measured enantioselectivity can be directly connected to the ensemble of diastereomeric transition states (TSs) that lead to the enantiomeric products, and the associated free energy difference between competing TSs (ΔΔG⧧ = -RT ln[(S)/(R)], where (S) and (R) represent the concentrations of the enantiomeric products). We, and others, speculated that this important piece of information can be leveraged to guide reaction optimization in a quantitative way.Although traditional linear free energy relationships (LFERs), such as Hammett plots, have been used to illuminate important mechanistic features, we sought to develop data science derived tools to expand the power of LFERs in order to describe complex reactions frequently encountered in modern asymmetric catalysis. Specifically, we investigated whether enantioselectivity data from a reaction can be quantitatively connected to the attributes of reaction components, such as catalyst and substrate structural features, to harness data for asymmetric catalyst design.In this context, we developed a workflow to relate computationally derived features of reaction components to enantioselectivity using data science tools. The mathematical representation of molecules can incorporate many aspects of a transformation, such as molecular features from substrate, product, catalyst, and proposed transition states. Statistical models relating these features to reaction outputs can be used for various tasks, such as performance prediction of untested molecules. Perhaps most importantly, statistical models can guide the generation of mechanistic hypotheses that are embedded within complex patterns of reaction responses. Overall, merging traditional physical organic experiments with statistical modeling techniques creates a feedback loop that enables both evaluation of multiple mechanistic hypotheses and future catalyst design. In this Account, we highlight the evolution and application of this approach in the context of a collaborative program based on chiral phosphoric acid catalysts (CPAs) in asymmetric catalysis.
Collapse
Affiliation(s)
- Jennifer M Crawford
- Department of Chemistry, University of Utah, 315 S. 1400 E., Salt Lake City, Utah 84112, United States
| | - Cian Kingston
- Department of Chemistry, University of Utah, 315 S. 1400 E., Salt Lake City, Utah 84112, United States
| | - F Dean Toste
- Department of Chemistry, University of California, Berkeley, California 94720, United States
| | - Matthew S Sigman
- Department of Chemistry, University of Utah, 315 S. 1400 E., Salt Lake City, Utah 84112, United States
| |
Collapse
|
35
|
Bordawekar S, Diwan M, Nere NK. Positioning for a sustainable future—Role of chemical engineers in transforming pharmaceutical process development. AIChE J 2021. [DOI: 10.1002/aic.17364] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
| | - Moiz Diwan
- Process R&D, AbbVie North Chicago Illinois USA
| | | |
Collapse
|