1
|
Pham TT, Guo Z, Li B, Lapkin AA, Yan N. Synthesis of Pyrrole-2-Carboxylic Acid from Cellulose- and Chitin-Based Feedstocks Discovered by the Automated Route Search. CHEMSUSCHEM 2024; 17:e202300538. [PMID: 37792551 DOI: 10.1002/cssc.202300538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 10/02/2023] [Accepted: 10/04/2023] [Indexed: 10/06/2023]
Abstract
The shift towards sustainable feedstocks for platform chemicals requires new routes to access functional molecules that contain heteroatoms, but there are limited bio-derived feedstocks that lead to heteroatoms in platform chemicals. Combining renewable molecules of different origins could be a solution to optimize the use of atoms from renewable sources. However, the lack of retrosynthetic tools makes it challenging to examine the extensive reaction networks of various platform molecules focusing on multiple bio-based feedstocks. In this study, a protocol was developed to identify potential transformation pathways that allow for the use of feedstocks from different origins. By analyzing existing knowledge on chemical reactions in large databases, several promising synthetic routes were shortlisted, with the reaction of D-glucosamine and pyruvic acid being the most interesting to make pyrrole-2-carboxylic acid (PCA). The optimized synthetic conditions resulted in 50 % yield of PCA, with insights gained from temperature variant NMR studies. The use of substrates obtained from two different bio-feedstock bases, namely cellulose and chitin, allowed for the establishment of a PCA-based chemical space.
Collapse
Affiliation(s)
- Thuy Trang Pham
- Department of Chemical and Biomolecular Engineering, National University of Singapore, 4 Engineering Drive 4, 117585, Singapore City, Singapore
| | - Zhen Guo
- Cambridge Centre for Advanced Research and Education in Singapore (CARES Ltd), 1 CREATE Way, #05-05 Create Tower, 138602, Singapore City, Singapore
- Chemical Data Intelligence (CDI) Pte Ltd, Robinson Road #02-00, 068898, Singapore City, Singapore
| | - Bing Li
- Department of Chemical and Biomolecular Engineering, National University of Singapore, 4 Engineering Drive 4, 117585, Singapore City, Singapore
| | - Alexei A Lapkin
- Cambridge Centre for Advanced Research and Education in Singapore (CARES Ltd), 1 CREATE Way, #05-05 Create Tower, 138602, Singapore City, Singapore
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge, CB3 0AS, UK
| | - Ning Yan
- Department of Chemical and Biomolecular Engineering, National University of Singapore, 4 Engineering Drive 4, 117585, Singapore City, Singapore
| |
Collapse
|
2
|
Strieth-Kalthoff F, Sandfort F, Kühnemund M, Schäfer FR, Kuchen H, Glorius F. Machine Learning for Chemical Reactivity: The Importance of Failed Experiments. Angew Chem Int Ed Engl 2022; 61:e202204647. [PMID: 35512117 DOI: 10.1002/anie.202204647] [Citation(s) in RCA: 62] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Indexed: 12/27/2022]
Abstract
Assessing the outcomes of chemical reactions in a quantitative fashion has been a cornerstone across all synthetic disciplines. Classically approached through empirical optimization, data-driven modelling bears an enormous potential to streamline this process. However, such predictive models require significant quantities of high-quality data, the availability of which is limited: Main reasons for this include experimental errors and, importantly, human biases regarding experiment selection and result reporting. In a series of case studies, we investigate the impact of these biases for drawing general conclusions from chemical reaction data, revealing the utmost importance of "negative" examples. Eventually, case studies into data expansion approaches showcase directions to circumvent these limitations-and demonstrate perspectives towards a long-term data quality enhancement in chemistry.
Collapse
Affiliation(s)
- Felix Strieth-Kalthoff
- Westfälische Wilhelms-Universität Münster, Organisch-Chemisches Institut, Corrensstr. 40, 48149, Münster, Germany
| | - Frederik Sandfort
- Westfälische Wilhelms-Universität Münster, Organisch-Chemisches Institut, Corrensstr. 40, 48149, Münster, Germany
| | - Marius Kühnemund
- Westfälische Wilhelms-Universität Münster, Department for Information Systems, Leonardo-Campus 3, 48149, Münster, Germany
| | - Felix R Schäfer
- Westfälische Wilhelms-Universität Münster, Organisch-Chemisches Institut, Corrensstr. 40, 48149, Münster, Germany
| | - Herbert Kuchen
- Westfälische Wilhelms-Universität Münster, Department for Information Systems, Leonardo-Campus 3, 48149, Münster, Germany
| | - Frank Glorius
- Westfälische Wilhelms-Universität Münster, Organisch-Chemisches Institut, Corrensstr. 40, 48149, Münster, Germany
| |
Collapse
|
3
|
Strieth‐Kalthoff F, Sandfort F, Kühnemund M, Schäfer FR, Kuchen H, Glorius F. Maschinelles Lernen zur Vorhersage chemischer Reaktivität: Die Bedeutung “gescheiterter” Experimente. Angew Chem Int Ed Engl 2022. [DOI: 10.1002/ange.202204647] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Felix Strieth‐Kalthoff
- Westfälische Wilhelms-Universität Münster Organisch-Chemisches Institut Corrensstr. 40 48149 Münster Deutschland
| | - Frederik Sandfort
- Westfälische Wilhelms-Universität Münster Organisch-Chemisches Institut Corrensstr. 40 48149 Münster Deutschland
| | - Marius Kühnemund
- Westfälische Wilhelms-Universität Münster Department for Information Systems Leonardo-Campus 3 48149 Münster Deutschland
| | - Felix R. Schäfer
- Westfälische Wilhelms-Universität Münster Organisch-Chemisches Institut Corrensstr. 40 48149 Münster Deutschland
| | - Herbert Kuchen
- Westfälische Wilhelms-Universität Münster Department for Information Systems Leonardo-Campus 3 48149 Münster Deutschland
| | - Frank Glorius
- Westfälische Wilhelms-Universität Münster Organisch-Chemisches Institut Corrensstr. 40 48149 Münster Deutschland
| |
Collapse
|
4
|
Yang Q, Liu Y, Cheng J, Li Y, Liu S, Duan Y, Zhang L, Luo S. An Ensemble Structure and Physiochemical (SPOC) Descriptor for Machine-Learning Prediction of Chemical Reaction and Molecular Properties. Chemphyschem 2022; 23:e202200255. [PMID: 35478429 DOI: 10.1002/cphc.202200255] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Indexed: 11/08/2022]
Abstract
Feature representations, or descriptors, are machines' chemical language that largely shapes the prediction capability, generalizability and interpretability of machine learning models. To develop a generally applicable descriptor is highly warranted for chemists to deal with conventional prediction tasks in the context of sparsely distributed and small datasets. Inspired by the chemist's vision on molecules, we presented herein an ensemble descriptor, SPOC, curated on the principles of physical organic chemistry that integrates Structure and Physicochemical property (SPOC) of a molecule. SPOC could be readily constructed by combining molecular fingerprints, representing the structure of a given molecule, and molecular physicochemical properties extracted from RDKit or Mordred molecular descriptors. The applicability of SPOC was fully surveyed in a range of well-structured chemical databases with machine learning tasks varying from regression to classifications.
Collapse
Affiliation(s)
- Qi Yang
- Tsinghua University, CBMS, Department of Chemistry, CHINA
| | - Yidi Liu
- Tsinghua University, CBMS, Department of Chemistry, CHINA
| | - Junjie Cheng
- Tsinghua University, CBMS, Department of Chemistry, CHINA
| | - Yao Li
- Tsinghua University, CBMS, Department of Chemistry, CHINA
| | - Siyuan Liu
- Tsinghua University, CBMS, Department of Chemistry, CHINA
| | - Yingdong Duan
- Tsinghua University, CBMS, Department of Chemistry, CHINA
| | - Long Zhang
- Tsinghua University, CBMS, Department of Chemistry, CHINA
| | - Sanzhong Luo
- Tsinghua University, Department of Chemistry, Tsinghua University, 100084, Beijing, CHINA
| |
Collapse
|
5
|
Abstract
The use of flow reactors in biocatalysis has increased significantly in recent years. Chemists have begun to design flow systems that even allow new biocatalytic reactions to take place. This concept article will focus on the design of flow systems that have allowed enzymes to go beyond their limits in batch. The case is made for moving towards fully continuous systems. With flow chemistry increasingly seen as an enabling technology for automated synthesis, and with advancements in AI-assisted enzyme design, there is a real possibility to fully automate the development and implementation of a continuous biocatalytic processes. This will lead to significantly improved enzyme processes for synthesis.
Collapse
Affiliation(s)
- Sebastian C. Cosgrove
- Lennard-Jones LaboratorySchool of Chemical and Physical SciencesKeele UniversityKeeleStaffordshireST5 5BGUnited Kingdom
| | | |
Collapse
|
6
|
Genheden S, Mårdh A, Lahti G, Engkvist O, Olsson S, Kogej T. Prediction of the chemical context for Buchwald‐Hartwig coupling reactions. Mol Inform 2022; 41:e2100294. [PMID: 35122702 PMCID: PMC9540548 DOI: 10.1002/minf.202100294] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 02/05/2022] [Indexed: 11/10/2022]
Abstract
We present machine learning models for predicting the chemical context for Buchwald‐Hartwig coupling reactions, i. e., what chemicals to add to the reactants to give a productive reaction. Using reaction data from in‐house electronic lab notebooks, we train two models: one based on single‐label data and one based on multi‐label data. Both models show excellent top‐3 accuracy of approximately 90 %, which suggests strong predictivity. Furthermore, there seems to be an advantage of including multi‐label data because the multi‐label model shows higher accuracy and better sensitivity for the individual contexts than the single‐label model. Although the models are performant, we also show that such models need to be re‐trained periodically as there is a strong temporal characteristic to the usage of different contexts. Therefore, a model trained on historical data will decrease in usefulness with time as newer and better contexts emerge and replace older ones. We hypothesize that such significant transitions in the context‐usage will likely affect any model predicting chemical contexts trained on historical data. Consequently, training context prediction models warrants careful planning of what data is used for training and how often the model needs to be re‐trained.
Collapse
|
7
|
Szymkuć S, Badowski T, Grzybowski BA. Is Organic Chemistry Really Growing Exponentially? Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202111540] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Sara Szymkuć
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
| | - Tomasz Badowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
| | - Bartosz A. Grzybowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
- IBS Center for Soft and Living Matter and Department of Chemistry UNIST 50, UNIST-gil, Eonyang-eup, Ulju-gun Ulsan South Korea
| |
Collapse
|
8
|
Szymkuć S, Badowski T, Grzybowski BA. Is Organic Chemistry Really Growing Exponentially? Angew Chem Int Ed Engl 2021; 60:26226-26232. [PMID: 34558168 DOI: 10.1002/anie.202111540] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Indexed: 11/05/2022]
Abstract
In terms of molecules and specific reaction examples, organic chemistry features an impressive, exponential growth. However, new reaction classes/types that fuel this growth are being discovered at a much slower and only linear (or even sublinear) rate. The proportion of newly discovered reaction types to all reactions being performed keeps decreasing, suggesting that synthetic chemistry becomes more reliant on reusing the well-known methods. The newly discovered chemistries are more complex than decades ago and allow for the rapid construction of complex scaffolds in fewer numbers of steps. We study these and other trends in the function of time, reaction-type popularity and complexity based on the algorithm that extracts generalized reaction class templates. These analyses are useful in the context of computer-assisted synthesis, machine learning (to estimate the numbers of models with sufficient reaction statistics), and identifying erroneous entries in reaction databases.
Collapse
Affiliation(s)
- Sara Szymkuć
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA
| | - Tomasz Badowski
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA
| | - Bartosz A Grzybowski
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA.,IBS Center for Soft and Living Matter and Department of Chemistry, UNIST, 50, UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan, South Korea
| |
Collapse
|
9
|
Moskal M, Beker W, Szymkuć S, Grzybowski BA. Scaffold‐Directed Face Selectivity Machine‐Learned from Vectors of Non‐covalent Interactions. Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202101986] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Affiliation(s)
- Martyna Moskal
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
| | - Wiktor Beker
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
| | - Sara Szymkuć
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
| | - Bartosz A. Grzybowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
- IBS Center for Soft and Living Matter and Department of Chemistry UNIST 50, UNIST-gil, Eonyang-eup, Ulju-gun Ulsan South Korea
| |
Collapse
|
10
|
Moskal M, Beker W, Szymkuć S, Grzybowski BA. Scaffold-Directed Face Selectivity Machine-Learned from Vectors of Non-covalent Interactions. Angew Chem Int Ed Engl 2021; 60:15230-15235. [PMID: 33876554 DOI: 10.1002/anie.202101986] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Revised: 03/29/2021] [Indexed: 11/06/2022]
Abstract
This work describes a method to vectorize and Machine-Learn, ML, non-covalent interactions responsible for scaffold-directed reactions important in synthetic chemistry. Models trained on this representation predict correct face of approach in ca. 90 % of Michael additions or Diels-Alder cycloadditions. These accuracies are significantly higher than those based on traditional ML descriptors, energetic calculations, or intuition of experienced synthetic chemists. Our results also emphasize the importance of ML models being provided with relevant mechanistic knowledge; without such knowledge, these models cannot easily "transfer-learn" and extrapolate to previously unseen reaction mechanisms.
Collapse
Affiliation(s)
- Martyna Moskal
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA
| | - Wiktor Beker
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA
| | - Sara Szymkuć
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA
| | - Bartosz A Grzybowski
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA.,IBS Center for Soft and Living Matter and Department of Chemistry, UNIST, 50, UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan, South Korea
| |
Collapse
|
11
|
Chen Y, Tian B, Cheng Z, Li X, Huang M, Sun Y, Liu S, Cheng X, Li S, Ding M. Electro-Descriptors for the Performance Prediction of Electro-Organic Synthesis. Angew Chem Int Ed Engl 2021; 60:4199-4207. [PMID: 33180375 DOI: 10.1002/anie.202014072] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Indexed: 12/20/2022]
Abstract
Electrochemical organic synthesis has attracted increasing attentions as a sustainable and versatile synthetic platform. Quantitative assessment of the electro-organic reactions, including reaction thermodynamics, electro-kinetics, and coupled chemical processes, can lead to effective analytical tool to guide their future design. Herein, we demonstrate that electrochemical parameters such as onset potential, Tafel slope, and effective voltage can be utilized as electro-descriptors for the evaluation of reaction conditions and prediction of reactivities (yields). An "electro-descriptor-diagram" is generated, where reactive and non-reactive conditions/substances show distinct boundary. Successful predictions of reaction outcomes have been demonstrated using electro-descriptor diagram, or from machine learning algorithms with experimentally-derived electro-descriptors. This method represents a promising tool for data-acquisition, reaction prediction, mechanistic investigation, and high-throughput screening for general organic electro-synthesis.
Collapse
Affiliation(s)
- Yuxuan Chen
- Key Laboratory of Mesoscopic Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| | - Bailin Tian
- Key Laboratory of Mesoscopic Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| | - Zheng Cheng
- Key Laboratory of Mesoscopic Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China.,Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| | - Xiaoshan Li
- Key Laboratory of Mesoscopic Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| | - Min Huang
- Key Laboratory of Mesoscopic Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| | - Yuxia Sun
- Key Laboratory of Mesoscopic Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| | - Shuai Liu
- Jiangsu Key Laboratory of Advanced Organic Materials, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| | - Xu Cheng
- Jiangsu Key Laboratory of Advanced Organic Materials, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| | - Shuhua Li
- Key Laboratory of Mesoscopic Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China.,Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| | - Mengning Ding
- Key Laboratory of Mesoscopic Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China
| |
Collapse
|
12
|
Puleo TR, Sujansky SJ, Wright SE, Bandar JS. Organic Superbases in Recent Synthetic Methodology Research. Chemistry 2021; 27:4216-4229. [DOI: 10.1002/chem.202003580] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Indexed: 12/23/2022]
Affiliation(s)
- Thomas R. Puleo
- Department of Chemistry Colorado State University Fort Collins Colorado 80523 USA
| | - Stephen J. Sujansky
- Department of Chemistry Colorado State University Fort Collins Colorado 80523 USA
| | - Shawn E. Wright
- Department of Chemistry Colorado State University Fort Collins Colorado 80523 USA
| | - Jeffrey S. Bandar
- Department of Chemistry Colorado State University Fort Collins Colorado 80523 USA
| |
Collapse
|
13
|
Chen Y, Tian B, Cheng Z, Li X, Huang M, Sun Y, Liu S, Cheng X, Li S, Ding M. Electro‐Descriptors for the Performance Prediction of Electro‐Organic Synthesis. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.202014072] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Yuxuan Chen
- Key Laboratory of Mesoscopic Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| | - Bailin Tian
- Key Laboratory of Mesoscopic Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| | - Zheng Cheng
- Key Laboratory of Mesoscopic Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
- Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| | - Xiaoshan Li
- Key Laboratory of Mesoscopic Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| | - Min Huang
- Key Laboratory of Mesoscopic Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| | - Yuxia Sun
- Key Laboratory of Mesoscopic Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| | - Shuai Liu
- Jiangsu Key Laboratory of Advanced Organic Materials School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| | - Xu Cheng
- Jiangsu Key Laboratory of Advanced Organic Materials School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| | - Shuhua Li
- Key Laboratory of Mesoscopic Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
- Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| | - Mengning Ding
- Key Laboratory of Mesoscopic Chemistry School of Chemistry and Chemical Engineering Nanjing University Nanjing 210023 China
| |
Collapse
|
14
|
Gasteiger J. Chemistry in Times of Artificial Intelligence. Chemphyschem 2020; 21:2233-2242. [PMID: 32808729 PMCID: PMC7702165 DOI: 10.1002/cphc.202000518] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Revised: 08/14/2020] [Indexed: 11/09/2022]
Abstract
Chemists have to a large extent gained their knowledge by doing experiments and thus gather data. By putting various data together and then analyzing them, chemists have fostered their understanding of chemistry. Since the 1960s, computer methods have been developed to perform this process from data to information to knowledge. Simultaneously, methods were developed for assisting chemists in solving their fundamental questions such as the prediction of chemical, physical, or biological properties, the design of organic syntheses, and the elucidation of the structure of molecules. This eventually led to a discipline of its own: chemoinformatics. Chemoinformatics has found important applications in the fields of drug discovery, analytical chemistry, organic chemistry, agrichemical research, food science, regulatory science, material science, and process control. From its inception, chemoinformatics has utilized methods from artificial intelligence, an approach that has recently gained more momentum.
Collapse
Affiliation(s)
- Johann Gasteiger
- Computer-Chemie-Centrum and Institute of Organic ChemistryUniversity of Erlangen-NurembergNaegelsbachstrasse 2591052ErlangenGermany
| |
Collapse
|
15
|
Coley CW, Eyke NS, Jensen KF. Autonomous Discovery in the Chemical Sciences Part I: Progress. Angew Chem Int Ed Engl 2020; 59:22858-22893. [DOI: 10.1002/anie.201909987] [Citation(s) in RCA: 100] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Indexed: 01/05/2023]
Affiliation(s)
- Connor W. Coley
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Natalie S. Eyke
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Klavs F. Jensen
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| |
Collapse
|
16
|
Coley CW, Eyke NS, Jensen KF. Autonome Entdeckung in den chemischen Wissenschaften, Teil I: Fortschritt. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.201909987] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Connor W. Coley
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Natalie S. Eyke
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Klavs F. Jensen
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| |
Collapse
|
17
|
Li X, Zhang S, Xu L, Hong X. Predicting Regioselectivity in Radical C−H Functionalization of Heterocycles through Machine Learning. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.202000959] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Xin Li
- Department of Chemistry Zhejiang University 38 Zheda Road Hangzhou 310027 China
| | - Shuo‐Qing Zhang
- Department of Chemistry Zhejiang University 38 Zheda Road Hangzhou 310027 China
| | - Li‐Cheng Xu
- Department of Chemistry Zhejiang University 38 Zheda Road Hangzhou 310027 China
| | - Xin Hong
- Department of Chemistry Zhejiang University 38 Zheda Road Hangzhou 310027 China
| |
Collapse
|
18
|
Li X, Zhang S, Xu L, Hong X. Predicting Regioselectivity in Radical C−H Functionalization of Heterocycles through Machine Learning. Angew Chem Int Ed Engl 2020; 59:13253-13259. [DOI: 10.1002/anie.202000959] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Revised: 03/30/2020] [Indexed: 12/13/2022]
Affiliation(s)
- Xin Li
- Department of Chemistry Zhejiang University 38 Zheda Road Hangzhou 310027 China
| | - Shuo‐Qing Zhang
- Department of Chemistry Zhejiang University 38 Zheda Road Hangzhou 310027 China
| | - Li‐Cheng Xu
- Department of Chemistry Zhejiang University 38 Zheda Road Hangzhou 310027 China
| | - Xin Hong
- Department of Chemistry Zhejiang University 38 Zheda Road Hangzhou 310027 China
| |
Collapse
|
19
|
Badowski T, Gajewska EP, Molga K, Grzybowski BA. Synergy Between Expert and Machine‐Learning Approaches Allows for Improved Retrosynthetic Planning. Angew Chem Int Ed Engl 2019. [DOI: 10.1002/ange.201912083] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Tomasz Badowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Ewa P. Gajewska
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Karol Molga
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Bartosz A. Grzybowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- IBS Center for Soft and Living Matter and Department of Chemistry UNIST 50, UNIST-gil, Eonyang-eup, Ulju-gun Ulsan South Korea
| |
Collapse
|
20
|
Badowski T, Gajewska EP, Molga K, Grzybowski BA. Synergy Between Expert and Machine‐Learning Approaches Allows for Improved Retrosynthetic Planning. Angew Chem Int Ed Engl 2019; 59:725-730. [DOI: 10.1002/anie.201912083] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2019] [Indexed: 12/27/2022]
Affiliation(s)
- Tomasz Badowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Ewa P. Gajewska
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Karol Molga
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Bartosz A. Grzybowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- IBS Center for Soft and Living Matter and Department of Chemistry UNIST 50, UNIST-gil, Eonyang-eup, Ulju-gun Ulsan South Korea
| |
Collapse
|
21
|
Isbrandt ES, Sullivan RJ, Newman SG. High Throughput Strategies for the Discovery and Optimization of Catalytic Reactions. Angew Chem Int Ed Engl 2019; 58:7180-7191. [DOI: 10.1002/anie.201812534] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Indexed: 12/29/2022]
Affiliation(s)
- Eric S. Isbrandt
- Centre for Catalysis Research and InnovationDepartment of Chemistry and Biomolecular SciencesUniversity of Ottawa 10 Marie-Curie Ottawa Ontario K1N 6N5 Canada
| | - Ryan J. Sullivan
- Centre for Catalysis Research and InnovationDepartment of Chemistry and Biomolecular SciencesUniversity of Ottawa 10 Marie-Curie Ottawa Ontario K1N 6N5 Canada
| | - Stephen G. Newman
- Centre for Catalysis Research and InnovationDepartment of Chemistry and Biomolecular SciencesUniversity of Ottawa 10 Marie-Curie Ottawa Ontario K1N 6N5 Canada
| |
Collapse
|
22
|
Schneider G, Clark DE. Automated De Novo Drug Design: Are We Nearly There Yet? Angew Chem Int Ed Engl 2019; 58:10792-10803. [PMID: 30730601 DOI: 10.1002/anie.201814681] [Citation(s) in RCA: 83] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Indexed: 11/09/2022]
Abstract
Medicinal chemistry and, in particular, drug design have often been perceived as more of an art than a science. The many unknowns of human disease and the sheer complexity of chemical space render decision making in medicinal chemistry exceptionally demanding. Computational models can assist the medicinal chemist in this endeavour. Provided here is an overview of recent examples of automated de novo molecular design, a discussion of the concepts and computational approaches involved, and the daring prediction of some of the possibilities and limitations of drug design using machine intelligence.
Collapse
Affiliation(s)
- Gisbert Schneider
- ETH Zurich, Department of Chemistry and Applied Biosciences, RETHINK, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - David E Clark
- Charles River, 6-9 Spire Green Centre, Harlow, Essex, CM19 5TR, UK
| |
Collapse
|
23
|
Schneider G, Clark DE. Automated De Novo Drug Design: Are We Nearly There Yet? Angew Chem Int Ed Engl 2019. [DOI: 10.1002/ange.201814681] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Gisbert Schneider
- ETH ZurichDepartment of Chemistry and Applied Biosciences, RETHINK Vladimir-Prelog-Weg 4 8093 Zurich Switzerland
| | - David E. Clark
- Charles River 6–9 Spire Green Centre Harlow Essex CM19 5TR UK
| |
Collapse
|
24
|
Roszak R, Bajczyk MD, Gajewska EP, Hołyst R, Grzybowski BA. Propagation of Oscillating Chemical Signals through Reaction Networks. Angew Chem Int Ed Engl 2019; 58:4520-4525. [PMID: 30397988 DOI: 10.1002/anie.201808821] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Indexed: 12/20/2022]
Abstract
Akin to electronic systems that can tune to and process signals of select frequencies, systems/networks of chemical reactions also "propagate" time-varying concentration inputs in a frequency-dependent manner. Whereas signals of low frequencies are transmitted, higher frequency inputs are dampened and converted into steady-concentration outputs. Such behavior is observed in both idealized reaction chains as well as realistic signaling cascades, in the latter case explaining the experimentally observed responses of such cascades to input calcium oscillations. These and other results are supported by numerical simulations within the freely available Kinetix web application we developed to study chemical systems of arbitrary architectures, reaction kinetics, and boundary conditions.
Collapse
Affiliation(s)
- Rafał Roszak
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, Warsaw, 02-224, Poland
| | - Michał D Bajczyk
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, Warsaw, 02-224, Poland
| | - Ewa P Gajewska
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, Warsaw, 02-224, Poland
| | - Robert Hołyst
- Institute of Physical Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, Warsaw, 02-224, Poland
| | - Bartosz A Grzybowski
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, Warsaw, 02-224, Poland.,IBS Center for Soft and Living Matter and Department of Chemistry, UNIST, 50, UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan, South Korea
| |
Collapse
|
25
|
Isbrandt ES, Sullivan RJ, Newman SG. Hochdurchsatzstrategien zur Entdeckung und Optimierung katalytischer Reaktionen. Angew Chem Int Ed Engl 2019. [DOI: 10.1002/ange.201812534] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Eric S. Isbrandt
- Centre for Catalysis Research and InnovationDepartment of Chemistry and Biomolecular SciencesUniversity of Ottawa 10 Marie-Curie Ottawa Ontario K1N 6N5 Kanada
| | - Ryan J. Sullivan
- Centre for Catalysis Research and InnovationDepartment of Chemistry and Biomolecular SciencesUniversity of Ottawa 10 Marie-Curie Ottawa Ontario K1N 6N5 Kanada
| | - Stephen G. Newman
- Centre for Catalysis Research and InnovationDepartment of Chemistry and Biomolecular SciencesUniversity of Ottawa 10 Marie-Curie Ottawa Ontario K1N 6N5 Kanada
| |
Collapse
|
26
|
Beker W, Gajewska EP, Badowski T, Grzybowski BA. Prediction of Major Regio‐, Site‐, and Diastereoisomers in Diels–Alder Reactions by Using Machine‐Learning: The Importance of Physically Meaningful Descriptors. Angew Chem Int Ed Engl 2018. [DOI: 10.1002/ange.201806920] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Wiktor Beker
- Institute of Organic Chemistry Polish Academy of Sciences ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Ewa P. Gajewska
- Institute of Organic Chemistry Polish Academy of Sciences ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Tomasz Badowski
- Institute of Organic Chemistry Polish Academy of Sciences ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Bartosz A. Grzybowski
- Institute of Organic Chemistry Polish Academy of Sciences ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Center for Soft and Living Matter and Department of Chemistry UNIST 50, UNIST-gil, Eonyang-eup, Ulju-gun Ulsan South Korea
| |
Collapse
|
27
|
Beker W, Gajewska EP, Badowski T, Grzybowski BA. Prediction of Major Regio‐, Site‐, and Diastereoisomers in Diels–Alder Reactions by Using Machine‐Learning: The Importance of Physically Meaningful Descriptors. Angew Chem Int Ed Engl 2018; 58:4515-4519. [DOI: 10.1002/anie.201806920] [Citation(s) in RCA: 73] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Revised: 08/22/2018] [Indexed: 01/15/2023]
Affiliation(s)
- Wiktor Beker
- Institute of Organic Chemistry Polish Academy of Sciences ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Ewa P. Gajewska
- Institute of Organic Chemistry Polish Academy of Sciences ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Tomasz Badowski
- Institute of Organic Chemistry Polish Academy of Sciences ul. Kasprzaka 44/52 01-224 Warsaw Poland
| | - Bartosz A. Grzybowski
- Institute of Organic Chemistry Polish Academy of Sciences ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Center for Soft and Living Matter and Department of Chemistry UNIST 50, UNIST-gil, Eonyang-eup, Ulju-gun Ulsan South Korea
| |
Collapse
|
28
|
Roszak R, Bajczyk MD, Gajewska EP, Hołyst R, Grzybowski BA. Propagation of Oscillating Chemical Signals through Reaction Networks. Angew Chem Int Ed Engl 2018. [DOI: 10.1002/ange.201808821] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Rafał Roszak
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 Warsaw 02-224 Poland
| | - Michał D. Bajczyk
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 Warsaw 02-224 Poland
| | - Ewa P. Gajewska
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 Warsaw 02-224 Poland
| | - Robert Hołyst
- Institute of Physical Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 Warsaw 02-224 Poland
| | - Bartosz A. Grzybowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 Warsaw 02-224 Poland
- IBS Center for Soft and Living Matter and Department of Chemistry UNIST 50, UNIST-gil, Eonyang-eup, Ulju-gun Ulsan South Korea
| |
Collapse
|
29
|
Choi S, Kim Y, Kim JW, Kim Z, Kim WY. Feasibility of Activation Energy Prediction of Gas-Phase Reactions by Machine Learning. Chemistry 2018; 24:12354-12358. [PMID: 29473970 DOI: 10.1002/chem.201800345] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Indexed: 11/09/2022]
Abstract
Machine learning based on big data has emerged as a powerful solution in various chemical problems. We investigated the feasibility of machine learning models for the prediction of activation energies of gas-phase reactions. Six different models with three different types, including the artificial neural network, the support vector regression, and the tree boosting methods, were tested. We used the structural and thermodynamic properties of molecules and their differences as input features without resorting to specific reaction types so as to maintain the most general input form for broad applicability. The tree boosting method showed the best performance among others in terms of the coefficient of determination, mean absolute error, and root mean square error, the values of which were 0.89, 1.95, and 4.49 kcal mol-1 , respectively. Computation time for the prediction of activation energies for 2541 test reactions was about one second on a single computing node without using accelerators.
Collapse
Affiliation(s)
- Sunghwan Choi
- Department of Chemistry, KAIST, 291, Daehak-Ro, Yuseong-gu, Daejeon, 34141, Republic of Korea.,National Institute of Supercomputing and Network, Korea Institute of Science and Technology Information, 245 Daehak-Ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Yeonjoon Kim
- Department of Chemistry, KAIST, 291, Daehak-Ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Jin Woo Kim
- Department of Chemistry, KAIST, 291, Daehak-Ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Zeehyo Kim
- Department of Chemistry, KAIST, 291, Daehak-Ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Woo Youn Kim
- Department of Chemistry, KAIST, 291, Daehak-Ro, Yuseong-gu, Daejeon, 34141, Republic of Korea.,KI for Artificial Intelligence, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon, Republic of Korea
| |
Collapse
|
30
|
Trobe M, Burke MD. The Molecular Industrial Revolution: Automated Synthesis of Small Molecules. Angew Chem Int Ed Engl 2018; 57:4192-4214. [PMID: 29513400 PMCID: PMC5912692 DOI: 10.1002/anie.201710482] [Citation(s) in RCA: 109] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Revised: 12/05/2017] [Indexed: 11/10/2022]
Abstract
Today we are poised for a transition from the highly customized crafting of specific molecular targets by hand to the increasingly general and automated assembly of different types of molecules with the push of a button. Creating machines that are capable of making many different types of small molecules on demand, akin to that which has been achieved on the macroscale with 3D printers, is challenging. Yet important progress is being made toward this objective with two complementary approaches: 1) Automation of customized synthesis routes to different targets by machines that enable the use of many reactions and starting materials, and 2) automation of generalized platforms that make many different targets using common coupling chemistry and building blocks. Continued progress in these directions has the potential to shift the bottleneck in molecular innovation from synthesis to imagination, and thereby help drive a new industrial revolution on the molecular scale.
Collapse
Affiliation(s)
- Melanie Trobe
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Martin D. Burke
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Department of Biochemistry, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA and Carle-Illinois College of Medicine, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA
| |
Collapse
|
31
|
Trobe M, Burke MD. Die molekulare industrielle Revolution: zur automatisierten Synthese organischer Verbindungen. Angew Chem Int Ed Engl 2018. [DOI: 10.1002/ange.201710482] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- Melanie Trobe
- Department of Chemistry University of Illinois Urbana-Champaign 600 S. Mathews, 454 RAL Urbana-Champaign IL 61801 USA
| | - Martin D. Burke
- Department of Chemistry University of Illinois Urbana-Champaign 600 S. Mathews, 454 RAL Urbana-Champaign IL 61801 USA
| |
Collapse
|
32
|
Rabe KS, Müller J, Skoupi M, Niemeyer CM. Cascades in Compartments: En Route to Machine-Assisted Biotechnology. Angew Chem Int Ed Engl 2017; 56:13574-13589. [DOI: 10.1002/anie.201703806] [Citation(s) in RCA: 115] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Indexed: 11/05/2022]
Affiliation(s)
- Kersten S. Rabe
- Chair of Chemical Biology; Karlsruher Institut für Technologie, KIT, Institut für Biologsiche Grenzflächen 1, IBG-I; Herrmann-von-Helmholtz Platz 1, Campus Nord Eggenstein-Leopoldshafen 76344 Germany
| | - Joachim Müller
- Chair of Chemical Biology; Karlsruher Institut für Technologie, KIT, Institut für Biologsiche Grenzflächen 1, IBG-I; Herrmann-von-Helmholtz Platz 1, Campus Nord Eggenstein-Leopoldshafen 76344 Germany
| | - Marc Skoupi
- Chair of Chemical Biology; Karlsruher Institut für Technologie, KIT, Institut für Biologsiche Grenzflächen 1, IBG-I; Herrmann-von-Helmholtz Platz 1, Campus Nord Eggenstein-Leopoldshafen 76344 Germany
| | - Christof M. Niemeyer
- Chair of Chemical Biology; Karlsruher Institut für Technologie, KIT, Institut für Biologsiche Grenzflächen 1, IBG-I; Herrmann-von-Helmholtz Platz 1, Campus Nord Eggenstein-Leopoldshafen 76344 Germany
| |
Collapse
|
33
|
Rabe KS, Müller J, Skoupi M, Niemeyer CM. Kaskaden in Kompartimenten: auf dem Weg zu maschinengestützter Biotechnologie. Angew Chem Int Ed Engl 2017. [DOI: 10.1002/ange.201703806] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Kersten S. Rabe
- Chair of Chemical Biology; Karlsruher Institut für Technologie, KIT, Institut für Biologische Grenzflächen 1, IBG-I; Herrmann-von-Helmholtz Platz 1, Campus Nord Eggenstein-Leopoldshafen 76344 Deutschland
| | - Joachim Müller
- Chair of Chemical Biology; Karlsruher Institut für Technologie, KIT, Institut für Biologische Grenzflächen 1, IBG-I; Herrmann-von-Helmholtz Platz 1, Campus Nord Eggenstein-Leopoldshafen 76344 Deutschland
| | - Marc Skoupi
- Chair of Chemical Biology; Karlsruher Institut für Technologie, KIT, Institut für Biologische Grenzflächen 1, IBG-I; Herrmann-von-Helmholtz Platz 1, Campus Nord Eggenstein-Leopoldshafen 76344 Deutschland
| | - Christof M. Niemeyer
- Chair of Chemical Biology; Karlsruher Institut für Technologie, KIT, Institut für Biologische Grenzflächen 1, IBG-I; Herrmann-von-Helmholtz Platz 1, Campus Nord Eggenstein-Leopoldshafen 76344 Deutschland
| |
Collapse
|
34
|
Segler MHS, Waller MP. Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction. Chemistry 2017; 23:5966-5971. [PMID: 28134452 DOI: 10.1002/chem.201605499] [Citation(s) in RCA: 245] [Impact Index Per Article: 30.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Indexed: 12/24/2022]
Abstract
Reaction prediction and retrosynthesis are the cornerstones of organic chemistry. Rule-based expert systems have been the most widespread approach to computationally solve these two related challenges to date. However, reaction rules often fail because they ignore the molecular context, which leads to reactivity conflicts. Herein, we report that deep neural networks can learn to resolve reactivity conflicts and to prioritize the most suitable transformation rules. We show that by training our model on 3.5 million reactions taken from the collective published knowledge of the entire discipline of chemistry, our model exhibits a top10-accuracy of 95 % in retrosynthesis and 97 % for reaction prediction on a validation set of almost 1 million reactions.
Collapse
Affiliation(s)
- Marwin H S Segler
- Organisch-Chemisches Institut and Center for Multiscale Theory and Computation, Westfälische Wilhelms-Universität Münster, Corrensstr. 40, 48149, Münster, Germany
| | - Mark P Waller
- Organisch-Chemisches Institut and Center for Multiscale Theory and Computation, Westfälische Wilhelms-Universität Münster, Corrensstr. 40, 48149, Münster, Germany
- Department of Physics and International Center for Quantum and Molecular Structures, Shanghai University, Shangda Road 99, 200444, Shanghai, China
| |
Collapse
|