1
|
Park NH, Manica M, Born J, Hedrick JL, Erdmann T, Zubarev DY, Adell-Mill N, Arrechea PL. Artificial intelligence driven design of catalysts and materials for ring opening polymerization using a domain-specific language. Nat Commun 2023; 14:3686. [PMID: 37344485 PMCID: PMC10284867 DOI: 10.1038/s41467-023-39396-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 06/12/2023] [Indexed: 06/23/2023] Open
Abstract
Advances in machine learning (ML) and automated experimentation are poised to vastly accelerate research in polymer science. Data representation is a critical aspect for enabling ML integration in research workflows, yet many data models impose significant rigidity making it difficult to accommodate a broad array of experiment and data types found in polymer science. This inflexibility presents a significant barrier for researchers to leverage their historical data in ML development. Here we show that a domain specific language, termed Chemical Markdown Language (CMDL), provides flexible, extensible, and consistent representation of disparate experiment types and polymer structures. CMDL enables seamless use of historical experimental data to fine-tune regression transformer (RT) models for generative molecular design tasks. We demonstrate the utility of this approach through the generation and the experimental validation of catalysts and polymers in the context of ring-opening polymerization-although we provide examples of how CMDL can be more broadly applied to other polymer classes. Critically, we show how the CMDL tuned model preserves key functional groups within the polymer structure, allowing for experimental validation. These results reveal the versatility of CMDL and how it facilitates translation of historical data into meaningful predictive and generative models to produce experimentally actionable output.
Collapse
Affiliation(s)
| | - Matteo Manica
- IBM Research-Zurich, Säumerstrasse 4, Rüschlikon, 8803, Switzerland
| | - Jannis Born
- IBM Research-Zurich, Säumerstrasse 4, Rüschlikon, 8803, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058, Basel, Switzerland
| | - James L Hedrick
- IBM Research-Almaden, 650 Harry Rd., San Jose, CA, 95120, USA
| | - Tim Erdmann
- IBM Research-Almaden, 650 Harry Rd., San Jose, CA, 95120, USA
| | | | - Nil Adell-Mill
- IBM Research-Zurich, Säumerstrasse 4, Rüschlikon, 8803, Switzerland
- Arctoris, 120E Olympic Avenue, Abingdon, OX14 4SA, Oxfordshire, UK
| | | |
Collapse
|
2
|
Hedrick JL, Piunova V, Park NH, Erdmann T, Arrechea PL. Simple and Efficient Synthesis of Functionalized Cyclic Carbonate Monomers Using Carbon Dioxide. ACS Macro Lett 2022; 11:368-375. [PMID: 35575375 DOI: 10.1021/acsmacrolett.2c00060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Aliphatic polycarbonates represent an important class of materials with diverse applications ranging from battery electrolytes, polyurethane intermediates, and materials for biomedical applications. These materials can be produced via the ring-opening polymerization (ROP) of six- to eight-membered cyclic carbonates derived from precursor 1,3- and 1,5-diols. These diols can contain a range of functional groups depending on the desired thermal, mechanical, and solution properties. Generally, the ring closure to form the cyclic carbonate requires the use of undesirable and hazardous reagents. Advances in synthetic methodologies and catalysis have enabled the use of carbon dioxide (CO2) to perform these transformations with a high conversion of diol to cyclic carbonate, yet modest isolated yields due to oligomerization side reactions. In this Letter, we evaluate a series of bases in the presence of p-toluenesulfonyl chloride and the appropriate diol to better understand their effect on the yield and presence of oligomer byproducts during cyclic carbonate formation from CO2. From this study, N,N-tetramethylethylenediamine (TMEDA) was identified as an optimal base, facilitating the preparation of a diverse array of both six- and eight-membered carbonates from CO2 within 10 to 15 min. The robust conditions for both, the preparation of the diol precursor, and the TMEDA-mediated carbonate synthesis enabled readily telescoping the two-step reaction sequence, greatly simplifying the process of monomer preparation.
Collapse
Affiliation(s)
- James L. Hedrick
- IBM Almaden Research Center, 650 Harry Road, San Jose, California 95120, United States
| | - Victoria Piunova
- IBM Almaden Research Center, 650 Harry Road, San Jose, California 95120, United States
| | - Nathaniel H. Park
- IBM Almaden Research Center, 650 Harry Road, San Jose, California 95120, United States
| | - Tim Erdmann
- IBM Almaden Research Center, 650 Harry Road, San Jose, California 95120, United States
| | - Pedro L. Arrechea
- IBM Almaden Research Center, 650 Harry Road, San Jose, California 95120, United States
| |
Collapse
|
3
|
Wang C, Zhang X, Zhao W, Liu X, Wang Q, Sun J. Synthesis of Aliphatic Hyperbranched Polycarbonates via Organo-Catalyzed “A1+B2”-Ring-Opening Polymerization. Macromolecules 2022. [DOI: 10.1021/acs.macromol.1c02174] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- Chengliang Wang
- Key Laboratory of Rubber-Plastics, Ministry of Education, School of Polymer Science and Engineering, Qingdao University of Science and Technology, Zhengzhou Rd. 53, Qingdao CN-266042, China
| | - Xu Zhang
- Key Laboratory of Rubber-Plastics, Ministry of Education, School of Polymer Science and Engineering, Qingdao University of Science and Technology, Zhengzhou Rd. 53, Qingdao CN-266042, China
| | - Wei Zhao
- Key Laboratory of Rubber-Plastics, Ministry of Education, School of Polymer Science and Engineering, Qingdao University of Science and Technology, Zhengzhou Rd. 53, Qingdao CN-266042, China
| | - Xin Liu
- Key Laboratory of Rubber-Plastics, Ministry of Education, School of Polymer Science and Engineering, Qingdao University of Science and Technology, Zhengzhou Rd. 53, Qingdao CN-266042, China
| | - Qingfu Wang
- Key Laboratory of Rubber-Plastics, Ministry of Education, School of Polymer Science and Engineering, Qingdao University of Science and Technology, Zhengzhou Rd. 53, Qingdao CN-266042, China
| | - Jingjiang Sun
- Key Laboratory of Rubber-Plastics, Ministry of Education, School of Polymer Science and Engineering, Qingdao University of Science and Technology, Zhengzhou Rd. 53, Qingdao CN-266042, China
| |
Collapse
|
4
|
Cencer MM, Moore JS, Assary RS. Machine learning for polymeric materials: an introduction. POLYM INT 2021. [DOI: 10.1002/pi.6345] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Affiliation(s)
- Morgan M Cencer
- Department of Chemistry University of Illinois at Urbana‐Champaign Urbana IL USA
- Materials Science Division Argonne National Laboratory Lemont IL USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana‐Champaign Urbana IL USA
| | - Jeffrey S Moore
- Department of Chemistry University of Illinois at Urbana‐Champaign Urbana IL USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana‐Champaign Urbana IL USA
| | - Rajeev S Assary
- Materials Science Division Argonne National Laboratory Lemont IL USA
| |
Collapse
|
5
|
Weber JM, Guo Z, Zhang C, Schweidtmann AM, Lapkin AA. Chemical data intelligence for sustainable chemistry. Chem Soc Rev 2021; 50:12013-12036. [PMID: 34520507 DOI: 10.1039/d1cs00477h] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
This study highlights new opportunities for optimal reaction route selection from large chemical databases brought about by the rapid digitalisation of chemical data. The chemical industry requires a transformation towards more sustainable practices, eliminating its dependencies on fossil fuels and limiting its impact on the environment. However, identifying more sustainable process alternatives is, at present, a cumbersome, manual, iterative process, based on chemical intuition and modelling. We give a perspective on methods for automated discovery and assessment of competitive sustainable reaction routes based on renewable or waste feedstocks. Three key areas of transition are outlined and reviewed based on their state-of-the-art as well as bottlenecks: (i) data, (ii) evaluation metrics, and (iii) decision-making. We elucidate their synergies and interfaces since only together these areas can bring about the most benefit. The field of chemical data intelligence offers the opportunity to identify the inherently more sustainable reaction pathways and to identify opportunities for a circular chemical economy. Our review shows that at present the field of data brings about most bottlenecks, such as data completion and data linkage, but also offers the principal opportunity for advancement.
Collapse
Affiliation(s)
- Jana M Weber
- Department of Chemical Engineering and Biotechnology, University of Cambridge, West Cambridge Site, Philippa Fawcett Drive, Cambridge CB3 0AS, UK. .,Chemical Data Intelligence (CDI) Pte Ltd, Robinson Road, #02-00, 068898, Singapore
| | - Zhen Guo
- Chemical Data Intelligence (CDI) Pte Ltd, Robinson Road, #02-00, 068898, Singapore.,Cambridge Centre for Advanced Research and Education in Singapore, CARES Ltd. 1 CREATE Way, CREATE Tower #05-05, 138602, Singapore
| | - Chonghuan Zhang
- Department of Chemical Engineering and Biotechnology, University of Cambridge, West Cambridge Site, Philippa Fawcett Drive, Cambridge CB3 0AS, UK.
| | - Artur M Schweidtmann
- Department of Chemical Engineering, Delft University of Technology, Van der Maasweg 9, Delft 2629 HZ, The Netherlands
| | - Alexei A Lapkin
- Department of Chemical Engineering and Biotechnology, University of Cambridge, West Cambridge Site, Philippa Fawcett Drive, Cambridge CB3 0AS, UK. .,Chemical Data Intelligence (CDI) Pte Ltd, Robinson Road, #02-00, 068898, Singapore.,Cambridge Centre for Advanced Research and Education in Singapore, CARES Ltd. 1 CREATE Way, CREATE Tower #05-05, 138602, Singapore
| |
Collapse
|
6
|
Reis M, Gusev F, Taylor NG, Chung SH, Verber MD, Lee YZ, Isayev O, Leibfarth FA. Machine-Learning-Guided Discovery of 19F MRI Agents Enabled by Automated Copolymer Synthesis. J Am Chem Soc 2021; 143:17677-17689. [PMID: 34637304 PMCID: PMC10833148 DOI: 10.1021/jacs.1c08181] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Modern polymer science suffers from the curse of multidimensionality. The large chemical space imposed by including combinations of monomers into a statistical copolymer overwhelms polymer synthesis and characterization technology and limits the ability to systematically study structure-property relationships. To tackle this challenge in the context of 19F magnetic resonance imaging (MRI) agents, we pursued a computer-guided materials discovery approach that combines synergistic innovations in automated flow synthesis and machine learning (ML) method development. A software-controlled, continuous polymer synthesis platform was developed to enable iterative experimental-computational cycles that resulted in the synthesis of 397 unique copolymer compositions within a six-variable compositional space. The nonintuitive design criteria identified by ML, which were accomplished by exploring <0.9% of the overall compositional space, lead to the identification of >10 copolymer compositions that outperformed state-of-the-art materials.
Collapse
Affiliation(s)
- Marcus Reis
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Filipp Gusev
- Department of Chemistry, Mellon College of Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - Nicholas G Taylor
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Sang Hun Chung
- Department of Biomedical Engineering, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Matthew D Verber
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Yueh Z Lee
- Department of Radiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Olexandr Isayev
- Department of Chemistry, Mellon College of Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - Frank A Leibfarth
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| |
Collapse
|
7
|
Sattari K, Xie Y, Lin J. Data-driven algorithms for inverse design of polymers. SOFT MATTER 2021; 17:7607-7622. [PMID: 34397078 DOI: 10.1039/d1sm00725d] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The ever-increasing demand for novel polymers with superior properties requires a deeper understanding and exploration of the chemical space. Recently, data-driven approaches to explore the chemical space for polymer design have emerged. Among them, inverse design strategies for designing polymers with specific properties have evolved to be a significant materials informatics platform by learning hidden knowledge from materials data as well as smartly navigating the chemical space in an optimized way. In this review, we first summarize the progress in the representation of polymers, a prerequisite step for the inverse design of polymers. Then, we systematically introduce three data-driven strategies implemented for the inverse design of polymers, i.e., high-throughput virtual screening, global optimization, and generative models. Finally, we discuss the challenges and opportunities of the data-driven strategies as well as optimization algorithms employed in the inverse design of polymers.
Collapse
Affiliation(s)
- Kianoosh Sattari
- Department of Mechanical and Aerospace Engineering, University of Missouri, Columbia, MO 65211, USA.
| | | | | |
Collapse
|