1
|
Sheridan RJ, Zauscher S, Brinson LC. BOTTS: broadband optimized time-temperature superposition for vastly accelerated viscoelastic data acquisition. SOFT MATTER 2024; 20:7811-7820. [PMID: 39258432 DOI: 10.1039/d4sm00798k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
Modern materials design strategies take advantage of the increasing amount of materials property data available and increasingly complex algorithms to take advantage of those data. However, viscoelastic materials resist this trend towards increased data rates due to their inherent time-dependent properties. Therefore, viscoelasticity measurements present a roadblock for data collection in an important aspect of material design. For thermorheologically simple (TRS) materials, time-temperature superposition (TTS) made relaxation spectrum measurements faster relative to, for example, very long creep experiments. However, TTS itself currently faces a speed limit originating in the common logarithmic discrete frequency sweep (DFS) mode of operation. In DFS, the measurement time is proportional (by a factor much greater than one) to the lowest frequency of measurement. This state of affairs has not improved for TTS for half a century or more. We utilize recent work in experimental rheometry on windowed chirps to collect three decades of complex modulus data simultaneously, resulting in a ∼500% increase in data collection. In BOTTS, we superpose several isothermal chirp responses to produce a master curve in a fraction of time required by the traditional DFS-TTS technique. The chirp responses have good, albeit nontrivial, signal-to-noise properties. We use linear error propagation and a noise-weighted least squares approach to automatically incorporate all the data into a reliable shifting method. Using model thermoset polymers, we show that DFS-TTS and BOTTS results are comparable, and therefore BOTTS data represent a first step towards a faster method for master curve generation from unmodified rheological measurement instruments.
Collapse
Affiliation(s)
- Richard J Sheridan
- Thomas Lord Department of Mechanical Engineering and Materials Science, Duke University, Durham, North Carolina, USA.
| | - Stefan Zauscher
- Thomas Lord Department of Mechanical Engineering and Materials Science, Duke University, Durham, North Carolina, USA.
| | - L Catherine Brinson
- Thomas Lord Department of Mechanical Engineering and Materials Science, Duke University, Durham, North Carolina, USA.
| |
Collapse
|
2
|
Yu M, Jia Q, Wang Q, Luo ZH, Yan F, Zhou YN. Data science-centric design, discovery, and evaluation of novel synthetically accessible polyimides with desired dielectric constants. Chem Sci 2024:d4sc05000b. [PMID: 39416299 PMCID: PMC11474456 DOI: 10.1039/d4sc05000b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Accepted: 10/01/2024] [Indexed: 10/19/2024] Open
Abstract
Rapidly advancing computer technology has demonstrated great potential in recent years to assist in the generation and discovery of promising molecular structures. Herein, we present a data science-centric "Design-Discovery-Evaluation" scheme for exploring novel polyimides (PIs) with desired dielectric constants (ε). A virtual library of over 100 000 synthetically accessible PIs is created by extending existing PIs. Within the framework of quantitative structure-property relationship (QSPR), a model sufficient to predict ε at multiple frequencies is developed with an R 2 of 0.9768, allowing further high-throughput screening of the prior structures with desired ε. Furthermore, the structural feature representation method of atomic adjacent group (AAG) is introduced, using which the reliability of high-throughput screening results is evaluated. This workflow identifies 9 novel PIs (ε >5 at 103 Hz and glass transition temperatures between 250 °C and 350 °C) with potential applications in high-temperature capacitive energy storage, and confirms these promising findings by high-fidelity molecular dynamics (MD) simulations.
Collapse
Affiliation(s)
- Mengxian Yu
- School of Chemical Engineering and Material Science, Tianjin University of Science and Technology Tianjin 300457 P. R. China
| | - Qingzhu Jia
- School of Chemical Engineering and Material Science, Tianjin University of Science and Technology Tianjin 300457 P. R. China
| | - Qiang Wang
- School of Chemical Engineering and Material Science, Tianjin University of Science and Technology Tianjin 300457 P. R. China
| | - Zheng-Hong Luo
- Department of Chemical Engineering, School of Chemistry and Chemical Engineering, State Key Laboratory of Metal Matrix Composites, Shanghai Jiao Tong University Shanghai 200240 P. R. China
| | - Fangyou Yan
- School of Chemical Engineering and Material Science, Tianjin University of Science and Technology Tianjin 300457 P. R. China
| | - Yin-Ning Zhou
- Department of Chemical Engineering, School of Chemistry and Chemical Engineering, State Key Laboratory of Metal Matrix Composites, Shanghai Jiao Tong University Shanghai 200240 P. R. China
| |
Collapse
|
3
|
Gormley AJ. Machine learning in drug delivery. J Control Release 2024; 373:23-30. [PMID: 38909704 PMCID: PMC11384327 DOI: 10.1016/j.jconrel.2024.06.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 06/17/2024] [Accepted: 06/19/2024] [Indexed: 06/25/2024]
Abstract
For decades, drug delivery scientists have been performing trial-and-error experimentation to manually sample parameter spaces and optimize release profiles through rational design. To enable this approach, scientists spend much of their career learning nuanced drug-material interactions that drive system behavior. In relatively simple systems, rational design criteria allow us to fine tune release profiles and enable efficacious therapies. However, as materials and drugs become increasingly sophisticated and their interactions have non-linear and compounding effects, the field is suffering the Curse of Dimensionality which prevents us from comprehending complex structure-function relationships. In the past, we have embraced this complexity by implementing high-throughput screens to increase the probability of finding ideal compositions. However, this brute force method was inefficient and led many to abandon these fishing expeditions. Fortunately, methods in data science including artificial intelligence / machine learning (AI/ML) are providing ideal analytical tools to model this complex data and ascertain quantitative structure-function relationships. In this Oration, I speak to the potential value of data science in drug delivery with particular focus on polymeric delivery systems. Here, I do not suggest that AI/ML will simply replace mechanistic understanding of complex systems. Rather, I propose that AI/ML should be yet another useful tool in the lab to navigate complex parameter spaces. The recent hype around AI/ML is breathtaking and potentially over inflated, but the value of these methods is poised to revolutionize how we perform science. Therefore, I encourage readers to consider adopting these skills and applying data science methods to their own problems. If done successfully, I believe we will all realize a paradigm shift in our approach to drug delivery.
Collapse
Affiliation(s)
- Adam J Gormley
- Associate Professor, Department of Biomedical Engineering, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, United States.
| |
Collapse
|
4
|
Rebello NJ, Arora A, Mochigase H, Lin TS, Shi J, Audus DJ, Muckley ES, Osmani A, Olsen BD. The Block Copolymer Phase Behavior Database. J Chem Inf Model 2024; 64:6464-6476. [PMID: 39126359 DOI: 10.1021/acs.jcim.4c00242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/12/2024]
Abstract
The Block Copolymer Database (BCDB) is a platform that allows users to search, submit, visualize, benchmark, and download experimental phase measurements and their associated characterization information for di- and multiblock copolymers. To the best of our knowledge, there is no widely accepted data model for publishing experimental and simulation data on block copolymer self-assembly. This proposed data schema with traceable information can accommodate any number of blocks and at the time of publication contains over 5400 block copolymer total melt phase measurements mined from the literature and manually curated and simulation data points of the phase diagram generated from self-consistent field theory that can rapidly be augmented. This database can be accessed via the Community Resource for Innovation in Polymer Technology (CRIPT) web application and the Materials Data Facility. The chemical structure of the polymer is encoded in BigSMILES, an extension of the Simplified Molecular-Input Line-Entry System (SMILES) into the macromolecular domain, and the user can search repeat units and functional groups using the SMARTS search syntax (SMILES Arbitrary Target Specification). The user can also query characterization and phase information using Structured Query Language (SQL) and download custom sets of block copolymer data to train machine learning models. Finally, a protocol is presented in which GPT-4, an AI-powered large language model, can be used to rapidly screen and identify block copolymer papers from the literature using only the abstract text and determine whether they have BCDB data, allowing the database to grow as the number of published papers on the World Wide Web increases. The F1 score for this model is 0.74. This platform is an important step in making polymer data more accessible to the broader community.
Collapse
Affiliation(s)
- Nathan J Rebello
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Akash Arora
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Hidenobu Mochigase
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Tzyy-Shyang Lin
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Jiale Shi
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Debra J Audus
- Materials Science and Engineering Division, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Eric S Muckley
- Citrine Informatics, Redwood City, California 94063-2483, United States
| | - Ardiana Osmani
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Bradley D Olsen
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
5
|
Alessandri R, Li CH, Keating S, Mohanty KT, Peng A, Lutkenhaus JL, Rowan SJ, Tabor DP, de Pablo JJ. Structural, Ionic, and Electronic Properties of Solid-State Phthalimide-Containing Polymers for All-Organic Batteries. JACS AU 2024; 4:2300-2311. [PMID: 38938799 PMCID: PMC11200234 DOI: 10.1021/jacsau.4c00276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 05/13/2024] [Accepted: 05/17/2024] [Indexed: 06/29/2024]
Abstract
Redox-active polymers serving as the active materials in solid-state electrodes offer a promising path toward realizing all-organic batteries. While both cathodic and anodic redox-active polymers are needed, the diversity of the available anodic materials is limited. Here, we predict solid-state structural, ionic, and electronic properties of anodic, phthalimide-containing polymers using a multiscale approach that combines atomistic molecular dynamics, electronic structure calculations, and machine learning surrogate models. Importantly, by combining information from each of these scales, we are able to bridge the gap between bottom-up molecular characteristics and macroscopic properties such as apparent diffusion coefficients of electron transport (D app). We investigate the impact of different polymer backbones and of two critical factors during battery operation: state of charge and polymer swelling. Our findings reveal that the state of charge significantly influences solid-state packing and the thermophysical properties of the polymers, which, in turn, affect ionic and electronic transport. A combination of molecular-level properties (such as the reorganization energy) and condensed-phase properties (such as effective electron hopping distances) determine the predicted ranking of electron transport capabilities of the polymers. We predict D app for the phthalimide-based polymers and for a reference nitroxide radical-based polymer, finding a 3 orders of magnitude increase in D app (≈10-6 cm2 s-1) with respect to the reference. This study underscores the promise of phthalimide-containing polymers as highly capable redox-active polymers for anodic materials in all-organic batteries, due to their exceptional predicted electron transport capabilities.
Collapse
Affiliation(s)
- Riccardo Alessandri
- Pritzker
School of Molecular Engineering, University
of Chicago, Chicago, Illinois 60637, United States
| | - Cheng-Han Li
- Department
of Chemistry, Texas A&M University, College Station, Texas 77842, United States
| | - Sheila Keating
- Department
of Chemistry, University of Chicago, Chicago, Illinois 60637, United States
| | - Khirabdhi T. Mohanty
- Artie
McFerrin Department of Chemical Engineering, Texas A&M University, College
Station, Texas 77843, United States
| | - Aaron Peng
- Pritzker
School of Molecular Engineering, University
of Chicago, Chicago, Illinois 60637, United States
| | - Jodie L. Lutkenhaus
- Artie
McFerrin Department of Chemical Engineering and Department of Materials
Science & Engineering, Texas A&M
University, College Station, Texas 77843, United States
| | - Stuart J. Rowan
- Pritzker
School of Molecular Engineering, University
of Chicago, Chicago, Illinois 60637, United States
- Department
of Chemistry, University of Chicago, Chicago, Illinois 60637, United States
| | - Daniel P. Tabor
- Department
of Chemistry, Texas A&M University, College Station, Texas 77842, United States
| | - Juan J. de Pablo
- Pritzker
School of Molecular Engineering, University
of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
6
|
Shi J, Walsh D, Zou W, Rebello NJ, Deagen ME, Fransen KA, Gao X, Olsen BD, Audus DJ. Calculating Pairwise Similarity of Polymer Ensembles via Earth Mover's Distance. ACS POLYMERS AU 2024; 4:66-76. [PMID: 38371731 PMCID: PMC10870752 DOI: 10.1021/acspolymersau.3c00029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 11/28/2023] [Accepted: 11/29/2023] [Indexed: 02/20/2024]
Abstract
Synthetic polymers, in contrast to small molecules and deterministic biomacromolecules, are typically ensembles composed of polymer chains with varying numbers, lengths, sequences, chemistry, and topologies. While numerous approaches exist for measuring pairwise similarity among small molecules and sequence-defined biomacromolecules, accurately determining the pairwise similarity between two polymer ensembles remains challenging. This work proposes the earth mover's distance (EMD) metric to calculate the pairwise similarity score between two polymer ensembles. EMD offers a greater resolution of chemical differences between polymer ensembles than the averaging method and provides a quantitative numeric value representing the pairwise similarity between polymer ensembles in alignment with chemical intuition. The EMD approach for assessing polymer similarity enhances the development of accurate chemical search algorithms within polymer databases and can improve machine learning techniques for polymer design, optimization, and property prediction.
Collapse
Affiliation(s)
- Jiale Shi
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
| | - Dylan Walsh
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
| | - Weizhong Zou
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
| | - Nathan J. Rebello
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
| | - Michael E. Deagen
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
| | - Katharina A. Fransen
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
| | - Xian Gao
- Department
of Chemical and Biomolecular Engineering, University of Notre Dame, Notre
Dame, Indiana 46556, United States
| | - Bradley D. Olsen
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
| | - Debra J. Audus
- Materials
Science and Engineering Division, National
Institute of Standards and Technology, Gaithersburg, Maryland 20899, United States
| |
Collapse
|
7
|
Day EC, Chittari SS, Bogen MP, Knight AS. Navigating the Expansive Landscapes of Soft Materials: A User Guide for High-Throughput Workflows. ACS POLYMERS AU 2023; 3:406-427. [PMID: 38107416 PMCID: PMC10722570 DOI: 10.1021/acspolymersau.3c00025] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 11/02/2023] [Accepted: 11/07/2023] [Indexed: 12/19/2023]
Abstract
Synthetic polymers are highly customizable with tailored structures and functionality, yet this versatility generates challenges in the design of advanced materials due to the size and complexity of the design space. Thus, exploration and optimization of polymer properties using combinatorial libraries has become increasingly common, which requires careful selection of synthetic strategies, characterization techniques, and rapid processing workflows to obtain fundamental principles from these large data sets. Herein, we provide guidelines for strategic design of macromolecule libraries and workflows to efficiently navigate these high-dimensional design spaces. We describe synthetic methods for multiple library sizes and structures as well as characterization methods to rapidly generate data sets, including tools that can be adapted from biological workflows. We further highlight relevant insights from statistics and machine learning to aid in data featurization, representation, and analysis. This Perspective acts as a "user guide" for researchers interested in leveraging high-throughput screening toward the design of multifunctional polymers and predictive modeling of structure-property relationships in soft materials.
Collapse
Affiliation(s)
| | | | - Matthew P. Bogen
- Department of Chemistry, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Abigail S. Knight
- Department of Chemistry, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| |
Collapse
|
8
|
Ting JM, Tamayo-Mendoza T, Petersen SR, Van Reet J, Ahmed UA, Snell NJ, Fisher JD, Stern M, Oviedo F. Frontiers in nonviral delivery of small molecule and genetic drugs, driven by polymer chemistry and machine learning for materials informatics. Chem Commun (Camb) 2023; 59:14197-14209. [PMID: 37955165 DOI: 10.1039/d3cc04705a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2023]
Abstract
Materials informatics (MI) has immense potential to accelerate the pace of innovation and new product development in biotechnology. Close collaborations between skilled physical and life scientists with data scientists are being established in pursuit of leveraging MI tools in automation and artificial intelligence (AI) to predict material properties in vitro and in vivo. However, the scarcity of large, standardized, and labeled materials data for connecting structure-function relationships represents one of the largest hurdles to overcome. In this Highlight, focus is brought to emerging developments in polymer-based therapeutic delivery platforms, where teams generate large experimental datasets around specific therapeutics and successfully establish a design-to-deployment cycle of specialized nanocarriers. Three select collaborations demonstrate how custom-built polymers protect and deliver small molecules, nucleic acids, and proteins, representing ideal use-cases for machine learning to understand how molecular-level interactions impact drug stabilization and release. We conclude with our perspectives on how MI innovations in automation efficiencies and digitalization of data-coupled with fundamental insight and creativity from the polymer science community-can accelerate translation of more gene therapies into lifesaving medicines.
Collapse
|
9
|
Liu A, Lee M, Venkatesh R, Bonsu JA, Volkovinsky R, Meredith JC, Reichmanis E, Grover MA. Conjugated Polymer Process Ontology and Experimental Data Repository for Organic Field-Effect Transistors. CHEMISTRY OF MATERIALS : A PUBLICATION OF THE AMERICAN CHEMICAL SOCIETY 2023; 35:8816-8826. [PMID: 38027538 PMCID: PMC10653076 DOI: 10.1021/acs.chemmater.3c01842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/30/2023] [Accepted: 10/02/2023] [Indexed: 12/01/2023]
Abstract
Polymer-based semiconductors and organic electronics encapsulate a significant research thrust for informatics-driven materials development. However, device measurements are described by a complex array of design and parameter choices, many of which are sparsely reported. For example, the mobility of a polymer-based organic field-effect transistor (OFET) may vary by several orders of magnitude for a given polymer as a plethora of parameters related to solution processing, interface design/surface treatment, thin-film deposition, postprocessing, and measurement settings have a profound effect on the value of the final measurement. Incomplete contextual, experimental details hamper the availability of reusable data applicable for data-driven optimization, modeling (e.g., machine learning), and analysis of new organic devices. To curate organic device databases that contain reproducible and findable, accessible, interoperable, and reusable (FAIR) experimental data records, data ontologies that fully describe sample provenance and process history are required. However, standards for generating such process ontologies are not widely adopted for experimental materials domains. In this work, we design and implement an object-relational database for storing experimental records of OFETs. A data structure is generated by drawing on an international standard for batch process control (ISA-88) to facilitate the design. We then mobilize these representative data records, curated from the literature and laboratory experiments, to enable data-driven learning of process-structure-property relationships. The work presented herein opens the door for the broader adoption of data management practices and design standards for both the organic electronics and the wider materials community.
Collapse
Affiliation(s)
- Aaron
L. Liu
- School
of Chemical & Biomolecular Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30332, United States
| | - Myeongyeon Lee
- Department
of Chemical & Biomolecular Engineering, Lehigh University, 124 East Morton Street, Bethlehem, Pennsylvania 18015, United States
| | - Rahul Venkatesh
- School
of Chemical & Biomolecular Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30332, United States
| | - Jessica A. Bonsu
- School
of Chemical & Biomolecular Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30332, United States
| | - Ron Volkovinsky
- School
of Chemical & Biomolecular Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30332, United States
| | - J. Carson Meredith
- School
of Chemical & Biomolecular Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30332, United States
| | - Elsa Reichmanis
- Department
of Chemical & Biomolecular Engineering, Lehigh University, 124 East Morton Street, Bethlehem, Pennsylvania 18015, United States
| | - Martha A. Grover
- School
of Chemical & Biomolecular Engineering, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, Georgia 30332, United States
| |
Collapse
|
10
|
Rebello NJ, Lin TS, Nazeer H, Olsen BD. BigSMARTS: A Topologically Aware Query Language and Substructure Search Algorithm for Polymer Chemical Structures. J Chem Inf Model 2023; 63:6555-6568. [PMID: 37874026 DOI: 10.1021/acs.jcim.3c00978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Molecular search is important in chemistry, biology, and informatics for identifying molecular structures within large data sets, improving knowledge discovery and innovation, and making chemical data FAIR (findable, accessible, interoperable, reusable). Search algorithms for polymers are significantly less developed than those for small molecules because polymer search relies on searching by polymer name, which can be challenging because polymer naming is overly broad (i.e., polyethylene), complicated for complex chemical structures, and often does not correspond to official IUPAC conventions. Chemical structure search in polymers is limited to substructures, such as monomers, without awareness of connectivity or topology. This work introduces a novel query language and graph traversal search algorithm for polymers that provides the first search method able to fully capture all of the chemical structures present in polymers. The BigSMARTS query language, an extension of the small-molecule SMARTS language, allows users to write queries that localize monomer and functional group searches to different parts of the polymer, like the middle block of a triblock, the side chain of a graft, and the backbone of a repeat unit. The substructure search algorithm is based on the traversal of graph representations of the generating functions for the stochastic graphs of polymers. Operationally, the algorithm first identifies cycles representing the monomers and then the end groups and finally performs a depth-first search to match entire subgraphs. To validate the algorithm, hundreds of queries were searched against hundreds of target chemistries and topologies from the literature, with approximately 440,000 query-target pairs. This tool provides a detailed algorithm that can be implemented in search engines to provide search results with full matching of the monomer connectivity and polymer topology.
Collapse
Affiliation(s)
- Nathan J Rebello
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Tzyy-Shyang Lin
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Heeba Nazeer
- Department of Computer Science, Wellesley College, 106 Central Street, Wellesley, Massachusetts 02481, United States
| | - Bradley D Olsen
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
11
|
Umar AK, Limpikirati PK, Luckanagul JA. From Linear to Nets: Multiconfiguration Polymer Structure Generation with PolyFlin. J Chem Inf Model 2023; 63:6717-6726. [PMID: 37851376 DOI: 10.1021/acs.jcim.3c01221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2023]
Abstract
Molecular modeling and simulations are essential tools in polymer science and engineering, enabling researchers to predict and understand the properties of macromolecules, including their structure, dynamics, thermodynamics, and overall material characteristics. However, one of the key challenges in polymer simulation and modeling lies in the initial topology design, as existing programs often lack the capability to generate all types of polymer forms. In this study, we present PolyFlin, a powerful Python module that addresses this limitation by allowing the generation of a wide range of polymer structures, from simple homopolymers to complex copolymers, including grafts, cyclic, star, dendrimers, and nets. PolyFlin offers a versatile and efficient tool for exploring and creating diverse polymer architectures, facilitating advancements in various fields that require precise polymer modeling and simulation.
Collapse
Affiliation(s)
- Abd Kakhar Umar
- Department of Pharmaceutics and Industrial Pharmacy, Faculty of Pharmaceutical Sciences, Chulalongkorn University, Bangkok 10330, Thailand
- Department of Pharmaceutics and Pharmaceutical Technology, Faculty of Pharmacy, Universitas Padjadjaran, Sumedang 45363, Indonesia
- Medical Informatics Laboratory, ETFLIN, Palu 94225, Indonesia
| | - Patanachai K Limpikirati
- Department of Food and Pharmaceutical Chemistry, Faculty of Pharmaceutical Sciences, Chulalongkorn University, Bangkok 10330, Thailand
- Metabolomics for Life Sciences Research Unit, Chulalongkorn University, Bangkok 10330, Thailand
| | - Jittima Amie Luckanagul
- Department of Pharmaceutics and Industrial Pharmacy, Faculty of Pharmaceutical Sciences, Chulalongkorn University, Bangkok 10330, Thailand
| |
Collapse
|
12
|
Kim S, Schroeder CM, Jackson NE. Open Macromolecular Genome: Generative Design of Synthetically Accessible Polymers. ACS POLYMERS AU 2023; 3:318-330. [PMID: 37576712 PMCID: PMC10416319 DOI: 10.1021/acspolymersau.3c00003] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 03/13/2023] [Accepted: 03/14/2023] [Indexed: 03/31/2023]
Abstract
A grand challenge in polymer science lies in the predictive design of new polymeric materials with targeted functionality. However, de novo design of functional polymers is challenging due to the vast chemical space and an incomplete understanding of structure-property relations. Recent advances in deep generative modeling have facilitated the efficient exploration of molecular design space, but data sparsity in polymer science is a major obstacle hindering progress. In this work, we introduce a vast polymer database known as the Open Macromolecular Genome (OMG), which contains synthesizable polymer chemistries compatible with known polymerization reactions and commercially available reactants selected for synthetic feasibility. The OMG is used in concert with a synthetically aware generative model known as Molecule Chef to identify property-optimized constitutional repeating units, constituent reactants, and reaction pathways of polymers, thereby advancing polymer design into the realm of synthetic relevance. As a proof-of-principle demonstration, we show that polymers with targeted octanol-water solubilities are readily generated together with monomer reactant building blocks and associated polymerization reactions. Suggested reactants are further integrated with Reaxys polymerization data to provide hypothetical reaction conditions (e.g., temperature, catalysts, and solvents). Broadly, the OMG is a polymer design approach capable of enabling data-intensive generative models for synthetic polymer design. Overall, this work represents a significant advance, enabling the property targeted design of synthetic polymers subject to practical synthetic constraints.
Collapse
Affiliation(s)
- Seonghwan Kim
- Department
of Materials Science and Engineering, University
of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Charles M. Schroeder
- Department
of Chemistry, University of Illinois at
Urbana-Champaign, Urbana, Illinois 61801, United States
- Department
of Materials Science and Engineering, University
of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Beckman
Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department
of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Nicholas E. Jackson
- Department
of Chemistry, University of Illinois at
Urbana-Champaign, Urbana, Illinois 61801, United States
- Beckman
Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
13
|
Park NH, Manica M, Born J, Hedrick JL, Erdmann T, Zubarev DY, Adell-Mill N, Arrechea PL. Artificial intelligence driven design of catalysts and materials for ring opening polymerization using a domain-specific language. Nat Commun 2023; 14:3686. [PMID: 37344485 PMCID: PMC10284867 DOI: 10.1038/s41467-023-39396-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 06/12/2023] [Indexed: 06/23/2023] Open
Abstract
Advances in machine learning (ML) and automated experimentation are poised to vastly accelerate research in polymer science. Data representation is a critical aspect for enabling ML integration in research workflows, yet many data models impose significant rigidity making it difficult to accommodate a broad array of experiment and data types found in polymer science. This inflexibility presents a significant barrier for researchers to leverage their historical data in ML development. Here we show that a domain specific language, termed Chemical Markdown Language (CMDL), provides flexible, extensible, and consistent representation of disparate experiment types and polymer structures. CMDL enables seamless use of historical experimental data to fine-tune regression transformer (RT) models for generative molecular design tasks. We demonstrate the utility of this approach through the generation and the experimental validation of catalysts and polymers in the context of ring-opening polymerization-although we provide examples of how CMDL can be more broadly applied to other polymer classes. Critically, we show how the CMDL tuned model preserves key functional groups within the polymer structure, allowing for experimental validation. These results reveal the versatility of CMDL and how it facilitates translation of historical data into meaningful predictive and generative models to produce experimentally actionable output.
Collapse
Affiliation(s)
| | - Matteo Manica
- IBM Research-Zurich, Säumerstrasse 4, Rüschlikon, 8803, Switzerland
| | - Jannis Born
- IBM Research-Zurich, Säumerstrasse 4, Rüschlikon, 8803, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058, Basel, Switzerland
| | - James L Hedrick
- IBM Research-Almaden, 650 Harry Rd., San Jose, CA, 95120, USA
| | - Tim Erdmann
- IBM Research-Almaden, 650 Harry Rd., San Jose, CA, 95120, USA
| | | | - Nil Adell-Mill
- IBM Research-Zurich, Säumerstrasse 4, Rüschlikon, 8803, Switzerland
- Arctoris, 120E Olympic Avenue, Abingdon, OX14 4SA, Oxfordshire, UK
| | | |
Collapse
|
14
|
Ma B, Finan NJ, Jany D, Deagen ME, Schadler LS, Brinson LC. Machine-Learning-Assisted Understanding of Polymer Nanocomposites Composition-Property Relationship: A Case Study of NanoMine Database. Macromolecules 2023; 56:3945-3953. [PMID: 37333841 PMCID: PMC10275499 DOI: 10.1021/acs.macromol.2c02249] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 01/27/2023] [Indexed: 06/20/2023]
Abstract
The NanoMine database, one of two nodes in the MaterialsMine database, is a new materials data resource that collects annotated data on polymer nanocomposites (PNCs). This work showcases the potential of NanoMine and other materials data resources to assist fundamental materials understanding and therefore rational materials design. This specific case study is built around studying the relationship between the change in the glass transition temperature Tg (ΔTg) and key descriptors of the nanofillers and the polymer matrix in PNCs. We sifted through data from over 2000 experimental samples curated into NanoMine, trained a decision tree classifier to predict the sign of PNC ΔTg, and built a multiple power regression metamodel to predict ΔTg. The successful model used key descriptors including composition, nanoparticle volume fraction, and interfacial surface energy. The results demonstrate the power of using aggregated materials data to gain insight and predictive capability. Further analysis points to the importance of additional analysis of parameters from processing methodologies and continuously adding curated data sets to increase the sample pool size.
Collapse
Affiliation(s)
- Boran Ma
- Department
of Mechanical Engineering and Materials Science, Duke University, Durham, North Carolina 27708, United States
| | - Nicholas J. Finan
- Department
of Mechanical Engineering and Materials Science, Duke University, Durham, North Carolina 27708, United States
| | - David Jany
- Department
of Mechanical Engineering and Materials Science, Duke University, Durham, North Carolina 27708, United States
| | - Michael E. Deagen
- Department
of Mechanical Engineering and Materials Science, Duke University, Durham, North Carolina 27708, United States
| | - Linda S. Schadler
- Department
of Department of Mechanical Engineering, College of Engineering and
Mathematical Sciences, University of Vermont, Burlington, Vermont 05405, United States
| | - L. Catherine Brinson
- Department
of Mechanical Engineering and Materials Science, Duke University, Durham, North Carolina 27708, United States
| |
Collapse
|
15
|
Yan T, Balzer AH, Herbert KM, Epps TH, Korley LTJ. Circularity in polymers: addressing performance and sustainability challenges using dynamic covalent chemistries. Chem Sci 2023; 14:5243-5265. [PMID: 37234906 PMCID: PMC10208058 DOI: 10.1039/d3sc00551h] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 04/14/2023] [Indexed: 05/28/2023] Open
Abstract
The circularity of current and future polymeric materials is a major focus of fundamental and applied research, as undesirable end-of-life outcomes and waste accumulation are global problems that impact our society. The recycling or repurposing of thermoplastics and thermosets is an attractive solution to these issues, yet both options are encumbered by poor property retention upon reuse, along with heterogeneities in common waste streams that limit property optimization. Dynamic covalent chemistry, when applied to polymeric materials, enables the targeted design of reversible bonds that can be tailored to specific reprocessing conditions to help address conventional recycling challenges. In this review, we highlight the key features of several dynamic covalent chemistries that can promote closed-loop recyclability and we discuss recent synthetic progress towards incorporating these chemistries into new polymers and existing commodity plastics. Next, we outline how dynamic covalent bonds and polymer network structure influence thermomechanical properties related to application and recyclability, with a focus on predictive physical models that describe network rearrangement. Finally, we examine the potential economic and environmental impacts of dynamic covalent polymeric materials in closed-loop processing using elements derived from techno-economic analysis and life-cycle assessment, including minimum selling prices and greenhouse gas emissions. Throughout each section, we discuss interdisciplinary obstacles that hinder the widespread adoption of dynamic polymers and present opportunities and new directions toward the realization of circularity in polymeric materials.
Collapse
Affiliation(s)
- Tianwei Yan
- Department of Chemical & Biomolecular Engineering, University of Delaware Newark 19716 Delaware USA
- Center for Plastics Innovation (CPI), University of Delaware Newark 19716 Delaware USA
| | - Alex H Balzer
- Department of Chemical & Biomolecular Engineering, University of Delaware Newark 19716 Delaware USA
- Center for Plastics Innovation (CPI), University of Delaware Newark 19716 Delaware USA
| | - Katie M Herbert
- Center for Plastics Innovation (CPI), University of Delaware Newark 19716 Delaware USA
| | - Thomas H Epps
- Department of Chemical & Biomolecular Engineering, University of Delaware Newark 19716 Delaware USA
- Center for Plastics Innovation (CPI), University of Delaware Newark 19716 Delaware USA
- Department of Materials Science and Engineering, University of Delaware Newark 19716 Delaware USA
- Center for Research in Soft matter and Polymers (CRiSP), University of Delaware Newark 19716 Delaware USA
| | - LaShanda T J Korley
- Department of Chemical & Biomolecular Engineering, University of Delaware Newark 19716 Delaware USA
- Center for Plastics Innovation (CPI), University of Delaware Newark 19716 Delaware USA
- Department of Materials Science and Engineering, University of Delaware Newark 19716 Delaware USA
- Center for Research in Soft matter and Polymers (CRiSP), University of Delaware Newark 19716 Delaware USA
| |
Collapse
|