1
|
Qiu H, Liu L, Qiu X, Dai X, Ji X, Sun ZY. PolyNC: a natural and chemical language model for the prediction of unified polymer properties. Chem Sci 2024; 15:534-544. [PMID: 38179518 PMCID: PMC10763023 DOI: 10.1039/d3sc05079c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 12/04/2023] [Indexed: 01/06/2024] Open
Abstract
Language models exhibit a profound aptitude for addressing multimodal and multidomain challenges, a competency that eludes the majority of off-the-shelf machine learning models. Consequently, language models hold great potential for comprehending the intricate interplay between material compositions and diverse properties, thereby accelerating material design, particularly in the realm of polymers. While past limitations in polymer data hindered the use of data-intensive language models, the growing availability of standardized polymer data and effective data augmentation techniques now opens doors to previously uncharted territories. Here, we present a revolutionary model to enable rapid and precise prediction of Polymer properties via the power of Natural language and Chemical language (PolyNC). To showcase the efficacy of PolyNC, we have meticulously curated a labeled prompt-structure-property corpus encompassing 22 970 polymer data points on a series of essential polymer properties. Through the use of natural language prompts, PolyNC gains a comprehensive understanding of polymer properties, while employing chemical language (SMILES) to describe polymer structures. In a unified text-to-text manner, PolyNC consistently demonstrates exceptional performance on both regression tasks (such as property prediction) and the classification task (polymer classification). Simultaneous and interactive multitask learning enables PolyNC to holistically grasp the structure-property relationships of polymers. Through a combination of experiments and characterizations, the generalization ability of PolyNC has been demonstrated, with attention analysis further indicating that PolyNC effectively learns structural information about polymers from multimodal inputs. This work provides compelling evidence of the potential for deploying end-to-end language models in polymer research, representing a significant advancement in the AI community's dedicated pursuit of advancing polymer science.
Collapse
Affiliation(s)
- Haoke Qiu
- State Key Laboratory of Polymer Physics and Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
- School of Applied Chemistry and Engineering, University of Science and Technology of China Hefei 230026 China
| | - Lunyang Liu
- State Key Laboratory of Polymer Physics and Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
| | - Xuepeng Qiu
- School of Applied Chemistry and Engineering, University of Science and Technology of China Hefei 230026 China
- CAS Key Laboratory of High-Performance Synthetic Rubber and its Composite Materials, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
| | - Xuemin Dai
- CAS Key Laboratory of High-Performance Synthetic Rubber and its Composite Materials, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
| | - Xiangling Ji
- State Key Laboratory of Polymer Physics and Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
- School of Applied Chemistry and Engineering, University of Science and Technology of China Hefei 230026 China
| | - Zhao-Yan Sun
- State Key Laboratory of Polymer Physics and Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
- School of Applied Chemistry and Engineering, University of Science and Technology of China Hefei 230026 China
| |
Collapse
|
2
|
Shi J, Albreiki F, Yamil J Colón, Srivastava S, Whitmer JK. Transfer Learning Facilitates the Prediction of Polymer-Surface Adhesion Strength. J Chem Theory Comput 2023; 19:4631-4640. [PMID: 37068204 DOI: 10.1021/acs.jctc.2c01314] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2023]
Abstract
Machine learning (ML) accelerates the exploration of material properties and their links to the structure of the underlying molecules. In previous work [Shi et al. ACS Applied Materials & Interfaces 2022, 14, 37161-37169.], ML models were applied to predict the adhesive free energy of polymer-surface interactions with high accuracy from the knowledge of the sequence data, demonstrating successes in inverse-design of polymer sequence for known surface compositions. While the method was shown to be successful in designing polymers for a known surface, extensive data sets were needed for each specific surface in order to train the surrogate models. Ideally, one should be able to infer information about similar surfaces without having to regenerate a full complement of adhesion data for each new case. In the current work, we demonstrate a transfer learning (TL) technique using a deep neural network to improve the accuracy of ML models trained on small data sets by pretraining on a larger database from a related system and fine-tuning the weights of all layers with a small amount of additional data. The shared knowledge from the pretrained model facilitates the prediction accuracy significantly on small data sets. We also explore the limits of database size on accuracy and the optimal tuning of network architecture and parameters for our learning tasks. While applied to a relatively simple coarse-grained (CG) polymer model, the general lessons of this study apply to detailed modeling studies and the broader problems of inverse materials design.
Collapse
Affiliation(s)
- Jiale Shi
- Department of Chemical and Biomolecular Engineering, University of Notre Dame, Notre Dame, Indiana 46556, United States
| | - Fahed Albreiki
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, Los Angeles, California 90095, United States
| | - Yamil J Colón
- Department of Chemical and Biomolecular Engineering, University of Notre Dame, Notre Dame, Indiana 46556, United States
| | - Samanvaya Srivastava
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, Los Angeles, California 90095, United States
- California NanoSystems Institute, Center for Biological Physics, University of California, Los Angeles, Los Angeles, California 90095, United States
- Institute for Carbon Management, University of California, Los Angeles, Los Angeles, California 90095, United States
- Center for Biological Physics, University of California, Los Angeles, Los Angeles, California 90095, United States
| | - Jonathan K Whitmer
- Department of Chemical and Biomolecular Engineering, University of Notre Dame, Notre Dame, Indiana 46556, United States
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana 46556, United States
| |
Collapse
|
3
|
Park NH, Manica M, Born J, Hedrick JL, Erdmann T, Zubarev DY, Adell-Mill N, Arrechea PL. Artificial intelligence driven design of catalysts and materials for ring opening polymerization using a domain-specific language. Nat Commun 2023; 14:3686. [PMID: 37344485 PMCID: PMC10284867 DOI: 10.1038/s41467-023-39396-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 06/12/2023] [Indexed: 06/23/2023] Open
Abstract
Advances in machine learning (ML) and automated experimentation are poised to vastly accelerate research in polymer science. Data representation is a critical aspect for enabling ML integration in research workflows, yet many data models impose significant rigidity making it difficult to accommodate a broad array of experiment and data types found in polymer science. This inflexibility presents a significant barrier for researchers to leverage their historical data in ML development. Here we show that a domain specific language, termed Chemical Markdown Language (CMDL), provides flexible, extensible, and consistent representation of disparate experiment types and polymer structures. CMDL enables seamless use of historical experimental data to fine-tune regression transformer (RT) models for generative molecular design tasks. We demonstrate the utility of this approach through the generation and the experimental validation of catalysts and polymers in the context of ring-opening polymerization-although we provide examples of how CMDL can be more broadly applied to other polymer classes. Critically, we show how the CMDL tuned model preserves key functional groups within the polymer structure, allowing for experimental validation. These results reveal the versatility of CMDL and how it facilitates translation of historical data into meaningful predictive and generative models to produce experimentally actionable output.
Collapse
Affiliation(s)
| | - Matteo Manica
- IBM Research-Zurich, Säumerstrasse 4, Rüschlikon, 8803, Switzerland
| | - Jannis Born
- IBM Research-Zurich, Säumerstrasse 4, Rüschlikon, 8803, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058, Basel, Switzerland
| | - James L Hedrick
- IBM Research-Almaden, 650 Harry Rd., San Jose, CA, 95120, USA
| | - Tim Erdmann
- IBM Research-Almaden, 650 Harry Rd., San Jose, CA, 95120, USA
| | | | - Nil Adell-Mill
- IBM Research-Zurich, Säumerstrasse 4, Rüschlikon, 8803, Switzerland
- Arctoris, 120E Olympic Avenue, Abingdon, OX14 4SA, Oxfordshire, UK
| | | |
Collapse
|
4
|
Li G, Zhu Y, Guo Y, Mabuchi T, Li D, Huang S, Wang S, Sun H, Tokumasu T. Deep Learning to Reveal the Distribution and Diffusion of Water Molecules in Fuel Cell Catalyst Layers. ACS APPLIED MATERIALS & INTERFACES 2023; 15:5099-5108. [PMID: 36652634 DOI: 10.1021/acsami.2c17198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Water management in the catalyst layers (CLs) of proton-exchange membrane fuel cells is crucial for its commercialization and popularization. However, the high experimental or computational cost in obtaining water distribution and diffusion remains a bottleneck in the existing experimental methods and simulation algorithms, and further mechanistic exploration at the nanoscale is necessary. Herein, we integrate, for the first time, molecular dynamics simulation with our customized analysis framework based on a multiattribute point cloud dataset and an advanced deep learning network. This was achieved through our workflow that generates simulated transport data of water molecules in the CLs as the training and test dataset. Deep learning framework models the multibody solid-liquid system of CLs on a molecular scale and completes the mapping from the Pt/C substrate structure and Nafion aggregates to the density distribution and diffusion coefficient of water molecules. The prediction results are comprehensively analyzed and error evaluated, which reveals the highly anisotropic interaction landscape between 50,000 pairs of interacting nanoparticles and explains the structure and water transport property relationship in the hydrated Nafion film on the molecular scale. Compared to the conventional methods, the proposed deep learning framework shows computational cost efficiency, accuracy, and good visual display. Further, it has a generality potential to model macro- and microscopic mass transport in different components of fuel cells. Our framework is expected to make real-time predictions of the distribution and diffusion of water molecules in CLs as well as establish statistical significance in the structural optimization and design of CLs and other components of fuel cells.
Collapse
Affiliation(s)
- Gaoyang Li
- Institute of Fluid Science, Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai980-8577, Japan
| | - Yonghong Zhu
- School of Chemical Engineering, Northwest University, Xi'an710069Shaanxi, China
| | - Yuting Guo
- Institute of Fluid Science, Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai980-8577, Japan
| | - Takuya Mabuchi
- Institute of Fluid Science, Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai980-8577, Japan
- Frontier Research Institute for Interdisciplinary Sciences, Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai, Miyagi980-8577, Japan
| | - Dong Li
- School of Chemical Engineering, Northwest University, Xi'an710069Shaanxi, China
| | - Shengfeng Huang
- Institute of Fluid Science, Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai980-8577, Japan
| | - Sirui Wang
- Graduate School of Engineering, Chiba University, 1-33, Yayoi-cho, Inage-ku, Chiba263-8522, Japan
| | - Haiyi Sun
- Institute of Fluid Science, Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai980-8577, Japan
| | - Takashi Tokumasu
- Institute of Fluid Science, Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai980-8577, Japan
| |
Collapse
|
5
|
Martin TB, Audus DJ. Emerging Trends in Machine Learning: A Polymer Perspective. ACS POLYMERS AU 2023. [DOI: 10.1021/acspolymersau.2c00053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Affiliation(s)
- Tyler B. Martin
- National Institute of Standards and Technology, Gaithersburg, Maryland20899, United States
| | - Debra J. Audus
- National Institute of Standards and Technology, Gaithersburg, Maryland20899, United States
| |
Collapse
|
6
|
Ma D, Xing Y, Zhang L. Reducing interfacial thermal resistance by interlayer. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2022; 35:053001. [PMID: 36541482 DOI: 10.1088/1361-648x/aca50a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 11/22/2022] [Indexed: 06/17/2023]
Abstract
Heat dissipation is crucial important for the performance and lifetime for highly integrated electronics, Li-ion battery-based devices and so on, which lies in the decrease of interfacial thermal resistance (ITR). To achieve this goal, introducing interlayer is the most widely used strategy in industry, which has attracted tremendous attention from researchers. In this review, we focus on bonding effect and bridging effect to illustrate how introduced interlayer decreases ITR. The behind mechanisms and theoretical understanding of these two effects are clearly illustrated. Simulative and experimental studies toward utilizing these two effects to decrease ITR of real materials and practical systems are reviewed. Specifically, the mechanisms and design rules for the newly emerged graded interlayers are discussed. The optimization of interlayers by machine learning algorithms are reviewed. Based on present researches, challenges and possible future directions about this topic are discussed.
Collapse
Affiliation(s)
- Dengke Ma
- Phonon Engineering Research Center of Jiangsu Province, Center for Quantum Transport and Thermal Energy Science, Institute of Physics Frontiers and Interdisciplinary Sciences, School of Physics and Technology, Nanjing Normal University, Nanjing 210023, People's Republic of China
| | - Yuheng Xing
- Department of Physics, Yancheng Institute of Technology, Yancheng 224051, People's Republic of China
| | - Lifa Zhang
- Phonon Engineering Research Center of Jiangsu Province, Center for Quantum Transport and Thermal Energy Science, Institute of Physics Frontiers and Interdisciplinary Sciences, School of Physics and Technology, Nanjing Normal University, Nanjing 210023, People's Republic of China
| |
Collapse
|
7
|
Liu Y, Zhou Y, Xu Y. State-of-the-Art, Opportunities, and Challenges in Bottom-up Synthesis of Polymers with High Thermal Conductivity. Polym Chem 2022. [DOI: 10.1039/d2py00272h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In contrast to metals, polymers are predominantly thermal and electrical insulators. With their unparalleled advantages such as light weight, turning polymer insulators into heat conductors with metal-like thermal conductivity is...
Collapse
|