1
|
Wang K, Gupta V, Lee CS, Mao Y, Kilic MNT, Li Y, Huang Z, Liao WK, Choudhary A, Agrawal A. XElemNet: towards explainable AI for deep neural networks in materials science. Sci Rep 2024; 14:25178. [PMID: 39448747 PMCID: PMC11502843 DOI: 10.1038/s41598-024-76535-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Accepted: 10/15/2024] [Indexed: 10/26/2024] Open
Abstract
Recent progress in deep learning has significantly impacted materials science, leading to accelerated material discovery and innovation. ElemNet, a deep neural network model that predicts formation energy from elemental compositions, exemplifies the application of deep learning techniques in this field. However, the "black-box" nature of deep learning models often raises concerns about their interpretability and reliability. In this study, we propose XElemNet to explore the interpretability of ElemNet by applying a series of explainable artificial intelligence (XAI) techniques, focusing on post-hoc analysis and model transparency. The experiments with artificial binary datasets reveal ElemNet's effectiveness in predicting convex hulls of element-pair systems across periodic table groups, indicating its capability to effectively discern elemental interactions in most cases. Additionally, feature importance analysis within ElemNet highlights alignment with chemical properties of elements such as reactivity and electronegativity. XElemNet provides insights into the strengths and limitations of ElemNet and offers a potential pathway for explaining other deep learning models in materials science.
Collapse
Affiliation(s)
- Kewei Wang
- Electrical and Computer Engineering, Northwestern University, Evanston, 60201, USA
| | - Vishu Gupta
- Electrical and Computer Engineering, Northwestern University, Evanston, 60201, USA
| | - Claire Songhyun Lee
- Electrical and Computer Engineering, Northwestern University, Evanston, 60201, USA
| | - Yuwei Mao
- Electrical and Computer Engineering, Northwestern University, Evanston, 60201, USA
| | | | - Youjia Li
- Electrical and Computer Engineering, Northwestern University, Evanston, 60201, USA
| | - Zanhua Huang
- Electrical and Computer Engineering, Northwestern University, Evanston, 60201, USA
| | - Wei-Keng Liao
- Electrical and Computer Engineering, Northwestern University, Evanston, 60201, USA
| | - Alok Choudhary
- Electrical and Computer Engineering, Northwestern University, Evanston, 60201, USA
| | - Ankit Agrawal
- Electrical and Computer Engineering, Northwestern University, Evanston, 60201, USA.
| |
Collapse
|
2
|
Day AL, Wahl CB, Gupta V, Dos Reis R, Liao WK, Mirkin CA, Dravid VP, Choudhary A, Agrawal A. Machine Learning-Enabled Image Classification for Automated Electron Microscopy. MICROSCOPY AND MICROANALYSIS : THE OFFICIAL JOURNAL OF MICROSCOPY SOCIETY OF AMERICA, MICROBEAM ANALYSIS SOCIETY, MICROSCOPICAL SOCIETY OF CANADA 2024; 30:456-465. [PMID: 38758983 DOI: 10.1093/mam/ozae042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 02/19/2024] [Accepted: 04/29/2024] [Indexed: 05/19/2024]
Abstract
Traditionally, materials discovery has been driven more by evidence and intuition than by systematic design. However, the advent of "big data" and an exponential increase in computational power have reshaped the landscape. Today, we use simulations, artificial intelligence (AI), and machine learning (ML) to predict materials characteristics, which dramatically accelerates the discovery of novel materials. For instance, combinatorial megalibraries, where millions of distinct nanoparticles are created on a single chip, have spurred the need for automated characterization tools. This paper presents an ML model specifically developed to perform real-time binary classification of grayscale high-angle annular dark-field images of nanoparticles sourced from these megalibraries. Given the high costs associated with downstream processing errors, a primary requirement for our model was to minimize false positives while maintaining efficacy on unseen images. We elaborate on the computational challenges and our solutions, including managing memory constraints, optimizing training time, and utilizing Neural Architecture Search tools. The final model outperformed our expectations, achieving over 95% precision and a weighted F-score of more than 90% on our test data set. This paper discusses the development, challenges, and successful outcomes of this significant advancement in the application of AI and ML to materials discovery.
Collapse
Affiliation(s)
- Alexandra L Day
- Department of Electrical and Computer Engineering, McCormick School of Engineering, Northwestern University, Technological Institute, 2145 Sheridan Road, Room L359, Evanston, IL 60208, USA
| | - Carolin B Wahl
- Department of Materials Science and Engineering, McCormick School of Engineering, Northwestern University, Cook Hall, 2220 Campus Drive, Room 2036, Evanston, IL 60208, USA
- International Institute for Nanotechnology, Northwestern University, Technological Institute, 2145 Sheridan Road, Room K111, Evanston, IL 60208, USA
| | - Vishu Gupta
- Department of Electrical and Computer Engineering, McCormick School of Engineering, Northwestern University, Technological Institute, 2145 Sheridan Road, Room L359, Evanston, IL 60208, USA
| | - Roberto Dos Reis
- Department of Materials Science and Engineering, McCormick School of Engineering, Northwestern University, Cook Hall, 2220 Campus Drive, Room 2036, Evanston, IL 60208, USA
- International Institute for Nanotechnology, Northwestern University, Technological Institute, 2145 Sheridan Road, Room K111, Evanston, IL 60208, USA
- The NUANCE Center, Northwestern University, Technological Institute, 2145 Sheridan Road, Room A173, Evanston, IL 60208, USA
| | - Wei-Keng Liao
- Department of Electrical and Computer Engineering, McCormick School of Engineering, Northwestern University, Technological Institute, 2145 Sheridan Road, Room L359, Evanston, IL 60208, USA
| | - Chad A Mirkin
- Department of Materials Science and Engineering, McCormick School of Engineering, Northwestern University, Cook Hall, 2220 Campus Drive, Room 2036, Evanston, IL 60208, USA
- International Institute for Nanotechnology, Northwestern University, Technological Institute, 2145 Sheridan Road, Room K111, Evanston, IL 60208, USA
- Department of Chemistry, Northwestern University, Technological Institute, 2145 Sheridan Road, Room K148, Evanston, IL 60208, USA
| | - Vinayak P Dravid
- Department of Materials Science and Engineering, McCormick School of Engineering, Northwestern University, Cook Hall, 2220 Campus Drive, Room 2036, Evanston, IL 60208, USA
- International Institute for Nanotechnology, Northwestern University, Technological Institute, 2145 Sheridan Road, Room K111, Evanston, IL 60208, USA
- The NUANCE Center, Northwestern University, Technological Institute, 2145 Sheridan Road, Room A173, Evanston, IL 60208, USA
| | - Alok Choudhary
- Department of Electrical and Computer Engineering, McCormick School of Engineering, Northwestern University, Technological Institute, 2145 Sheridan Road, Room L359, Evanston, IL 60208, USA
| | - Ankit Agrawal
- Department of Electrical and Computer Engineering, McCormick School of Engineering, Northwestern University, Technological Institute, 2145 Sheridan Road, Room L359, Evanston, IL 60208, USA
| |
Collapse
|
3
|
Chen X, Lu S, Chen Q, Zhou Q, Wang J. From bulk effective mass to 2D carrier mobility accurate prediction via adversarial transfer learning. Nat Commun 2024; 15:5391. [PMID: 38918387 PMCID: PMC11199574 DOI: 10.1038/s41467-024-49686-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 06/10/2024] [Indexed: 06/27/2024] Open
Abstract
Data scarcity is one of the critical bottlenecks to utilizing machine learning in material discovery. Transfer learning can use existing big data to assist property prediction on small data sets, but the premise is that there must be a strong correlation between large and small data sets. To extend its applicability in scenarios with different properties and materials, here we develop a hybrid framework combining adversarial transfer learning and expert knowledge, which enables the direct prediction of carrier mobility of two-dimensional (2D) materials using the knowledge learned from bulk effective mass. Specifically, adversarial training ensures that only common knowledge between bulk and 2D materials is extracted while expert knowledge is incorporated to further improve the prediction accuracy and generalizability. Successfully, 2D carrier mobilities are predicted with the accuracy over 90% from only crystal structure, and 21 2D semiconductors with carrier mobilities far exceeding silicon and suitable bandgap are successfully screened out. This work enables transfer learning in simultaneous cross-property and cross-material scenarios, providing an effective tool to predict intricate material properties with limited data.
Collapse
Affiliation(s)
- Xinyu Chen
- Key Laboratory of Quantum Materials and Devices of Ministry of Education, School of Physics, Southeast University, Nanjing, China
| | - Shuaihua Lu
- Key Laboratory of Quantum Materials and Devices of Ministry of Education, School of Physics, Southeast University, Nanjing, China
| | - Qian Chen
- Key Laboratory of Quantum Materials and Devices of Ministry of Education, School of Physics, Southeast University, Nanjing, China
| | - Qionghua Zhou
- Key Laboratory of Quantum Materials and Devices of Ministry of Education, School of Physics, Southeast University, Nanjing, China.
- Suzhou Laboratory, Suzhou, China.
| | - Jinlan Wang
- Key Laboratory of Quantum Materials and Devices of Ministry of Education, School of Physics, Southeast University, Nanjing, China.
- Suzhou Laboratory, Suzhou, China.
| |
Collapse
|
4
|
Li P, Dong L, Li C, Li Y, Zhao J, Peng B, Wang W, Zhou S, Liu W. Machine Learning to Promote Efficient Screening of Low-Contact Electrode for 2D Semiconductor Transistor Under Limited Data. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2312887. [PMID: 38606800 DOI: 10.1002/adma.202312887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/09/2024] [Indexed: 04/13/2024]
Abstract
Low-barrier and high-injection electrodes are crucial for high-performance (HP) 2D semiconductor devices. Conventional trial-and-error methodologies for electrode material screening are impractical because of their low efficiency and arbitrary specificity. Although machine learning has emerged as a promising alternative to tackle this problem, its practical application in semiconductor devices is hindered by its substantial data requirements. In this paper, a comprehensive scheme combining an autoencoding regularized adversarial neural network and a feature-adaptive variational active learning algorithm for screening low-contact electrode materials for 2D semiconductor transistors with limited data is proposed. The proposed scheme exhibits exceptional performance by training with only 15% of the total data points, where the mean square errors are 0.17 and 0.27 eV for the vertical and lateral Schottky barrier, respectively, and 2.88% for tunneling probability. Further, it exhibits an optimal predictive performance for 100 randomly sampled training datasets, reveals the underlying physical insight based on the identified features, and realizes continual improvement by employing detailed density-of-states descriptors. Finally, the empirical evaluations of the transport characteristics are conducted and verified by constructing MOSFET devices. These findings demonstrate the considerable potential of machine-learning techniques for screening high-efficiency electrode materials and constructing HP 2D semiconductor devices.
Collapse
Affiliation(s)
- Penghui Li
- Shaanxi Province Key Laboratory of Thin Films Technology and Optical Test, Xi'an Technological University, Xi'an, 710032, China
- School of Opto-electronical Engineering, Xi'an Technological University, Xi'an, 710032, China
| | - Linpeng Dong
- Shaanxi Province Key Laboratory of Thin Films Technology and Optical Test, Xi'an Technological University, Xi'an, 710032, China
- School of Opto-electronical Engineering, Xi'an Technological University, Xi'an, 710032, China
| | - Chong Li
- Xi'an Xiangteng Microelectronics Technology Co., Ltd, Xi'an, 710075, China
| | - Yan Li
- Shaanxi Province Key Laboratory of Thin Films Technology and Optical Test, Xi'an Technological University, Xi'an, 710032, China
- School of Opto-electronical Engineering, Xi'an Technological University, Xi'an, 710032, China
| | - Jie Zhao
- Shaanxi Province Key Laboratory of Thin Films Technology and Optical Test, Xi'an Technological University, Xi'an, 710032, China
- School of Opto-electronical Engineering, Xi'an Technological University, Xi'an, 710032, China
| | - Bo Peng
- Key Laboratory of Wide Band-Gap Semiconductor Materials and Devices, School of Microelectronics, Xidian University, Xi'an, 710071, China
| | - Wei Wang
- School of Opto-electronical Engineering, Xi'an Technological University, Xi'an, 710032, China
| | - Shun Zhou
- Shaanxi Province Key Laboratory of Thin Films Technology and Optical Test, Xi'an Technological University, Xi'an, 710032, China
- School of Opto-electronical Engineering, Xi'an Technological University, Xi'an, 710032, China
| | - Weiguo Liu
- Shaanxi Province Key Laboratory of Thin Films Technology and Optical Test, Xi'an Technological University, Xi'an, 710032, China
- School of Opto-electronical Engineering, Xi'an Technological University, Xi'an, 710032, China
| |
Collapse
|
5
|
Korolev V, Mitrofanov A. Coarse-Grained Crystal Graph Neural Networks for Reticular Materials Design. J Chem Inf Model 2024; 64:1919-1931. [PMID: 38456446 DOI: 10.1021/acs.jcim.3c02083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]
Abstract
Reticular materials, including metal-organic frameworks and covalent organic frameworks, combine the relative ease of synthesis and an impressive range of applications in various fields from gas storage to biomedicine. Diverse properties arise from the variation of building units─metal centers and organic linkers─in almost infinite chemical space. Such variation substantially complicates the experimental design and promotes the use of computational methods. In particular, the most successful artificial intelligence algorithms for predicting the properties of reticular materials are atomic-level graph neural networks, which optionally incorporate domain knowledge. Nonetheless, the data-driven inverse design involving these models suffers from the incorporation of irrelevant and redundant features such as a full atomistic graph and network topology. In this study, we propose a new way of representing materials, aiming to overcome the limitations of existing methods; the message passing is performed on a coarse-grained crystal graph that comprises molecular building units. To highlight the merits of our approach, we assessed the predictive performance and energy efficiency of neural networks built on different materials representations, including composition-based and crystal-structure-aware models. Coarse-grained crystal graph neural networks showed decent accuracy at low computational costs, making them a valuable alternative to omnipresent atomic-level algorithms. Moreover, the presented models can be successfully integrated into an inverse materials design pipeline as estimators of the objective function. Overall, the coarse-grained crystal graph framework is aimed at challenging the prevailing atom-centric perspective on reticular materials design.
Collapse
Affiliation(s)
- Vadim Korolev
- Department of Chemistry, Lomonosov Moscow State University, Moscow 119991, Russia
- MSU Institute for Artificial Intelligence, Lomonosov Moscow State University, Moscow 119192, Russia
| | - Artem Mitrofanov
- Department of Chemistry, Lomonosov Moscow State University, Moscow 119991, Russia
- MSU Institute for Artificial Intelligence, Lomonosov Moscow State University, Moscow 119192, Russia
| |
Collapse
|
6
|
Wu Q, Kang L, Lin Z. A Machine Learning Study on High Thermal Conductivity Assisted to Discover Chalcogenides with Balanced Infrared Nonlinear Optical Performance. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2309675. [PMID: 37929600 DOI: 10.1002/adma.202309675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 10/24/2023] [Indexed: 11/07/2023]
Abstract
Exploration of novel nonlinear optical (NLO) chalcogenides with high laser-induced damage thresholds (LIDT) is critical for mid-infrared (mid-IR) solid-state laser applications. High lattice thermal conductivity (κL ) is crucial to increasing LIDT yet often neglected in the search for NLO crystals due to lack of accurate κL data. A machine learning (ML) approach to predict κL for over 6000 chalcogenides is hereby proposed. Combining ML-generated κL data and first-principles calculation, a high-throughput screening route is initiated, and ten new potential mid-IR NLO chalcogenides with optimal bandgap, NLO coefficients, and thermal conductivity are discovered, in which Li2 SiS3 and AlZnGaS4 are highlighted. Big-data analysis on structural chemistry proves that the chalcogenides having dense and simple lattice structures with low anisotropy, light atoms, and strong covalent bonds are likely to possess higher κL . The four-coordinated motifs in which central cations show the bond valence sum of +2 to +3 and are from IIIA, IVA, VA, and IIB groups, such as those in diamond-like defect-chalcopyrite chalcogenides, are preferred to fulfill the desired structural chemistry conditions for balanced NLO and thermal properties. This work provides not only an efficient strategy but also interpretable research directions in the search for NLO crystals with high thermal conductivity.
Collapse
Affiliation(s)
- Qingchen Wu
- Functional Crystals Lab, Key Laboratory of Functional Crystals and Laser Technology, Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Beijing, 100190, China
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Lei Kang
- Functional Crystals Lab, Key Laboratory of Functional Crystals and Laser Technology, Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Beijing, 100190, China
| | - Zheshuai Lin
- Functional Crystals Lab, Key Laboratory of Functional Crystals and Laser Technology, Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Beijing, 100190, China
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China
| |
Collapse
|
7
|
Wang Z, Chen A, Tao K, Han Y, Li J. MatGPT: A Vane of Materials Informatics from Past, Present, to Future. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2306733. [PMID: 37813548 DOI: 10.1002/adma.202306733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/05/2023] [Indexed: 10/17/2023]
Abstract
Combining materials science, artificial intelligence (AI), physical chemistry, and other disciplines, materials informatics is continuously accelerating the vigorous development of new materials. The emergence of "GPT (Generative Pre-trained Transformer) AI" shows that the scientific research field has entered the era of intelligent civilization with "data" as the basic factor and "algorithm + computing power" as the core productivity. The continuous innovation of AI will impact the cognitive laws and scientific methods, and reconstruct the knowledge and wisdom system. This leads to think more about materials informatics. Here, a comprehensive discussion of AI models and materials infrastructures is provided, and the advances in the discovery and design of new materials are reviewed. With the rise of new research paradigms triggered by "AI for Science", the vane of materials informatics: "MatGPT", is proposed and the technical path planning from the aspects of data, descriptors, generative models, pretraining models, directed design models, collaborative training, experimental robots, as well as the efforts and preparations needed to develop a new generation of materials informatics, is carried out. Finally, the challenges and constraints faced by materials informatics are discussed, in order to achieve a more digital, intelligent, and automated construction of materials informatics with the joint efforts of more interdisciplinary scientists.
Collapse
Affiliation(s)
- Zhilong Wang
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - An Chen
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Kehao Tao
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yanqiang Han
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Jinjin Li
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| |
Collapse
|
8
|
Wang K, Xu L, Shao W, Jin H, Wang Q, Ma M. A Multiple-Fidelity Method for Accurate Simulation of MoS 2 Properties Using JAX-ReaxFF and Neural Network Potentials. J Phys Chem Lett 2024; 15:371-379. [PMID: 38175525 DOI: 10.1021/acs.jpclett.3c03080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Reactive force field (ReaxFF) is a commonly used force field for modeling chemical reactions at the atomic level. Recently, JAX-ReaxFF, combined with automatic differentiation, has been used to efficiently parametrize ReaxFF. However, its analytical formula may lead to inaccurate predictions. While neural network-based potentials (NNPs) trained on density functional theory-labeled data offer a more accurate method, it requires a large amount of training data to be trained from scratch. To overcome these issues, we present a multiple-fidelity method that combines JAX-ReaxFF and NNP and apply the method on MoS2, a promising two-dimensional semiconductor for flexible electronics. By incorporating implicit prior physical information, ReaxFF can serve as a cost-effective way to generate pretraining data, facilitating more accurate simulations of MoS2. Moreover, in the Mo-S-H system, the pretraining strategy can reduce root-mean-square errors of energy by 20%. This approach can be extended to a wide variety of material systems, accelerating their computational research.
Collapse
Affiliation(s)
- Kehan Wang
- State Key Laboratory of Tribology in Advanced Equipment (SKLT), Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China
- Center for Nano and Micro Mechanics, Tsinghua University, Beijing 100084, China
| | - Longkun Xu
- Samsung Research China - Beijing (SRC-B), Beijing 100102, China
| | - Wei Shao
- Samsung Research China - Beijing (SRC-B), Beijing 100102, China
| | - Haishun Jin
- Samsung Research China - Beijing (SRC-B), Beijing 100102, China
| | - Qiang Wang
- Samsung Research China - Beijing (SRC-B), Beijing 100102, China
| | - Ming Ma
- State Key Laboratory of Tribology in Advanced Equipment (SKLT), Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China
- Center for Nano and Micro Mechanics, Tsinghua University, Beijing 100084, China
- Institute of Superlubricity Technology, Research Institute of Tsinghua University in Shenzhen, Shenzhen 518063, Guangdong, China
| |
Collapse
|
9
|
Gong S, Yan K, Xie T, Shao-Horn Y, Gomez-Bombarelli R, Ji S, Grossman JC. Examining graph neural networks for crystal structures: Limitations and opportunities for capturing periodicity. SCIENCE ADVANCES 2023; 9:eadi3245. [PMID: 37948518 PMCID: PMC10637739 DOI: 10.1126/sciadv.adi3245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 10/13/2023] [Indexed: 11/12/2023]
Abstract
Graph neural networks (GNNs) have recently been used to learn the representations of crystal structures through an end-to-end data-driven approach. However, a systematic top-down approach to evaluate and understand the limitations of GNNs in accurately capturing crystal structures has yet to be established. In this study, we introduce an approach using human-designed descriptors as a compendium of human knowledge to investigate the extent to which GNNs can comprehend crystal structures. Our findings reveal that current state-of-the-art GNNs fall short in accurately capturing the periodicity of crystal structures. We analyze this failure by exploring three aspects: local expressive power, long-range information processing, and readout function. To address these identified limitations, we propose a straightforward and general solution: the hybridization of descriptors with GNNs, which directly supplements the missing information to GNNs. The hybridization enhances the predictive accuracy of GNNs for specific material properties, most notably phonon internal energy and heat capacity, which heavily rely on the periodicity of materials.
Collapse
Affiliation(s)
- Sheng Gong
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Keqiang Yan
- Computer Science and Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Tian Xie
- Microsoft Research, Cambridge CB1 2FB, UK
| | - Yang Shao-Horn
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Rafael Gomez-Bombarelli
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Shuiwang Ji
- Computer Science and Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Jeffrey C. Grossman
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
10
|
Ho B, Zhao J, Liu J, Tang L, Guan Z, Li X, Li M, Howard E, Wheeler R, Bae J. SEMPro: A Data-Driven Pipeline To Learn Structure-Property Insights from Scanning Electron Microscopy Images. ACS MATERIALS LETTERS 2023; 5:3117-3125. [PMID: 37969140 PMCID: PMC10630981 DOI: 10.1021/acsmaterialslett.3c00909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 10/12/2023] [Accepted: 10/16/2023] [Indexed: 11/17/2023]
Abstract
Analyzing hydrogel microstructure through scanning electron microscopy (SEM) images is crucial in understanding hydrogel properties. However, the analysis of SEM images in hydrogel research heavily relies on the intuition of individual researchers and is constrained by the limited size of the dataset. To address this, we propose SEMPro, a data-driven solution using web-scraping and deep learning (DL) to compile and analyze the structure-property relationships of hydrogels through SEM images. It accurately predicts the elastic modulus from SEM images within the same order of magnitude and displays a learned extraction of modulus-relevant features in SEM images as seen through the nontrivial activation mapping and transfer learning. By employing Explainable AI through activation map exposure, SEMPro validates the model predictions. SEMPro represents a closed-loop data collection and analysis pipeline, providing critical insights into hydrogels and soft materials. This innovative approach has the potential to revolutionize hydrogel research, offering high-dimensional insights for further advancements.
Collapse
Affiliation(s)
- Brandon Ho
- Department
of Electrical and Computer Engineering, University of California San Diego, La Jolla, California 92093, United States
| | - Jiayu Zhao
- Department
of NanoEngineering, University of California
San Diego, La Jolla, California 92093, United States
| | - Joseph Liu
- Department
of NanoEngineering, University of California
San Diego, La Jolla, California 92093, United States
| | - Lisa Tang
- Department
of NanoEngineering, University of California
San Diego, La Jolla, California 92093, United States
- Chemical
Engineering Program, University of California
San Diego, La Jolla, California 92093, United States
| | - Zhecun Guan
- Department
of NanoEngineering, University of California
San Diego, La Jolla, California 92093, United States
| | - Xiao Li
- Material
Science and Engineering Program, University
of California San Diego, La Jolla, California 92093, United States
| | - Minghao Li
- Material
Science and Engineering Program, University
of California San Diego, La Jolla, California 92093, United States
| | - Elizabeth Howard
- Department
of NanoEngineering, University of California
San Diego, La Jolla, California 92093, United States
| | - Rebecca Wheeler
- Department
of NanoEngineering, University of California
San Diego, La Jolla, California 92093, United States
- Chemical
Engineering Program, University of California
San Diego, La Jolla, California 92093, United States
| | - Jinhye Bae
- Department
of NanoEngineering, University of California
San Diego, La Jolla, California 92093, United States
- Chemical
Engineering Program, University of California
San Diego, La Jolla, California 92093, United States
- Material
Science and Engineering Program, University
of California San Diego, La Jolla, California 92093, United States
| |
Collapse
|
11
|
Duan M, Wang Y, Zhao D, Liu H, Zhang G, Li K, Zhang H, Huang L, Zhang R, Zhou F. Orchestrating information across tissues via a novel multitask GAT framework to improve quantitative gene regulation relation modeling for survival analysis. Brief Bioinform 2023; 24:bbad238. [PMID: 37427963 DOI: 10.1093/bib/bbad238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Revised: 05/29/2023] [Accepted: 06/08/2023] [Indexed: 07/11/2023] Open
Abstract
Survival analysis is critical to cancer prognosis estimation. High-throughput technologies facilitate the increase in the dimension of genic features, but the number of clinical samples in cohorts is relatively small due to various reasons, including difficulties in participant recruitment and high data-generation costs. Transcriptome is one of the most abundantly available OMIC (referring to the high-throughput data, including genomic, transcriptomic, proteomic and epigenomic) data types. This study introduced a multitask graph attention network (GAT) framework DQSurv for the survival analysis task. We first used a large dataset of healthy tissue samples to pretrain the GAT-based HealthModel for the quantitative measurement of the gene regulatory relations. The multitask survival analysis framework DQSurv used the idea of transfer learning to initiate the GAT model with the pretrained HealthModel and further fine-tuned this model using two tasks i.e. the main task of survival analysis and the auxiliary task of gene expression prediction. This refined GAT was denoted as DiseaseModel. We fused the original transcriptomic features with the difference vector between the latent features encoded by the HealthModel and DiseaseModel for the final task of survival analysis. The proposed DQSurv model stably outperformed the existing models for the survival analysis of 10 benchmark cancer types and an independent dataset. The ablation study also supported the necessity of the main modules. We released the codes and the pretrained HealthModel to facilitate the feature encodings and survival analysis of transcriptome-based future studies, especially on small datasets. The model and the code are available at http://www.healthinformaticslab.org/supp/.
Collapse
Affiliation(s)
- Meiyu Duan
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China, 130012
| | - Yueying Wang
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China, 130012
| | - Dong Zhao
- School of Biology and Engineering, and Engineering Research Center of Medical Biotechnology, Guizhou Medical University, Guiyang, Guizhou 550025, China
| | - Hongmei Liu
- School of Biology and Engineering, and Engineering Research Center of Medical Biotechnology, Guizhou Medical University, Guiyang, Guizhou 550025, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, Jilin, China, 130012
| | - Gongyou Zhang
- School of Biology and Engineering, and Engineering Research Center of Medical Biotechnology, Guizhou Medical University, Guiyang, Guizhou 550025, China
| | - Kewei Li
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China, 130012
| | - Haotian Zhang
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China, 130012
| | - Lan Huang
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China, 130012
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, Jilin, China, 130012
| | - Ruochi Zhang
- School of Artificial Intelligence, Jilin University, Changchun, China, 130012
| | - Fengfeng Zhou
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China, 130012
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, Jilin, China, 130012
| |
Collapse
|
12
|
Liu Y, Yang Z, Zou X, Ma S, Liu D, Avdeev M, Shi S. Data quantity governance for machine learning in materials science. Natl Sci Rev 2023; 10:nwad125. [PMID: 37323811 PMCID: PMC10265966 DOI: 10.1093/nsr/nwad125] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 04/14/2023] [Accepted: 04/26/2023] [Indexed: 06/17/2023] Open
Abstract
Data-driven machine learning (ML) is widely employed in the analysis of materials structure-activity relationships, performance optimization and materials design due to its superior ability to reveal latent data patterns and make accurate prediction. However, because of the laborious process of materials data acquisition, ML models encounter the issue of the mismatch between a high dimension of feature space and a small sample size (for traditional ML models) or the mismatch between model parameters and sample size (for deep-learning models), usually resulting in terrible performance. Here, we review the efforts for tackling this issue via feature reduction, sample augmentation and specific ML approaches, and show that the balance between the number of samples and features or model parameters should attract great attention during data quantity governance. Following this, we propose a synergistic data quantity governance flow with the incorporation of materials domain knowledge. After summarizing the approaches to incorporating materials domain knowledge into the process of ML, we provide examples of incorporating domain knowledge into governance schemes to demonstrate the advantages of the approach and applications. The work paves the way for obtaining the required high-quality data to accelerate materials design and discovery based on ML.
Collapse
Affiliation(s)
- Yue Liu
- School of Computer Engineering and Science, Shanghai University, Shanghai200444, China
- Shanghai Engineering Research Center of Intelligent Computing System, Shanghai200444, China
| | - Zhengwei Yang
- School of Computer Engineering and Science, Shanghai University, Shanghai200444, China
| | - Xinxin Zou
- School of Computer Engineering and Science, Shanghai University, Shanghai200444, China
| | - Shuchang Ma
- School of Computer Engineering and Science, Shanghai University, Shanghai200444, China
| | - Dahui Liu
- School of Computer Engineering and Science, Shanghai University, Shanghai200444, China
| | - Maxim Avdeev
- Australian Nuclear Science and Technology Organisation, Sydney 2232, Australia
- School of Chemistry, The University of Sydney, Sydney 2006, Australia
| | - Siqi Shi
- State Key Laboratory of Advanced Special Steel, School of Materials Science and Engineering, Shanghai University, Shanghai200444, China
- Materials Genome Institute, Shanghai University, Shanghai200444, China
| |
Collapse
|
13
|
Gupta V, Peltekian A, Liao WK, Choudhary A, Agrawal A. Improving deep learning model performance under parametric constraints for materials informatics applications. Sci Rep 2023; 13:9128. [PMID: 37277456 DOI: 10.1038/s41598-023-36336-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Accepted: 06/01/2023] [Indexed: 06/07/2023] Open
Abstract
Modern machine learning (ML) and deep learning (DL) techniques using high-dimensional data representations have helped accelerate the materials discovery process by efficiently detecting hidden patterns in existing datasets and linking input representations to output properties for a better understanding of the scientific phenomenon. While a deep neural network comprised of fully connected layers has been widely used for materials property prediction, simply creating a deeper model with a large number of layers often faces with vanishing gradient problem, causing a degradation in the performance, thereby limiting usage. In this paper, we study and propose architectural principles to address the question of improving the performance of model training and inference under fixed parametric constraints. Here, we present a general deep-learning framework based on branched residual learning (BRNet) with fully connected layers that can work with any numerical vector-based representation as input to build accurate models to predict materials properties. We perform model training for materials properties using numerical vectors representing different composition-based attributes of the respective materials and compare the performance of the proposed models against traditional ML and existing DL architectures. We find that the proposed models are significantly more accurate than the ML/DL models for all data sizes by using different composition-based attributes as input. Further, branched learning requires fewer parameters and results in faster model training due to better convergence during the training phase than existing neural networks, thereby efficiently building accurate models for predicting materials properties.
Collapse
Affiliation(s)
- Vishu Gupta
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA
| | - Alec Peltekian
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA
| | - Wei-Keng Liao
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA
| | - Alok Choudhary
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA
| | - Ankit Agrawal
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA.
| |
Collapse
|
14
|
Takahashi K, Takahashi L. Toward the Golden Age of Materials Informatics: Perspective and Opportunities. J Phys Chem Lett 2023; 14:4726-4733. [PMID: 37172318 DOI: 10.1021/acs.jpclett.3c00648] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Materials informatics is reaching the transition point and is evolving from its early stages of adoption and development and moving toward its golden age. Here, the transformation of the early stage of materials informatics toward the next level of materials informatics is explored. In particular, it has become crucial to be able to manipulate materials synthesis data, materials properties data, and materials characterization data. Through the use of ontology, material design and understanding can be carried out simultaneously in a whitebox manner. Here, a perspective on the ultimate goal of materials informatics along with potential key components is discussed.
Collapse
Affiliation(s)
- Keisuke Takahashi
- Department of Chemistry, Hokkaido University, North 10, West 8, Sapporo 060-0810, Japan
| | - Lauren Takahashi
- Department of Chemistry, Hokkaido University, North 10, West 8, Sapporo 060-0810, Japan
| |
Collapse
|
15
|
Gupta V, Choudhary K, Mao Y, Wang K, Tavazza F, Campbell C, Liao WK, Choudhary A, Agrawal A. MPpredictor: An Artificial Intelligence-Driven Web Tool for Composition-Based Material Property Prediction. J Chem Inf Model 2023; 63:1865-1871. [PMID: 36972592 PMCID: PMC10091406 DOI: 10.1021/acs.jcim.3c00307] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
Abstract
The applications of artificial intelligence, machine learning, and deep learning techniques in the field of materials science are becoming increasingly common due to their promising abilities to extract and utilize data-driven information from available data and accelerate materials discovery and design for future applications. In an attempt to assist with this process, we deploy predictive models for multiple material properties, given the composition of the material. The deep learning models described here are built using a cross-property deep transfer learning technique, which leverages source models trained on large data sets to build target models on small data sets with different properties. We deploy these models in an online software tool that takes a number of material compositions as input, performs preprocessing to generate composition-based attributes for each material, and feeds them into the predictive models to obtain up to 41 different material property values. The material property predictor is available online at http://ai.eecs.northwestern.edu/MPpredictor.
Collapse
Affiliation(s)
- Vishu Gupta
- ECE Department, Northwestern University, Evanston, Illinois 60208, United States
| | - Kamal Choudhary
- Materials Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, United States
- Theiss Research, La Jolla, California 92037, United States
- DeepMaterials LLC, Silver Spring, Maryland 20906, United States
| | - Yuwei Mao
- ECE Department, Northwestern University, Evanston, Illinois 60208, United States
| | - Kewei Wang
- ECE Department, Northwestern University, Evanston, Illinois 60208, United States
| | - Francesca Tavazza
- Materials Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, United States
| | - Carelyn Campbell
- Materials Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, United States
| | - Wei-Keng Liao
- ECE Department, Northwestern University, Evanston, Illinois 60208, United States
| | - Alok Choudhary
- ECE Department, Northwestern University, Evanston, Illinois 60208, United States
| | - Ankit Agrawal
- ECE Department, Northwestern University, Evanston, Illinois 60208, United States
| |
Collapse
|
16
|
Duarte JC, da Rocha RD, Borges I. Which molecular properties determine the impact sensitivity of an explosive? A machine learning quantitative investigation of nitroaromatic explosives. Phys Chem Chem Phys 2023; 25:6877-6890. [PMID: 36799468 DOI: 10.1039/d2cp05339j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Abstract
We decomposed density functional theory charge densities of 53 nitroaromatic molecules into atom-centered electric multipoles using the distributed multipole analysis that provides a detailed picture of the molecular electronic structure. Three electric multipoles, (the charge of the nitro groups), (the total dipole, i.e., polarization, of the nitro groups), (the total electron delocalization of the C ring atoms), and the number of explosophore groups (#NO2) were selected as features for a comprehensive machine learning (ML) investigation. The target property was the impact sensitivity h50 (cm) values quantified by drop-weight measurements, with a large h50 (e.g., 150 cm) indicating that an explosive is insensitive and vice versa. After a preliminary screening of 42 ML algorithms, four were selected based on the lowest root mean square errors: Extra Trees, Random Forests, Gradient Boosting, and AdaBoost. Compared to experimental data, the predicted h50 values of molecules having very different sensitivities for the four algorithms have differences in the range 19-28%. The most important properties for predicting h50 are the electron delocalization in the ring atoms and the polarization of the nitro groups with averaged weights of 39% and 35%, followed by the charge (16%) and number (10%) of nitro groups. A significant result is how the contribution of these properties to h50 depends on their actual sensitivities: for the most sensitive explosives (h50 up to ∼50 cm), the four properties contribute to reducing h50, and for intermediate ones (∼50 cm ≲ h50 ≲ 100 cm) #NO2 and contribute to increasing it and the other two properties to reducing it. For highly insensitive explosives (h50 ≳ 200 cm), all four properties essentially contribute to increasing it. These results furnish a consistent molecular basis of the sensitivities of known explosives that also can be used for developing safer new ones.
Collapse
Affiliation(s)
- Julio Cesar Duarte
- Departamento de Engenharia de Computação, Instituto Militar de Engenharia, Instituto Militar de Engenharia, Praça General Tibúrcio, 80, Urca, Rio de Janeiro (RJ), 22290-270, Brazil. .,Programa de Pós-Graduação em Engenharia de Defesa, Militar de Engenharia, Instituto Militar de Engenharia, Praça General Tibúrcio, 80, Urca, Rio de Janeiro (RJ), 22290-270, Brazil
| | - Romulo Dias da Rocha
- Programa de Pós-Graduação em Engenharia de Defesa, Militar de Engenharia, Instituto Militar de Engenharia, Praça General Tibúrcio, 80, Urca, Rio de Janeiro (RJ), 22290-270, Brazil
| | - Itamar Borges
- Departamento de Engenharia de Computação, Instituto Militar de Engenharia, Instituto Militar de Engenharia, Praça General Tibúrcio, 80, Urca, Rio de Janeiro (RJ), 22290-270, Brazil. .,Departamento de Química, Militar de Engenharia, Instituto Militar de Engenharia, Praça General Tibúrcio, 80, Urca, Rio de Janeiro (RJ), 22290-270, Brazil
| |
Collapse
|
17
|
Harfouche AL, Nakhle F, Harfouche AH, Sardella OG, Dart E, Jacobson D. A primer on artificial intelligence in plant digital phenomics: embarking on the data to insights journey. TRENDS IN PLANT SCIENCE 2023; 28:154-184. [PMID: 36167648 DOI: 10.1016/j.tplants.2022.08.021] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 08/22/2022] [Accepted: 08/25/2022] [Indexed: 06/16/2023]
Abstract
Artificial intelligence (AI) has emerged as a fundamental component of global agricultural research that is poised to impact on many aspects of plant science. In digital phenomics, AI is capable of learning intricate structure and patterns in large datasets. We provide a perspective and primer on AI applications to phenome research. We propose a novel human-centric explainable AI (X-AI) system architecture consisting of data architecture, technology infrastructure, and AI architecture design. We clarify the difference between post hoc models and 'interpretable by design' models. We include guidance for effectively using an interpretable by design model in phenomic analysis. We also provide directions to sources of tools and resources for making data analytics increasingly accessible. This primer is accompanied by an interactive online tutorial.
Collapse
Affiliation(s)
- Antoine L Harfouche
- Department for Innovation in Biological, Agro-Food, and Forest Systems, University of Tuscia, Viterbo, VT 01100, Italy.
| | - Farid Nakhle
- Department for Innovation in Biological, Agro-Food, and Forest Systems, University of Tuscia, Viterbo, VT 01100, Italy
| | - Antoine H Harfouche
- Unité de Formation et de Recherche en Sciences Économiques, Gestion, Mathématiques, et Informatique, Université Paris Nanterre, 92001 Nanterre, France
| | - Orlando G Sardella
- Department for Innovation in Biological, Agro-Food, and Forest Systems, University of Tuscia, Viterbo, VT 01100, Italy
| | - Eli Dart
- Energy Sciences Network (ESnet), Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Daniel Jacobson
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
| |
Collapse
|
18
|
Lansford JL, Barnes BC, Rice BM, Jensen KF. Building Chemical Property Models for Energetic Materials from Small Datasets Using a Transfer Learning Approach. J Chem Inf Model 2022; 62:5397-5410. [PMID: 36240441 DOI: 10.1021/acs.jcim.2c00841] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
For many experimentally measured chemical properties that cannot be directly computed from first-principles, the existing physics-based models do not extrapolate well to out-of-sample molecules, and experimental datasets themselves are too small for traditional machine learning (ML) approaches. To overcome these limitations, we apply a transfer learning approach, whereby we simultaneously train a multi-target regression model on a small number of molecules with experimentally measured values and a large number of molecules with related computed properties. We demonstrate this methodology on predicting the experimentally measured impact sensitivity of energetic crystals, finding that both characteristics of the computed dataset and model architecture are important to prediction accuracy of the small experimental dataset. Our directed-message passing neural network (D-MPNN) ML model using transfer learning outperforms direct-ML and physics-based models on a diverse test set, and the new methods described here are widely applicable to modeling many other structure-property relationships.
Collapse
Affiliation(s)
- Joshua L Lansford
- U.S. Army Combat Capabilities Development Command (DEVCOM) Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States.,Department of Chemical Engineering, MIT, Cambridge, Massachusetts 02139, United States
| | - Brian C Barnes
- U.S. Army Combat Capabilities Development Command (DEVCOM) Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - Betsy M Rice
- U.S. Army Combat Capabilities Development Command (DEVCOM) Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - Klavs F Jensen
- Department of Chemical Engineering, MIT, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
19
|
Ren Y, Chakraborty T, Doijad S, Falgenhauer L, Falgenhauer J, Goesmann A, Schwengers O, Heider D. Deep Transfer Learning Enables Robust Prediction of Antimicrobial Resistance for Novel Antibiotics. Antibiotics (Basel) 2022; 11:1611. [PMID: 36421255 PMCID: PMC9686617 DOI: 10.3390/antibiotics11111611] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 11/08/2022] [Accepted: 11/10/2022] [Indexed: 09/08/2024] Open
Abstract
Antimicrobial resistance (AMR) has become one of the serious global health problems, threatening the effective treatment of a growing number of infections. Machine learning and deep learning show great potential in rapid and accurate AMR predictions. However, a large number of samples for the training of these models is essential. In particular, for novel antibiotics, limited training samples and data imbalance hinder the models' generalization performance and overall accuracy. We propose a deep transfer learning model that can improve model performance for AMR prediction on small, imbalanced datasets. As our approach relies on transfer learning and secondary mutations, it is also applicable to novel antibiotics and emerging resistances in the future and enables quick diagnostics and personalized treatments.
Collapse
Affiliation(s)
- Yunxiao Ren
- Department of Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, 35032 Marburg, Germany
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-University of Marburg, 35032 Marburg, Germany
| | - Trinad Chakraborty
- Institute of Medical Microbiology, Justus Liebig University Giessen, 35392 Giessen, Germany
- German Center for Infection Research, Partner Site Giessen-Marburg-Langen, 35392 Giessen, Germany
| | - Swapnil Doijad
- Institute of Medical Microbiology, Justus Liebig University Giessen, 35392 Giessen, Germany
- German Center for Infection Research, Partner Site Giessen-Marburg-Langen, 35392 Giessen, Germany
| | - Linda Falgenhauer
- German Center for Infection Research, Partner Site Giessen-Marburg-Langen, 35392 Giessen, Germany
- Institute of Hygiene and Environmental Medicine, Justus Liebig University Giessen, 35392 Giessen, Germany
- Hessisches Universitäres Kompetenzzentrum Krankenhaushygiene, 35392 Giessen, Germany
| | - Jane Falgenhauer
- Institute of Medical Microbiology, Justus Liebig University Giessen, 35392 Giessen, Germany
- German Center for Infection Research, Partner Site Giessen-Marburg-Langen, 35392 Giessen, Germany
| | - Alexander Goesmann
- German Center for Infection Research, Partner Site Giessen-Marburg-Langen, 35392 Giessen, Germany
- Department of Bioinformatics and Systems Biology, Justus Liebig University Giessen, 35392 Giessen, Germany
| | - Oliver Schwengers
- German Center for Infection Research, Partner Site Giessen-Marburg-Langen, 35392 Giessen, Germany
- Department of Bioinformatics and Systems Biology, Justus Liebig University Giessen, 35392 Giessen, Germany
| | - Dominik Heider
- Department of Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, 35032 Marburg, Germany
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-University of Marburg, 35032 Marburg, Germany
| |
Collapse
|
20
|
Mao Y, Yang Z, Jha D, Paul A, Liao WK, Choudhary A, Agrawal A. Generative Adversarial Networks and Mixture Density Networks-Based Inverse Modeling for Microstructural Materials Design. INTEGRATING MATERIALS AND MANUFACTURING INNOVATION 2022; 11:637-647. [PMID: 36530375 PMCID: PMC9744696 DOI: 10.1007/s40192-022-00285-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Accepted: 10/25/2022] [Indexed: 06/17/2023]
Abstract
There are two broad modeling paradigms in scientific applications: forward and inverse. While forward modeling estimates the observations based on known causes, inverse modeling attempts to infer the causes given the observations. Inverse problems are usually more critical as well as difficult in scientific applications as they seek to explore the causes that cannot be directly observed. Inverse problems are used extensively in various scientific fields, such as geophysics, health care and materials science. Exploring the relationships from properties to microstructures is one of the inverse problems in material science. It is challenging to solve the microstructure discovery inverse problem, because it usually needs to learn a one-to-many nonlinear mapping. Given a target property, there are multiple different microstructures that exhibit the target property, and their discovery also requires significant computing time. Further, microstructure discovery becomes even more difficult because the dimension of properties (input) is much lower than that of microstructures (output). In this work, we propose a framework consisting of generative adversarial networks and mixture density networks for inverse modeling of structure-property linkages in materials, i.e., microstructure discovery for a given property. The results demonstrate that compared to baseline methods, the proposed framework can overcome the above-mentioned challenges and discover multiple promising solutions in an efficient manner.
Collapse
Affiliation(s)
- Yuwei Mao
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL USA
| | - Zijiang Yang
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL USA
| | - Dipendra Jha
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL USA
| | - Arindam Paul
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL USA
| | - Wei-keng Liao
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL USA
| | - Alok Choudhary
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL USA
| | - Ankit Agrawal
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL USA
| |
Collapse
|
21
|
Shavalieva G, Papadokonstantakis S, Peters G. Prior Knowledge for Predictive Modeling: The Case of Acute Aquatic Toxicity. J Chem Inf Model 2022; 62:4018-4031. [PMID: 35998659 PMCID: PMC9472271 DOI: 10.1021/acs.jcim.1c01079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Indexed: 11/30/2022]
Abstract
Early assessment of the potential impact of chemicals on health and the environment requires toxicological properties of the molecules. Predictive modeling is often used to estimate the property values in silico from pre-existing experimental data, which is often scarce and uncertain. One of the ways to advance the predictive modeling procedure might be the use of knowledge existing in the field. Scientific publications contain a vast amount of knowledge. However, the amount of manual work required to process the enormous volumes of information gathered in scientific articles might hinder its utilization. This work explores the opportunity of semiautomated knowledge extraction from scientific papers and investigates a few potential ways of its use for predictive modeling. The knowledge extraction and predictive modeling are applied to the field of acute aquatic toxicity. Acute aquatic toxicity is an important parameter of the safety assessment of chemicals. The extensive amount of diverse information existing in the field makes acute aquatic toxicity an attractive area for investigation of knowledge use for predictive modeling. The work demonstrates that the knowledge collection and classification procedure could be useful in hybrid modeling studies concerning the model and predictor selection, addressing data gaps, and evaluation of models' performance.
Collapse
Affiliation(s)
- Gulnara Shavalieva
- Department
of Space, Earth and Environment, Division of Energy Technology, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden
| | - Stavros Papadokonstantakis
- Department
of Space, Earth and Environment, Division of Energy Technology, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden
- Institute
of Chemical, Environmental and Bioscience Engineering, TU Wien, Getreidemarkt 9, 1060 Vienna, Austria
| | - Gregory Peters
- Department
of Technology Management and Economics, Chalmers University of Technology, SE-411 33 Gothenburg, Sweden
| |
Collapse
|
22
|
Jha D, Gupta V, Liao WK, Choudhary A, Agrawal A. Moving closer to experimental level materials property prediction using AI. Sci Rep 2022; 12:11953. [PMID: 35831344 PMCID: PMC9279333 DOI: 10.1038/s41598-022-15816-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 06/29/2022] [Indexed: 11/25/2022] Open
Abstract
While experiments and DFT-computations have been the primary means for understanding the chemical and physical properties of crystalline materials, experiments are expensive and DFT-computations are time-consuming and have significant discrepancies against experiments. Currently, predictive modeling based on DFT-computations have provided a rapid screening method for materials candidates for further DFT-computations and experiments; however, such models inherit the large discrepancies from the DFT-based training data. Here, we demonstrate how AI can be leveraged together with DFT to compute materials properties more accurately than DFT itself by focusing on the critical materials science task of predicting “formation energy of a material given its structure and composition”. On an experimental hold-out test set containing 137 entries, AI can predict formation energy from materials structure and composition with a mean absolute error (MAE) of 0.064 eV/atom; comparing this against DFT-computations, we find that AI can significantly outperform DFT computations for the same task (discrepancies of \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$>0.076$$\end{document}>0.076 eV/atom) for the first time.
Collapse
Affiliation(s)
- Dipendra Jha
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, 60208, USA
| | - Vishu Gupta
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, 60208, USA
| | - Wei-Keng Liao
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, 60208, USA
| | - Alok Choudhary
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, 60208, USA
| | - Ankit Agrawal
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, 60208, USA.
| |
Collapse
|
23
|
Kolluru A, Shoghi N, Shuaibi M, Goyal S, Das A, Zitnick CL, Ulissi Z. Transfer learning using attentions across atomic systems with graph neural networks (TAAG). J Chem Phys 2022; 156:184702. [PMID: 35568535 DOI: 10.1063/5.0088019] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Recent advances in Graph Neural Networks (GNNs) have transformed the space of molecular and catalyst discovery. Despite the fact that the underlying physics across these domains remain the same, most prior work has focused on building domain-specific models either in small molecules or in materials. However, building large datasets across all domains is computationally expensive; therefore, the use of transfer learning (TL) to generalize to different domains is a promising but under-explored approach to this problem. To evaluate this hypothesis, we use a model that is pretrained on the Open Catalyst Dataset (OC20), and we study the model's behavior when fine-tuned for a set of different datasets and tasks. This includes MD17, the *CO adsorbate dataset, and OC20 across different tasks. Through extensive TL experiments, we demonstrate that the initial layers of GNNs learn a more basic representation that is consistent across domains, whereas the final layers learn more task-specific features. Moreover, these well-known strategies show significant improvement over the non-pretrained models for in-domain tasks with improvements of 53% and 17% for the *CO dataset and across the Open Catalyst Project (OCP) task, respectively. TL approaches result in up to 4× speedup in model training depending on the target data and task. However, these do not perform well for the MD17 dataset, resulting in worse performance than the non-pretrained model for few molecules. Based on these observations, we propose transfer learning using attentions across atomic systems with graph Neural Networks (TAAG), an attention-based approach that adapts to prioritize and transfer important features from the interaction layers of GNNs. The proposed method outperforms the best TL approach for out-of-domain datasets, such as MD17, and gives a mean improvement of 6% over a model trained from scratch.
Collapse
Affiliation(s)
- Adeesh Kolluru
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | - Nima Shoghi
- Meta AI Research, Menlo Park, California 94025, USA
| | - Muhammed Shuaibi
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | | | - Abhishek Das
- Meta AI Research, Menlo Park, California 94025, USA
| | | | - Zachary Ulissi
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| |
Collapse
|