1
|
Jiang C, He H, Guo H, Zhang X, Han Q, Weng Y, Fu X, Zhu Y, Yan N, Tu X, Sun Y. Transfer learning guided discovery of efficient perovskite oxide for alkaline water oxidation. Nat Commun 2024; 15:6301. [PMID: 39060252 PMCID: PMC11282268 DOI: 10.1038/s41467-024-50605-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 07/16/2024] [Indexed: 07/28/2024] Open
Abstract
Perovskite oxides show promise for the oxygen evolution reaction. However, numerical chemical compositions remain unexplored due to inefficient trial-and-error methods for material discovery. Here, we develop a transfer learning paradigm incorporating a pre-trained model, ensemble learning, and active learning, enabling the prediction of undiscovered perovskite oxides with enhanced generalizability for this reaction. Screening 16,050 compositions leads to the identification and synthesis of 36 new perovskite oxides, including 13 pure perovskite structures. Pr0.1Sr0.9Co0.5Fe0.5O3 and Pr0.1Sr0.9Co0.5Fe0.3Mn0.2O3 exhibit low overpotentials of 327 mV and 315 mV at 10 mA cm-2, respectively. Electrochemical measurements reveal coexistence of absorbate evolution and lattice oxygen mechanisms for O-O coupling in both materials. Pr0.1Sr0.9Co0.5Fe0.3Mn0.2O3 demonstrates enhanced OH- affinity compared to Pr0.1Sr0.9Co0.5Fe0.5O3, with the emergence of oxo-bridged Mn-Co conjugate facilitating charge redistribution and dynamic reversibility of Olattice/VO, thereby slowing down Co dissolution. This work paves the way for accelerated discovery and development of high-performance perovskite oxide electrocatalysts for this reaction.
Collapse
Affiliation(s)
- Chang Jiang
- College of Energy, Xiamen University, Xiamen, China
| | - Hongyuan He
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, UK
| | - Hongquan Guo
- College of Energy, Xiamen University, Xiamen, China
| | | | - Qingyang Han
- College of Energy, Xiamen University, Xiamen, China
| | - Yanhong Weng
- Shenzhen Key Laboratory of Energy Electrocatalytic Materials, Guangdong Research Center for Interfacial Engineering of Functional Materials, College of Materials Science and Engineering, Shenzhen University, Shenzhen, China
| | - Xianzhu Fu
- Shenzhen Key Laboratory of Energy Electrocatalytic Materials, Guangdong Research Center for Interfacial Engineering of Functional Materials, College of Materials Science and Engineering, Shenzhen University, Shenzhen, China
| | - Yinlong Zhu
- Institute for Frontier Science, Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Ning Yan
- School of Physics and Technology, Wuhan University, Wuhan, China
| | - Xin Tu
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, UK.
| | - Yifei Sun
- College of Energy, Xiamen University, Xiamen, China.
- State Key Laboratory of Physical Chemistry of Solid Surface, Xiamen University, Xiamen, China.
- Shenzhen Research, Institute of Xiamen University, Shenzhen, China.
| |
Collapse
|
2
|
Zhai X, Chen M. Accelerated Design for Perovskite-Oxide-Based Photocatalysts Using Machine Learning Techniques. MATERIALS (BASEL, SWITZERLAND) 2024; 17:3026. [PMID: 38930399 PMCID: PMC11206125 DOI: 10.3390/ma17123026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 06/05/2024] [Accepted: 06/08/2024] [Indexed: 06/28/2024]
Abstract
The rapid discovery of photocatalysts with desired performance among tens of thousands of potential perovskites represents a significant advancement. To expedite the design of perovskite-oxide-based photocatalysts, we developed a model of ABO3-type perovskites using machine learning methods based on atomic and experimental parameters. This model can be used to predict specific surface area (SSA), a key parameter closely associated with photocatalytic activity. The model construction involved several steps, including data collection, feature selection, model construction, web-service development, virtual screening and mechanism elucidation. Statistical analysis revealed that the support vector regression model achieved a correlation coefficient of 0.9462 for the training set and 0.8786 for the leave-one-out cross-validation. The potential perovskites with higher SSA than the highest SSA observed in the existing dataset were identified using the model and our computation platform. We also developed a webserver of the model, freely accessible to users. The methodologies outlined in this study not only facilitate the discovery of new perovskites but also enable exploration of the correlations between the perovskite properties and the physicochemical features. These findings provide valuable insights for further research and applications of perovskites using machine learning techniques.
Collapse
Affiliation(s)
- Xiuyun Zhai
- College of Intelligent Manufacturing, Hunan University of Science and Engineering, Yongzhou 425199, China
| | - Mingtong Chen
- Public Experimental Teaching Center, Panzhihua University, Panzhihua 617000, China
| |
Collapse
|
3
|
Zheng F, Lu J, Zhu Z, Jiang H, Yan Y, He Y, Yuan S, Sun Q. Predicting Molecular Self-Assembly on Metal Surfaces Using Graph Neural Networks Based on Experimental Data Sets. ACS NANO 2023; 17:17545-17553. [PMID: 37611029 DOI: 10.1021/acsnano.3c06405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/25/2023]
Abstract
The application of supramolecular chemistry on solid surfaces has received extensive attention in the past few decades. To date, combining experiments with quantum mechanical or molecular dynamic methods represents the key strategy to explore the molecular self-assembled structures, which is, however, often laborious. Recently, machine learning (ML) has become one of the most exciting tools in material research, allowing for both efficiency and accuracy in predicting molecular properties. In this work, we constructed a graph neural network to predict the self-assembly of functional polycyclic aromatic hydrocarbons (PAHs) on metal surfaces. Using scanning tunneling microscopy (STM), we characterized the self-assembled nanostructures of a homologous series of PAH molecules on different metal surfaces to construct an experimental data set for model training. Compared with traditional ML algorithms, our model exhibits better predictive performance. Finally, the generalization of the model is further verified by comparing the ML predictions and experimental results of different functionalized molecule. Our results demonstrate training experimental data sets to produce a predictive ML model of molecular self-assembly with generalization performance, which allows for the predictive design of nanostructures with functional molecules.
Collapse
Affiliation(s)
- Fengru Zheng
- Materials Genome Institute, Shanghai University, 200444 Shanghai, China
| | - Jiayi Lu
- Materials Genome Institute, Shanghai University, 200444 Shanghai, China
| | - Zhiwen Zhu
- Materials Genome Institute, Shanghai University, 200444 Shanghai, China
| | - Hao Jiang
- Materials Genome Institute, Shanghai University, 200444 Shanghai, China
| | - Yuyi Yan
- Materials Genome Institute, Shanghai University, 200444 Shanghai, China
| | - Yu He
- Materials Genome Institute, Shanghai University, 200444 Shanghai, China
| | - Shaoxuan Yuan
- Materials Genome Institute, Shanghai University, 200444 Shanghai, China
| | - Qiang Sun
- Materials Genome Institute, Shanghai University, 200444 Shanghai, China
| |
Collapse
|
4
|
Wang J, Xu P, Ji X, Li M, Lu W. Feature Selection in Machine Learning for Perovskite Materials Design and Discovery. MATERIALS (BASEL, SWITZERLAND) 2023; 16:3134. [PMID: 37109971 PMCID: PMC10146176 DOI: 10.3390/ma16083134] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2023] [Revised: 04/11/2023] [Accepted: 04/13/2023] [Indexed: 06/19/2023]
Abstract
Perovskite materials have been one of the most important research objects in materials science due to their excellent photoelectric properties as well as correspondingly complex structures. Machine learning (ML) methods have been playing an important role in the design and discovery of perovskite materials, while feature selection as a dimensionality reduction method has occupied a crucial position in the ML workflow. In this review, we introduced the recent advances in the applications of feature selection in perovskite materials. First, the development tendency of publications about ML in perovskite materials was analyzed, and the ML workflow for materials was summarized. Then the commonly used feature selection methods were briefly introduced, and the applications of feature selection in inorganic perovskites, hybrid organic-inorganic perovskites (HOIPs), and double perovskites (DPs) were reviewed. Finally, we put forward some directions for the future development of feature selection in machine learning for perovskite material design.
Collapse
Affiliation(s)
- Junya Wang
- Department of Mathematics, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Pengcheng Xu
- Materials Genome Institute, Shanghai University, Shanghai 200444, China
| | - Xiaobo Ji
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Minjie Li
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Wencong Lu
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
- Zhejiang Laboratory, Hangzhou 311100, China
- Key Laboratory of Silicate Cultural Relics Conservation (Shanghai University), Ministry of Education, Shanghai 200444, China
| |
Collapse
|
5
|
Banerjee A, Gajewicz-Skretna A, Roy K. A machine learning q-RASPR approach for efficient predictions of the specific surface area of perovskites. Mol Inform 2023; 42:e2200261. [PMID: 36618002 DOI: 10.1002/minf.202200261] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/05/2023] [Accepted: 01/06/2023] [Indexed: 01/10/2023]
Abstract
In this study, the specific surface area of various perovskites was modeled using a novel quantitative read-across structure-property relationship (q-RASPR) approach, which clubs both Read-Across (RA) and quantitative structure-property relationship (QSPR) together. After optimization of the hyper-parameters, certain similarity-based error measures for each query compound were obtained. Clubbing some of these error-based measures with the previously selected features along with the Read-Across prediction function, a number of machine learning models were developed using Partial Least Squares (PLS), Ridge Regression (RR), Linear Support Vector Regression (LSVR), Random Forest (RF) regression, Gradient Boost (GBoost), Adaptive Boosting (Adaboost), Multiple Layer Perceptron (MLP) regression and k-Nearest Neighbor (kNN) regression. Based on the repeated cross-validation as well as external prediction quality and interpretability, the PLS model (nTraining = 38, nTest = 12, R T r a i n 2 ${{R}_{Train}^{2}}$ =0.737, Q L O O 2 = 0 . 637 , R T e s t 2 = 0 . 898 , Q F 1 T e s t 2 = 0 . 901 ) ${{Q}_{LOO}^{2}=0.637,\ {R}_{Test}^{2}=0.898,{\rm \ }\ {Q}_{F1\left(Test\right)}^{2}=0.901)}$ was selected as the best predictor which underscored the previously reported results. The finally selected model should efficiently predict specific surface areas of other perovskites for their use in photocatalysis. The new q-RASPR method also appears promising for the prediction of several other property endpoints of interest in materials science.
Collapse
Affiliation(s)
- Arkaprava Banerjee
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700 032, India
| | - Agnieszka Gajewicz-Skretna
- Laboratory of Environmental Chemoinformatics, Faculty of Chemistry, University of Gdansk, Gdansk, Wita Stwosza 63, 80-308 Gdansk, Poland
| | - Kunal Roy
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700 032, India
| |
Collapse
|
6
|
Xu P, Chang D, Lu T, Li L, Li M, Lu W. Search for ABO 3 Type Ferroelectric Perovskites with Targeted Multi-Properties by Machine Learning Strategies. J Chem Inf Model 2022; 62:5038-5049. [PMID: 34375112 DOI: 10.1021/acs.jcim.1c00566] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Ferroelectric perovskites are one of the most promising functional materials due to the pyroelectric and piezoelectric effect. In the practical applications of ferroelectric perovskites, it is often necessary to meet the requirements of multiple properties. In this work, a multiproperties machine learning strategy was proposed to accelerate the discovery and design of new ferroelectric ABO3-type perovskites. First, a classification model was constructed with data collected from publications to distinguish ferroelectric and nonferroelectric perovskites. The classification accuracies of LOOCV and the test set are 87.29% and 86.21%, respectively. Then, two machine learning strategies, Machine-Learning Workflow and SISSO, were used to construct the regression models to predict the specific surface area (SSA), band gap (Eg), Curie temperature (Tc), and dielectric loss (tan δ) of ABO3-type perovskites. The correlation coefficients of LOOCV in the optimal models for SSA, Eg, and Tc are 0.935, 0.891, and 0.971, respectively, while the correlation coefficient of the predicted and experimental values of the SISSO model for tan δ prediction could reach 0.913. On the basis of the models, 20 ABO3 ferroelectric perovskites with three different application prospects were screened out with the required properties, which could be explained by the patterns between the important descriptors and the properties by using SHAP. Furthermore, the constructed models were developed into web servers for the researchers to accelerate the rational design and discovery of ABO3 ferroelectric perovskites with desired multiple properties.
Collapse
Affiliation(s)
- Pengcheng Xu
- Materials Genome Institute, Shanghai University, and Shanghai Materials Genome Institute, Shanghai 200444, China
| | - Dongping Chang
- Materials Genome Institute, Shanghai University, and Shanghai Materials Genome Institute, Shanghai 200444, China
| | - Tian Lu
- Materials Genome Institute, Shanghai University, and Shanghai Materials Genome Institute, Shanghai 200444, China
| | - Long Li
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Minjie Li
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Wencong Lu
- Materials Genome Institute, Shanghai University, and Shanghai Materials Genome Institute, Shanghai 200444, China.,Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| |
Collapse
|
7
|
Wang X, Lu T, Zhou W, Ji X, Lu W, Yang J. Accelerated Discovery of Ternary Gold Alloy Materials with Low Resistivity via an Interpretable Machine Learning Strategy. Chem Asian J 2022; 17:e202200771. [DOI: 10.1002/asia.202200771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2022] [Revised: 09/06/2022] [Indexed: 11/08/2022]
Affiliation(s)
| | - Tian Lu
- Shanghai University Shanghai Materials Genome Institute CHINA
| | - Wenyan Zhou
- Sino-Platinum Metals co.,Ltd Sino-Platinum Metals Co., Ltd, Kunming CHINA
| | - Xiaobo Ji
- Shanghai University Department of Chemistry, College of Sciences CHINA
| | - Wencong Lu
- Shanghai University Materials Genome Institute 99 Shangda Road, Shanghai Shanghai CHINA
| | - Jiong Yang
- Shanghai University Materials Genome Institute Shanghai University CHINA
| |
Collapse
|
8
|
Bai Q, Duan Y, Lian J, Wang X. Computation-accelerated discovery of the K2NiF4-type oxyhydrides combing density functional theory and machine learning approach. Front Chem 2022; 10:964953. [PMID: 36092671 PMCID: PMC9458981 DOI: 10.3389/fchem.2022.964953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 07/29/2022] [Indexed: 11/13/2022] Open
Abstract
The emerging K2NiF4-type oxyhydrides with unique hydride ions (H−) and O2- coexisting in the anion sublattice offer superior functionalities for numerous applications. However, the exploration and innovations of the oxyhydrides are challenged by their rarity as a limited number of compounds reported in experiments, owing to the stringent laboratory conditions. Herein, we employed a suite of computations involving ab initio methods, informatics and machine learning to investigate the stability relationship of the K2NiF4-type oxyhydrides. The comprehensive stability map of the oxyhydrides chemical space was constructed to identify 76 new compounds with good thermodynamic stabilities using the high-throughput computations. Based on the established database, we reveal geometric constraints and electronegativities of cationic elements as significant factors governing the oxyhydrides stabilities via informatics tools. Besides fixed stoichiometry compounds, mixed-cation oxyhydrides can provide promising properties due to the enhancement of compositional tunability. However, the exploration of the mixed compounds is hindered by their huge quantity and the rarity of stable oxyhydrides. Therefore, we propose a two-step machine learning workflow consisting of a simple transfer learning to discover 114 formable oxyhydrides from thousands of unknown mixed compositions. The predicted high H− conductivities of the representative oxyhydrides indicate their suitability as energy conversion materials. Our study provides an insight into the oxyhydrides chemistry which is applicable to other mixed-anion systems, and demonstrates an efficient computational paradigm for other materials design applications, which are challenged by the unavailable and highly unbalanced materials database.
Collapse
Affiliation(s)
- Qiang Bai
- *Correspondence: Qiang Bai, ; Xiaomin Wang,
| | | | | | | |
Collapse
|
9
|
Cai X, Liu F, Yu A, Qin J, Hatamvand M, Ahmed I, Luo J, Zhang Y, Zhang H, Zhan Y. Data-driven design of high-performance MASn xPb 1-xI 3 perovskite materials by machine learning and experimental realization. LIGHT, SCIENCE & APPLICATIONS 2022; 11:234. [PMID: 35882845 PMCID: PMC9325779 DOI: 10.1038/s41377-022-00924-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 06/13/2022] [Accepted: 06/30/2022] [Indexed: 05/12/2023]
Abstract
The photovoltaic performance of perovskite solar cell is determined by multiple interrelated factors, such as perovskite compositions, electronic properties of each transport layer and fabrication parameters, which makes it rather challenging for optimization of device performances and discovery of underlying mechanisms. Here, we propose and realize a novel machine learning approach based on forward-reverse framework to establish the relationship between key parameters and photovoltaic performance in high-profile MASnxPb1-xI3 perovskite materials. The proposed method establishes the asymmetrically bowing relationship between band gap and Sn composition, which is precisely verified by our experiments. Based on the analysis of structural evolution and SHAP library, the rapid-change region and low-bandgap plateau region for small and large Sn composition are explained, respectively. By establishing the models for photovoltaic parameters of working photovoltaic devices, the deviation of short-circuit current and open-circuit voltage with band gap in defective-zone and low-bandgap-plateau regions from Shockley-Queisser theory is captured by our models, and the former is due to the deep-level traps formed by crystallographic distortion and the latter is due to the enhanced susceptibility by increased Sn4+ content. The more difficulty for hole extraction than electron is also concluded in the models and the prediction curve of power conversion efficiency is in a good agreement with Shockley-Queisser limit. With the help of search and optimization algorithms, an optimized Sn:Pb composition ratio near 0.6 is finally obtained for high-performance perovskite solar cells, then verified by our experiments. Our constructive method could also be applicable to other material optimization and efficient device development.
Collapse
Affiliation(s)
- Xia Cai
- School of Information Science and Technology, Fudan University, Shanghai, 200433, China
- College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai, 200234, China
- Center of Micro-Nano System, Fudan University, Shanghai, 200433, China
| | - Fengcai Liu
- School of Information Science and Technology, Fudan University, Shanghai, 200433, China
- Center of Micro-Nano System, Fudan University, Shanghai, 200433, China
| | - Anran Yu
- School of Information Science and Technology, Fudan University, Shanghai, 200433, China
- Center of Micro-Nano System, Fudan University, Shanghai, 200433, China
| | - Jiajun Qin
- Department of Physics, Chemistry and Biology, Linköping University, Linköping, SE-58183, Sweden
| | - Mohammad Hatamvand
- School of Information Science and Technology, Fudan University, Shanghai, 200433, China
- Center of Micro-Nano System, Fudan University, Shanghai, 200433, China
| | - Irfan Ahmed
- School of Information Science and Technology, Fudan University, Shanghai, 200433, China
- Center of Micro-Nano System, Fudan University, Shanghai, 200433, China
| | - Jiayan Luo
- School of Information Science and Technology, Fudan University, Shanghai, 200433, China
- Center of Micro-Nano System, Fudan University, Shanghai, 200433, China
| | - Yiming Zhang
- School of Information Science and Technology, Fudan University, Shanghai, 200433, China
- Key Laboratory of Micro and Nano Photonic Structures and Department of Optical Science and Engineering, Fudan University, Shanghai, 200433, China
| | - Hao Zhang
- School of Information Science and Technology, Fudan University, Shanghai, 200433, China.
- Key Laboratory of Micro and Nano Photonic Structures and Department of Optical Science and Engineering, Fudan University, Shanghai, 200433, China.
- Yiwu Research Institute of Fudan University, Chengbei Road, Yiwu City, Zhejiang, 322000, China.
| | - Yiqiang Zhan
- School of Information Science and Technology, Fudan University, Shanghai, 200433, China.
- Center of Micro-Nano System, Fudan University, Shanghai, 200433, China.
| |
Collapse
|
10
|
Xu P, Chen C, Chen S, Lu W, Qian Q, Zeng Y. Machine Learning-Assisted Design of Yttria-Stabilized Zirconia Thermal Barrier Coatings with High Bonding Strength. ACS OMEGA 2022; 7:21052-21061. [PMID: 35755382 PMCID: PMC9219529 DOI: 10.1021/acsomega.2c01839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Accepted: 05/30/2022] [Indexed: 06/15/2023]
Abstract
As a high-quality thermal barrier coating material, yttria-stabilized zirconia (YSZ) can effectively reduce the temperature of the collective materials to be used on the surface of gas turbine hot-end components. The bonding strength between YSZ and the substrate is also one of the most important factors for the applications. Herein, the Gaussian mixture model (GMM) and support vector regression (SVR) were used to construct a machine learning model between YSZ coating bonding strength and atmospheric plasma spraying (APS) process parameters. First, GMM was used to expand the original 8 data points to 400 with the R value of leave-one-out cross-validation improved from 0.690 to 0.990. Then, the specific effects of APS process parameters were explored through Shapley additive explanations and sensitivity analysis. Principal component analysis was used to explain the constructed model and obtain the optimized area with a high bonding strength. After experimental validation, the results showed that under the APS process parameters of a current of 617 A, a voltage of 65 V, a H2 flow of 3 L min-1, and a thickness of 200 μm, the bonding strength increased by more than 19% to 55.5 MPa compared with the original maximum value of 46.6 MPa, indicating that the constructed GMM-SVR model can accurately predict the bonding strength of YSZ coating.
Collapse
Affiliation(s)
- Pengcheng Xu
- Materials
Genome Institute, Shanghai University, Shanghai 200444, China
| | - Can Chen
- The
State Key Lab of High Performance Ceramics and Superfine Micro-structure,
Shanghai Institute of Ceramics, Chinese
Academy of Sciences, Shanghai 200050, China
| | - Shuizhou Chen
- School
of Computer Engineering and Science, Shanghai
University, Shanghai 200444, China
| | - Wencong Lu
- Materials
Genome Institute, Shanghai University, Shanghai 200444, China
- Department
of Chemistry, College of Sciences, Shanghai
University, Shanghai 200444, China
| | - Quan Qian
- School
of Computer Engineering and Science, Shanghai
University, Shanghai 200444, China
| | - Yi Zeng
- The
State Key Lab of High Performance Ceramics and Superfine Micro-structure,
Shanghai Institute of Ceramics, Chinese
Academy of Sciences, Shanghai 200050, China
| |
Collapse
|
11
|
Zhang S, Lu T, Xu P, Tao Q, Li M, Lu W. Predicting the Formability of Hybrid Organic-Inorganic Perovskites via an Interpretable Machine Learning Strategy. J Phys Chem Lett 2021; 12:7423-7430. [PMID: 34337946 DOI: 10.1021/acs.jpclett.1c01939] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Predicting the formability of perovskite structure for hybrid organic-inorganic perovskites (HOIPs) is a prominent challenge in the search for the required materials from a huge search space. Here, we propose an interpretable strategy combining machine learning with a shapley additive explanations (SHAP) approach to accelerate the discovery of potential HOIPs. According to the prediction of the best classification model, top-198 nontoxic candidates with a probability of formability (Pf) of >0.99 are screened from 18560 virtual samples. The SHAP analysis reveals that the radius and lattice constant of the B site (rB and LCB) are positively related to formability, while the ionic radius of the A site (rA), the tolerant factor (t), and the first ionization energy of the B site (I1B) have negative relations. The significant finding is that stricter ranges of t (0.84-1.12) and improved tolerant factor τ (critical value of 6.20) do exist for HOIPs, which are different from inorganic perovskites, providing a simple and fast assessment in the design of materials with an HOIP structure.
Collapse
Affiliation(s)
- Shilin Zhang
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Tian Lu
- Materials Genome Institute, Shanghai University, Shanghai 200444, China
| | - Pengcheng Xu
- Materials Genome Institute, Shanghai University, Shanghai 200444, China
| | - Qiuling Tao
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Minjie Li
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Wencong Lu
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
- Materials Genome Institute, Shanghai University, Shanghai 200444, China
| |
Collapse
|
12
|
Xu P, Lu T, Ju L, Tian L, Li M, Lu W. Machine Learning Aided Design of Polymer with Targeted Band Gap Based on DFT Computation. J Phys Chem B 2021; 125:601-611. [DOI: 10.1021/acs.jpcb.0c08674] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Pengcheng Xu
- Materials Genome Institute, Shanghai University, and Shanghai Materials Genome Institute, Shanghai 200444, China
| | - Tian Lu
- Materials Genome Institute, Shanghai University, and Shanghai Materials Genome Institute, Shanghai 200444, China
| | - Lifei Ju
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Lumin Tian
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Minjie Li
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| | - Wencong Lu
- Materials Genome Institute, Shanghai University, and Shanghai Materials Genome Institute, Shanghai 200444, China
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China
| |
Collapse
|
13
|
Affiliation(s)
- Hanoch Senderowitz
- Department of Chemistry , Bar Ilan University , Ramat-Gan 5290002 , Israel
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy , University of North Carolina , Chapel Hill , North Carolina 27599 , United States
| |
Collapse
|
14
|
Ma R, Liu Z, Zhang Q, Liu Z, Luo T. Evaluating Polymer Representations via Quantifying Structure-Property Relationships. J Chem Inf Model 2019; 59:3110-3119. [PMID: 31268306 DOI: 10.1021/acs.jcim.9b00358] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Machine learning techniques are being applied in quantifying structure-property relationships for a wide variety of materials, where the properly represented materials play key roles. Although algorithms for representation learning are extensively studied, their applications to domain-specific areas, such as polymers, are limited largely due to the lack of benchmark databases. In this work, we investigate different types of polymer representations, including Morgan fingerprint (MF), molecular embedding (ME), and molecular graph (MG), based on the benchmark database from a subset of the well-known web-based polymer databases, PolyInfo. We evaluate the quality of different polymer representations via quantifying the relationships between the representations and polymer properties, including density, melting temperature, and glass transition temperature. Different representation learning schemes for MEs, such as supervised learning, semisupervised learning, and transfer learning, are investigated. In supervised learning, only labeled molecules in our benchmark database are used for representation learning, in semisupervised learning, both labeled and unlabeled molecules in our benchmark database are used, and in transfer learning, molecules from an external database that is different from the benchmark database are used for representation learning. It is found that ME (with the R2 of 0.724 in the density case, 0.684 in the melting temperature case, and 0.865 in the glass transition temperature case) outperforms the other representations for structure-property relationship quantification in all cases studied, and MG (with the R2 of 0.260 in the density case, -0.149 in the melting temperature case, and 0.711 in the glass transition case) is shown to be much inferior to ME and MF (with the R2 of 0.562 in the density case, 0.645 in the melting temperature case, and 0.849 in the glass transition case), likely due to the relatively small volumes of training data available. For MEs, it is found that the similarities of substructure MEs under different learning schemes (e.g., SL, SSL, and TL) are differently estimated, thus leading to different performance scores in structure-property relation quantification. Combinations of MEs show little effect on predictive performance when comparing to the single MEs in the corresponding regression tasks, proving no information gain in mixing MEs.
Collapse
Affiliation(s)
- Ruimin Ma
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| | - Zeyu Liu
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| | - Quanwei Zhang
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| | - Zhiyu Liu
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| | - Tengfei Luo
- Department of Aerospace and Mechanical Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States.,Department of Chemical and Biomolecular Engineering , University of Notre Dame , Notre Dame , Indiana 46556 , United States
| |
Collapse
|