1
|
Purcell TAR, Scheffler M, Ghiringhelli LM. Recent advances in the SISSO method and their implementation in the SISSO++ code. J Chem Phys 2023; 159:114110. [PMID: 37721326 DOI: 10.1063/5.0156620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Accepted: 08/21/2023] [Indexed: 09/19/2023] Open
Abstract
Accurate and explainable artificial-intelligence (AI) models are promising tools for accelerating the discovery of new materials. Recently, symbolic regression has become an increasingly popular tool for explainable AI because it yields models that are relatively simple analytical descriptions of target properties. Due to its deterministic nature, the sure-independence screening and sparsifying operator (SISSO) method is a particularly promising approach for this application. Here, we describe the new advancements of the SISSO algorithm, as implemented into SISSO++, a C++ code with Python bindings. We introduce a new representation of the mathematical expressions found by SISSO. This is a first step toward introducing "grammar" rules into the feature creation step. Importantly, by introducing a controlled nonlinear optimization to the feature creation step, we expand the range of possible descriptors found by the methodology. Finally, we introduce refinements to the solver algorithms for both regression and classification, which drastically increase the reliability and efficiency of SISSO. For all these improvements to the basic SISSO algorithm, we not only illustrate their potential impact but also fully detail how they operate both mathematically and computationally.
Collapse
Affiliation(s)
- Thomas A R Purcell
- The NOMAD Laboratory at the FHI of the Max-Planck-Gesellschaft and IRIS-Adlershof of the Humboldt-Universität zu Berlin, Faradayweg 4-6, D-14195 Berlin, Germany
| | - Matthias Scheffler
- The NOMAD Laboratory at the FHI of the Max-Planck-Gesellschaft and IRIS-Adlershof of the Humboldt-Universität zu Berlin, Faradayweg 4-6, D-14195 Berlin, Germany
| | - Luca M Ghiringhelli
- The NOMAD Laboratory at the FHI of the Max-Planck-Gesellschaft and IRIS-Adlershof of the Humboldt-Universität zu Berlin, Faradayweg 4-6, D-14195 Berlin, Germany
- Physics Department and IRIS-Adlershof, Humboldt Universität zu Berlin, Zum Großen Windkanal 2, D-12489 Berlin, Germany
| |
Collapse
|
2
|
Foppa L, Purcell TAR, Levchenko SV, Scheffler M, Ghringhelli LM. Hierarchical Symbolic Regression for Identifying Key Physical Parameters Correlated with Bulk Properties of Perovskites. PHYSICAL REVIEW LETTERS 2022; 129:055301. [PMID: 35960572 DOI: 10.1103/physrevlett.129.055301] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 04/27/2022] [Accepted: 06/10/2022] [Indexed: 06/15/2023]
Abstract
Symbolic regression identifies nonlinear, analytical expressions relating materials properties and key physical parameters. However, the pool of expressions grows rapidly with complexity, compromising its efficiency. We tackle this challenge hierarchically: identified expressions are used as inputs for further obtaining more complex expressions. Crucially, this framework can transfer knowledge among properties, as demonstrated using the sure-independence-screening-and-sparsifying-operator approach to identify expressions for lattice constant and cohesive energy, which are then used to model the bulk modulus of ABO_{3} perovskites.
Collapse
Affiliation(s)
- Lucas Foppa
- The NOMAD Laboratory at Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
- The NOMAD Laboratory at Humboldt-Universität zu Berlin, Zum Großen Windkanal 6, D-12489 Berlin, Germany
| | - Thomas A R Purcell
- The NOMAD Laboratory at Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
- The NOMAD Laboratory at Humboldt-Universität zu Berlin, Zum Großen Windkanal 6, D-12489 Berlin, Germany
| | - Sergey V Levchenko
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30/1, 121205 Moscow, Russia
| | - Matthias Scheffler
- The NOMAD Laboratory at Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
- The NOMAD Laboratory at Humboldt-Universität zu Berlin, Zum Großen Windkanal 6, D-12489 Berlin, Germany
| | - Luca M Ghringhelli
- The NOMAD Laboratory at Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, D-14195 Berlin, Germany
- The NOMAD Laboratory at Humboldt-Universität zu Berlin, Zum Großen Windkanal 6, D-12489 Berlin, Germany
| |
Collapse
|
3
|
Zheng J, Sun X, Hu J, Wang S, Yao Z, Deng S, Pan X, Pan Z, Wang J. Symbolic Transformer Accelerating Machine Learning Screening of Hydrogen and Deuterium Evolution Reaction Catalysts in MA 2Z 4 Materials. ACS APPLIED MATERIALS & INTERFACES 2021; 13:50878-50891. [PMID: 34672634 DOI: 10.1021/acsami.1c13236] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Two-dimensional (2D) materials have been developed into various catalysts with high performance, but employing them for developing highly stable and active nonprecious hydrogen evolution reaction (HER) catalysts still encounters many challenges. To this end, the machine learning (ML) screening of HER catalysts is accelerated by using genetic programming (GP) of symbolic transformers for various typical 2D MA2Z4 materials. The values of the Gibbs free energy of hydrogen adsorption (ΔGH*) are accurately and rapidly predicted via extreme gradient boosting regression by using only simple GP-processed elemental features, with a low predictive root-mean-square error of 0.14 eV. With the analysis of ML and density functional theory (DFT) methods, it is found that various electronic structural properties of metal atoms and the p-band center of surface atoms play a crucial role in regulating the HER performance. Based on these findings, NbSi2N4 and VSi2N4 are discovered to be active catalysts with thermodynamical and dynamical stability as ΔGH* approaches to zero (-0.041 and 0.024 eV). In addition, DFT calculations reveal that these catalysts also exhibit good deuterium evolution reaction (DER) performance. Overall, a multistep workflow is developed through ML models combined with DFT calculations for efficiently screening the potential HER and DER catalysts from 2D materials with the same crystal prototype, which is believed to have significant contribution to catalyst design and fabrication.
Collapse
Affiliation(s)
- Jingnan Zheng
- Institute of Industrial Catalysis, State Key Laboratory Breeding Base of Green-Chemical Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310032, P. R. China
| | | | | | - ShiBin Wang
- Institute of Industrial Catalysis, State Key Laboratory Breeding Base of Green-Chemical Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310032, P. R. China
| | - Zihao Yao
- Institute of Industrial Catalysis, State Key Laboratory Breeding Base of Green-Chemical Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310032, P. R. China
| | - Shengwei Deng
- Institute of Industrial Catalysis, State Key Laboratory Breeding Base of Green-Chemical Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310032, P. R. China
| | | | | | - Jianguo Wang
- Institute of Industrial Catalysis, State Key Laboratory Breeding Base of Green-Chemical Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310032, P. R. China
| |
Collapse
|
4
|
Yang K, Cao Y, Zhang Y, Fan S, Tang M, Aberg D, Sadigh B, Zhou F. Self-supervised learning and prediction of microstructure evolution with convolutional recurrent neural networks. PATTERNS (NEW YORK, N.Y.) 2021; 2:100243. [PMID: 34036288 PMCID: PMC8134942 DOI: 10.1016/j.patter.2021.100243] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Revised: 03/02/2021] [Accepted: 03/30/2021] [Indexed: 12/18/2022]
Abstract
Microstructural evolution is a key aspect of understanding and exploiting the processing-structure-property relationship of materials. Modeling microstructure evolution usually relies on coarse-grained simulations with evolution principles described by partial differential equations (PDEs). Here we demonstrate that convolutional recurrent neural networks can learn the underlying physical rules and replace PDE-based simulations in the prediction of microstructure phenomena. Neural nets are trained by self-supervised learning with image sequences from simulations of several common processes, including plane-wave propagation, grain growth, spinodal decomposition, and dendritic crystal growth. The trained networks can accurately predict both short-term local dynamics and long-term statistical properties of microstructures assessed herein and are capable of extrapolating beyond the training datasets in spatiotemporal domains and configurational and parametric spaces. Such a data-driven approach offers significant advantages over PDE-based simulations in time-stepping efficiency and offers a useful alternative, especially when the material parameters or governing PDEs are not well determined.
Collapse
Affiliation(s)
- Kaiqi Yang
- Department of Materials Science and NanoEngineering, Rice University, Houston, TX 77005, USA
| | - Yifan Cao
- Department of Materials Science and NanoEngineering, Rice University, Houston, TX 77005, USA
| | - Youtian Zhang
- Department of Materials Science and NanoEngineering, Rice University, Houston, TX 77005, USA
| | - Shaoxun Fan
- Department of Materials Science and NanoEngineering, Rice University, Houston, TX 77005, USA
| | - Ming Tang
- Department of Materials Science and NanoEngineering, Rice University, Houston, TX 77005, USA
| | - Daniel Aberg
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA
| | - Babak Sadigh
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA
| | - Fei Zhou
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA
| |
Collapse
|
5
|
Kang P, Liu Z, Abou-Rachid H, Guo H. Machine-Learning Assisted Screening of Energetic Materials. J Phys Chem A 2020; 124:5341-5351. [PMID: 32511924 DOI: 10.1021/acs.jpca.0c02647] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
In this work, machine learning (ML), materials informatics (MI), and thermochemical data are combined to screen potential candidates of energetic materials. To directly characterize energetic performance, the heat of explosion ΔHe is used as the target property. The critical descriptors of cohesive energy, averaged over all constituent elements and the oxygen balance, are found by forward stepwise selection from a large number of possible descriptors. With them and a theoretically labeled ΔHe training data set, a satisfactory surrogate ML model is trained. The ML model is applied to large databases ICSD and PubChem to predict ΔHe. At the gross-level filtering by the ML model, 2732 molecular candidates based on carbon, hydrogen, nitrogen, and oxygen (CHNO) with high ΔHe values are predicted. Afterward, a fine-level thermochemical screening is carried out on the 2732 materials, resulting in 262 candidates with TNT equivalent power index Pe(TNT) greater than 1.5. Raising Pe(TNT) further to larger than 1.8, 29 potential candidates are found from the 2732 materials, all are new to the current reservoir of well-known energetic materials.
Collapse
Affiliation(s)
- Peng Kang
- Center for the Physics of Materials and Department of Physics, McGill University, Montreal, Quebec H3A 2T8, Canada.,Nanoacademic Technologies Inc., Suite 802, 666 Sherbrooke West, Montreal, Quebec H3A 1E7, Canada
| | - Zhongli Liu
- Center for the Physics of Materials and Department of Physics, McGill University, Montreal, Quebec H3A 2T8, Canada
| | - Hakima Abou-Rachid
- Defence Research and Development Canada, Valcartier, Quebec G3J 1X5, Canada
| | - Hong Guo
- Center for the Physics of Materials and Department of Physics, McGill University, Montreal, Quebec H3A 2T8, Canada.,Nanoacademic Technologies Inc., Suite 802, 666 Sherbrooke West, Montreal, Quebec H3A 1E7, Canada
| |
Collapse
|