1
|
Schrier J, Norquist AJ, Buonassisi T, Brgoch J. In Pursuit of the Exceptional: Research Directions for Machine Learning in Chemical and Materials Science. J Am Chem Soc 2023; 145:21699-21716. [PMID: 37754929 DOI: 10.1021/jacs.3c04783] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/28/2023]
Abstract
Exceptional molecules and materials with one or more extraordinary properties are both technologically valuable and fundamentally interesting, because they often involve new physical phenomena or new compositions that defy expectations. Historically, exceptionality has been achieved through serendipity, but recently, machine learning (ML) and automated experimentation have been widely proposed to accelerate target identification and synthesis planning. In this Perspective, we argue that the data-driven methods commonly used today are well-suited for optimization but not for the realization of new exceptional materials or molecules. Finding such outliers should be possible using ML, but only by shifting away from using traditional ML approaches that tweak the composition, crystal structure, or reaction pathway. We highlight case studies of high-Tc oxide superconductors and superhard materials to demonstrate the challenges of ML-guided discovery and discuss the limitations of automation for this task. We then provide six recommendations for the development of ML methods capable of exceptional materials discovery: (i) Avoid the tyranny of the middle and focus on extrema; (ii) When data are limited, qualitative predictions that provide direction are more valuable than interpolative accuracy; (iii) Sample what can be made and how to make it and defer optimization; (iv) Create room (and look) for the unexpected while pursuing your goal; (v) Try to fill-in-the-blanks of input and output space; (vi) Do not confuse human understanding with model interpretability. We conclude with a description of how these recommendations can be integrated into automated discovery workflows, which should enable the discovery of exceptional molecules and materials.
Collapse
Affiliation(s)
- Joshua Schrier
- Department of Chemistry, Fordham University, The Bronx, New York 10458, United States
| | - Alexander J Norquist
- Department of Chemistry, Haverford College, Haverford, Pennsylvania 19041, United States
| | - Tonio Buonassisi
- Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Jakoah Brgoch
- Department of Chemistry and Texas Center for Superconductivity, University of Houston, Houston, Texas 77204, United States
| |
Collapse
|
2
|
Mairpady A, Mourad AHI, Mozumder MS. Accelerated Discovery of the Polymer Blends for Cartilage Repair through Data-Mining Tools and Machine-Learning Algorithm. Polymers (Basel) 2022; 14:polym14091802. [PMID: 35566970 PMCID: PMC9104973 DOI: 10.3390/polym14091802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 04/20/2022] [Accepted: 04/20/2022] [Indexed: 11/23/2022] Open
Abstract
In designing successful cartilage substitutes, the selection of scaffold materials plays a central role, among several other important factors. In an empirical approach, the selection of the most appropriate polymer(s) for cartilage repair is an expensive and time-consuming affair, as traditionally it requires numerous trials. Moreover, it is humanly impossible to go through the huge library of literature available on the potential polymer(s) and to correlate the physical, mechanical, and biological properties that might be suitable for cartilage tissue engineering. Hence, the objective of this study is to implement an inverse design approach to predict the best polymer(s)/blend(s) for cartilage repair by using a machine-learning algorithm (i.e., multinomial logistic regression (MNLR)). Initially, a systematic bibliometric analysis on cartilage repair has been performed by using the bibliometrix package in the R program. Then, the database was created by extracting the mechanical properties of the most frequently used polymers/blends from the PoLyInfo library by using data-mining tools. Then, an MNLR algorithm was run by using the mechanical properties of the polymers, which are similar to the cartilages, as the input and the polymer(s)/blends as the predicted output. The MNLR algorithm used in this study predicts polyethylene/polyethylene-graftpoly(maleic anhydride) blend as the best candidate for cartilage repair.
Collapse
Affiliation(s)
- Anusha Mairpady
- Chemical and Petroleum Engineering Department, UAE University, Al Ain P.O. Box 15551, United Arab Emirates;
| | - Abdel-Hamid I. Mourad
- Mechanical and Aerospace Engineering Department, UAE University, Al Ain P.O. Box 15551, United Arab Emirates;
- National Water and Energy Center, United Arab Emirates University, Al Ain P.O. Box 15551, United Arab Emirates
| | - Mohammad Sayem Mozumder
- Chemical and Petroleum Engineering Department, UAE University, Al Ain P.O. Box 15551, United Arab Emirates;
- Correspondence:
| |
Collapse
|
3
|
Reilly CE, Dillon RJ, Nayak A, Brogan S, Moot T, Brennaman MK, Lopez R, Meyer TJ, Alibabaei L. Dye-Sensitized Nonstoichiometric Strontium Titanate Core-Shell Photocathodes for Photoelectrosynthesis Applications. ACS APPLIED MATERIALS & INTERFACES 2021; 13:15261-15269. [PMID: 33745279 DOI: 10.1021/acsami.1c00933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
A core-shell approach that utilizes a high-surface-area conducting core and an outer semiconductor shell is exploited here to prepare p-type dye-sensitized solar energy cells that operate with a minimal applied bias. Photocathodes were prepared by coating thin films of nanocrystalline indium tin oxide with a 0.8 nm Al2O3 seeding layer, followed by the chemical growth of nonstoichiometric strontium titanate. Films were annealed and sensitized with either a porphyrin chromophore or a chromophore-catalyst molecular assembly consisting of the porphyrin covalently tethered to the ruthenium complex. The sensitized photoelectrodes produced cathodic photocurrents of up to -315 μA/cm2 under simulated sunlight (AM1.5G, 100 mW/cm2) in aqueous media, pH 5. The photocurrent was increased by the addition of regenerative hole donors to the system, consistent with slow interfacial recombination kinetics, an important property of p-type dye-sensitized electrodes.
Collapse
Affiliation(s)
- Caroline E Reilly
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Robert J Dillon
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Animesh Nayak
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Shane Brogan
- Department of Applied Physical Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Taylor Moot
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Matthew K Brennaman
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Rene Lopez
- Department of Physics and Astronomy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Thomas J Meyer
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Leila Alibabaei
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| |
Collapse
|
4
|
Luo S, Li T, Wang X, Faizan M, Zhang L. High‐throughput computational materials screening and discovery of optoelectronic semiconductors. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2020. [DOI: 10.1002/wcms.1489] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Shulin Luo
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE and School of Materials Science and Engineering Jilin University Changchun China
| | - Tianshu Li
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE and School of Materials Science and Engineering Jilin University Changchun China
| | - Xinjiang Wang
- Department of Physics, State Key Laboratory of Superhard Materials Jilin University Changchun China
| | - Muhammad Faizan
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE and School of Materials Science and Engineering Jilin University Changchun China
| | - Lijun Zhang
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE and School of Materials Science and Engineering Jilin University Changchun China
| |
Collapse
|
5
|
Chen X, Chen D, Weng M, Jiang Y, Wei GW, Pan F. Topology-Based Machine Learning Strategy for Cluster Structure Prediction. J Phys Chem Lett 2020; 11:4392-4401. [PMID: 32320253 PMCID: PMC7351018 DOI: 10.1021/acs.jpclett.0c00974] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
In cluster physics, the determination of the ground-state structure of medium-sized and large-sized clusters is a challenge due to the number of local minimal values on the potential energy surface growing exponentially with cluster size. Although machine learning approaches have had much success in materials sciences, their applications in clusters are often hindered by the geometric complexity clusters. Persistent homology provides a new topological strategy to simplify geometric complexity while retaining important chemical and physical information without having to "downgrade" the original data. We further propose persistent pairwise independence (PPI) to enhance the predictive power of persistent homology. We construct topology-based machine learning models to reveal hidden structure-energy relationships in lithium (Li) clusters. We integrate the topology-based machine learning models, a particle swarm optimization algorithm, and density functional theory calculations to accelerate the search of the globally stable structure of clusters.
Collapse
Affiliation(s)
- Xin Chen
- School of Advanced Materials, Shenzhen Graduate School, Peking University, Shenzhen 518055, People's Republic of China
| | - Dong Chen
- School of Advanced Materials, Shenzhen Graduate School, Peking University, Shenzhen 518055, People's Republic of China
| | - Mouyi Weng
- School of Advanced Materials, Shenzhen Graduate School, Peking University, Shenzhen 518055, People's Republic of China
| | - Yi Jiang
- School of Advanced Materials, Shenzhen Graduate School, Peking University, Shenzhen 518055, People's Republic of China
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Feng Pan
- School of Advanced Materials, Shenzhen Graduate School, Peking University, Shenzhen 518055, People's Republic of China
| |
Collapse
|
6
|
Friederich P, Fediai A, Kaiser S, Konrad M, Jung N, Wenzel W. Toward Design of Novel Materials for Organic Electronics. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2019; 31:e1808256. [PMID: 31012166 DOI: 10.1002/adma.201808256] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Indexed: 06/09/2023]
Abstract
Materials for organic electronics are presently used in prominent applications, such as displays in mobile devices, while being intensely researched for other purposes, such as organic photovoltaics, large-area devices, and thin-film transistors. Many of the challenges to improve and optimize these applications are material related and there is a nearly infinite chemical space that needs to be explored to identify the most suitable material candidates. Established experimental approaches struggle with the size and complexity of this chemical space. Herein, the development of simulation methods is addressed, with a particular emphasis on predictive multiscale protocols, to complement experimental research in the identification of novel materials and illustrate the potential of these methods with a few prominent recent applications. Finally, the potential of machine learning and methods based on artificial intelligence is discussed to further accelerate the search for new materials.
Collapse
Affiliation(s)
- Pascal Friederich
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
- Department of Chemistry, University of Toronto, 80 St. George Street, M5S 3H6, Toronto, Ontario, Canada
| | - Artem Fediai
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Simon Kaiser
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Manuel Konrad
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Nicole Jung
- Institute of Organic Chemistry (IOC), Karlsruhe Institute of Technology (KIT), Fritz-Haber-Weg 6, 76131, Karlsruhe, Germany
| | - Wolfgang Wenzel
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| |
Collapse
|
7
|
Butler KT, Davies DW, Cartwright H, Isayev O, Walsh A. Machine learning for molecular and materials science. Nature 2018; 559:547-555. [PMID: 30046072 DOI: 10.1038/s41586-018-0337-2] [Citation(s) in RCA: 1158] [Impact Index Per Article: 193.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Accepted: 05/09/2018] [Indexed: 02/06/2023]
Abstract
Here we summarize recent progress in machine learning for the chemical sciences. We outline machine-learning techniques that are suitable for addressing research questions in this domain, as well as future directions for the field. We envisage a future in which the design, synthesis, characterization and application of molecules and materials is accelerated by artificial intelligence.
Collapse
Affiliation(s)
- Keith T Butler
- ISIS Facility, Rutherford Appleton Laboratory, Harwell Campus, Harwell, UK
| | | | | | - Olexandr Isayev
- Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| | - Aron Walsh
- Department of Materials Science and Engineering, Yonsei University, Seoul, South Korea. .,Department of Materials, Imperial College London, London, UK.
| |
Collapse
|
8
|
Smith JS, Nebgen B, Lubbers N, Isayev O, Roitberg AE. Less is more: Sampling chemical space with active learning. J Chem Phys 2018; 148:241733. [DOI: 10.1063/1.5023802] [Citation(s) in RCA: 278] [Impact Index Per Article: 46.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Affiliation(s)
- Justin S. Smith
- Department of Chemistry, University of Florida, Gainesville, Florida 32611, USA
| | - Ben Nebgen
- Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Nicholas Lubbers
- Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Olexandr Isayev
- UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Adrian E. Roitberg
- Department of Chemistry, University of Florida, Gainesville, Florida 32611, USA
| |
Collapse
|
9
|
Venkatraman V, Raju R, Oikonomopoulos SP, Alsberg BK. The dye-sensitized solar cell database. J Cheminform 2018; 10:18. [PMID: 29616364 PMCID: PMC5882482 DOI: 10.1186/s13321-018-0272-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2017] [Accepted: 03/25/2018] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Dye-sensitized solar cells (DSSCs) have garnered a lot of attention in recent years. The solar energy to power conversion efficiency of a DSSC is influenced by various components of the cell such as the dye, electrolyte, electrodes and additives among others leading to varying experimental configurations. A large number of metal-based and metal-free dye sensitizers have now been reported and tools using such data to indicate new directions for design and development are on the rise. DESCRIPTION DSSCDB, the first of its kind dye-sensitized solar cell database, aims to provide users with up-to-date information from publications on the molecular structures of the dyes, experimental details and reported measurements (efficiencies and spectral properties) and thereby facilitate a comprehensive and critical evaluation of the data. Currently, the DSSCDB contains over 4000 experimental observations spanning multiple dye classes such as triphenylamines, carbazoles, coumarins, phenothiazines, ruthenium and porphyrins. CONCLUSION The DSSCDB offers a web-based, comprehensive source of property data for dye sensitized solar cells. Access to the database is available through the following URL: www.dyedb.com .
Collapse
Affiliation(s)
| | - Rajesh Raju
- Department of Chemistry, NTNU, Høgskoleringen, 7491, Trondheim, Norway
| | | | - Bjørn K Alsberg
- Department of Chemistry, NTNU, Høgskoleringen, 7491, Trondheim, Norway
| |
Collapse
|
10
|
Davies DW, Butler KT, Skelton JM, Xie C, Oganov AR, Walsh A. Computer-aided design of metal chalcohalide semiconductors: from chemical composition to crystal structure. Chem Sci 2017; 9:1022-1030. [PMID: 29675149 PMCID: PMC5883896 DOI: 10.1039/c7sc03961a] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2017] [Accepted: 12/04/2017] [Indexed: 11/21/2022] Open
Abstract
The standard paradigm in computational materials science is INPUT: Structure; OUTPUT: Properties, which has yielded many successes but is ill-suited for exploring large areas of chemical and configurational hyperspace.
The standard paradigm in computational materials science is INPUT: Structure; OUTPUT: Properties, which has yielded many successes but is ill-suited for exploring large areas of chemical and configurational hyperspace. We report a high-throughput screening procedure that uses compositional descriptors to search for new photoactive semiconducting compounds. We show how feeding high-ranking element combinations to structure prediction algorithms can constitute a pragmatic computer-aided materials design approach. Techniques based on structural analogy (data mining of known lattice types) and global searches (direct optimisation using evolutionary algorithms) are combined for translating between chemical composition and crystal structure. The properties of four novel chalcohalides (Sn5S4Cl2, Sn4SF6, Cd5S4Cl2 and Cd4SF6) are predicted, of which two are calculated to have bandgaps in the visible range of the electromagnetic spectrum.
Collapse
Affiliation(s)
- Daniel W Davies
- Centre for Sustainable Chemical Technologies , Department of Chemistry , University of Bath , Claverton Down , Bath BA2 7AY , UK .
| | - Keith T Butler
- Centre for Sustainable Chemical Technologies , Department of Chemistry , University of Bath , Claverton Down , Bath BA2 7AY , UK .
| | - Jonathan M Skelton
- Centre for Sustainable Chemical Technologies , Department of Chemistry , University of Bath , Claverton Down , Bath BA2 7AY , UK .
| | - Congwei Xie
- Science and Technology on Thermostructural Composite Materials Laboratory , International Center for Materials Discovery , School of Materials Science and Engineering , Northwestern Polytechnical University , Xian , Shaanxi 710072 , Peoples Republic of China
| | - Artem R Oganov
- International Center for Materials Discovery , School of Materials Science and Engineering , Northwestern Polytechnical University , Xian , Shaanxi 710072 , Peoples Republic of China.,Skolkovo Institute of Science and Technology , 3 Nobel Street , Moscow Region 143026 , Russia.,Moscow Institute of Physics and Technology , Dolgoprudny , Moscow Region 141700 , Russia
| | - Aron Walsh
- Department of Materials Science and Engineering , Yonsei University , Seoul 03722 , Korea . .,Department of Materials , Imperial College London , Exhibition Road , London SW7 2AZ , UK
| |
Collapse
|