1
|
Aldossary A, Campos-Gonzalez-Angulo JA, Pablo-García S, Leong SX, Rajaonson EM, Thiede L, Tom G, Wang A, Avagliano D, Aspuru-Guzik A. In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
Affiliation(s)
- Abdulrahman Aldossary
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | | | - Sergio Pablo-García
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
| | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Ella Miray Rajaonson
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Luca Thiede
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Andrew Wang
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Davide Avagliano
- Chimie ParisTech, PSL University, CNRS, Institute of Chemistry for Life and Health Sciences (iCLeHS UMR 8060), Paris, F-75005, France
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
- Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON, M5S 3E4, Canada
- Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON, M5S 3E5, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 66118 University Ave., Toronto, M5G 1M1, Canada
- Acceleration Consortium, 80 St George St, Toronto, M5S 3H6, Canada
| |
Collapse
|
2
|
Xie T, Wan Y, Zhou Y, Huang W, Liu Y, Linghu Q, Wang S, Kit C, Grazian C, Zhang W, Hoex B. Creation of a structured solar cell material dataset and performance prediction using large language models. PATTERNS (NEW YORK, N.Y.) 2024; 5:100955. [PMID: 38800367 PMCID: PMC11117053 DOI: 10.1016/j.patter.2024.100955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 02/26/2024] [Accepted: 02/27/2024] [Indexed: 05/29/2024]
Abstract
Materials scientists usually collect experimental data to summarize experiences and predict improved materials. However, a crucial issue is how to proficiently utilize unstructured data to update existing structured data, particularly in applied disciplines. This study introduces a new natural language processing (NLP) task called structured information inference (SII) to address this problem. We propose an end-to-end approach to summarize and organize the multi-layered device-level information from the literature into structured data. After comparing different methods, we fine-tuned LLaMA with an F1 score of 87.14% to update an existing perovskite solar cell dataset with articles published since its release, allowing its direct use in subsequent data analysis. Using structured information, we developed regression tasks to predict the electrical performance of solar cells. Our results demonstrate comparable performance to traditional machine-learning methods without feature selection and highlight the potential of large language models for scientific knowledge acquisition and material development.
Collapse
Affiliation(s)
- Tong Xie
- School of Photovoltaic and Renewable Energy Engineering, University of New South Wales, Kensington, NSW, Australia
- GreenDynamics Pty. Ltd, Kensington, NSW, Australia
| | - Yuwei Wan
- GreenDynamics Pty. Ltd, Kensington, NSW, Australia
- Department of Linguistics and Translation, City University of Hong Kong, Hong Kong, China
| | - Yufei Zhou
- Department of Linguistics and Translation, City University of Hong Kong, Hong Kong, China
| | - Wei Huang
- School of Computer Science and Engineering, University of New South Wales, Kensington, NSW, Australia
| | - Yixuan Liu
- GreenDynamics Pty. Ltd, Kensington, NSW, Australia
| | - Qingyuan Linghu
- GreenDynamics Pty. Ltd, Kensington, NSW, Australia
- School of Computer Science and Engineering, University of New South Wales, Kensington, NSW, Australia
| | - Shaozhou Wang
- School of Photovoltaic and Renewable Energy Engineering, University of New South Wales, Kensington, NSW, Australia
- GreenDynamics Pty. Ltd, Kensington, NSW, Australia
| | - Chunyu Kit
- Department of Linguistics and Translation, City University of Hong Kong, Hong Kong, China
| | - Clara Grazian
- School of Mathematics and Statistics, University of Sydney, Camperdown, NSW, Australia
- DARE ARC Training Centre in Data Analytics for Resources and Environments, South Eveleigh, NSW, Australia
| | - Wenjie Zhang
- School of Computer Science and Engineering, University of New South Wales, Kensington, NSW, Australia
| | - Bram Hoex
- School of Photovoltaic and Renewable Energy Engineering, University of New South Wales, Kensington, NSW, Australia
| |
Collapse
|
3
|
Ju CW, Shen Y, French EJ, Yi J, Bi H, Tian A, Lin Z. Accurate Electronic and Optical Properties of Organic Doublet Radicals Using Machine Learned Range-Separated Functionals. J Phys Chem A 2024. [PMID: 38382058 DOI: 10.1021/acs.jpca.3c07437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
Luminescent organic semiconducting doublet-spin radicals are unique and emergent optical materials because their fluorescent quantum yields (Φfl) are not compromised by the spin-flipping intersystem crossing (ISC) into a dark high-spin state. The multiconfigurational nature of these radicals challenges their electronic structure calculations in the framework of single-reference density functional theory (DFT) and introduces room for method improvement. In the present study, we extended our earlier development of ML-ωPBE [J. Phys. Chem. Lett., 2021, 12, 9516-9524], a range-separated hybrid (RSH) exchange-correlation (XC) functional constructed using the stacked ensemble machine learning (SEML) algorithm, from closed-shell organic semiconducting molecules to doublet-spin organic semiconducting radicals. We assessed its performance for a new test set of 64 doublet-spin radicals from five categories while placing all previously compiled 3926 closed-shell molecules in the new training set. Interestingly, ML-ωPBE agrees with the nonempirical OT-ωPBE functional regarding the prediction of the molecule-dependent range-separation parameter (ω), with a small mean absolute error (MAE) of 0.0197 a0-1, but saves the computational cost by 2.46 orders of magnitude. This result demonstrates an outstanding domain adaptation capacity of ML-ωPBE for diverse organic semiconducting species. To further assess the predictive power of ML-ωPBE in experimental observables, we also applied it to evaluate absorption and fluorescence energies (Eabs and Efl) using linear-response time-dependent DFT (TDDFT), and we compared its behavior with nine popular XC functionals. For most radicals, ML-ωPBE reproduces experimental measurements of Eabs and Efl with small MAEs of 0.299 and 0.254 eV, only marginally different from those of OT-ωPBE. Our work illustrates a successful extension of the SEML framework from closed-shell molecules to doublet-spin radicals and will open the venue for calculating optical properties for organic semiconductors using single-reference TDDFT.
Collapse
Affiliation(s)
- Cheng-Wei Ju
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts 01003, United States
- Pritzker School of Molecular Engineering, The University of Chicago, Chicago, Illinois 60637, United States
| | - Yili Shen
- Manning College of Information and Computer Sciences, University of Massachusetts, Amherst, Massachusetts 01003, United States
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, Indiana 46556, United States
| | - Ethan J French
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts 01003, United States
- Department of Mathematics and Statistics, University of Massachusetts, Amherst, Massachusetts 01003, United States
- Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Charlestown, Massachusetts 02129, United States
| | - Jun Yi
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts 01003, United States
- Department of Chemistry, Wake Forest University, Winston-Salem, North Carolina 27109, United States
| | - Hongshan Bi
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts 01003, United States
| | - Aaron Tian
- Manning College of Information and Computer Sciences, University of Massachusetts, Amherst, Massachusetts 01003, United States
- Department of Mathematics and Statistics, University of Massachusetts, Amherst, Massachusetts 01003, United States
| | - Zhou Lin
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts 01003, United States
| |
Collapse
|
4
|
Mayo Yanes E, Chakraborty S, Gershoni-Poranne R. COMPAS-2: a dataset of cata-condensed hetero-polycyclic aromatic systems. Sci Data 2024; 11:97. [PMID: 38242917 PMCID: PMC10799083 DOI: 10.1038/s41597-024-02927-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 01/05/2024] [Indexed: 01/21/2024] Open
Abstract
Polycyclic aromatic systems are highly important to numerous applications, in particular to organic electronics and optoelectronics. High-throughput screening and generative models that can help to identify new molecules to advance these technologies require large amounts of high-quality data, which is expensive to generate. In this report, we present the largest freely available dataset of geometries and properties of cata-condensed poly(hetero)cyclic aromatic molecules calculated to date. Our dataset contains ~500k molecules comprising 11 types of aromatic and antiaromatic building blocks calculated at the GFN1-xTB level and is representative of a highly diverse chemical space. We detail the structure enumeration process and the methods used to provide various electronic properties (including HOMO-LUMO gap, adiabatic ionization potential, and adiabatic electron affinity). Additionally, we benchmark against a ~50k dataset calculated at the CAM-B3LYP-D3BJ/def2-SVP level and develop a fitting scheme to correct the xTB values to higher accuracy. These new datasets represent the second installment in the COMputational database of Polycyclic Aromatic Systems (COMPAS) Project.
Collapse
Affiliation(s)
- Eduardo Mayo Yanes
- Schulich Faculty of Chemistry, Technion - Israel Institute of Technology, Haifa, 32000, Israel
| | - Sabyasachi Chakraborty
- Schulich Faculty of Chemistry, Technion - Israel Institute of Technology, Haifa, 32000, Israel
| | - Renana Gershoni-Poranne
- Schulich Faculty of Chemistry, Technion - Israel Institute of Technology, Haifa, 32000, Israel.
| |
Collapse
|
5
|
Ishfaq M, Mubashir T, Abdou SN, Tahir MH, Halawa MI, Ibrahim MM, Xie Y. Data mining and library generation to search electron-rich and electron-deficient building blocks for the designing of polymers for photoacoustic imaging. Heliyon 2023; 9:e21332. [PMID: 37964821 PMCID: PMC10641172 DOI: 10.1016/j.heliyon.2023.e21332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 10/08/2023] [Accepted: 10/19/2023] [Indexed: 11/16/2023] Open
Abstract
Photoacoustic imaging is a good method for biological imaging, for this purpose, materials with strong near infrared (NIR) absorbance are required. In the present study, machine learning models are used to predict the light absorption behavior of polymers. Molecular descriptors are utilized to train a variety of machine learning models. Building blocks are searched from chemical databases, as well as new building blocks are designed using chemical library enumeration method. The Breaking Retrosynthetically Interesting Chemical Substructures (BRICS) method is employed for the creation of 10,000 novel polymers. These polymers are designed based on the input of searched and selected building blocks. To enhance the process, the optimal machine learning model is utilized to predict the UV/visible absorption maxima of the newly designed polymers. Concurrently, chemical similarity analysis is also performed on the selected polymers, and synthetic accessibility of selected polymers is calculated. In summary, the polymers are all easy to synthesize, increasing their potential for practical applications.
Collapse
Affiliation(s)
| | - Tayyaba Mubashir
- Institute of Chemistry, University of Sargodha, Sargodha, 40100, Pakistan
| | - Safaa N. Abdou
- Department of Chemistry, Khurmah University College, Taif University, P.O. Box 11099, Taif, 21944, Saudi Arabia
| | - Mudassir Hussain Tahir
- Research Faculty of Agriculture, Field Science Center for Northern Biosphere, Hokkaido University, Sapporo, Hokkaido, 060-8589, 060-0811, Japan
| | - Mohamed Ibrahim Halawa
- Department of Pharmaceutical Analytical Chemistry, Faculty of Pharmacy, Mansoura University, Mansoura, 35516, Mansoura, Egypt
- Guangdong Laboratory of Artificial Intelligence & Digital Economy (SZ), Shenzhen University, Shenzhen, 518060, China
| | - Mohamed M. Ibrahim
- Department of Chemistry, College of Science, Taif University, P.O. Box 11099, Taif, 21944, Saudi Arabia
| | - Yulin Xie
- Huanggang Normal University, Huanggang, 438000, China
| |
Collapse
|
6
|
Villot C, Huang T, Lao KU. Accurate prediction of global-density-dependent range-separation parameters based on machine learning. J Chem Phys 2023; 159:044103. [PMID: 37486048 DOI: 10.1063/5.0157340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Accepted: 07/03/2023] [Indexed: 07/25/2023] Open
Abstract
In this work, we develop an accurate and efficient XGBoost machine learning model for predicting the global-density-dependent range-separation parameter, ωGDD, for long-range corrected functional (LRC)-ωPBE. This ωGDDML model has been built using a wide range of systems (11 466 complexes, ten different elements, and up to 139 heavy atoms) with fingerprints for the local atomic environment and histograms of distances for the long-range atomic correlation for mapping the quantum mechanical range-separation values. The promising performance on the testing set with 7046 complexes shows a mean absolute error of 0.001 117 a0-1 and only five systems (0.07%) with an absolute error larger than 0.01 a0-1, which indicates the good transferability of our ωGDDML model. In addition, the only required input to obtain ωGDDML is the Cartesian coordinates without electronic structure calculations, thereby enabling rapid predictions. LRC-ωPBE(ωGDDML) is used to predict polarizabilities for a series of oligomers, where polarizabilities are sensitive to the asymptotic density decay and are crucial in a variety of applications, including the calculations of dispersion corrections and refractive index, and surpasses the performance of all other popular density functionals except for the non-tuned LRC-ωPBE. Finally, LRC-ωPBE (ωGDDML) combined with (extended) symmetry-adapted perturbation theory is used in calculating noncovalent interactions to further show that the traditional ab initio system-specific tuning procedure can be bypassed. The present study not only provides an accurate and efficient way to determine the range-separation parameter for LRC-ωPBE but also shows the synergistic benefits of fusing the power of physically inspired density functional LRC-ωPBE and the data-driven ωGDDML model.
Collapse
Affiliation(s)
- Corentin Villot
- Department of Chemistry, Virginia Commonwealth University, Richmond, Virginia 23284, USA
| | - Tong Huang
- Department of Chemistry, Virginia Commonwealth University, Richmond, Virginia 23284, USA
| | - Ka Un Lao
- Department of Chemistry, Virginia Commonwealth University, Richmond, Virginia 23284, USA
| |
Collapse
|
7
|
Clyde A, Liu X, Brettin T, Yoo H, Partin A, Babuji Y, Blaiszik B, Mohd-Yusof J, Merzky A, Turilli M, Jha S, Ramanathan A, Stevens R. AI-accelerated protein-ligand docking for SARS-CoV-2 is 100-fold faster with no significant change in detection. Sci Rep 2023; 13:2105. [PMID: 36747041 PMCID: PMC9901402 DOI: 10.1038/s41598-023-28785-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 01/24/2023] [Indexed: 02/08/2023] Open
Abstract
Protein-ligand docking is a computational method for identifying drug leads. The method is capable of narrowing a vast library of compounds down to a tractable size for downstream simulation or experimental testing and is widely used in drug discovery. While there has been progress in accelerating scoring of compounds with artificial intelligence, few works have bridged these successes back to the virtual screening community in terms of utility and forward-looking development. We demonstrate the power of high-speed ML models by scoring 1 billion molecules in under a day (50 k predictions per GPU seconds). We showcase a workflow for docking utilizing surrogate AI-based models as a pre-filter to a standard docking workflow. Our workflow is ten times faster at screening a library of compounds than the standard technique, with an error rate less than 0.01% of detecting the underlying best scoring 0.1% of compounds. Our analysis of the speedup explains that another order of magnitude speedup must come from model accuracy rather than computing speed. In order to drive another order of magnitude of acceleration, we share a benchmark dataset consisting of 200 million 3D complex structures and 2D structure scores across a consistent set of 13 million "in-stock" molecules over 15 receptors, or binding sites, across the SARS-CoV-2 proteome. We believe this is strong evidence for the community to begin focusing on improving the accuracy of surrogate models to improve the ability to screen massive compound libraries 100 × or even 1000 × faster than current techniques and reduce missing top hits. The technique outlined aims to be a fast drop-in replacement for docking for screening billion-scale molecular libraries.
Collapse
Affiliation(s)
- Austin Clyde
- Argonne National Laboratory, Data Science and Learning Division, Chicago, Lemont, 60439, USA.
- Department of Computer Science, University of Chicago, Chicago, 60637, USA.
| | - Xuefeng Liu
- Department of Computer Science, University of Chicago, Chicago, 60637, USA
| | - Thomas Brettin
- Department of Computer Science, University of Chicago, Chicago, 60637, USA
- Argonne National Laboratory, Computing, Environment, and Life Sciences Directorate, Lemont, 60439, USA
| | - Hyunseung Yoo
- Argonne National Laboratory, Data Science and Learning Division, Chicago, Lemont, 60439, USA
| | - Alexander Partin
- Argonne National Laboratory, Data Science and Learning Division, Chicago, Lemont, 60439, USA
| | - Yadu Babuji
- Department of Computer Science, University of Chicago, Chicago, 60637, USA
| | - Ben Blaiszik
- Argonne National Laboratory, Data Science and Learning Division, Chicago, Lemont, 60439, USA
- University of Chicago, Globus, Chicago, 60637, USA
| | - Jamaludin Mohd-Yusof
- Los Alamos National Laboratory, Computer, Computational, and Statistical Sciences, Los Alamos, 87545, USA
| | - Andre Merzky
- Department of Electrical and Computer Engineering, Rutgers University, Piscataway, 08854, USA
- Brookhaven National Laboratory, Computational Sciences Initiative, Upton, 11973, USA
| | - Matteo Turilli
- Department of Electrical and Computer Engineering, Rutgers University, Piscataway, 08854, USA
- Brookhaven National Laboratory, Computational Sciences Initiative, Upton, 11973, USA
| | - Shantenu Jha
- Department of Electrical and Computer Engineering, Rutgers University, Piscataway, 08854, USA
- Brookhaven National Laboratory, Computational Sciences Initiative, Upton, 11973, USA
| | - Arvind Ramanathan
- Argonne National Laboratory, Data Science and Learning Division, Chicago, Lemont, 60439, USA
| | - Rick Stevens
- Department of Computer Science, University of Chicago, Chicago, 60637, USA
- Argonne National Laboratory, Computing, Environment, and Life Sciences Directorate, Lemont, 60439, USA
| |
Collapse
|
8
|
Bhat V, Sornberger P, Pokuri BSS, Duke R, Ganapathysubramanian B, Risko C. Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches. Chem Sci 2022; 14:203-213. [PMID: 36605753 PMCID: PMC9769113 DOI: 10.1039/d2sc04676h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Accepted: 11/16/2022] [Indexed: 11/18/2022] Open
Abstract
Accelerating the development of π-conjugated molecules for applications such as energy generation and storage, catalysis, sensing, pharmaceuticals, and (semi)conducting technologies requires rapid and accurate evaluation of the electronic, redox, or optical properties. While high-throughput computational screening has proven to be a tremendous aid in this regard, machine learning (ML) and other data-driven methods can further enable orders of magnitude reduction in time while at the same time providing dramatic increases in the chemical space that is explored. However, the lack of benchmark datasets containing the electronic, redox, and optical properties that characterize the diverse, known chemical space of organic π-conjugated molecules limits ML model development. Here, we present a curated dataset containing 25k molecules with density functional theory (DFT) and time-dependent DFT (TDDFT) evaluated properties that include frontier molecular orbitals, ionization energies, relaxation energies, and low-lying optical excitation energies. Using the dataset, we train a hierarchy of ML models, ranging from classical models such as ridge regression to sophisticated graph neural networks, with molecular SMILES representation as input. We observe that graph neural networks augmented with contextual information allow for significantly better predictions across a wide array of properties. Our best-performing models also provide an uncertainty quantification for the predictions. To democratize access to the data and trained models, an interactive web platform has been developed and deployed.
Collapse
Affiliation(s)
- Vinayak Bhat
- Department of Chemistry and Center for Applied Energy Research, University of Kentucky Lexington Kentucky 40506 USA
| | - Parker Sornberger
- Department of Chemistry and Center for Applied Energy Research, University of Kentucky Lexington Kentucky 40506 USA
| | - Balaji Sesha Sarath Pokuri
- Department of Mechanical Engineering and Translational AI Center, Iowa State University Ames Iowa 50010 USA
| | - Rebekah Duke
- Department of Chemistry and Center for Applied Energy Research, University of Kentucky Lexington Kentucky 40506 USA
| | | | - Chad Risko
- Department of Chemistry and Center for Applied Energy Research, University of Kentucky Lexington Kentucky 40506 USA
| |
Collapse
|
9
|
Statistical analysis and visualization of data of non-fullerene small molecule acceptors from Harvard organic photovoltaic database. Structural similarity analysis with famous non-fullerene small molecule acceptors to search new building blocks. J Photochem Photobiol A Chem 2022. [DOI: 10.1016/j.jphotochem.2022.114501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
10
|
Zhang G, Lin FR, Qi F, Heumüller T, Distler A, Egelhaaf HJ, Li N, Chow PCY, Brabec CJ, Jen AKY, Yip HL. Renewed Prospects for Organic Photovoltaics. Chem Rev 2022; 122:14180-14274. [PMID: 35929847 DOI: 10.1021/acs.chemrev.1c00955] [Citation(s) in RCA: 172] [Impact Index Per Article: 86.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Organic photovoltaics (OPVs) have progressed steadily through three stages of photoactive materials development: (i) use of poly(3-hexylthiophene) and fullerene-based acceptors (FAs) for optimizing bulk heterojunctions; (ii) development of new donors to better match with FAs; (iii) development of non-fullerene acceptors (NFAs). The development and application of NFAs with an A-D-A configuration (where A = acceptor and D = donor) has enabled devices to have efficient charge generation and small energy losses (Eloss < 0.6 eV), resulting in substantially higher power conversion efficiencies (PCEs) than FA-based devices. The discovery of Y6-type acceptors (Y6 = 2,2'-((2Z,2'Z)-((12,13-bis(2-ethylhexyl)-3,9-diundecyl-12,13-dihydro-[1,2,5]-thiadiazolo[3,4-e]-thieno[2″,3″:4',5']thieno-[2',3':4,5]pyrrolo-[3,2-g]thieno-[2',3':4,5]thieno-[3,2-b]indole-2,10-diyl)bis(methanylylidene))bis(5,6-difluoro-3-oxo-2,3-dihydro-1H-indene-2,1-diylidene))dimalononitrile) with an A-DA' D-A configuration has further propelled the PCEs to go beyond 15% due to smaller Eloss values (∼0.5 eV) and higher external quantum efficiencies. Subsequently, the PCEs of Y6-series single-junction devices have increased to >19% and may soon approach 20%. This review provides an update of recent progress of OPV in the following aspects: developments of novel NFAs and donors, understanding of the structure-property relationships and underlying mechanisms of state-of-the-art OPVs, and tasks underpinning the commercialization of OPVs, such as device stability, module development, potential applications, and high-throughput manufacturing. Finally, an outlook and prospects section summarizes the remaining challenges for the further development of OPV technology.
Collapse
Affiliation(s)
- Guichuan Zhang
- State Key Laboratory of Luminescent Materials and Devices, Institute of Polymer Optoelectronic Materials and Devices, School of Materials Science and Engineering, South China University of Technology, Guangzhou 510640, China.,School of Semiconductor Science and Technology, South China Normal University, Foshan 528225, China
| | - Francis R Lin
- Department of Materials Science and Engineering, City University of Hong Kong, Kowloon 999077, Hong Kong, China.,Department of Chemistry, City University of Hong Kong, Kowloon 999077, Hong Kong, China
| | - Feng Qi
- Department of Materials Science and Engineering, City University of Hong Kong, Kowloon 999077, Hong Kong, China.,Department of Chemistry, City University of Hong Kong, Kowloon 999077, Hong Kong, China
| | - Thomas Heumüller
- Institute of Materials for Electronics and Energy Technology (i-MEET), Department of Materials Science and Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Martensstrasse 7, 91058 Erlangen, Germany.,Helmholtz Institute Erlangen-Nürnberg (HI ERN), Immerwahrstrasse 2, 91058 Erlangen, Germany
| | - Andreas Distler
- Institute of Materials for Electronics and Energy Technology (i-MEET), Department of Materials Science and Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Martensstrasse 7, 91058 Erlangen, Germany
| | - Hans-Joachim Egelhaaf
- Institute of Materials for Electronics and Energy Technology (i-MEET), Department of Materials Science and Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Martensstrasse 7, 91058 Erlangen, Germany.,Helmholtz Institute Erlangen-Nürnberg (HI ERN), Immerwahrstrasse 2, 91058 Erlangen, Germany
| | - Ning Li
- State Key Laboratory of Luminescent Materials and Devices, Institute of Polymer Optoelectronic Materials and Devices, School of Materials Science and Engineering, South China University of Technology, Guangzhou 510640, China
| | - Philip C Y Chow
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam 999077, Hong Kong, China
| | - Christoph J Brabec
- Institute of Materials for Electronics and Energy Technology (i-MEET), Department of Materials Science and Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Martensstrasse 7, 91058 Erlangen, Germany.,Helmholtz Institute Erlangen-Nürnberg (HI ERN), Immerwahrstrasse 2, 91058 Erlangen, Germany
| | - Alex K-Y Jen
- Department of Materials Science and Engineering, City University of Hong Kong, Kowloon 999077, Hong Kong, China.,Department of Chemistry, City University of Hong Kong, Kowloon 999077, Hong Kong, China.,School of Energy and Environment, City University of Hong Kong, Kowloon 999077, Hong Kong, China.,Hong Kong Institute for Clean Energy, City University of Hong Kong, Kowloon 999077, Hong Kong, China
| | - Hin-Lap Yip
- State Key Laboratory of Luminescent Materials and Devices, Institute of Polymer Optoelectronic Materials and Devices, School of Materials Science and Engineering, South China University of Technology, Guangzhou 510640, China.,Department of Materials Science and Engineering, City University of Hong Kong, Kowloon 999077, Hong Kong, China.,School of Energy and Environment, City University of Hong Kong, Kowloon 999077, Hong Kong, China.,Hong Kong Institute for Clean Energy, City University of Hong Kong, Kowloon 999077, Hong Kong, China
| |
Collapse
|
11
|
When machine learning meets molecular synthesis. TRENDS IN CHEMISTRY 2022. [DOI: 10.1016/j.trechm.2022.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
12
|
Zhao ZW, Del Cueto M, Troisi A. Limitations of machine learning models when predicting compounds with completely new chemistries: possible improvements applied to the discovery of new non-fullerene acceptors. DIGITAL DISCOVERY 2022; 1:266-276. [PMID: 35769202 PMCID: PMC9189862 DOI: 10.1039/d2dd00004k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 03/23/2022] [Indexed: 11/21/2022]
Abstract
We try to determine if machine learning (ML) methods, applied to the discovery of new materials on the basis of existing data sets, have the power to predict completely new classes of compounds (extrapolating) or perform well only when interpolating between known materials. We introduce the leave-one-group-out cross-validation, in which the ML model is trained to explicitly perform extrapolations of unseen chemical families. This approach can be used across materials science and chemistry problems to improve the added value of ML predictions, instead of using extrapolative ML models that were trained with a regular cross-validation. We consider as a case study the problem of the discovery of non-fullerene acceptors because novel classes of acceptors are naturally classified into distinct chemical families. We show that conventional ML methods are not useful in practice when attempting to predict the efficiency of a completely novel class of materials. The approach proposed in this work increases the accuracy of the predictions to enable at least the categorization of materials with a performance above and below the median value.
Collapse
Affiliation(s)
- Zhi-Wen Zhao
- Department of Chemistry, University of Liverpool Liverpool L69 3BX UK
- Institute of Functional Material Chemistry, Faculty of Chemistry, Northeast Normal University Changchun 130024 Jilin P. R. China
| | - Marcos Del Cueto
- Department of Chemistry, University of Liverpool Liverpool L69 3BX UK
| | - Alessandro Troisi
- Department of Chemistry, University of Liverpool Liverpool L69 3BX UK
| |
Collapse
|
13
|
Moore GJ, Bardagot O, Banerji N. Deep Transfer Learning: A Fast and Accurate Tool to Predict the Energy Levels of Donor Molecules for Organic Photovoltaics. ADVANCED THEORY AND SIMULATIONS 2022. [DOI: 10.1002/adts.202100511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Gareth John Moore
- Department of Chemistry Biochemistry and Pharmaceutical Sciences University of Bern Freiestrasse 3 Bern 3012 Switzerland
| | - Olivier Bardagot
- Department of Chemistry Biochemistry and Pharmaceutical Sciences University of Bern Freiestrasse 3 Bern 3012 Switzerland
| | - Natalie Banerji
- Department of Chemistry Biochemistry and Pharmaceutical Sciences University of Bern Freiestrasse 3 Bern 3012 Switzerland
| |
Collapse
|
14
|
Omar ÖH, Nematiaram T, Troisi A, Padula D. Organic materials repurposing, a data set for theoretical predictions of new applications for existing compounds. Sci Data 2022; 9:54. [PMID: 35165288 PMCID: PMC8844419 DOI: 10.1038/s41597-022-01142-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 12/21/2021] [Indexed: 01/28/2023] Open
Abstract
We present a data set of 48182 organic semiconductors, constituted of molecules that were prepared with a documented synthetic pathway and are stable in solid state. We based our search on the Cambridge Structural Database, from which we selected semiconductors with a computational funnel procedure. For each entry we provide a set of electronic properties relevant for organic materials research, and the electronic wavefunction for further calculations and/or analyses. This data set has low bias because it was not built from a set of materials designed for organic electronics, and thus it provides an excellent starting point in the search of new applications for known materials, with a great potential for novel physical insight. The data set contains molecules used as benchmarks in many fields of organic materials research, allowing to test the reliability of computational screenings for the desired application, "rediscovering" well-known molecules. This is demonstrated by a series of different applications in the field of organic materials, confirming the potential for the repurposing of known organic molecules.
Collapse
Affiliation(s)
- Ömer H Omar
- University of Liverpool, Department of Chemistry, Liverpool, L69 7ZD, UK
| | - Tahereh Nematiaram
- University of Liverpool, Department of Chemistry, Liverpool, L69 7ZD, UK
| | - Alessandro Troisi
- University of Liverpool, Department of Chemistry, Liverpool, L69 7ZD, UK.
| | - Daniele Padula
- Università di Siena, Dipartimento di Biotecnologie, Chimica e Farmacia, Siena, 53100, Italy.
| |
Collapse
|
15
|
Review on Y6-Based Semiconductor Materials and Their Future Development via Machine Learning. CRYSTALS 2022. [DOI: 10.3390/cryst12020168] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
Non-fullerene acceptors are promising to achieve high efficiency in organic solar cells (OSCs). Y6-based acceptors, one group of new n-type semiconductors, have triggered tremendous attention when they reported a power-conversion efficiency (PCE) of 15.7% in 2019. After that, scientists are trying to improve the efficiency in different aspects including choosing new donors, tuning Y6 structures, and device engineering. In this review, we first summarize the properties of Y6 materials and the seven critical methods modifying the Y6 structure to improve the PCEs developed in the latest three years as well as the basic principles and parameters of OSCs. Finally, the authors would share perspectives on possibilities, necessities, challenges, and potential applications for designing multifunctional organic device with desired performances via machine learning.
Collapse
|
16
|
Gupta A, Chakraborty S, Ghosh D, Ramakrishnan R. Data-driven modeling of S 0 → S 1 excitation energy in the BODIPY chemical space: High-throughput computation, quantum machine learning, and inverse design. J Chem Phys 2021; 155:244102. [PMID: 34972385 DOI: 10.1063/5.0076787] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Derivatives of BODIPY are popular fluorophores due to their synthetic feasibility, structural rigidity, high quantum yield, and tunable spectroscopic properties. While the characteristic absorption maximum of BODIPY is at 2.5 eV, combinations of functional groups and substitution sites can shift the peak position by ±1 eV. Time-dependent long-range corrected hybrid density functional methods can model the lowest excitation energies offering a semi-quantitative precision of ±0.3 eV. Alas, the chemical space of BODIPYs stemming from combinatorial introduction of-even a few dozen-substituents is too large for brute-force high-throughput modeling. To navigate this vast space, we select 77 412 molecules and train a kernel-based quantum machine learning model providing <2% hold-out error. Further reuse of the results presented here to navigate the entire BODIPY universe comprising over 253 giga (253 × 109) molecules is demonstrated by inverse-designing candidates with desired target excitation energies.
Collapse
Affiliation(s)
- Amit Gupta
- Centre for Interdisciplinary Sciences, Tata Institute of Fundamental Research, Hyderabad 500107, India
| | - Sabyasachi Chakraborty
- Centre for Interdisciplinary Sciences, Tata Institute of Fundamental Research, Hyderabad 500107, India
| | - Debashree Ghosh
- Indian Association for the Cultivation of Science, Kolkata 700032, India
| | - Raghunathan Ramakrishnan
- Centre for Interdisciplinary Sciences, Tata Institute of Fundamental Research, Hyderabad 500107, India
| |
Collapse
|
17
|
Li Y, Xu Y, Yu Y. CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery. Molecules 2021; 26:molecules26237257. [PMID: 34885843 PMCID: PMC8658888 DOI: 10.3390/molecules26237257] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 11/25/2021] [Accepted: 11/26/2021] [Indexed: 11/16/2022] Open
Abstract
Molecular latent representations, derived from autoencoders (AEs), have been widely used for drug or material discovery over the past couple of years. In particular, a variety of machine learning methods based on latent representations have shown excellent performance on quantitative structure–activity relationship (QSAR) modeling. However, the sequence feature of them has not been considered in most cases. In addition, data scarcity is still the main obstacle for deep learning strategies, especially for bioactivity datasets. In this study, we propose the convolutional recurrent neural network and transfer learning (CRNNTL) method inspired by the applications of polyphonic sound detection and electrocardiogram classification. Our model takes advantage of both convolutional and recurrent neural networks for feature extraction, as well as the data augmentation method. According to QSAR modeling on 27 datasets, CRNNTL can outperform or compete with state-of-art methods in both drug and material properties. In addition, the performances on one isomers-based dataset indicate that its excellent performance results from the improved ability in global feature extraction when the ability of the local one is maintained. Then, the transfer learning results show that CRNNTL can overcome data scarcity when choosing relative source datasets. Finally, the high versatility of our model is shown by using different latent representations as inputs from other types of AEs.
Collapse
Affiliation(s)
- Yaqin Li
- West China Tianfu Hospital, Sichuan University, Chengdu 610041, China
- Correspondence: (Y.L.); (Y.Y.)
| | - Yongjin Xu
- Department of Chemistry and Molecular Biology, University of Gothenburg, Kemivägen 10, 41296 Gothenburg, Sweden;
| | - Yi Yu
- Department of Chemistry and Molecular Biology, University of Gothenburg, Kemivägen 10, 41296 Gothenburg, Sweden;
- Correspondence: (Y.L.); (Y.Y.)
| |
Collapse
|
18
|
Ju CW, French EJ, Geva N, Kohn AW, Lin Z. Stacked Ensemble Machine Learning for Range-Separation Parameters. J Phys Chem Lett 2021; 12:9516-9524. [PMID: 34559964 DOI: 10.1021/acs.jpclett.1c02506] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Density functional theory-based high-throughput materials and drug discovery has achieved tremendous success in recent decades, but its power on organic semiconducting molecules suffered catastrophically from the self-interaction error until the nonempirical but expensive optimally tuned range-separated hybrid (OT-RSH) functionals were developed. An OT-RSH transitions from a short-range (semi)local functional to a long-range Hartree-Fock exchange at a distance characterized by a molecule-specific range-separation parameter (ω). Herein, we propose a stacked ensemble machine learning model that provides an accelerated alternative of OT-RSH based on system-dependent structural and electronic configurations. We trained ML-ωPBE, the first functional in our series, using a database of 1970 molecules with sufficient structural and functional diversity, and assessed its accuracy and efficiency using another 1956 molecules. Compared with nonempirical OT-ωPBE, ML-ωPBE reaches a mean absolute error of 0.00504a0-1 for optimal ω's, reduces the computational cost by 2.66 orders of magnitude, and achieves comparable predictive power in optical properties.
Collapse
Affiliation(s)
- Cheng-Wei Ju
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts 01003, United States
- Pritzker School of Molecular Engineering, The University of Chicago, Chicago, Illinois 60637, United States
| | - Ethan J French
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts 01003, United States
| | - Nadav Geva
- Advanced Micro Devices Inc., Boxborough, Massachusetts 01719, United States
| | - Alexander W Kohn
- Blizzard Entertainment Inc., Irvine, California 92618, United States
| | - Zhou Lin
- Department of Chemistry, University of Massachusetts, Amherst, Massachusetts 01003, United States
| |
Collapse
|
19
|
Eibeck A, Nurkowski D, Menon A, Bai J, Wu J, Zhou L, Mosbach S, Akroyd J, Kraft M. Predicting Power Conversion Efficiency of Organic Photovoltaics: Models and Data Analysis. ACS OMEGA 2021; 6:23764-23775. [PMID: 34568656 PMCID: PMC8459373 DOI: 10.1021/acsomega.1c02156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 07/16/2021] [Indexed: 06/13/2023]
Abstract
In this paper, the ability of three selected machine learning neural and baseline models in predicting the power conversion efficiency (PCE) of organic photovoltaics (OPVs) using molecular structure information as an input is assessed. The bidirectional long short-term memory (gFSI/BiLSTM), attentive fingerprints (attentive FP), and simple graph neural networks (simple GNN) as well as baseline support vector regression (SVR), random forests (RF), and high-dimensional model representation (HDMR) methods are trained to both the large and computational Harvard clean energy project database (CEPDB) and the much smaller experimental Harvard organic photovoltaic 15 dataset (HOPV15). It was found that the neural-based models generally performed better on the computational dataset with the attentive FP model reaching a state-of-the-art performance with the test set mean squared error of 0.071. The experimental dataset proved much harder to fit, with all of the models exhibiting a rather poor performance. Contrary to the computational dataset, the baseline models were found to perform better than the neural models. To improve the ability of machine learning models to predict PCEs for OPVs, either better computational results that correlate well with experiments or more experimental data at well-controlled conditions are likely required.
Collapse
Affiliation(s)
- Andreas Eibeck
- CARES,
Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, 138602 Singapore
| | - Daniel Nurkowski
- CMCL
Innovations, Sheraton House, Castle Park, Cambridge CB3 0AX, U.K.
| | - Angiras Menon
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
| | - Jiaru Bai
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
| | - Jinkui Wu
- School
of Chemical Engineering, Sichuan University, Chengdu, Sichuan 610065, China
| | - Li Zhou
- School
of Chemical Engineering, Sichuan University, Chengdu, Sichuan 610065, China
| | - Sebastian Mosbach
- CMCL
Innovations, Sheraton House, Castle Park, Cambridge CB3 0AX, U.K.
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
| | - Jethro Akroyd
- CMCL
Innovations, Sheraton House, Castle Park, Cambridge CB3 0AX, U.K.
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
| | - Markus Kraft
- CARES,
Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, 138602 Singapore
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
- School
of Chemical and Biomedical Engineering, Nanyang Technological University, 62 Nanyang Drive, 637459 Singapore
| |
Collapse
|
20
|
Abstract
Theoretical simulations of electronic excitations and associated processes in molecules are indispensable for fundamental research and technological innovations. However, such simulations are notoriously challenging to perform with quantum mechanical methods. Advances in machine learning open many new avenues for assisting molecular excited-state simulations. In this Review, we track such progress, assess the current state of the art and highlight the critical issues to solve in the future. We overview a broad range of machine learning applications in excited-state research, which include the prediction of molecular properties, improvements of quantum mechanical methods for the calculations of excited-state properties and the search for new materials. Machine learning approaches can help us understand hidden factors that influence photo-processes, leading to a better control of such processes and new rules for the design of materials for optoelectronic applications.
Collapse
|
21
|
Rodríguez-Martínez X, Pascual-San-José E, Campoy-Quiles M. Accelerating organic solar cell material's discovery: high-throughput screening and big data. ENERGY & ENVIRONMENTAL SCIENCE 2021; 14:3301-3322. [PMID: 34211582 PMCID: PMC8209551 DOI: 10.1039/d1ee00559f] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 04/20/2021] [Indexed: 05/27/2023]
Abstract
The discovery of novel high-performing materials such as non-fullerene acceptors and low band gap donor polymers underlines the steady increase of record efficiencies in organic solar cells witnessed during the past years. Nowadays, the resulting catalogue of organic photovoltaic materials is becoming unaffordably vast to be evaluated following classical experimentation methodologies: their requirements in terms of human workforce time and resources are prohibitively high, which slows momentum to the evolution of the organic photovoltaic technology. As a result, high-throughput experimental and computational methodologies are fostered to leverage their inherently high exploratory paces and accelerate novel materials discovery. In this review, we present some of the computational (pre)screening approaches performed prior to experimentation to select the most promising molecular candidates from the available materials libraries or, alternatively, generate molecules beyond human intuition. Then, we outline the main high-throuhgput experimental screening and characterization approaches with application in organic solar cells, namely those based on lateral parametric gradients (measuring-intensive) and on automated device prototyping (fabrication-intensive). In both cases, experimental datasets are generated at unbeatable paces, which notably enhance big data readiness. Herein, machine-learning algorithms find a rewarding application niche to retrieve quantitative structure-activity relationships and extract molecular design rationale, which are expected to keep the material's discovery pace up in organic photovoltaics.
Collapse
Affiliation(s)
| | | | - Mariano Campoy-Quiles
- Institut de Ciència de Materials de Barcelona, ICMAB-CSIC, Campus UAB 08193 Bellaterra Spain
| |
Collapse
|
22
|
Friederich P, Krenn M, Tamblyn I, Aspuru-Guzik A. Scientific intuition inspired by machine learning-generated hypotheses. MACHINE LEARNING-SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abda08] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
23
|
|
24
|
Munshi J, Chen W, Chien T, Balasubramanian G. Transfer Learned Designer Polymers For Organic Solar Cells. J Chem Inf Model 2021; 61:134-142. [PMID: 33410685 DOI: 10.1021/acs.jcim.0c01157] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Organic photovoltaic (OPV) materials have been examined extensively over the past two decades for solar cell applications because of the potential for device flexibility, low-temperature solution processability, and negligible environmental impact. However, discovery of new candidate OPV materials, especially polymer-based electron donors, that demonstrate notable power conversion efficiencies (PCEs), is nontrivial and time-intensive exercise given the extensive set of possible chemistries. Recent progress in machine learning accelerated materials discovery has facilitated to address this challenge, with molecular line representations, such as Simplified Molecular-Input Line-Entry Systems (SMILES), gaining popularity as molecular fingerprints describing the donor chemical structures. Here, we employ a transfer learning based recurrent neural (LSTM) model, which harnesses the SMILES molecular fingerprints as an input to generate novel designer chemistries for OPV devices. The generative model, perfected on a small focused OPV data set, predicts new polymer repeat units with potentially high PCE. Calculations of the similarity coefficient between the known and the generated polymers corroborate the accuracy of the model predictability as a function of the underlying chemical specificity. The data-enabled framework is sufficiently generic for use in accelerated machine learned materials discovery for various chemistries and applications, mining the hitherto available experimental and computational data.
Collapse
Affiliation(s)
- Joydeep Munshi
- Department of Mechanical Engineering & Mechanics, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Wei Chen
- Department of Mechanical Engineering, Northwestern University, Evanston, Illinois 60208, United States
| | - TeYu Chien
- Department of Physics & Astronomy, University of Wyoming, Laramie, Wyoming 82071, United States
| | - Ganesh Balasubramanian
- Department of Mechanical Engineering & Mechanics, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| |
Collapse
|
25
|
Coley CW, Eyke NS, Jensen KF. Autonome Entdeckung in den chemischen Wissenschaften, Teil II: Ausblick. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.201909989] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Connor W. Coley
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Natalie S. Eyke
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Klavs F. Jensen
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| |
Collapse
|
26
|
Tsubaki M, Mizoguchi T. Quantum Deep Field: Data-Driven Wave Function, Electron Density Generation, and Atomization Energy Prediction and Extrapolation with Machine Learning. PHYSICAL REVIEW LETTERS 2020; 125:206401. [PMID: 33258648 DOI: 10.1103/physrevlett.125.206401] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Accepted: 09/23/2020] [Indexed: 06/12/2023]
Abstract
Deep neural networks (DNNs) have been used to successfully predict molecular properties calculated based on the Kohn-Sham density functional theory (KS-DFT). Although this prediction is fast and accurate, we believe that a DNN model for KS-DFT must not only predict the properties but also provide the electron density of a molecule. This Letter presents the quantum deep field (QDF), which provides the electron density with an unsupervised but end-to-end physics-informed modeling by learning the atomization energy on a large-scale dataset. QDF performed well at atomization energy prediction, generated valid electron density, and demonstrated extrapolation.
Collapse
Affiliation(s)
- Masashi Tsubaki
- National Institute of Advanced Industrial Science and Technology, 2-3-26 Aomi, Koto-ku, Tokyo 135-0064, Japan
| | - Teruyasu Mizoguchi
- Institute of Industrial Science, University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo 153-8505, Japan
| |
Collapse
|
27
|
Wu J, Wang S, Zhou L, Ji X, Dai Y, Dang Y, Kraft M. Deep-Learning Architecture in QSPR Modeling for the Prediction of Energy Conversion Efficiency of Solar Cells. Ind Eng Chem Res 2020. [DOI: 10.1021/acs.iecr.0c03880] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Jinkui Wu
- School of Chemical Engineering, Sichuan University, Chengdu 610065, China
| | - Shihui Wang
- School of Chemical Engineering, Sichuan University, Chengdu 610065, China
| | - Li Zhou
- School of Chemical Engineering, Sichuan University, Chengdu 610065, China
| | - Xu Ji
- School of Chemical Engineering, Sichuan University, Chengdu 610065, China
| | - Yiyang Dai
- School of Chemical Engineering, Sichuan University, Chengdu 610065, China
| | - Yagu Dang
- School of Chemical Engineering, Sichuan University, Chengdu 610065, China
| | - Markus Kraft
- Cambridge Center for Advanced Research and Education in Singapore Ltd., 1 Create Way, 138602 Singapore
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K
- School of Chemical and Biomedical Engineering, Nanyang Technological University, 62 Nanyang Drive, Singapore 637459, Singapore
| |
Collapse
|
28
|
Häse F, Roch LM, Friederich P, Aspuru-Guzik A. Designing and understanding light-harvesting devices with machine learning. Nat Commun 2020; 11:4587. [PMID: 32917886 PMCID: PMC7486390 DOI: 10.1038/s41467-020-17995-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 07/16/2020] [Indexed: 01/27/2023] Open
Abstract
Understanding the fundamental processes of light-harvesting is crucial to the development of clean energy materials and devices. Biological organisms have evolved complex metabolic mechanisms to efficiently convert sunlight into chemical energy. Unraveling the secrets of this conversion has inspired the design of clean energy technologies, including solar cells and photocatalytic water splitting. Describing the emergence of macroscopic properties from microscopic processes poses the challenge to bridge length and time scales of several orders of magnitude. Machine learning experiences increased popularity as a tool to bridge the gap between multi-level theoretical models and Edisonian trial-and-error approaches. Machine learning offers opportunities to gain detailed scientific insights into the underlying principles governing light-harvesting phenomena and can accelerate the fabrication of light-harvesting devices.
Collapse
Affiliation(s)
- Florian Häse
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, 02138, MA, USA
- CIFAR AI Chair, Vector Institute for Artificial Intelligence, 661 University Avenue, Toronto, ON, M5S 1M1, Canada
- Department of Computer Science, University of Toronto, 214 College Street, Toronto, ON, M5S 3H6, Canada
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Loïc M Roch
- CIFAR AI Chair, Vector Institute for Artificial Intelligence, 661 University Avenue, Toronto, ON, M5S 1M1, Canada
- Department of Computer Science, University of Toronto, 214 College Street, Toronto, ON, M5S 3H6, Canada
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- ChemOS Sàrl, Lausanne, VD, 1006, Switzerland
| | - Pascal Friederich
- Department of Computer Science, University of Toronto, 214 College Street, Toronto, ON, M5S 3H6, Canada
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Institute of Nanotechnology, Karlsruhe Insititute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Alán Aspuru-Guzik
- CIFAR AI Chair, Vector Institute for Artificial Intelligence, 661 University Avenue, Toronto, ON, M5S 1M1, Canada.
- Department of Computer Science, University of Toronto, 214 College Street, Toronto, ON, M5S 3H6, Canada.
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada.
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 661 University Avenue, Toronto, ON, M5S 1M1, Canada.
| |
Collapse
|
29
|
Luo S, Li T, Wang X, Faizan M, Zhang L. High‐throughput computational materials screening and discovery of optoelectronic semiconductors. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2020. [DOI: 10.1002/wcms.1489] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Shulin Luo
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE and School of Materials Science and Engineering Jilin University Changchun China
| | - Tianshu Li
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE and School of Materials Science and Engineering Jilin University Changchun China
| | - Xinjiang Wang
- Department of Physics, State Key Laboratory of Superhard Materials Jilin University Changchun China
| | - Muhammad Faizan
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE and School of Materials Science and Engineering Jilin University Changchun China
| | - Lijun Zhang
- State Key Laboratory of Integrated Optoelectronics, Key Laboratory of Automobile Materials of MOE and School of Materials Science and Engineering Jilin University Changchun China
| |
Collapse
|
30
|
Coley CW, Eyke NS, Jensen KF. Autonomous Discovery in the Chemical Sciences Part II: Outlook. Angew Chem Int Ed Engl 2020; 59:23414-23436. [PMID: 31553509 DOI: 10.1002/anie.201909989] [Citation(s) in RCA: 101] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Indexed: 01/19/2023]
Abstract
This two-part Review examines how automation has contributed to different aspects of discovery in the chemical sciences. In this second part, we reflect on a selection of exemplary studies. It is increasingly important to articulate what the role of automation and computation has been in the scientific process and how that has or has not accelerated discovery. One can argue that even the best automated systems have yet to "discover" despite being incredibly useful as laboratory assistants. We must carefully consider how they have been and can be applied to future problems of chemical discovery in order to effectively design and interact with future autonomous platforms. The majority of this Review defines a large set of open research directions, including improving our ability to work with complex data, build empirical models, automate both physical and computational experiments for validation, select experiments, and evaluate whether we are making progress towards the ultimate goal of autonomous discovery. Addressing these practical and methodological challenges will greatly advance the extent to which autonomous systems can make meaningful discoveries.
Collapse
Affiliation(s)
- Connor W Coley
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Natalie S Eyke
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Klavs F Jensen
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| |
Collapse
|
31
|
Hoffmann R, Malrieu JP. Simulation vs. Understanding: A Tension, in Quantum Chemistry and Beyond. Part B. The March of Simulation, for Better or Worse. Angew Chem Int Ed Engl 2020; 59:13156-13178. [PMID: 31675462 DOI: 10.1002/anie.201910283] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Indexed: 11/08/2022]
Abstract
In the second part of this Essay, we leave philosophy, and begin by describing Roald's being trashed by simulation. This leads us to a general sketch of artificial intelligence (AI), Searle's Chinese room, and Strevens' account of what a go-playing program knows. Back to our terrain-we ask "Quantum Chemistry, † ca. 2020?" Then we move to examples of Big Data, machine learning and neural networks in action, first in chemistry and then affecting social matters, trivial to scary. We argue that moral decisions are hardly to be left to a computer. And that posited causes, even if recognized as provisional, represent a much deeper level of understanding than correlations. At this point, we try to pull the reader up, giving voice to the opposing view of an optimistic, limitless future. But we don't do justice to that view-how could we, older mammals on the way to extinction that we are? We try. But then we return to fuss, questioning the ascetic dimension of scientists, their romance with black boxes. And argue for a science of many tongues.
Collapse
Affiliation(s)
- Roald Hoffmann
- Dept. of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, 14850, USA
| | - Jean-Paul Malrieu
- Laboratoire de Chimie et Physique Quantiques, Université de Toulouse 3, 118 route de Narbonne, 31062, Toulouse, France
| |
Collapse
|
32
|
Hoffmann R, Malrieu J. Simulation vs. Understanding: A Tension, in Quantum Chemistry and Beyond. Part B. The March of Simulation, for Better or Worse. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.201910283] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Roald Hoffmann
- Dept. of Chemistry and Chemical Biology Cornell University Ithaca NY 14850 USA
| | - Jean‐Paul Malrieu
- Laboratoire de Chimie et Physique Quantiques Université de Toulouse 3 118 route de Narbonne 31062 Toulouse France
| |
Collapse
|
33
|
Haghighatlari M, Vishwakarma G, Altarawy D, Subramanian R, Kota BU, Sonpal A, Setlur S, Hachmann J. ChemML
: A machine learning and informatics program package for the analysis, mining, and modeling of chemical and materials data. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2020. [DOI: 10.1002/wcms.1458] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Mojtaba Haghighatlari
- Department of Chemical and Biological Engineering University at Buffalo, The State University of New York Buffalo New York
| | - Gaurav Vishwakarma
- Department of Chemical and Biological Engineering University at Buffalo, The State University of New York Buffalo New York
| | - Doaa Altarawy
- The Molecular Sciences Software Institute, Virginia Tech Blacksburg Virginia
- Computer and Systems Engineering Department Alexandria University Alexandria Egypt
| | - Ramachandran Subramanian
- Department of Computer Science and Engineering University at Buffalo, The State University of New York Buffalo New York
- Center for Unified Biometrics and Sensors University at Buffalo, The State University of New York Buffalo New York
| | - Bhargava U. Kota
- Department of Computer Science and Engineering University at Buffalo, The State University of New York Buffalo New York
- Center for Unified Biometrics and Sensors University at Buffalo, The State University of New York Buffalo New York
| | - Aditya Sonpal
- Department of Chemical and Biological Engineering University at Buffalo, The State University of New York Buffalo New York
| | - Srirangaraj Setlur
- Department of Computer Science and Engineering University at Buffalo, The State University of New York Buffalo New York
- Center for Unified Biometrics and Sensors University at Buffalo, The State University of New York Buffalo New York
- Center of Excellence for Document Analysis and Recognition, University at Buffalo The State University of New York Buffalo New York
| | - Johannes Hachmann
- Department of Chemical and Biological Engineering University at Buffalo, The State University of New York Buffalo New York
- Computational and Data‐Enabled Science and Engineering Graduate Program University at Buffalo, The State University of New York Buffalo New York
- New York State Center of Excellence in Materials Informatics Buffalo New York
| |
Collapse
|
34
|
Fare C, Turcani L, Pyzer-Knapp EO. Powerful, transferable representations for molecules through intelligent task selection in deep multitask networks. Phys Chem Chem Phys 2020; 22:13041-13048. [DOI: 10.1039/d0cp02319a] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
We develop and test a framework for selecting appropriate chemical datasets to create molecular representations tailored for specific tasks.
Collapse
Affiliation(s)
- Clyde Fare
- IBM Research UK
- Sci-Tech Daresbury
- Warrington
- UK
| | | | | |
Collapse
|
35
|
Beard EJ, Sivaraman G, Vázquez-Mayagoitia Á, Vishwanath V, Cole JM. Comparative dataset of experimental and computational attributes of UV/vis absorption spectra. Sci Data 2019; 6:307. [PMID: 31804487 PMCID: PMC6895184 DOI: 10.1038/s41597-019-0306-0] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Accepted: 11/12/2019] [Indexed: 02/07/2023] Open
Abstract
The ability to auto-generate databases of optical properties holds great prospects in data-driven materials discovery for optoelectronic applications. We present a cognate set of experimental and computational data that describes key features of optical absorption spectra. This includes an auto-generated database of 18,309 records of experimentally determined UV/vis absorption maxima, λmax, and associated extinction coefficients, ϵ, where present. This database was produced using the text-mining toolkit, ChemDataExtractor, on 402,034 scientific documents. High-throughput electronic-structure calculations using fast (simplified Tamm-Dancoff approach) and traditional (time-dependent) density functional theory were executed to predict λmax and oscillation strengths, f (related to ϵ) for a subset of validated compounds. Paired quantities of these computational and experimental data show strong correlations in λmax, f and ϵ, laying the path for reliable in silico calculations of additional optical properties. The total dataset of 8,488 unique compounds and a subset of 5,380 compounds with experimental and computational data, are available in MongoDB, CSV and JSON formats. These can be queried using Python, R, Java, and MATLAB, for data-driven optoelectronic materials discovery. Measurement(s) | ultraviolet–visible spectrum • absorption wavelength • extinction coefficient | Technology Type(s) | digital curation |
Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.10304897
Collapse
Affiliation(s)
- Edward J Beard
- Cavendish Laboratory, Department of Physics, University of Cambridge, J. J. Thomson Avenue, Cambridge, CB3 0HE, UK.,ISIS Neutron and Muon Source, STFC Rutherford Appleton Laboratory, Harwell Science and Innovation Campus, Didcot, Oxfordshire, OX11 0QX, UK
| | - Ganesh Sivaraman
- Argonne National Laboratory, 9700 South Cass Avenue, Lemont, IL, 60439, USA
| | | | | | - Jacqueline M Cole
- Cavendish Laboratory, Department of Physics, University of Cambridge, J. J. Thomson Avenue, Cambridge, CB3 0HE, UK. .,ISIS Neutron and Muon Source, STFC Rutherford Appleton Laboratory, Harwell Science and Innovation Campus, Didcot, Oxfordshire, OX11 0QX, UK. .,Argonne National Laboratory, 9700 South Cass Avenue, Lemont, IL, 60439, USA. .,Department of Chemical Engineering and Biotechnology, University of Cambridge, West Cambridge Site, Philippa Fawcett Drive, Cambridge, CB3 0FS, UK.
| |
Collapse
|
36
|
Abreha BG, Agarwal S, Foster I, Blaiszik B, Lopez SA. Virtual Excited State Reference for the Discovery of Electronic Materials Database: An Open-Access Resource for Ground and Excited State Properties of Organic Molecules. J Phys Chem Lett 2019; 10:6835-6841. [PMID: 31642678 DOI: 10.1021/acs.jpclett.9b02577] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
This letter announces the Virtual Excited State Reference for the Discovery of Electronic Materials Database (VERDE materials DB), the first database to include downloadable excited-state structures (S0, S1, T1) and photophysical properties. VERDE materials DB is searchable, open-access via www.verdedb.org , and focused on light-responsive π-conjugated organic molecules with applications in green chemistry, organic solar cells, and organic redox flow batteries. It includes results of our active and past virtual screening studies; to date, more than 13 000 density functional theory (DFT) calculations have been performed on 1 500 molecules to obtain frontier molecular orbitals and photophysical properties, including excitation energies, dipole moments, and redox potentials. To improve community access, we have made VERDE materials DB available via an integration with the Materials Data Facility. We are leveraging VERDE materials DB to train machine learning algorithms to identify new materials and structure-property relationships between molecular ground- and excited-states. We present a case-study involving photoaffinity labels, including predictions of new diazirine-based photoaffinity labels anticipated to have high photostabilities.
Collapse
Affiliation(s)
- Biruk G Abreha
- Northeastern University , Boston , Massachusetts 02115 , United States
| | - Snigdha Agarwal
- Northeastern University , Boston , Massachusetts 02115 , United States
| | - Ian Foster
- Argonne National Laboratory , Lemont , Illinois 60439 , United States
- University of Chicago , Chicago , Illinois 60637 , United States
| | - Ben Blaiszik
- Argonne National Laboratory , Lemont , Illinois 60439 , United States
- University of Chicago , Chicago , Illinois 60637 , United States
| | - Steven A Lopez
- Northeastern University , Boston , Massachusetts 02115 , United States
| |
Collapse
|
37
|
Ryu S, Kwon Y, Kim WY. A Bayesian graph convolutional network for reliable prediction of molecular properties with uncertainty quantification. Chem Sci 2019; 10:8438-8446. [PMID: 31803423 PMCID: PMC6839511 DOI: 10.1039/c9sc01992h] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2019] [Accepted: 07/21/2019] [Indexed: 01/14/2023] Open
Abstract
Deep neural networks have been increasingly used in various chemical fields. In the nature of a data-driven approach, their performance strongly depends on data used in training. Therefore, models developed in data-deficient situations can cause highly uncertain predictions, leading to vulnerable decision making. Here, we show that Bayesian inference enables more reliable prediction with quantitative uncertainty analysis. Decomposition of the predictive uncertainty into model- and data-driven uncertainties allows us to elucidate the source of errors for further improvements. For molecular applications, we devised a Bayesian graph convolutional network (GCN) and evaluated its performance for molecular property predictions. Our study on the classification problem of bio-activity and toxicity shows that the confidence of prediction can be quantified in terms of the predictive uncertainty, leading to more accurate virtual screening of drug candidates than standard GCNs. The result of log P prediction illustrates that data noise affects the data-driven uncertainty more significantly than the model-driven one. Based on this finding, we could identify artefacts that arose from quantum mechanical calculations in the Harvard Clean Energy Project dataset. Consequently, the Bayesian GCN is critical for molecular applications under data-deficient conditions.
Collapse
Affiliation(s)
- Seongok Ryu
- Department of Chemistry , KAIST , 291 Daehak-ro, Yuseong-gu , Daejeon 34141 , Republic of Korea .
| | - Yongchan Kwon
- Department of Statistics , Seoul National University , 1 Gwanak-ro, Gwanak-gu , Seoul 08826 , Republic of Korea
| | - Woo Youn Kim
- Department of Chemistry , KAIST , 291 Daehak-ro, Yuseong-gu , Daejeon 34141 , Republic of Korea .
- KI for Artificial Intelligence , KAIST , 291 Daehak-ro, Yuseong-gu , Daejeon 34141 , Republic of Korea
| |
Collapse
|
38
|
Afzal MAF, Sonpal A, Haghighatlari M, Schultz AJ, Hachmann J. A deep neural network model for packing density predictions and its application in the study of 1.5 million organic molecules. Chem Sci 2019; 10:8374-8383. [PMID: 31762970 PMCID: PMC6855195 DOI: 10.1039/c9sc02677k] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2019] [Accepted: 07/08/2019] [Indexed: 01/23/2023] Open
Abstract
Computational pipeline for the accelerated discovery of organic materials with high refractive index via high-throughput screening and machine learning.
The process of developing new compounds and materials is increasingly driven by computational modeling and simulation, which allow us to characterize candidates before pursuing them in the laboratory. One of the non-trivial properties of interest for organic materials is their packing in the bulk, which is highly dependent on their molecular structure. By controlling the latter, we can realize materials with a desired density (as well as other target properties). Molecular dynamics simulations are a popular and reasonably accurate way to compute the bulk density of molecules, however, since these calculations are computationally intensive, they are not a practically viable option for high-throughput screening studies that assess material candidates on a massive scale. In this work, we employ machine learning to develop a data-derived prediction model that is an alternative to physics-based simulations, and we utilize it for the hyperscreening of 1.5 million small organic molecules as well as to gain insights into the relationship between structural makeup and packing density. We also use this study to analyze the learning curve of the employed neural network approach and gain empirical data on the dependence of model performance and training data size, which will inform future investigations.
Collapse
Affiliation(s)
- Mohammad Atif Faiz Afzal
- Department of Chemical and Biological Engineering , University at Buffalo , The State University of New York , Buffalo , NY 14260 , USA . ;
| | - Aditya Sonpal
- Department of Chemical and Biological Engineering , University at Buffalo , The State University of New York , Buffalo , NY 14260 , USA . ;
| | - Mojtaba Haghighatlari
- Department of Chemical and Biological Engineering , University at Buffalo , The State University of New York , Buffalo , NY 14260 , USA . ;
| | - Andrew J Schultz
- Department of Chemical and Biological Engineering , University at Buffalo , The State University of New York , Buffalo , NY 14260 , USA . ;
| | - Johannes Hachmann
- Department of Chemical and Biological Engineering , University at Buffalo , The State University of New York , Buffalo , NY 14260 , USA . ; .,Computational and Data-Enabled Science and Engineering Graduate Program , University at Buffalo , The State University of New York , Buffalo , NY 14260 , USA.,New York State Center of Excellence in Materials Informatics , Buffalo , NY 14203 , USA
| |
Collapse
|
39
|
Kim H, Park JY, Choi S. Energy refinement and analysis of structures in the QM9 database via a highly accurate quantum chemical method. Sci Data 2019; 6:109. [PMID: 31270326 PMCID: PMC6610095 DOI: 10.1038/s41597-019-0121-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Accepted: 06/13/2019] [Indexed: 12/12/2022] Open
Abstract
A wide variety of data-driven approaches have been introduced in the field of quantum chemistry. To extend the applicable range and improve the prediction power of those approaches, highly accurate quantum chemical benchmarks that cover extremely large chemical spaces are required. Here, we report ~134 k quantum chemical calculations performed with G4MP2, the fourth generation of the G-n series in which second-order perturbation theory is employed. A single composite method calculation executes several low-level calculations to reproduce the results of high-level ab initio calculations with the aim of saving computational costs. Therefore, our database reports the results of the various methods (e.g., density functional theory, Hartree-Fock, Møller-Plesset perturbation theory, and coupled-cluster theory). Additionally, we examined the structure information of both the QM9 and the revised databases via chemical graph analysis. Our database can be applied to refine and improve the quality of data-driven quantum chemical prediction. Furthermore, we reported the raw outputs of all calculations performed in this work for other potential applications.
Collapse
Affiliation(s)
- Hyungjun Kim
- Department of Chemistry, Incheon National University, 119 Academy-ro, Yeonsu-gu, Incheon, 22012, Republic of Korea
| | - Ji Young Park
- Department of Chemistry, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Sunghwan Choi
- National Institute of Supercomputing and Network, Korea Institute of Science and Technology Information, Daejeon, 34141, Republic of Korea.
| |
Collapse
|
40
|
St John PC, Phillips C, Kemper TW, Wilson AN, Guan Y, Crowley MF, Nimlos MR, Larsen RE. Message-passing neural networks for high-throughput polymer screening. J Chem Phys 2019; 150:234111. [PMID: 31228909 DOI: 10.1063/1.5099132] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Machine learning methods have shown promise in predicting molecular properties, and given sufficient training data, machine learning approaches can enable rapid high-throughput virtual screening of large libraries of compounds. Graph-based neural network architectures have emerged in recent years as the most successful approach for predictions based on molecular structure and have consistently achieved the best performance on benchmark quantum chemical datasets. However, these models have typically required optimized 3D structural information for the molecule to achieve the highest accuracy. These 3D geometries are costly to compute for high levels of theory, limiting the applicability and practicality of machine learning methods in high-throughput screening applications. In this study, we present a new database of candidate molecules for organic photovoltaic applications, comprising approximately 91 000 unique chemical structures. Compared to existing datasets, this dataset contains substantially larger molecules (up to 200 atoms) as well as extrapolated properties for long polymer chains. We show that message-passing neural networks trained with and without 3D structural information for these molecules achieve similar accuracy, comparable to state-of-the-art methods on existing benchmark datasets. These results therefore emphasize that for larger molecules with practical applications, near-optimal prediction results can be obtained without using optimized 3D geometry as an input. We further show that learned molecular representations can be leveraged to reduce the training data required to transfer predictions to a new density functional theory functional.
Collapse
Affiliation(s)
- Peter C St John
- Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Caleb Phillips
- Computational Science Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Travis W Kemper
- Computational Science Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - A Nolan Wilson
- National Biaoenergy Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Yanfei Guan
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, USA
| | - Michael F Crowley
- Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Mark R Nimlos
- National Biaoenergy Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Ross E Larsen
- Computational Science Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| |
Collapse
|
41
|
Friederich P, Fediai A, Kaiser S, Konrad M, Jung N, Wenzel W. Toward Design of Novel Materials for Organic Electronics. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2019; 31:e1808256. [PMID: 31012166 DOI: 10.1002/adma.201808256] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Indexed: 06/09/2023]
Abstract
Materials for organic electronics are presently used in prominent applications, such as displays in mobile devices, while being intensely researched for other purposes, such as organic photovoltaics, large-area devices, and thin-film transistors. Many of the challenges to improve and optimize these applications are material related and there is a nearly infinite chemical space that needs to be explored to identify the most suitable material candidates. Established experimental approaches struggle with the size and complexity of this chemical space. Herein, the development of simulation methods is addressed, with a particular emphasis on predictive multiscale protocols, to complement experimental research in the identification of novel materials and illustrate the potential of these methods with a few prominent recent applications. Finally, the potential of machine learning and methods based on artificial intelligence is discussed to further accelerate the search for new materials.
Collapse
Affiliation(s)
- Pascal Friederich
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
- Department of Chemistry, University of Toronto, 80 St. George Street, M5S 3H6, Toronto, Ontario, Canada
| | - Artem Fediai
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Simon Kaiser
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Manuel Konrad
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Nicole Jung
- Institute of Organic Chemistry (IOC), Karlsruhe Institute of Technology (KIT), Fritz-Haber-Weg 6, 76131, Karlsruhe, Germany
| | - Wolfgang Wenzel
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| |
Collapse
|
42
|
Virtual Screening of Conjugated Polymers for Organic Photovoltaic Devices Using Support Vector Machines and Ensemble Learning. INT J POLYM SCI 2019. [DOI: 10.1155/2019/4538514] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Herein, we report virtual screening of potential semiconductor polymers for high-performance organic photovoltaic (OPV) devices using various machine learning algorithms. We particularly focus on support vector machine (SVM) and ensemble learning approaches. We found that the power conversion efficiencies of the device prepared with the polymer candidates can be predicted with their structure fingerprints as the only inputs. In other words, no preliminary knowledge about material properties was required. Additionally, the predictive performance could be further improved by “blending” the results of the SVM and random forest models. The resulting ensemble learning algorithm might open up a new opportunity for more precise, high-throughput virtual screening of conjugated polymers for OPV devices.
Collapse
|
43
|
Afzal MAF, Hachmann J. Benchmarking DFT approaches for the calculation of polarizability inputs for refractive index predictions in organic polymers. Phys Chem Chem Phys 2019; 21:4452-4460. [PMID: 30734777 DOI: 10.1039/c8cp05492d] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In a previous study, we introduced a new computational protocol to accurately predict the index of refraction (RI) of organic polymers using a combination of first-principles and data modeling. This protocol is based on the Lorentz-Lorenz equation and involves the calculation of static polarizabilities and number densities of oligomer sequences, which are extrapolated to the polymer limit. We chose to compute the polarizabilities within the density functional theory (DFT) framework using the PBE0/def2-TZVP-D3 model chemistry. While this ad hoc choice proved remarkably successful, it is also relatively expensive from a computational perspective. It represents the bottleneck step in the overall RI modeling protocol, thus limiting its utility for virtual high-throughput screening studies, in which efficiency is essential. For polymers that exhibit late-onset extensivity, the employed linear extrapolation scheme can require demanding calculations on long-oligomer sequences, thus becoming another bottleneck. In the work presented here, we benchmark DFT model chemistries to identify approaches that optimize the balance between accuracy and efficiency for this application domain. We compare results for conjugated and non-conjugated polymers, augment our original extrapolation approach with a non-linear option, analyze how the polarizability errors propagate into the RI predictions, and offer guidance for method selection.
Collapse
Affiliation(s)
- Mohammad Atif Faiz Afzal
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York, Buffalo, NY 14260, USA.
| | | |
Collapse
|
44
|
Afzal MAF, Cheng C, Hachmann J. Combining first-principles and data modeling for the accurate prediction of the refractive index of organic polymers. J Chem Phys 2018; 148:241712. [DOI: 10.1063/1.5007873] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Affiliation(s)
- Mohammad Atif Faiz Afzal
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York, Buffalo, New York 14260, USA
| | - Chong Cheng
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York, Buffalo, New York 14260, USA
| | - Johannes Hachmann
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York, Buffalo, New York 14260, USA
- Computational and Data-Enabled Science and Engineering Graduate Program, University at Buffalo, The State University of New York, Buffalo, New York 14260, USA
- New York State Center of Excellence in Materials Informatics, Buffalo, New York 14203, USA
| |
Collapse
|
45
|
Guillén-López A, Delesma C, Amador-Bedolla C, Robles M, Muñiz J. Electronic structure and nonlinear optical properties of organic photovoltaic systems with potential applications on solar cell devices: a DFT approach. Theor Chem Acc 2018. [DOI: 10.1007/s00214-018-2267-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
46
|
Hachmann J, Afzal MAF, Haghighatlari M, Pal Y. Building and deploying a cyberinfrastructure for the data-driven design of chemical systems and the exploration of chemical space. MOLECULAR SIMULATION 2018. [DOI: 10.1080/08927022.2018.1471692] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Affiliation(s)
- Johannes Hachmann
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York , Buffalo, NY, USA
- Computational and Data-Enabled Science and Engineering Graduate Program, University at Buffalo, The State University of New York , Buffalo, NY, USA
- New York State Center of Excellence in Materials Informatics , Buffalo, NY, USA
| | - Mohammad Atif Faiz Afzal
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York , Buffalo, NY, USA
| | - Mojtaba Haghighatlari
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York , Buffalo, NY, USA
| | - Yudhajit Pal
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York , Buffalo, NY, USA
| |
Collapse
|
47
|
Nagasawa S, Al-Naamani E, Saeki A. Computer-Aided Screening of Conjugated Polymers for Organic Solar Cell: Classification by Random Forest. J Phys Chem Lett 2018; 9:2639-2646. [PMID: 29733216 DOI: 10.1021/acs.jpclett.8b00635] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Owing to the diverse chemical structures, organic photovoltaic (OPV) applications with a bulk heterojunction framework have greatly evolved over the last two decades, which has produced numerous organic semiconductors exhibiting improved power conversion efficiencies (PCEs). Despite the recent fast progress in materials informatics and data science, data-driven molecular design of OPV materials remains challenging. We report a screening of conjugated molecules for polymer-fullerene OPV applications by supervised learning methods (artificial neural network (ANN) and random forest (RF)). Approximately 1000 experimental parameters including PCE, molecular weight, and electronic properties are manually collected from the literature and subjected to machine learning with digitized chemical structures. Contrary to the low correlation coefficient in ANN, RF yields an acceptable accuracy, which is twice that of random classification. We demonstrate the application of RF screening for the design, synthesis, and characterization of a conjugated polymer, which facilitates a rapid development of optoelectronic materials.
Collapse
Affiliation(s)
- Shinji Nagasawa
- Department of Applied Chemistry, Graduate School of Engineering , Osaka University , 2-1 Yamadaoka , Suita, Osaka 565-0871 , Japan
| | - Eman Al-Naamani
- Department of Applied Chemistry, Graduate School of Engineering , Osaka University , 2-1 Yamadaoka , Suita, Osaka 565-0871 , Japan
| | - Akinori Saeki
- Department of Applied Chemistry, Graduate School of Engineering , Osaka University , 2-1 Yamadaoka , Suita, Osaka 565-0871 , Japan
- Precursory Research for Embryonic Science and Technology (PRESTO) , Japan Science and Technology Agency , 4-1-8 Honcho , Kawaguchi, Saitama 332-0012 , Japan
| |
Collapse
|
48
|
Ghahremanpour MM, van Maaren PJ, van der Spoel D. The Alexandria library, a quantum-chemical database of molecular properties for force field development. Sci Data 2018; 5:180062. [PMID: 29633987 PMCID: PMC5892371 DOI: 10.1038/sdata.2018.62] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Accepted: 02/19/2018] [Indexed: 12/03/2022] Open
Abstract
Data quality as well as library size are crucial issues for force field development. In order to predict molecular properties in a large chemical space, the foundation to build force fields on needs to encompass a large variety of chemical compounds. The tabulated molecular physicochemical properties also need to be accurate. Due to the limited transparency in data used for development of existing force fields it is hard to establish data quality and reusability is low. This paper presents the Alexandria library as an open and freely accessible database of optimized molecular geometries, frequencies, electrostatic moments up to the hexadecupole, electrostatic potential, polarizabilities, and thermochemistry, obtained from quantum chemistry calculations for 2704 compounds. Values are tabulated and where available compared to experimental data. This library can assist systematic development and training of empirical force fields for a broad range of molecules.
Collapse
Affiliation(s)
- Mohammad M. Ghahremanpour
- Uppsala Centre for Computational Chemistry, Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Husargatan 3, Box 596, SE-75124 Uppsala, Sweden
| | - Paul J. van Maaren
- Uppsala Centre for Computational Chemistry, Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Husargatan 3, Box 596, SE-75124 Uppsala, Sweden
| | - David van der Spoel
- Uppsala Centre for Computational Chemistry, Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Husargatan 3, Box 596, SE-75124 Uppsala, Sweden
| |
Collapse
|
49
|
Venkatraman V, Raju R, Oikonomopoulos SP, Alsberg BK. The dye-sensitized solar cell database. J Cheminform 2018; 10:18. [PMID: 29616364 PMCID: PMC5882482 DOI: 10.1186/s13321-018-0272-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2017] [Accepted: 03/25/2018] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Dye-sensitized solar cells (DSSCs) have garnered a lot of attention in recent years. The solar energy to power conversion efficiency of a DSSC is influenced by various components of the cell such as the dye, electrolyte, electrodes and additives among others leading to varying experimental configurations. A large number of metal-based and metal-free dye sensitizers have now been reported and tools using such data to indicate new directions for design and development are on the rise. DESCRIPTION DSSCDB, the first of its kind dye-sensitized solar cell database, aims to provide users with up-to-date information from publications on the molecular structures of the dyes, experimental details and reported measurements (efficiencies and spectral properties) and thereby facilitate a comprehensive and critical evaluation of the data. Currently, the DSSCDB contains over 4000 experimental observations spanning multiple dye classes such as triphenylamines, carbazoles, coumarins, phenothiazines, ruthenium and porphyrins. CONCLUSION The DSSCDB offers a web-based, comprehensive source of property data for dye sensitized solar cells. Access to the database is available through the following URL: www.dyedb.com .
Collapse
Affiliation(s)
| | - Rajesh Raju
- Department of Chemistry, NTNU, Høgskoleringen, 7491, Trondheim, Norway
| | | | - Bjørn K Alsberg
- Department of Chemistry, NTNU, Høgskoleringen, 7491, Trondheim, Norway
| |
Collapse
|
50
|
Theoretical investigation of the use of nanocages with an adsorbed halogen atom as anode materials in metal-ion batteries. J Mol Model 2018; 24:64. [PMID: 29468439 DOI: 10.1007/s00894-018-3604-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Accepted: 01/30/2018] [Indexed: 01/29/2023]
Abstract
The applicability of C44, B22N22, Ge44, and Al22P22 nanocages, as well as variants of those nanocages with an adsorbed halogen atom, as high-performance anode materials in Li-ion, Na-ion, and K-ion batteries was investigated theoretically via density functional theory. The results obtained indicate that, among the nanocages with no adsorbed halogen atom, Al22P22 would be the best candidate for a novel anode material for use in metal-ion batteries. Calculations also suggest that K-ion batteries which utilize these nanocages as anode materials would give better performance and would yield higher cell voltages than the corresponding Li-ion and Na-ion batteries with nanocage-based anodes. Also, the results for the nanocages with an adsorbed halogen atom imply that employing them as anode materials would lead to higher cell voltages and better metal-ion battery performance than if the nanocages with no adsorbed halogen atom were to be used as anode materials instead. Results further implied that nanocages with an adsorbed F atom would give higher cell voltages and better battery performance than nanocages with an adsorbed Cl or Br atom. We were ultimately able to conclude that a K-ion battery that utilized Al21P22 with an adsorbed F atom as its anode material would afford the best metal-ion battery performance; we therefore propose this as a novel highly efficient metal-ion battery. Graphical abstract The results of a theoretical investigation indicated that Al22P22 is a better candidate for a high-performance anode material in metal-ion batteries than Ge44 is. Calculations also showed that K-ion batteries with nanocage-based anodes would produce higher cell voltages and perform better than the equivalent Li-ion and Na-ion batteries with nanocage-based anodes, and that anodes based on nanocages with an adsorbed F atom would perform better than anodes based on nanocages with an adsorbed Cl or Br atom.
Collapse
|