1
|
Zhu Y, Peng J, Xu C, Lan Z. Unsupervised Machine Learning in the Analysis of Nonadiabatic Molecular Dynamics Simulation. J Phys Chem Lett 2024:9601-9619. [PMID: 39270134 DOI: 10.1021/acs.jpclett.4c01751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2024]
Abstract
The all-atomic full-dimensional-level simulations of nonadiabatic molecular dynamics (NAMD) in large realistic systems has received high research interest in recent years. However, such NAMD simulations normally generate an enormous amount of time-dependent high-dimensional data, leading to a significant challenge in result analyses. Based on unsupervised machine learning (ML) methods, considerable efforts were devoted to developing novel and easy-to-use analysis tools for the identification of photoinduced reaction channels and the comprehensive understanding of complicated molecular motions in NAMD simulations. Here, we tried to survey recent advances in this field, particularly to focus on how to use unsupervised ML methods to analyze the trajectory-based NAMD simulation results. Our purpose is to offer a comprehensive discussion on several essential components of this analysis protocol, including the selection of ML methods, the construction of molecular descriptors, the establishment of analytical frameworks, their advantages and limitations, and persistent challenges.
Collapse
Affiliation(s)
- Yifei Zhu
- MOE Key Laboratory of Environmental Theoretical Chemistry, SCNU Environmental Research Institute, Guangdong Provincial Key Laboratory of Chemical Pollution and Environmental Safety, School of Environment, South China Normal University, Guangzhou 510006, P. R. China
| | - Jiawei Peng
- MOE Key Laboratory of Environmental Theoretical Chemistry, SCNU Environmental Research Institute, Guangdong Provincial Key Laboratory of Chemical Pollution and Environmental Safety, School of Environment, South China Normal University, Guangzhou 510006, P. R. China
| | - Chao Xu
- MOE Key Laboratory of Environmental Theoretical Chemistry, SCNU Environmental Research Institute, Guangdong Provincial Key Laboratory of Chemical Pollution and Environmental Safety, School of Environment, South China Normal University, Guangzhou 510006, P. R. China
| | - Zhenggang Lan
- MOE Key Laboratory of Environmental Theoretical Chemistry, SCNU Environmental Research Institute, Guangdong Provincial Key Laboratory of Chemical Pollution and Environmental Safety, School of Environment, South China Normal University, Guangzhou 510006, P. R. China
| |
Collapse
|
2
|
Wu J, Wang R, Tan Y, Liu L, Chen Z, Zhang S, Lou X, Yun J. Hybrid machine learning model based predictions for properties of poly(2-hydroxyethyl methacrylate)-poly(vinyl alcohol) composite cryogels embedded with bacterial cellulose. J Chromatogr A 2024; 1727:464996. [PMID: 38763087 DOI: 10.1016/j.chroma.2024.464996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 05/10/2024] [Accepted: 05/13/2024] [Indexed: 05/21/2024]
Abstract
Supermacroporous composite cryogels with enhanced adjustable functionality have received extensive interest in bioseparation, tissue engineering, and drug delivery. However, the variations in their components significantly impactfinal properties. This study presents a two-step hybrid machine learning approach for predicting the properties of innovative poly(2-hydroxyethyl methacrylate)-poly(vinyl alcohol) composite cryogels embedded with bacterial cellulose (pHEMA-PVA-BC) based on their compositions. By considering the ratios of HEMA (1.0-22.0 wt%), PVA (0.2-4.0 wt%), poly(ethylene glycol) diacrylate (1.0-4.5 wt%), BC (0.1-1.5 wt%), and water (68.0-96.0 wt%) as investigational variables, overlay sampling uniform design (OSUD) was employed to construct a high-quality dataset for model development. The random forest (RF) model was used to classify the preparation conditions. Then four models of artificial neural network, RF, gradient boosted regression trees (GBRT), and XGBoost were developed to predict the basic properties of the composite cryogels. The results showed that the RF model achieved an accurate three-class classification of preparation conditions. Among the four models, the GBRT model exhibited the best predictive performance of the basic properties, with the mean absolute percentage error of 16.04 %, 0.85 %, and 2.44 % for permeability, effective porosity, and height of theoretical plate (1.0 cm/min), respectively. Characterization results of the representative pHEMA-PVA-BC composite cryogel showed an effective porosity of 81.01 %, a permeability of 1.20 × 10-12 m2, and a range of height of theoretical plate between 0.40-0.49 cm at flow velocities of 0.5-3.0 cm/min. These indicate that the pHEMA-PVA-BC cryogel was an excellent material with supermacropores, low flow resistance and high mass transfer efficiency. Furthermore, the model output demonstrates that the alteration of the proportions of PVA (0.2-3.5 wt%) and BC (0.1-1.5 wt%) components in composite cryogels resulted in significant changes in the material basic properties. This work represents an attempt to efficiently design and prepare target composite cryogels using machine learning and providing valuable insights for the efficient development of polymers.
Collapse
Affiliation(s)
- Jiawei Wu
- State Key Laboratory Breeding Base of Green Chemistry Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Chaowang Road 18, Hangzhou 310032, PR China
| | - Ruobing Wang
- State Key Laboratory Breeding Base of Green Chemistry Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Chaowang Road 18, Hangzhou 310032, PR China
| | - Yan Tan
- State Key Laboratory Breeding Base of Green Chemistry Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Chaowang Road 18, Hangzhou 310032, PR China
| | - Lulu Liu
- State Key Laboratory Breeding Base of Green Chemistry Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Chaowang Road 18, Hangzhou 310032, PR China
| | - Zhihong Chen
- State Key Laboratory Breeding Base of Green Chemistry Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Chaowang Road 18, Hangzhou 310032, PR China
| | - Songhong Zhang
- State Key Laboratory Breeding Base of Green Chemistry Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Chaowang Road 18, Hangzhou 310032, PR China
| | - Xiaoling Lou
- State Key Laboratory Breeding Base of Green Chemistry Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Chaowang Road 18, Hangzhou 310032, PR China.
| | - Junxian Yun
- State Key Laboratory Breeding Base of Green Chemistry Synthesis Technology, College of Chemical Engineering, Zhejiang University of Technology, Chaowang Road 18, Hangzhou 310032, PR China.
| |
Collapse
|
3
|
Cencer MM, Moore JS, Assary RS. Machine learning for polymeric materials: an introduction. POLYM INT 2021. [DOI: 10.1002/pi.6345] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Affiliation(s)
- Morgan M Cencer
- Department of Chemistry University of Illinois at Urbana‐Champaign Urbana IL USA
- Materials Science Division Argonne National Laboratory Lemont IL USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana‐Champaign Urbana IL USA
| | - Jeffrey S Moore
- Department of Chemistry University of Illinois at Urbana‐Champaign Urbana IL USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana‐Champaign Urbana IL USA
| | - Rajeev S Assary
- Materials Science Division Argonne National Laboratory Lemont IL USA
| |
Collapse
|
4
|
Sahu H, Li H, Chen L, Rajan AC, Kim C, Stingelin N, Ramprasad R. An Informatics Approach for Designing Conducting Polymers. ACS APPLIED MATERIALS & INTERFACES 2021; 13:53314-53322. [PMID: 34038635 DOI: 10.1021/acsami.1c04017] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Doping conjugated polymers, which are potential candidates for the next generation of organic electronics, is an effective strategy for manipulating their electrical conductivity. However, selecting a suitable polymer-dopant combination is exceptionally challenging because of the vastness of the chemical, configurational, and morphological spaces one needs to search. In this work, high-performance surrogate models, trained on available experimentally measured data, are developed to predict the p-type electrical conductivity and are used to screen a large candidate hypothetical data set of more than 800 000 polymer-dopant combinations. Promising candidates are identified for synthesis and device fabrication. Additionally, new design guidelines are extracted that verify and extend knowledge on important molecular fragments that correlate to high conductivity. Conductivity prediction models are also deployed at www.polymergenome.org for broader open-access community use.
Collapse
Affiliation(s)
- Harikrishna Sahu
- Department of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Hongmo Li
- Department of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Lihua Chen
- Department of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Arunkumar Chitteth Rajan
- Department of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Chiho Kim
- Department of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Natalie Stingelin
- Department of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Rampi Ramprasad
- Department of Materials Science and Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
5
|
Westermayr J, Marquetand P. Machine Learning for Electronically Excited States of Molecules. Chem Rev 2021; 121:9873-9926. [PMID: 33211478 PMCID: PMC8391943 DOI: 10.1021/acs.chemrev.0c00749] [Citation(s) in RCA: 167] [Impact Index Per Article: 55.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Indexed: 12/11/2022]
Abstract
Electronically excited states of molecules are at the heart of photochemistry, photophysics, as well as photobiology and also play a role in material science. Their theoretical description requires highly accurate quantum chemical calculations, which are computationally expensive. In this review, we focus on not only how machine learning is employed to speed up such excited-state simulations but also how this branch of artificial intelligence can be used to advance this exciting research field in all its aspects. Discussed applications of machine learning for excited states include excited-state dynamics simulations, static calculations of absorption spectra, as well as many others. In order to put these studies into context, we discuss the promises and pitfalls of the involved machine learning techniques. Since the latter are mostly based on quantum chemistry calculations, we also provide a short introduction into excited-state electronic structure methods and approaches for nonadiabatic dynamics simulations and describe tricks and problems when using them in machine learning for excited states of molecules.
Collapse
Affiliation(s)
- Julia Westermayr
- Institute
of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
| | - Philipp Marquetand
- Institute
of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Vienna
Research Platform on Accelerating Photoreaction Discovery, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Data
Science @ Uni Vienna, University of Vienna, Währinger Strasse 29, 1090 Vienna, Austria
| |
Collapse
|
6
|
Abstract
Electronically excited states of molecules are at the heart of photochemistry, photophysics, as well as photobiology and also play a role in material science. Their theoretical description requires highly accurate quantum chemical calculations, which are computationally expensive. In this review, we focus on not only how machine learning is employed to speed up such excited-state simulations but also how this branch of artificial intelligence can be used to advance this exciting research field in all its aspects. Discussed applications of machine learning for excited states include excited-state dynamics simulations, static calculations of absorption spectra, as well as many others. In order to put these studies into context, we discuss the promises and pitfalls of the involved machine learning techniques. Since the latter are mostly based on quantum chemistry calculations, we also provide a short introduction into excited-state electronic structure methods and approaches for nonadiabatic dynamics simulations and describe tricks and problems when using them in machine learning for excited states of molecules.
Collapse
Affiliation(s)
- Julia Westermayr
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
| | - Philipp Marquetand
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Vienna Research Platform on Accelerating Photoreaction Discovery, University of Vienna, Währinger Strasse 17, 1090 Vienna, Austria
- Data Science @ Uni Vienna, University of Vienna, Währinger Strasse 29, 1090 Vienna, Austria
| |
Collapse
|
7
|
Zozoulenko I, Franco-Gonzalez JF, Gueskine V, Mehandzhiyski A, Modarresi M, Rolland N, Tybrandt K. Electronic, Optical, Morphological, Transport, and Electrochemical Properties of PEDOT: A Theoretical Perspective. Macromolecules 2021. [DOI: 10.1021/acs.macromol.1c00444] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- Igor Zozoulenko
- Laboratory of Organic Electronics, ITN, Linköping University, 60174 Norrköping, Sweden
| | | | - Viktor Gueskine
- Laboratory of Organic Electronics, ITN, Linköping University, 60174 Norrköping, Sweden
| | | | - Mohsen Modarresi
- Department of Physics, Ferdowsi University of Mashhad, Mashhad, PO Box 91775-1436, Iran
| | - Nicolas Rolland
- Laboratory of Organic Electronics, ITN, Linköping University, 60174 Norrköping, Sweden
| | - Klas Tybrandt
- Laboratory of Organic Electronics, ITN, Linköping University, 60174 Norrköping, Sweden
| |
Collapse
|
8
|
Abstract
Theoretical simulations of electronic excitations and associated processes in molecules are indispensable for fundamental research and technological innovations. However, such simulations are notoriously challenging to perform with quantum mechanical methods. Advances in machine learning open many new avenues for assisting molecular excited-state simulations. In this Review, we track such progress, assess the current state of the art and highlight the critical issues to solve in the future. We overview a broad range of machine learning applications in excited-state research, which include the prediction of molecular properties, improvements of quantum mechanical methods for the calculations of excited-state properties and the search for new materials. Machine learning approaches can help us understand hidden factors that influence photo-processes, leading to a better control of such processes and new rules for the design of materials for optoelectronic applications.
Collapse
|
9
|
Coppola F, Cimino P, Raucci U, Chiariello MG, Petrone A, Rega N. Exploring the Franck-Condon region of a photoexcited charge transfer complex in solution to interpret femtosecond stimulated Raman spectroscopy: excited state electronic structure methods to unveil non-radiative pathways. Chem Sci 2021; 12:8058-8072. [PMID: 34194695 PMCID: PMC8208128 DOI: 10.1039/d1sc01238j] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 04/27/2021] [Indexed: 01/12/2023] Open
Abstract
We present electronic structure methods to unveil the non-radiative pathways of photoinduced charge transfer (CT) reactions that play a main role in photophysics and light harvesting technologies. A prototypical π-stacked molecular complex consisting of an electron donor (1-chloronaphthalene, 1ClN) and an electron acceptor (tetracyanoethylene, TCNE) was investigated in dichloromethane solution for this purpose. The characterization of TCNE:π:1ClN in both its equilibrium ground and photoinduced low-lying CT electronic states was performed by using a reliable and accurate theoretical-computational methodology exploiting ab initio molecular dynamics simulations. The structural and vibrational time evolution of key vibrational modes is found to be in excellent agreement with femtosecond stimulated Raman spectroscopy experiments [R. A. Mathies et al., J. Phys. Chem. A, 2018, 122, 14, 3594], unveiling a correlation between vibrational fingerprints and electronic properties. The evaluation of nonadiabatic coupling matrix elements along generalized normal modes has made possible the interpretation on the molecular scale of the activation of nonradiative relaxation pathways towards the ground electronic state. In particular, two low frequency vibrational modes such as the out of plane bending and dimer breathing and the TCNE central C[double bond, length as m-dash]C stretching play a prominent role in relaxation phenomena from the electronic CT state to the ground state one.
Collapse
Affiliation(s)
- Federico Coppola
- Department of Chemical Sciences, University of Napoli Federico II, Complesso Universitario di M.S. Angelo via Cintia Napoli 80126 Italy
| | - Paola Cimino
- Department of Pharmaceutical Sciences, University of Salerno Salerno 84084 Italy
| | - Umberto Raucci
- Department of Chemical Sciences, University of Napoli Federico II, Complesso Universitario di M.S. Angelo via Cintia Napoli 80126 Italy
| | - Maria Gabriella Chiariello
- Department of Chemical Sciences, University of Napoli Federico II, Complesso Universitario di M.S. Angelo via Cintia Napoli 80126 Italy
| | - Alessio Petrone
- Department of Chemical Sciences, University of Napoli Federico II, Complesso Universitario di M.S. Angelo via Cintia Napoli 80126 Italy
| | - Nadia Rega
- Department of Chemical Sciences, University of Napoli Federico II, Complesso Universitario di M.S. Angelo via Cintia Napoli 80126 Italy
- Centro Interdipartimentale di Ricerca sui Biomateriali (CRIB) Piazzale Tecchio Napoli I-80125 Italy
| |
Collapse
|
10
|
Lee CK, Lu C, Yu Y, Sun Q, Hsieh CY, Zhang S, Liu Q, Shi L. Transfer learning with graph neural networks for optoelectronic properties of conjugated oligomers. J Chem Phys 2021; 154:024906. [PMID: 33445906 DOI: 10.1063/5.0037863] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Despite the remarkable progress of machine learning (ML) techniques in chemistry, modeling the optoelectronic properties of long conjugated oligomers and polymers with ML remains challenging due to the difficulty in obtaining sufficient training data. Here, we use transfer learning to address the data scarcity issue by pre-training graph neural networks using data from short oligomers. With only a few hundred training data, we are able to achieve an average error of about 0.1 eV for the excited-state energy of oligothiophenes against time-dependent density functional theory (TDDFT) calculations. We show that the success of our transfer learning approach relies on the relative locality of low-lying electronic excitations in long conjugated oligomers. Finally, we demonstrate the transferability of our approach by modeling the lowest-lying excited-state energies of poly(3-hexylthiophene) in its single-crystal and solution phases using the transfer learning models trained with the data of gas-phase oligothiophenes. The transfer learning predicted excited-state energy distributions agree quantitatively with TDDFT calculations and capture some important qualitative features observed in experimental absorption spectra.
Collapse
Affiliation(s)
| | - Chengqiang Lu
- Anhui Province Key Lab of Big Data Analysis and Application, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Yue Yu
- Chemistry and Chemical Biology, University of California, Merced, California 95343, USA
| | - Qiming Sun
- Tencent America, Palo Alto, California 94306, USA
| | | | | | - Qi Liu
- Anhui Province Key Lab of Big Data Analysis and Application, University of Science and Technology of China, Hefei, Anhui 230026, China
| | - Liang Shi
- Chemistry and Chemical Biology, University of California, Merced, California 95343, USA
| |
Collapse
|