1
|
Huang Y, Zhang H, Lin Z, Wei Y, Xi W. RevGraphVAMP: A protein molecular simulation analysis model combining graph convolutional neural networks and physical constraints. Methods 2024; 229:163-174. [PMID: 38972499 DOI: 10.1016/j.ymeth.2024.06.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Revised: 06/19/2024] [Accepted: 06/24/2024] [Indexed: 07/09/2024] Open
Abstract
Molecular dynamics simulation is a crucial research domain within the life sciences, focusing on comprehending the mechanisms of biomolecular interactions at atomic scales. Protein simulation, as a critical subfield, often utilizes MD for implementation, with trajectory data play a pivotal role in drug discovery. The advancement of high-performance computing and deep learning technology becomes popular and critical to predict protein properties from vast trajectory data, posing challenges regarding data features extraction from the complicated simulation data and dimensionality reduction. Simultaneously, it is essential to provide a meaningful explanation of the biological mechanism behind dimensionality. To tackle this challenge, we propose a new unsupervised model named RevGraphVAMP to intelligently analyze the simulation trajectory. This model is based on the variational approach for Markov processes (VAMP) and integrates graph convolutional neural networks and physical constraint optimization to enhance the learning performance. Additionally, we introduce attention mechanism to assess the importance of key interaction region, facilitating the interpretation of molecular mechanism. In comparison to other VAMPNets models, our model showcases competitive performance, improved accuracy in state transition prediction, as demonstrated through its application to two public datasets and the Shank3-Rap1 complex, which is associated with autism spectrum disorder. Moreover, it enhanced dimensionality reduction discrimination across different substates and provides interpretable results for protein structural characterization.
Collapse
Affiliation(s)
- Ying Huang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Huiling Zhang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; College of Mathematics and Informatics, South China Agricultural University, Guangzhou, 510642, China
| | - Zhenli Lin
- Department of Ophthalmology, Shenzhen University General Hospital, Shenzhen 518055, China
| | - Yanjie Wei
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; Faculty of Computer Science and Control Engineering, Shenzhen University of Advanced Technology, Shenzhen 518107, China.
| | - Wenhui Xi
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; Faculty of Computer Science and Control Engineering, Shenzhen University of Advanced Technology, Shenzhen 518107, China.
| |
Collapse
|
2
|
Hwang W, Kwon S, Lee WB, Kim Y. Self-assembly prediction of architecture-controlled bottlebrush copolymers in solution using graph convolutional networks. SOFT MATTER 2024; 20:4905-4915. [PMID: 38867573 DOI: 10.1039/d4sm00453a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2024]
Abstract
The investigation of bottlebrush copolymer self-assembly in solution involves a comprehensive approach integrating simulation and experimental research, due to their unique physical characteristics. However, the intricate architecture of bottlebrush copolymers and the diverse solvent conditions introduce a wide range of parameter spaces. In this study, we investigated the solution self-assembly behavior of bottlebrush copolymers by combining dissipative particle dynamics (DPD) simulation results and machine learning (ML) including graph convolutional networks (GCNs). The architecture of bottlebrush copolymers is encoded by graphs including connectivity, side chain length, bead types, and interaction parameters of DPD simulation. Using GCN, we accurately predicted the single chain properties of bottlebrush copolymers with over 95% accuracy. Furthermore, phase behavior was precisely predicted using these single chain properties. Shapley additive explanations (SHAP) values of single chain properties to the various self-assembly morphologies were calculated to investigate the correlation between single chain properties and morphologies. In addition, we analyzed single chain properties and phase behavior as a function of DPD interaction parameters, extracting relevant physical properties for vesicle morphology formation. This work paves the way for tailored design in solution of self-assembled nanostructures of bottlebrush copolymers, offering a GCN framework for precise prediction of self-assembly morphologies under various chain architectures and solvent conditions.
Collapse
Affiliation(s)
- Wooseop Hwang
- Department of Materials Science and Engineering, Korea University, Seoul 02841, Republic of Korea.
| | - Sangwoo Kwon
- School of Chemical and Biological Engineering, Institute of Chemical Processes, Seoul National University, Seoul 08826, Republic of Korea.
| | - Won Bo Lee
- School of Chemical and Biological Engineering, Institute of Chemical Processes, Seoul National University, Seoul 08826, Republic of Korea.
| | - YongJoo Kim
- Department of Materials Science and Engineering, Korea University, Seoul 02841, Republic of Korea.
| |
Collapse
|
3
|
Yasuda I, Endo K, Arai N, Yasuoka K. In-layer inhomogeneity of molecular dynamics in quasi-liquid layers of ice. Commun Chem 2024; 7:117. [PMID: 38811834 PMCID: PMC11136980 DOI: 10.1038/s42004-024-01197-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 05/02/2024] [Indexed: 05/31/2024] Open
Abstract
Quasi-liquid layers (QLLs) are present on the surface of ice and play a significant role in its distinctive chemical and physical properties. These layers exhibit considerable heterogeneity across different scales ranging from nanometers to millimeters. Although the formation of partially ice-like structures has been proposed, the molecular-level understanding of this heterogeneity remains unclear. Here, we examined the heterogeneity of molecular dynamics on QLLs based on molecular dynamics simulations and machine learning analysis of the simulation data. We demonstrated that the molecular dynamics of QLLs do not comprise a mixture of solid- and liquid water molecules. Rather, molecules having similar behaviors form dynamical domains that are associated with the dynamical heterogeneity of supercooled water. Nonetheless, molecules in the domains frequently switch their dynamical state. Furthermore, while there is no observable characteristic domain size, the long-range ordering strongly depends on the temperature and crystal face. Instead of a mixture of static solid- and liquid-like regions, our results indicate the presence of heterogeneous molecular dynamics in QLLs, which offers molecular-level insights into the surface properties of ice.
Collapse
Affiliation(s)
- Ikki Yasuda
- Department of Mechanical Engineering, Keio University, Yokohama, Japan
| | - Katsuhiro Endo
- Department of Mechanical Engineering, Keio University, Yokohama, Japan
| | - Noriyoshi Arai
- Department of Mechanical Engineering, Keio University, Yokohama, Japan
| | - Kenji Yasuoka
- Department of Mechanical Engineering, Keio University, Yokohama, Japan.
| |
Collapse
|
4
|
Zhou X, Zhou Y, Yu L, Qi L, Oh KS, Hu P, Lee SY, Chen C. Gel polymer electrolytes for rechargeable batteries toward wide-temperature applications. Chem Soc Rev 2024; 53:5291-5337. [PMID: 38634467 DOI: 10.1039/d3cs00551h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]
Abstract
Rechargeable batteries, typically represented by lithium-ion batteries, have taken a huge leap in energy density over the last two decades. However, they still face material/chemical challenges in ensuring safety and long service life at temperatures beyond the optimum range, primarily due to the chemical/electrochemical instabilities of conventional liquid electrolytes against aggressive electrode reactions and temperature variation. In this regard, a gel polymer electrolyte (GPE) with its liquid components immobilized and stabilized by a solid matrix, capable of retaining almost all the advantageous natures of the liquid electrolytes and circumventing the interfacial issues that exist in the all-solid-state electrolytes, is of great significance to realize rechargeable batteries with extended working temperature range. We begin this review with the main challenges faced in the development of GPEs, based on extensive literature research and our practical experience. Then, a significant section is dedicated to the requirements and design principles of GPEs for wide-temperature applications, with special attention paid to the feasibility, cost, and environmental impact. Next, the research progress of GPEs is thoroughly reviewed according to the strategies applied. In the end, we outline some prospects of GPEs related to innovations in material sciences, advanced characterizations, artificial intelligence, and environmental impact analysis, hoping to spark new research activities that ultimately bring us a step closer to realizing wide-temperature rechargeable batteries.
Collapse
Affiliation(s)
- Xiaoyan Zhou
- School of Resource and Environmental Sciences, Hubei Biomass-Resource Chemistry and Environmental Biotechnology Key Laboratory, Wuhan University, Wuhan 430079, P. R. China.
- School of Science, Hubei University of Technology, Wuhan 430070, P. R. China.
| | - Yifang Zhou
- School of Resource and Environmental Sciences, Hubei Biomass-Resource Chemistry and Environmental Biotechnology Key Laboratory, Wuhan University, Wuhan 430079, P. R. China.
| | - Le Yu
- School of Resource and Environmental Sciences, Hubei Biomass-Resource Chemistry and Environmental Biotechnology Key Laboratory, Wuhan University, Wuhan 430079, P. R. China.
| | - Luhe Qi
- School of Resource and Environmental Sciences, Hubei Biomass-Resource Chemistry and Environmental Biotechnology Key Laboratory, Wuhan University, Wuhan 430079, P. R. China.
| | - Kyeong-Seok Oh
- Department of Chemical and Biomolecular Engineering, Yonsei University, Seoul, Republic of Korea.
| | - Pei Hu
- School of Science, Hubei University of Technology, Wuhan 430070, P. R. China.
| | - Sang-Young Lee
- Department of Chemical and Biomolecular Engineering, Yonsei University, Seoul, Republic of Korea.
| | - Chaoji Chen
- School of Resource and Environmental Sciences, Hubei Biomass-Resource Chemistry and Environmental Biotechnology Key Laboratory, Wuhan University, Wuhan 430079, P. R. China.
| |
Collapse
|
5
|
Miao L, Jia W, Cao X, Jiao L. Computational chemistry for water-splitting electrocatalysis. Chem Soc Rev 2024; 53:2771-2807. [PMID: 38344774 DOI: 10.1039/d2cs01068b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Electrocatalytic water splitting driven by renewable electricity has attracted great interest in recent years for producing hydrogen with high-purity. However, the practical applications of this technology are limited by the development of electrocatalysts with high activity, low cost, and long durability. In the search for new electrocatalysts, computational chemistry has made outstanding contributions by providing fundamental laws that govern the electron behavior and enabling predictions of electrocatalyst performance. This review delves into theoretical studies on electrochemical water-splitting processes. Firstly, we introduce the fundamentals of electrochemical water electrolysis and subsequently discuss the current advancements in computational methods and models for electrocatalytic water splitting. Additionally, a comprehensive overview of benchmark descriptors is provided to aid in understanding intrinsic catalytic performance for water-splitting electrocatalysts. Finally, we critically evaluate the remaining challenges within this field.
Collapse
Affiliation(s)
- Licheng Miao
- Key Laboratory of Advanced Energy Materials Chemistry (Ministry of Education), Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), College of Chemistry, Nankai University, Tianjin 300071, China.
| | - Wenqi Jia
- Key Laboratory of Advanced Energy Materials Chemistry (Ministry of Education), Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), College of Chemistry, Nankai University, Tianjin 300071, China.
| | - Xuejie Cao
- Key Laboratory of Advanced Energy Materials Chemistry (Ministry of Education), Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), College of Chemistry, Nankai University, Tianjin 300071, China.
| | - Lifang Jiao
- Key Laboratory of Advanced Energy Materials Chemistry (Ministry of Education), Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), College of Chemistry, Nankai University, Tianjin 300071, China.
| |
Collapse
|
6
|
Nicolle A, Deng S, Ihme M, Kuzhagaliyeva N, Ibrahim EA, Farooq A. Mixtures Recomposition by Neural Nets: A Multidisciplinary Overview. J Chem Inf Model 2024; 64:597-620. [PMID: 38284618 DOI: 10.1021/acs.jcim.3c01633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2024]
Abstract
Artificial Neural Networks (ANNs) are transforming how we understand chemical mixtures, providing an expressive view of the chemical space and multiscale processes. Their hybridization with physical knowledge can bridge the gap between predictivity and understanding of the underlying processes. This overview explores recent progress in ANNs, particularly their potential in the 'recomposition' of chemical mixtures. Graph-based representations reveal patterns among mixture components, and deep learning models excel in capturing complexity and symmetries when compared to traditional Quantitative Structure-Property Relationship models. Key components, such as Hamiltonian networks and convolution operations, play a central role in representing multiscale mixtures. The integration of ANNs with Chemical Reaction Networks and Physics-Informed Neural Networks for inverse chemical kinetic problems is also examined. The combination of sensors with ANNs shows promise in optical and biomimetic applications. A common ground is identified in the context of statistical physics, where ANN-based methods iteratively adapt their models by blending their initial states with training data. The concept of mixture recomposition unveils a reciprocal inspiration between ANNs and reactive mixtures, highlighting learning behaviors influenced by the training environment.
Collapse
Affiliation(s)
- Andre Nicolle
- Aramco Fuel Research Center, Rueil-Malmaison 92852, France
| | - Sili Deng
- Massachusetts Institute of Technology, Cambridge 02139, Massachusetts, United States
| | - Matthias Ihme
- Stanford University, Stanford 94305, California, United States
| | | | - Emad Al Ibrahim
- King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia
| | - Aamir Farooq
- King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia
| |
Collapse
|
7
|
Ishiai S, Yasuda I, Endo K, Yasuoka K. Graph-Neural-Network-Based Unsupervised Learning of the Temporal Similarity of Structural Features Observed in Molecular Dynamics Simulations. J Chem Theory Comput 2024; 20:819-831. [PMID: 38190503 DOI: 10.1021/acs.jctc.3c00995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2024]
Abstract
Classification of molecular structures is a crucial step in molecular dynamics (MD) simulations to detect various structures and phases within systems. Molecular structures, which are commonly identified using order parameters, were recently identified using machine learning (ML), that is, the ML models acquire structural features using labeled crystals or phases via supervised learning. However, these approaches may not identify unlabeled or unknown structures, such as the imperfect crystal structures observed in nonequilibrium systems and interfaces. In this study, we proposed the use of a novel unsupervised learning framework, denoted temporal self-supervised learning (TSSL), to learn structural features and design their parameters. In TSSL, the ML models learn that the structural similarity is learned via contrastive learning based on minor short-term variations caused by perturbations in MD simulations. This learning framework is applied to a sophisticated architecture of graph neural network models that use bond angle and length data of the neighboring atoms. TSSL successfully classifies water and ice crystals based on high local ordering, and furthermore, it detects imperfect structures typical of interfaces such as the water-ice and ice-vapor interfaces.
Collapse
Affiliation(s)
- Satoki Ishiai
- Department of Mechanical Engineering, Keio University, Yokohama 223-8522, Japan
| | - Ikki Yasuda
- Department of Mechanical Engineering, Keio University, Yokohama 223-8522, Japan
| | - Katsuhiro Endo
- Department of Mechanical Engineering, Keio University, Yokohama 223-8522, Japan
- National Institute of Advanced Industrial Science and Technology (AIST), Ibaraki 305-8568, Japan
| | - Kenji Yasuoka
- Department of Mechanical Engineering, Keio University, Yokohama 223-8522, Japan
| |
Collapse
|
8
|
Mustali J, Yasuda I, Hirano Y, Yasuoka K, Gautieri A, Arai N. Unsupervised deep learning for molecular dynamics simulations: a novel analysis of protein-ligand interactions in SARS-CoV-2 M pro. RSC Adv 2023; 13:34249-34261. [PMID: 38019981 PMCID: PMC10663885 DOI: 10.1039/d3ra06375e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 11/06/2023] [Indexed: 12/01/2023] Open
Abstract
Molecular dynamics (MD) simulations, which are central to drug discovery, offer detailed insights into protein-ligand interactions. However, analyzing large MD datasets remains a challenge. Current machine-learning solutions are predominantly supervised and have data labelling and standardisation issues. In this study, we adopted an unsupervised deep-learning framework, previously benchmarked for rigid proteins, to study the more flexible SARS-CoV-2 main protease (Mpro). We ran MD simulations of Mpro with various ligands and refined the data by focusing on binding-site residues and time frames in stable protein conformations. The optimal descriptor chosen was the distance between the residues and the center of the binding pocket. Using this approach, a local dynamic ensemble was generated and fed into our neural network to compute Wasserstein distances across system pairs, revealing ligand-induced conformational differences in Mpro. Dimensionality reduction yielded an embedding map that correlated ligand-induced dynamics and binding affinity. Notably, the high-affinity compounds showed pronounced effects on the protein's conformations. We also identified the key residues that contributed to these differences. Our findings emphasize the potential of combining unsupervised deep learning with MD simulations to extract valuable information and accelerate drug discovery.
Collapse
Affiliation(s)
- Jessica Mustali
- Department of Electronics, Information and Bioengineering, Politecnico di Milano Italy
| | - Ikki Yasuda
- Department of Mechanical Engineering, Keio University Japan
| | | | - Kenji Yasuoka
- Department of Mechanical Engineering, Keio University Japan
| | - Alfonso Gautieri
- Department of Electronics, Information and Bioengineering, Politecnico di Milano Italy
| | - Noriyoshi Arai
- Department of Mechanical Engineering, Keio University Japan
| |
Collapse
|
9
|
Li T, Tian C, Moridi A, Yeo J. Elucidating Interfacial Dynamics of Ti-Al Systems Using Molecular Dynamics Simulation and Markov State Modeling. ACS APPLIED MATERIALS & INTERFACES 2023; 15:50489-50498. [PMID: 37852198 DOI: 10.1021/acsami.3c09868] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2023]
Abstract
Due to their remarkable mechanical and chemical properties, Ti-Al-based materials are attracting considerable interest in numerous fields of engineering, such as automotive, aerospace, and defense. With their low density, high strength, and resistance to corrosion and oxidation, these intermetallic alloys and metal-compound composites have found diverse applications. However, additive manufacturing and heat treatment of Ti-Al alloys frequently lead to brittleness and severe formation of defects. The present study delves into the interfacial dynamics of these Ti-Al systems, particularly focusing on the behavior of Ti and Al atoms in the presence of TiAl3 grain boundaries under experimental heat treatment conditions. Using a combination of molecular dynamics and Markov state modeling, we scrutinize the kinetic processes involved in the formation of TiAl3. The molecular dynamics simulation indicates that at the early stage of heat treatment, the predominating process is the diffusion of Al atoms toward the Ti surface through the TiAl3 grain boundaries. Markov state modeling identifies three distinct dynamic states of Al atoms within the Ti/Al mixture that forms during the process, each exhibiting a unique spatial distribution. Using transition time scales as a qualitative measure of the rapidness of the dynamics, it is observed that the Al dynamics is significantly less rapid near the Ti surface compared to the Al surface. Put together, the results offer a comprehensive understanding of the interfacial dynamics and reveal a three-stage diffusion mechanism. The process initiates with the premelting of Al, proceeds with the prevalent diffusion of Al atoms toward the Ti surface, and eventually ceases as the Ti concentration within the mixture progressively increases. The insights gained from this study could contribute significantly to the control and optimization of manufacturing processes for these high-performing Ti-Al-based materials.
Collapse
Affiliation(s)
- Tianjiao Li
- Sibley School of Mechanical and Aerospace Engineering, Cornell University, Ithaca, New York 14853, United States
| | - Chenxi Tian
- Sibley School of Mechanical and Aerospace Engineering, Cornell University, Ithaca, New York 14853, United States
| | - Atieh Moridi
- Sibley School of Mechanical and Aerospace Engineering, Cornell University, Ithaca, New York 14853, United States
| | - Jingjie Yeo
- Sibley School of Mechanical and Aerospace Engineering, Cornell University, Ithaca, New York 14853, United States
| |
Collapse
|
10
|
Lamb KD, Gentine P. Zero-shot learning of aerosol optical properties with graph neural networks. Sci Rep 2023; 13:18777. [PMID: 37907512 PMCID: PMC10618469 DOI: 10.1038/s41598-023-45235-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 10/17/2023] [Indexed: 11/02/2023] Open
Abstract
Black carbon (BC), a strongly absorbing aerosol sourced from combustion, is an important short-lived climate forcer. BC's complex morphology contributes to uncertainty in its direct climate radiative effects, as current methods to accurately calculate the optical properties of these aerosols are too computationally expensive to be used online in models or for observational retrievals. Here we demonstrate that a Graph Neural Network (GNN) trained to predict the optical properties of numerically-generated BC fractal aggregates can accurately generalize to arbitrarily shaped particles, including much larger ([Formula: see text]) aggregates than in the training dataset. This zero-shot learning approach could be used to estimate single particle optical properties of realistically-shaped aerosol and cloud particles for inclusion in radiative transfer codes for atmospheric models and remote sensing inversions. In addition, GNN's can be used to gain physical intuition on the relationship between small-scale interactions (here of the spheres' positions and interactions) and large-scale properties (here of the radiative properties of aerosols).
Collapse
Affiliation(s)
- K D Lamb
- Department of Earth and Environmental Engineering, Columbia University, New York, USA.
| | - P Gentine
- Department of Earth and Environmental Engineering, Columbia University, New York, USA
| |
Collapse
|
11
|
Liu B, Xue M, Qiu Y, Konovalov KA, O’Connor MS, Huang X. GraphVAMPnets for uncovering slow collective variables of self-assembly dynamics. J Chem Phys 2023; 159:094901. [PMID: 37655771 PMCID: PMC11005469 DOI: 10.1063/5.0158903] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 08/11/2023] [Indexed: 09/02/2023] Open
Abstract
Uncovering slow collective variables (CVs) of self-assembly dynamics is important to elucidate its numerous kinetic assembly pathways and drive the design of novel structures for advanced materials through the bottom-up approach. However, identifying the CVs for self-assembly presents several challenges. First, self-assembly systems often consist of identical monomers, and the feature representations should be invariant to permutations and rotational symmetries. Physical coordinates, such as aggregate size, lack high-resolution detail, while common geometric coordinates like pairwise distances are hindered by the permutation and rotational symmetry challenges. Second, self-assembly is usually a downhill process, and the trajectories often suffer from insufficient sampling of backward transitions that correspond to the dissociation of self-assembled structures. Popular dimensionality reduction methods, such as time-structure independent component analysis, impose detailed balance constraints, potentially obscuring the true dynamics of self-assembly. In this work, we employ GraphVAMPnets, which combines graph neural networks with a variational approach for Markovian process (VAMP) theory to identify the slow CVs of the self-assembly processes. First, GraphVAMPnets bears the advantages of graph neural networks, in which the graph embeddings can represent self-assembly structures in high-resolution while being invariant to permutations and rotational symmetries. Second, it is built upon VAMP theory, which studies Markov processes without forcing detailed balance constraints, which addresses the out-of-equilibrium challenge in the self-assembly process. We demonstrate GraphVAMPnets for identifying slow CVs of self-assembly kinetics in two systems: the aggregation of two hydrophobic molecules and the self-assembly of patchy particles. We expect that our GraphVAMPnets can be widely applied to molecular self-assembly.
Collapse
Affiliation(s)
- Bojun Liu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Mingyi Xue
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Kirill A. Konovalov
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Michael S. O’Connor
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Author to whom correspondence should be addressed:
| |
Collapse
|
12
|
Liu H, Huang Z, Schoenholz SS, Cubuk ED, Smedskjaer MM, Sun Y, Wang W, Bauchy M. Learning molecular dynamics: predicting the dynamics of glasses by a machine learning simulator. MATERIALS HORIZONS 2023; 10:3416-3428. [PMID: 37382413 DOI: 10.1039/d3mh00028a] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/30/2023]
Abstract
Many-body dynamics of atoms such as glass dynamics is generally governed by complex (and sometimes unknown) physics laws. This challenges the construction of atom dynamics simulations that both (i) capture the physics laws and (ii) run with little computation cost. Here, based on graph neural network (GNN), we introduce an observation-based graph network (OGN) framework to "bypass all physics laws" to simulate complex glass dynamics solely from their static structure. By taking the example of molecular dynamics (MD) simulations, we successfully apply the OGN to predict atom trajectories evolving up to a few hundred timesteps and ranging over different families of complex atomistic systems, which implies that the atom dynamics is largely encoded in their static structure in disordered phases and, furthermore, allows us to explore the capacity of OGN simulations that is potentially generic to many-body dynamics. Importantly, unlike traditional numerical simulations, the OGN simulations bypass the numerical constraint of small integration timestep by a multiplier of ≥5 to conserve energy and momentum until hundreds of timesteps, thus leapfrogging the execution speed of MD simulations for a modest timescale.
Collapse
Affiliation(s)
- Han Liu
- SOlids inFormaTics AI-Laboratory (SOFT-AI-Lab), College of Polymer Science and Engineering, Sichuan University, Chengdu 610065, China.
| | - Zijie Huang
- Department of Computer Science, University of California, Los Angeles, California, 90095, USA
| | | | - Ekin D Cubuk
- Brain Team, Google Research, Mountain View, California, 94043, USA
| | - Morten M Smedskjaer
- Department of Chemistry and Bioscience, Aalborg University, Aalborg 9220, Denmark
| | - Yizhou Sun
- Department of Computer Science, University of California, Los Angeles, California, 90095, USA
| | - Wei Wang
- Department of Computer Science, University of California, Los Angeles, California, 90095, USA
| | - Mathieu Bauchy
- Physics of AmoRphous and Inorganic Solids Laboratory (PARISlab), Department of Civil and Environmental Engineering, University of California, Los Angeles, California, 90095, USA.
| |
Collapse
|
13
|
Purnomo A, Hayashibe M. Sparse identification of Lagrangian for nonlinear dynamical systems via proximal gradient method. Sci Rep 2023; 13:7919. [PMID: 37193704 DOI: 10.1038/s41598-023-34931-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 05/10/2023] [Indexed: 05/18/2023] Open
Abstract
The autonomous distillation of physical laws only from data is of great interest in many scientific fields. Data-driven modeling frameworks that adopt sparse regression techniques, such as sparse identification of nonlinear dynamics (SINDy) and its modifications, are developed to resolve difficulties in extracting underlying dynamics from experimental data. However, SINDy faces certain difficulties when the dynamics contain rational functions. The Lagrangian is substantially more concise than the actual equations of motion, especially for complex systems, and it does not usually contain rational functions for mechanical systems. Few proposed methods proposed to date, such as Lagrangian-SINDy we have proposed recently, can extract the true form of the Lagrangian of dynamical systems from data; however, these methods are easily affected by noise as a fact. In this study, we developed an extended version of Lagrangian-SINDy (xL-SINDy) to obtain the Lagrangian of dynamical systems from noisy measurement data. We incorporated the concept of SINDy and used the proximal gradient method to obtain sparse Lagrangian expressions. Further, we demonstrated the effectiveness of xL-SINDy against different noise levels using four mechanical systems. In addition, we compared its performance with SINDy-PI (parallel, implicit) which is a latest robust variant of SINDy that can handle implicit dynamics and rational nonlinearities. The experimental results reveal that xL-SINDy is much more robust than the existing methods for extracting the governing equations of nonlinear mechanical systems from data with noise. We believe this contribution is significant toward noise-tolerant computational method for explicit dynamics law extraction from data.
Collapse
Affiliation(s)
- Adam Purnomo
- Department of Robotics, Graduate School of Engineering, Tohoku University, Sendai, 980-8579, Japan
| | - Mitsuhiro Hayashibe
- Department of Robotics, Graduate School of Engineering, Tohoku University, Sendai, 980-8579, Japan.
| |
Collapse
|
14
|
Qu N, Chen M, Liao M, Cheng Y, Lai Z, Zhou F, Zhu J, Liu Y, Zhang L. Accelerating Density Functional Calculation of Adatom Adsorption on Graphene via Machine Learning. MATERIALS (BASEL, SWITZERLAND) 2023; 16:2633. [PMID: 37048928 PMCID: PMC10095669 DOI: 10.3390/ma16072633] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 03/20/2023] [Accepted: 03/22/2023] [Indexed: 06/19/2023]
Abstract
Graphene has attracted significant interest due to its unique properties. Herein, we built an adsorption structure selection workflow based on a density functional theory (DFT) calculation and machine learning to provide a guide for the interfacial properties of graphene. There are two main parts in our workflow. One main part is a DFT calculation routine to generate a dataset automatically. This part includes adatom random selection, modeling adsorption structures automatically, and a calculation of adsorption properties. It provides the dataset for the second main part in our workflow, which is a machine learning model. The inputs are atomic characteristics selected by feature engineering, and the network features are optimized by a genetic algorithm. The mean percentage error of our model was below 35%. Our routine is a general DFT calculation accelerating routine, which could be applied to many other problems. An attempt on graphene/magnesium composites design was carried out. Our predicting results match well with the interfacial properties calculated by DFT. This indicated that our routine presents an option for quick-design graphene-reinforced metal matrix composites.
Collapse
Affiliation(s)
- Nan Qu
- School of Materials Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
| | - Mo Chen
- School of Materials Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
| | - Mingqing Liao
- School of Materials Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
| | - Yuan Cheng
- National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Harbin Institute of Technology, Harbin 150001, China
| | - Zhonghong Lai
- Center of Analysis, Measurement and Computing, Harbin Institute of Technology, Harbin 150001, China
| | - Fei Zhou
- School of Materials Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
| | - Jingchuan Zhu
- School of Materials Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
| | - Yong Liu
- School of Materials Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
| | - Lin Zhang
- Biological Physics, Department of Physics and Astronomy, University of Manchester, Oxford Road, Manchester M13 9PL, UK
| |
Collapse
|
15
|
Bradford G, Lopez J, Ruza J, Stolberg MA, Osterude R, Johnson JA, Gomez-Bombarelli R, Shao-Horn Y. Chemistry-Informed Machine Learning for Polymer Electrolyte Discovery. ACS CENTRAL SCIENCE 2023; 9:206-216. [PMID: 36844492 PMCID: PMC9951296 DOI: 10.1021/acscentsci.2c01123] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Indexed: 06/18/2023]
Abstract
Solid polymer electrolytes (SPEs) have the potential to improve lithium-ion batteries by enhancing safety and enabling higher energy densities. However, SPEs suffer from significantly lower ionic conductivity than liquid and solid ceramic electrolytes, limiting their adoption in functional batteries. To facilitate more rapid discovery of high ionic conductivity SPEs, we developed a chemistry-informed machine learning model that accurately predicts ionic conductivity of SPEs. The model was trained on SPE ionic conductivity data from hundreds of experimental publications. Our chemistry-informed model encodes the Arrhenius equation, which describes temperature activated processes, into the readout layer of a state-of-the-art message passing neural network and has significantly improved accuracy over models that do not encode temperature dependence. Chemically informed readout layers are compatible with deep learning for other property prediction tasks and are especially useful where limited training data are available. Using the trained model, ionic conductivity values were predicted for several thousand candidate SPE formulations, allowing us to identify promising candidate SPEs. We also generated predictions for several different anions in poly(ethylene oxide) and poly(trimethylene carbonate), demonstrating the utility of our model in identifying descriptors for SPE ionic conductivity.
Collapse
Affiliation(s)
- Gabriel Bradford
- Department
of Mechanical Engineering, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
| | - Jeffrey Lopez
- Research
Laboratory of Electronics, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
| | - Jurgis Ruza
- Department
of Materials Science and Engineering, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
| | - Michael A. Stolberg
- Department
of Chemistry, Massachusetts Institute of
Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
- Department
of Materials Science and Engineering, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
| | - Richard Osterude
- Department
of Materials Science and Engineering, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
| | - Jeremiah A. Johnson
- Department
of Chemistry, Massachusetts Institute of
Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
| | - Rafael Gomez-Bombarelli
- Department
of Materials Science and Engineering, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
| | - Yang Shao-Horn
- Department
of Mechanical Engineering, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
- Department
of Materials Science and Engineering, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts02139, United States
| |
Collapse
|
16
|
Soltani S, Sinclair CW, Rottler J. Exploring glassy dynamics with Markov state models from graph dynamical neural networks. Phys Rev E 2022; 106:025308. [PMID: 36109953 DOI: 10.1103/physreve.106.025308] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 07/21/2022] [Indexed: 06/15/2023]
Abstract
Using machine learning techniques, we introduce a Markov state model (MSM) for a model glass former that reveals structural heterogeneities and their slow dynamics by coarse-graining the molecular dynamics into a low-dimensional feature space. The transition timescale between states is larger than the conventional structural relaxation time τ_{α}, but can be obtained from trajectories much shorter than τ_{α}. The learned map of states assigned to the particles corresponds to local excess Voronoi volume. These results resonate with classic free volume theories of the glass transition, singling out local packing fluctuations as one of the dominant slowly relaxing features.
Collapse
Affiliation(s)
- Siavash Soltani
- Department of Materials Engineering, The University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4
| | - Chad W Sinclair
- Department of Materials Engineering, The University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4
| | - Jörg Rottler
- Department of Physics and Astronomy, The University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z1
- Stewart Blusson Quantum Matter Institute, The University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4
| |
Collapse
|
17
|
Mai H, Le TC, Chen D, Winkler DA, Caruso RA. Machine Learning for Electrocatalyst and Photocatalyst Design and Discovery. Chem Rev 2022; 122:13478-13515. [PMID: 35862246 DOI: 10.1021/acs.chemrev.2c00061] [Citation(s) in RCA: 61] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Electrocatalysts and photocatalysts are key to a sustainable future, generating clean fuels, reducing the impact of global warming, and providing solutions to environmental pollution. Improved processes for catalyst design and a better understanding of electro/photocatalytic processes are essential for improving catalyst effectiveness. Recent advances in data science and artificial intelligence have great potential to accelerate electrocatalysis and photocatalysis research, particularly the rapid exploration of large materials chemistry spaces through machine learning. Here a comprehensive introduction to, and critical review of, machine learning techniques used in electrocatalysis and photocatalysis research are provided. Sources of electro/photocatalyst data and current approaches to representing these materials by mathematical features are described, the most commonly used machine learning methods summarized, and the quality and utility of electro/photocatalyst models evaluated. Illustrations of how machine learning models are applied to novel electro/photocatalyst discovery and used to elucidate electrocatalytic or photocatalytic reaction mechanisms are provided. The review offers a guide for materials scientists on the selection of machine learning methods for electrocatalysis and photocatalysis research. The application of machine learning to catalysis science represents a paradigm shift in the way advanced, next-generation catalysts will be designed and synthesized.
Collapse
Affiliation(s)
- Haoxin Mai
- Applied Chemistry and Environmental Science, School of Science, STEM College, RMIT University, GPO Box 2476, Melbourne, Victoria 3001, Australia
| | - Tu C Le
- School of Engineering, STEM College, RMIT University, GPO Box 2476, Melbourne, Victoria 3001, Australia
| | - Dehong Chen
- Applied Chemistry and Environmental Science, School of Science, STEM College, RMIT University, GPO Box 2476, Melbourne, Victoria 3001, Australia
| | - David A Winkler
- Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria 3052, Australia.,Biochemistry and Chemistry, La Trobe University, Kingsbury Drive, Bundoora, Victoria 3042, Australia.,School of Pharmacy, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| | - Rachel A Caruso
- Applied Chemistry and Environmental Science, School of Science, STEM College, RMIT University, GPO Box 2476, Melbourne, Victoria 3001, Australia
| |
Collapse
|
18
|
Kollias L, Rousseau R, Glezakou VA, Salvalaglio M. Understanding Metal-Organic Framework Nucleation from a Solution with Evolving Graphs. J Am Chem Soc 2022; 144:11099-11109. [PMID: 35709413 DOI: 10.1021/jacs.1c13508] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
A mechanistic understanding of metal-organic framework (MOF) synthesis and scale-up remains underexplored due to the complex nature of the interactions of their building blocks. In this work, we investigate the collective assembly of building units at the early stages of MOF nucleation, using MIL-101(Cr) as a prototypical example. Using large-scale molecular dynamics simulations, we observe that the choice of solvent (water and N,N-dimethylformamide), the introduction of ions (Na+ and F-) and the relative populations of MIL-101(Cr) half-secondary building unit (half-SBU) isomers have a strong influence on the cluster formation process. Additionally, the shape, size, nucleation and growth rates, crystallinity, and short and long-range order largely vary depending on the synthesis conditions. We evaluate these properties as they naturally emerge when interpreting the self-assembly of MOF nuclei as the time evolution of an undirected graph. Solution-induced conformational complexity and ionic concentration have a dramatic effect on the morphology of clusters emerging during assembly. While pure solvents lead to the rapid formation of a small number of large clusters, the presence of ions in aqueous solutions results in smaller clusters and slower nucleation. This diversity is captured by the key features of the graph representation. Principle component analysis on graph properties reveals that only a small number of molecular descriptors is needed to deconvolute MOF self-assembly. Descriptors such as the average coordination number between half-SBUs and fractal dimension are of particulalr interest as they can be can be followed experimentally by techniques like by time-resolved spectroscopy. Ultimately, graph theory emerges as an approach that can be used to understand complex processes revealing molecular descriptors accessible by both simulation and experiment.
Collapse
Affiliation(s)
- Loukas Kollias
- Basic and Applied Molecular Foundations, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Roger Rousseau
- Basic and Applied Molecular Foundations, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Vassiliki-Alexandra Glezakou
- Basic and Applied Molecular Foundations, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Matteo Salvalaglio
- Thomas Young Centre and Department of Chemical Engineering, University College London, London WC1E 7JE, United Kingdom
| |
Collapse
|
19
|
Expanding the active charge carriers of polymer electrolytes in lithium-based batteries using an anion-hosting cathode. Nat Commun 2022; 13:3209. [PMID: 35680867 PMCID: PMC9184592 DOI: 10.1038/s41467-022-30788-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 05/18/2022] [Indexed: 12/02/2022] Open
Abstract
Ionic-conductive polymers are appealing electrolyte materials for solid-state lithium-based batteries. However, these polymers are detrimentally affected by the electrochemically-inactive anion migration that limits the ionic conductivity and accelerates cell failure. To circumvent this issue, we propose the use of polyvinyl ferrocene (PVF) as positive electrode active material. The PVF acts as an anion-acceptor during redox processes, thus simultaneously setting anions and lithium ions as effective charge carriers. We report the testing of various Li||PVF lab-scale cells using polyethylene oxide (PEO) matrix and Li-containing salts with different anions. Interestingly, the cells using the PEO-lithium bis(trifluoromethanesulfonyl)imide (LiTFSI) solid electrolyte deliver an initial capacity of 108 mAh g−1 at 100 μA cm−2 and 60 °C, and a discharge capacity retention of 70% (i.e., 70 mAh g−1) after 2800 cycles at 300 μA cm−2 and 60 °C. The Li|PEO-LiTFSI|PVF cells tested at 50 μA cm−2 and 30 °C can also deliver an initial discharge capacity of around 98 mAh g−1 with an electrolyte ionic conductivity in the order of 10−5 S cm−1. The energy content of secondary batteries is often limited by the charge carriers available in the system. Here, the authors employed an anion acceptor cathode for simultaneous use of electrolyte anions and cations as effective charge carriers in solid polymer electrolytes for lithium-based batteries.
Collapse
|
20
|
Ghorbani M, Prasad S, Klauda JB, Brooks BR. GraphVAMPNet, using graph neural networks and variational approach to Markov processes for dynamical modeling of biomolecules. J Chem Phys 2022; 156:184103. [PMID: 35568532 PMCID: PMC9094994 DOI: 10.1063/5.0085607] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 04/22/2022] [Indexed: 11/14/2022] Open
Abstract
Finding a low dimensional representation of data from long-timescale trajectories of biomolecular processes, such as protein folding or ligand-receptor binding, is of fundamental importance, and kinetic models, such as Markov modeling, have proven useful in describing the kinetics of these systems. Recently, an unsupervised machine learning technique called VAMPNet was introduced to learn the low dimensional representation and the linear dynamical model in an end-to-end manner. VAMPNet is based on the variational approach for Markov processes and relies on neural networks to learn the coarse-grained dynamics. In this paper, we combine VAMPNet and graph neural networks to generate an end-to-end framework to efficiently learn high-level dynamics and metastable states from the long-timescale molecular dynamics trajectories. This method bears the advantages of graph representation learning and uses graph message passing operations to generate an embedding for each datapoint, which is used in the VAMPNet to generate a coarse-grained dynamical model. This type of molecular representation results in a higher resolution and a more interpretable Markov model than the standard VAMPNet, enabling a more detailed kinetic study of the biomolecular processes. Our GraphVAMPNet approach is also enhanced with an attention mechanism to find the important residues for classification into different metastable states.
Collapse
Affiliation(s)
| | - Samarjeet Prasad
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20824, USA
| | - Jeffery B. Klauda
- Department of Chemical and Biomolecular Engineering, University of Maryland, College Park, Maryland 20742, USA
| | - Bernard R. Brooks
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20824, USA
| |
Collapse
|
21
|
Portillo JR, Soler-Toscano F, Langa JA. Global structural stability and the role of cooperation in mutualistic systems. PLoS One 2022; 17:e0267404. [PMID: 35439272 PMCID: PMC9017889 DOI: 10.1371/journal.pone.0267404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 04/07/2022] [Indexed: 11/19/2022] Open
Abstract
Dynamical systems on graphs allow to describe multiple phenomena from different areas of Science. In particular, many complex systems in Ecology are studied by this approach. In this paper we analize the mathematical framework for the study of the structural stability of each stationary point, feasible or not, introducing a generalization for this concept, defined as Global Structural Stability. This approach would fit with the proper mathematical concept of structural stability, in which we find a full description of the complex dynamics on the phase space due to nonlinear dynamics. This fact can be analyzed as an informational field grounded in a global attractor whose structure can be completely characterized. These attractors are stable under perturbation and suppose the minimal structurally stable sets. We also study in detail, mathematically and computationally, the zones characterizing different levels of biodiversity in bipartite graphs describing mutualistic antagonistic systems of population dynamics. In particular, we investigate the dependence of the region of maximal biodiversity of a system on its connectivity matrix. On the other hand, as the network topology does not completely determine the robustness of the dynamics of a complex network, we study the correlation between structural stability and several graph measures. A systematic study on synthetic and biological graphs is presented, including 10 mutualistic networks of plants and seed-dispersal and 1000 random synthetic networks. We compare the role of centrality measures and modularity, concluding the importance of just cooperation strength among nodes when describing areas of maximal biodiversity. Indeed, we show that cooperation parameters are the central role for biodiversity while other measures act as secondary supporting functions.
Collapse
Affiliation(s)
- José R. Portillo
- Department of Applied Mathematics I, University of Seville, Seville, Spain
- Instituto de Matemáticas de la Universidad de Sevilla Antonio de Castro Brzezicki, Seville, Spain
| | - Fernando Soler-Toscano
- Department of Philosophy, Logic and Philosophy of Science, University of Seville, Seville, Spain
| | - José A. Langa
- Department of Differential Equations and Numerical Analysis, University of Seville, Seville, Spain
- Instituto de Matemáticas de la Universidad de Sevilla Antonio de Castro Brzezicki, Seville, Spain
| |
Collapse
|
22
|
Mardt A, Noé F. Progress in deep Markov state modeling: Coarse graining and experimental data restraints. J Chem Phys 2021; 155:214106. [PMID: 34879670 DOI: 10.1063/5.0064668] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Recent advances in deep learning frameworks have established valuable tools for analyzing the long-timescale behavior of complex systems, such as proteins. In particular, the inclusion of physical constraints, e.g., time-reversibility, was a crucial step to make the methods applicable to biophysical systems. Furthermore, we advance the method by incorporating experimental observables into the model estimation showing that biases in simulation data can be compensated for. We further develop a new neural network layer in order to build a hierarchical model allowing for different levels of details to be studied. Finally, we propose an attention mechanism, which highlights important residues for the classification into different states. We demonstrate the new methodology on an ultralong molecular dynamics simulation of the Villin headpiece miniprotein.
Collapse
Affiliation(s)
- Andreas Mardt
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
23
|
Schmidt J, Pettersson L, Verdozzi C, Botti S, Marques MAL. Crystal graph attention networks for the prediction of stable materials. SCIENCE ADVANCES 2021; 7:eabi7948. [PMID: 34860548 PMCID: PMC8641929 DOI: 10.1126/sciadv.abi7948] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 10/14/2021] [Indexed: 05/27/2023]
Abstract
Graph neural networks for crystal structures typically use the atomic positions and the atomic species as input. Unfortunately, this information is not available when predicting new materials, for which the precise geometrical information is unknown. We circumvent this problem by replacing the precise bond distances with embeddings of graph distances. This allows our networks to be applied directly in high-throughput studies based on both composition and crystal structure prototype without using relaxed structures as input. To train these networks, we curate a dataset of over 2 million density functional calculations of crystals with consistent calculation parameters. We apply the resulting model to the high-throughput search of 15 million tetragonal perovskites of composition ABCD2. As a result, we identify several thousand potentially stable compounds and demonstrate that transfer learning from the newly curated dataset reduces the required training data by 50%.
Collapse
Affiliation(s)
- Jonathan Schmidt
- Institut für Physik, Martin-Luther-Universität Halle-Wittenberg, 06120 Halle (Saale), Germany
| | - Love Pettersson
- Department of Physics, Lund University Box 118, 221 00 Lund, Sweden
| | - Claudio Verdozzi
- Department of Physics, Lund University Box 118, 221 00 Lund, Sweden
| | - Silvana Botti
- Institut für Festkörpertheorie und Optik and European Theoretical Spectroscopy Facility, Friedrich-Schiller-Universität Jena, D-07743 Jena, Germany
| | - Miguel A. L. Marques
- Institut für Physik, Martin-Luther-Universität Halle-Wittenberg, 06120 Halle (Saale), Germany
| |
Collapse
|
24
|
Weber JK, Morrone JA, Bagchi S, Pabon JDE, Kang SG, Zhang L, Cornell WD. Simplified, interpretable graph convolutional neural networks for small molecule activity prediction. J Comput Aided Mol Des 2021; 36:391-404. [PMID: 34817762 PMCID: PMC9325818 DOI: 10.1007/s10822-021-00421-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Accepted: 09/24/2021] [Indexed: 12/11/2022]
Abstract
We here present a streamlined, explainable graph convolutional neural network (gCNN) architecture for small molecule activity prediction. We first conduct a hyperparameter optimization across nearly 800 protein targets that produces a simplified gCNN QSAR architecture, and we observe that such a model can yield performance improvements over both standard gCNN and RF methods on difficult-to-classify test sets. Additionally, we discuss how reductions in convolutional layer dimensions potentially speak to the “anatomical” needs of gCNNs with respect to radial coarse graining of molecular substructure. We augment this simplified architecture with saliency map technology that highlights molecular substructures relevant to activity, and we perform saliency analysis on nearly 100 data-rich protein targets. We show that resultant substructural clusters are useful visualization tools for understanding substructure-activity relationships. We go on to highlight connections between our models’ saliency predictions and observations made in the medicinal chemistry literature, focusing on four case studies of past lead finding and lead optimization campaigns.
Collapse
Affiliation(s)
- Jeffrey K Weber
- IBM Thomas J Watson Research Center, Yorktown Heights, NY, USA
| | | | - Sugato Bagchi
- IBM Thomas J Watson Research Center, Yorktown Heights, NY, USA
| | | | - Seung-Gu Kang
- IBM Thomas J Watson Research Center, Yorktown Heights, NY, USA
| | - Leili Zhang
- IBM Thomas J Watson Research Center, Yorktown Heights, NY, USA
| | - Wendy D Cornell
- IBM Thomas J Watson Research Center, Yorktown Heights, NY, USA.
| |
Collapse
|
25
|
Lombardo T, Duquesnoy M, El-Bouysidy H, Årén F, Gallo-Bueno A, Jørgensen PB, Bhowmik A, Demortière A, Ayerbe E, Alcaide F, Reynaud M, Carrasco J, Grimaud A, Zhang C, Vegge T, Johansson P, Franco AA. Artificial Intelligence Applied to Battery Research: Hype or Reality? Chem Rev 2021; 122:10899-10969. [PMID: 34529918 PMCID: PMC9227745 DOI: 10.1021/acs.chemrev.1c00108] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
![]()
This is a critical
review of artificial intelligence/machine learning
(AI/ML) methods applied to battery research. It aims at providing
a comprehensive, authoritative, and critical, yet easily understandable,
review of general interest to the battery community. It addresses
the concepts, approaches, tools, outcomes, and challenges of using
AI/ML as an accelerator for the design and optimization of the next
generation of batteries—a current hot topic. It intends to
create both accessibility of these tools to the chemistry and electrochemical
energy sciences communities and completeness in terms of the different
battery R&D aspects covered.
Collapse
Affiliation(s)
- Teo Lombardo
- Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France
| | - Marc Duquesnoy
- Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France
| | - Hassna El-Bouysidy
- Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Physics, Chalmers University of Technology, SE-41296 Göteborg, Sweden
| | - Fabian Årén
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Physics, Chalmers University of Technology, SE-41296 Göteborg, Sweden
| | - Alfonso Gallo-Bueno
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Centre for Cooperative Research on Alternative Energies (CIC energiGUNE), Basque Research and Technology Alliance (BRTA), Alava Technology Park, Albert Einstein 48, 01510 Vitoria-Gasteiz, Spain
| | - Peter Bjørn Jørgensen
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Energy Conversion and Storage, Technical University of Denmark, Anker Engelunds Vej, Building 301, 2800 Kgs. Lyngby, Denmark
| | - Arghya Bhowmik
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Energy Conversion and Storage, Technical University of Denmark, Anker Engelunds Vej, Building 301, 2800 Kgs. Lyngby, Denmark
| | - Arnaud Demortière
- Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France
| | - Elixabete Ayerbe
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,CIDETEC, Basque Research and Technology Alliance (BRTA), Po. Miramón 196, 20014 Donostia-San Sebastián, Spain
| | - Francisco Alcaide
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,CIDETEC, Basque Research and Technology Alliance (BRTA), Po. Miramón 196, 20014 Donostia-San Sebastián, Spain
| | - Marine Reynaud
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Centre for Cooperative Research on Alternative Energies (CIC energiGUNE), Basque Research and Technology Alliance (BRTA), Alava Technology Park, Albert Einstein 48, 01510 Vitoria-Gasteiz, Spain
| | - Javier Carrasco
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Centre for Cooperative Research on Alternative Energies (CIC energiGUNE), Basque Research and Technology Alliance (BRTA), Alava Technology Park, Albert Einstein 48, 01510 Vitoria-Gasteiz, Spain
| | - Alexis Grimaud
- Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,UMR CNRS 8260 "Chimie du Solide et Energie", Collège de France, 11 Place Marcelin Berthelot, 75231 Paris Cedex 05, France Sorbonne Universités - UPMC Univ Paris 06, 4 Place Jussieu, F-75005 Paris, France
| | - Chao Zhang
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Chemistry - Ångström Laboratory, Box 538, 75121 Uppsala, Sweden
| | - Tejs Vegge
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Energy Conversion and Storage, Technical University of Denmark, Anker Engelunds Vej, Building 301, 2800 Kgs. Lyngby, Denmark
| | - Patrik Johansson
- ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Department of Physics, Chalmers University of Technology, SE-41296 Göteborg, Sweden
| | - Alejandro A Franco
- Laboratoire de Réactivité et Chimie des Solides (LRCS), UMR CNRS 7314, Université de Picardie Jules Verne, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Réseau sur le Stockage Electrochimique de l'Energie (RS2E), FR CNRS 3459, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,ALISTORE-European Research Institute, FR CNRS 3104, Hub de l'Energie, 15, rue Baudelocque, 80039 Amiens Cedex, France.,Institut Universitaire de France, 103 Boulevard Saint Michel, 75005 Paris, France
| |
Collapse
|
26
|
Xue LY, Guo F, Wen YS, Feng SQ, Huang XN, Guo L, Li HS, Cui SX, Zhang GQ, Wang QL. ReaxFF-MPNN machine learning potential: a combination of reactive force field and message passing neural networks. Phys Chem Chem Phys 2021; 23:19457-19464. [PMID: 34524283 DOI: 10.1039/d1cp01656c] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Reactive force field (ReaxFF) is a powerful computational tool for exploring material properties. In this work, we proposed an enhanced reactive force field model, which uses message passing neural networks (MPNN) to compute the bond order and bond energies. MPNN are a variation of graph neural networks (GNN), which are derived from graph theory. In MPNN or GNN, molecular structures are treated as a graph and atoms and chemical bonds are represented by nodes and edges. The edge states correspond to the bond order in ReaxFF and are updated by message functions according to the message passing algorithms. The results are very encouraging; the investigation of the potential, such as the potential energy surface, reaction energies and equation of state, are greatly improved by this simple improvement. The new potential model, called reactive force field with message passing neural networks (ReaxFF-MPNN), is provided as an interface in an atomic simulation environment (ASE) with which the original ReaxFF and ReaxFF-MPNN potential models can do MD simulations and geometry optimizations within the ASE. Furthermore, machine learning, based on an active learning algorithm and gradient optimizer, is designed to train the model. We found that the active learning machine not only saves the manual work to collect the training data but is also much more effective than the general optimizer.
Collapse
Affiliation(s)
- Li-Yuan Xue
- Shandong Provincial Key Laboratory of Optical Communication Science and Technology, Liaocheng, 252000, China.
| | - Feng Guo
- Shandong Provincial Key Laboratory of Optical Communication Science and Technology, Liaocheng, 252000, China. .,School of Physical Science and Information Technology, Liaocheng University, Liaocheng, 252000, China
| | - Yu-Shi Wen
- Institute of Chemical Materials, China Academy of Engineering Physics (CAEP), Mianyang, Sichuan, 621900, China.
| | - Shi-Quan Feng
- School of Physics and Electronic Engineering, Zhengzhou University of Light Industry, Zhengzhou, 450002, China
| | - Xiao-Na Huang
- School of Power and Mechanical Engineering, Wuhan University, Wuhan, Hubei, 430072, China
| | - Lei Guo
- School of Business, Shandong Normal University, Jinan, 250014, China
| | - Heng-Shuai Li
- Shandong Provincial Key Laboratory of Optical Communication Science and Technology, Liaocheng, 252000, China.
| | - Shou-Xin Cui
- School of Physical Science and Information Technology, Liaocheng University, Liaocheng, 252000, China
| | - Gui-Qing Zhang
- School of Physical Science and Information Technology, Liaocheng University, Liaocheng, 252000, China
| | - Qing-Lin Wang
- Shandong Provincial Key Laboratory of Optical Communication Science and Technology, Liaocheng, 252000, China. .,School of Physical Science and Information Technology, Liaocheng University, Liaocheng, 252000, China
| |
Collapse
|
27
|
Chen J. Advanced Electron Microscopy of Nanophased Synthetic Polymers and Soft Complexes for Energy and Medicine Applications. NANOMATERIALS (BASEL, SWITZERLAND) 2021; 11:2405. [PMID: 34578720 PMCID: PMC8470047 DOI: 10.3390/nano11092405] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 08/02/2021] [Accepted: 09/10/2021] [Indexed: 11/23/2022]
Abstract
After decades of developments, electron microscopy has become a powerful and irreplaceable tool in understanding the ionic, electrical, mechanical, chemical, and other functional performances of next-generation polymers and soft complexes. The recent progress in electron microscopy of nanostructured polymers and soft assemblies is important for applications in many different fields, including, but not limited to, mesoporous and nanoporous materials, absorbents, membranes, solid electrolytes, battery electrodes, ion- and electron-transporting materials, organic semiconductors, soft robotics, optoelectronic devices, biomass, soft magnetic materials, and pharmaceutical drug design. For synthetic polymers and soft complexes, there are four main characteristics that differentiate them from their inorganic or biomacromolecular counterparts in electron microscopy studies: (1) lower contrast, (2) abundance of light elements, (3) polydispersity or nanomorphological variations, and (4) large changes induced by electron beams. Since 2011, the Center for Nanophase Materials Sciences (CNMS) at Oak Ridge National Laboratory has been working with numerous facility users on nanostructured polymer composites, block copolymers, polymer brushes, conjugated molecules, organic-inorganic hybrid nanomaterials, organic-inorganic interfaces, organic crystals, and other soft complexes. This review crystalizes some of the essential challenges, successes, failures, and techniques during the process in the past ten years. It also presents some outlooks and future expectations on the basis of these works at the intersection of electron microscopy, soft matter, and artificial intelligence. Machine learning is expected to automate and facilitate image processing and information extraction of polymer and soft hybrid nanostructures in aspects such as dose-controlled imaging and structure analysis.
Collapse
Affiliation(s)
- Jihua Chen
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
| |
Collapse
|
28
|
Clark AE, Adams H, Hernandez R, Krylov AI, Niklasson AMN, Sarupria S, Wang Y, Wild SM, Yang Q. The Middle Science: Traversing Scale In Complex Many-Body Systems. ACS CENTRAL SCIENCE 2021; 7:1271-1287. [PMID: 34471670 PMCID: PMC8393217 DOI: 10.1021/acscentsci.1c00685] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
A roadmap is developed that integrates simulation methodology and data science methods to target new theories that traverse the multiple length- and time-scale features of many-body phenomena.
Collapse
Affiliation(s)
- Aurora E. Clark
- Department of Chemistry, Washington State University, Pullman, Washington 99163, United States
| | - Henry Adams
- Department of Mathematics, Colorado State
University, Fort Collins, Colorado 80523, United States
| | - Rigoberto Hernandez
- Departments
of Chemistry, Chemical and Biomolecular Engineering, and Materials
Science and Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Anna I. Krylov
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | - Anders M. N. Niklasson
- Theoretical
Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Sapna Sarupria
- Department of Chemical and Biomolecular Engineering, Center for Optical
Materials Science and Engineering Technologies (COMSET), Clemson University, Clemson, South Carolina 29670, United States
- Department
of Chemistry, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Yusu Wang
- Halıcıŏglu Data Science Institute, University of California, San Diego, La Jolla, California 92093, United States
| | - Stefan M. Wild
- Mathematics
and Computer Science Division, Argonne National
Laboratory, Lemont, Illinois 60439, United
States
| | - Qian Yang
- Computer Science and Engineering Department, University of Connecticut, Storrs, Connecticut 06269-4155, United States
| |
Collapse
|
29
|
Glielmo A, Husic BE, Rodriguez A, Clementi C, Noé F, Laio A. Unsupervised Learning Methods for Molecular Simulation Data. Chem Rev 2021; 121:9722-9758. [PMID: 33945269 PMCID: PMC8391792 DOI: 10.1021/acs.chemrev.0c01195] [Citation(s) in RCA: 116] [Impact Index Per Article: 38.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Indexed: 12/21/2022]
Abstract
Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of data produced by atomistic and molecular simulations, in material science, solid state physics, biophysics, and biochemistry. In this Review, we provide a comprehensive overview of the methods of unsupervised learning that have been most commonly used to investigate simulation data and indicate likely directions for further developments in the field. In particular, we discuss feature representation of molecular systems and present state-of-the-art algorithms of dimensionality reduction, density estimation, and clustering, and kinetic models. We divide our discussion into self-contained sections, each discussing a specific method. In each section, we briefly touch upon the mathematical and algorithmic foundations of the method, highlight its strengths and limitations, and describe the specific ways in which it has been used-or can be used-to analyze molecular simulation data.
Collapse
Affiliation(s)
- Aldo Glielmo
- International
School for Advanced Studies (SISSA) 34014 Trieste, Italy
| | - Brooke E. Husic
- Freie
Universität Berlin, Department of Mathematics
and Computer Science, 14195 Berlin, Germany
| | - Alex Rodriguez
- International Centre for Theoretical
Physics (ICTP), Condensed Matter and Statistical
Physics Section, 34100 Trieste, Italy
| | - Cecilia Clementi
- Freie
Universität Berlin, Department for
Physics, 14195 Berlin, Germany
- Rice
University Houston, Department of Chemistry, Houston, Texas 77005, United States
| | - Frank Noé
- Freie
Universität Berlin, Department of Mathematics
and Computer Science, 14195 Berlin, Germany
- Freie
Universität Berlin, Department for
Physics, 14195 Berlin, Germany
- Rice
University Houston, Department of Chemistry, Houston, Texas 77005, United States
| | - Alessandro Laio
- International
School for Advanced Studies (SISSA) 34014 Trieste, Italy
- International Centre for Theoretical
Physics (ICTP), Condensed Matter and Statistical
Physics Section, 34100 Trieste, Italy
| |
Collapse
|
30
|
Hempel T, Del Razo MJ, Lee CT, Taylor BC, Amaro RE, Noé F. Independent Markov decomposition: Toward modeling kinetics of biomolecular complexes. Proc Natl Acad Sci U S A 2021; 118:e2105230118. [PMID: 34321356 PMCID: PMC8346863 DOI: 10.1073/pnas.2105230118] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
To advance the mission of in silico cell biology, modeling the interactions of large and complex biological systems becomes increasingly relevant. The combination of molecular dynamics (MD) simulations and Markov state models (MSMs) has enabled the construction of simplified models of molecular kinetics on long timescales. Despite its success, this approach is inherently limited by the size of the molecular system. With increasing size of macromolecular complexes, the number of independent or weakly coupled subsystems increases, and the number of global system states increases exponentially, making the sampling of all distinct global states unfeasible. In this work, we present a technique called independent Markov decomposition (IMD) that leverages weak coupling between subsystems to compute a global kinetic model without requiring the sampling of all combinatorial states of subsystems. We give a theoretical basis for IMD and propose an approach for finding and validating such a decomposition. Using empirical few-state MSMs of ion channel models that are well established in electrophysiology, we demonstrate that IMD models can reproduce experimental conductance measurements with a major reduction in sampling compared with a standard MSM approach. We further show how to find the optimal partition of all-atom protein simulations into weakly coupled subunits.
Collapse
Affiliation(s)
- Tim Hempel
- Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany
- Department of Physics, Freie Universität Berlin, 14195 Berlin, Germany
| | - Mauricio J Del Razo
- Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany
- Van't Hoff Institute for Molecular Sciences, University of Amsterdam, 1090 GD Amsterdam, The Netherlands
- Korteweg-de Vries Institute for Mathematics, University of Amsterdam, 1090 GE Amsterdam, The Netherlands
- Dutch Institute for Emergent Phenomena, 1090 GL Amsterdam, The Netherlands
| | - Christopher T Lee
- Department of Mechanical and Aerospace Engineering, University of California San Diego, La Jolla, CA 92093
| | - Bryn C Taylor
- Biomedical Sciences Graduate Program, University of California San Diego, La Jolla, CA 92093
| | - Rommie E Amaro
- Department of Chemistry & Biochemistry, University of California San Diego, La Jolla, CA 92093;
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany;
- Department of Physics, Freie Universität Berlin, 14195 Berlin, Germany
- Department of Chemistry, Rice University, Houston, TX 77005
| |
Collapse
|
31
|
Exploring the octanol-water partition coefficient dataset using deep learning techniques and data augmentation. Commun Chem 2021; 4:90. [PMID: 36697535 PMCID: PMC9814212 DOI: 10.1038/s42004-021-00528-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 05/21/2021] [Indexed: 01/28/2023] Open
Abstract
Today more and more data are freely available. Based on these big datasets deep neural networks (DNNs) rapidly gain relevance in computational chemistry. Here, we explore the potential of DNNs to predict chemical properties from chemical structures. We have selected the octanol-water partition coefficient (log P) as an example, which plays an essential role in environmental chemistry and toxicology but also in chemical analysis. The predictive performance of the developed DNN is good with an rmse of 0.47 log units in the test dataset and an rmse of 0.33 for an external dataset from the SAMPL6 challenge. To this end, we trained the DNN using data augmentation considering all potential tautomeric forms of the chemicals. We further demonstrate how DNN models can help in the curation of the log P dataset by identifying potential errors, and address limitations of the dataset itself.
Collapse
|
32
|
Kalinin SV, Zhang S, Valleti M, Pyles H, Baker D, De Yoreo JJ, Ziatdinov M. Disentangling Rotational Dynamics and Ordering Transitions in a System of Self-Organizing Protein Nanorods via Rotationally Invariant Latent Representations. ACS NANO 2021; 15:6471-6480. [PMID: 33861068 DOI: 10.1021/acsnano.0c08914] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The dynamics of complex ordering systems with active rotational degrees of freedom exemplified by protein self-assembly is explored using a machine learning workflow that combines deep learning-based semantic segmentation and rotationally invariant variational autoencoder-based analysis of orientation and shape evolution. The latter allows for disentanglement of the particle orientation from other degrees of freedom and compensates for lateral shifts. The disentangled representations in the latent space encode the rich spectrum of local transitions that can now be visualized and explored via continuous variables. The time dependence of ensemble averages allows insight into the time dynamics of the system and, in particular, illustrates the presence of the potential ordering transition. Finally, analysis of the latent variables along the single-particle trajectory allows tracing these parameters on a single-particle level. The proposed approach is expected to be universally applicable for the description of the imaging data in optical, scanning probe, and electron microscopy seeking to understand the dynamics of complex systems where rotations are a significant part of the process.
Collapse
Affiliation(s)
- Sergei V Kalinin
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
| | - Shuai Zhang
- Materials Science and Engineering, University of Washington, Seattle, Washington 98195, United States
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Mani Valleti
- Bredesen Center for Interdisciplinary Research, University of Tennessee, Knoxville, Tennessee 37996, United States
| | - Harley Pyles
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
- Institute for Protein Design, University of Washington, Seattle, Washington 98195, United States
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
- Institute for Protein Design, University of Washington, Seattle, Washington 98195, United States
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, United States
| | - James J De Yoreo
- Materials Science and Engineering, University of Washington, Seattle, Washington 98195, United States
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Maxim Ziatdinov
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
- Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
| |
Collapse
|
33
|
Mistry A, Franco AA, Cooper SJ, Roberts SA, Viswanathan V. How Machine Learning Will Revolutionize Electrochemical Sciences. ACS ENERGY LETTERS 2021; 6:1422-1431. [PMID: 33869772 PMCID: PMC8042659 DOI: 10.1021/acsenergylett.1c00194] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 03/08/2021] [Indexed: 05/21/2023]
Abstract
Electrochemical systems function via interconversion of electric charge and chemical species and represent promising technologies for our cleaner, more sustainable future. However, their development time is fundamentally limited by our ability to identify new materials and understand their electrochemical response. To shorten this time frame, we need to switch from the trial-and-error approach of finding useful materials to a more selective process by leveraging model predictions. Machine learning (ML) offers data-driven predictions and can be helpful. Herein we ask if ML can revolutionize the development cycle from decades to a few years. We outline the necessary characteristics of such ML implementations. Instead of enumerating various ML algorithms, we discuss scientific questions about the electrochemical systems to which ML can contribute.
Collapse
Affiliation(s)
- Aashutosh Mistry
- Chemical
Sciences and Engineering Division, Argonne
National Laboratory, Lemont, Illinois 60439, United States
| | - Alejandro A. Franco
- Laboratorie
de Réactivité et Chimie des Solides (LRCS), UMR CNRS
7314, Université de Picardie Jules Verne, Hub de I’Energie, 15 rue Baudelocque, 80039 Amiens Cedex, France
- Réseau
sur le Stockage Electrochimique de l’Energie (RS2E), FR CNRS
3459, Hub de l’Energie, 15 rue Baudelocque, 80039 Amiens Cedex, France
- ALISTORE-European
Research Institute, FR CNRS 3104, Hub de l’Energie, 15 rue Baudelocque, 80039 Amiens Cedex, France
- Institut
Universitaire de France, 103 Boulevard Saint Michel, 75005 Paris, France
| | - Samuel J. Cooper
- Dyson
School of Design Engineering, Imperial College
London, London SW7 2DB, United Kingdom
| | - Scott A. Roberts
- Engineering
Sciences Center, Sandia National Laboratories, Albuquerque, New Mexico 87185, United States
| | | |
Collapse
|
34
|
Huang J, Zhang L, Wang H, Zhao J, Cheng J, E W. Deep potential generation scheme and simulation protocol for the Li 10GeP 2S 12-type superionic conductors. J Chem Phys 2021; 154:094703. [PMID: 33685134 DOI: 10.1063/5.0041849] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Solid-state electrolyte materials with superior lithium ionic conductivities are vital to the next-generation Li-ion batteries. Molecular dynamics could provide atomic scale information to understand the diffusion process of Li-ion in these superionic conductor materials. Here, we implement the deep potential generator to set up an efficient protocol to automatically generate interatomic potentials for Li10GeP2S12-type solid-state electrolyte materials (Li10GeP2S12, Li10SiP2S12, and Li10SnP2S12). The reliability and accuracy of the fast interatomic potentials are validated. With the potentials, we extend the simulation of the diffusion process to a wide temperature range (300 K-1000 K) and systems with large size (∼1000 atoms). Important technical aspects such as the statistical error and size effect are carefully investigated, and benchmark tests including the effect of density functional, thermal expansion, and configurational disorder are performed. The computed data that consider these factors agree well with the experimental results, and we find that the three structures show different behaviors with respect to configurational disorder. Our work paves the way for further research on computation screening of solid-state electrolyte materials.
Collapse
Affiliation(s)
- Jianxing Huang
- State Key Laboratory of Physical Chemistry of Solid Surfaces, iChEM, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Linfeng Zhang
- Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey 08544, USA
| | - Han Wang
- Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Fenghao East Road 2, Beijing 100094, People's Republic of China
| | - Jinbao Zhao
- State Key Laboratory of Physical Chemistry of Solid Surfaces, iChEM, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Jun Cheng
- State Key Laboratory of Physical Chemistry of Solid Surfaces, iChEM, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Weinan E
- Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey 08544, USA
| |
Collapse
|
35
|
Planas-Iglesias J, Marques SM, Pinto GP, Musil M, Stourac J, Damborsky J, Bednar D. Computational design of enzymes for biotechnological applications. Biotechnol Adv 2021; 47:107696. [PMID: 33513434 DOI: 10.1016/j.biotechadv.2021.107696] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 01/12/2021] [Accepted: 01/13/2021] [Indexed: 12/14/2022]
Abstract
Enzymes are the natural catalysts that execute biochemical reactions upholding life. Their natural effectiveness has been fine-tuned as a result of millions of years of natural evolution. Such catalytic effectiveness has prompted the use of biocatalysts from multiple sources on different applications, including the industrial production of goods (food and beverages, detergents, textile, and pharmaceutics), environmental protection, and biomedical applications. Natural enzymes often need to be improved by protein engineering to optimize their function in non-native environments. Recent technological advances have greatly facilitated this process by providing the experimental approaches of directed evolution or by enabling computer-assisted applications. Directed evolution mimics the natural selection process in a highly accelerated fashion at the expense of arduous laboratory work and economic resources. Theoretical methods provide predictions and represent an attractive complement to such experiments by waiving their inherent costs. Computational techniques can be used to engineer enzymatic reactivity, substrate specificity and ligand binding, access pathways and ligand transport, and global properties like protein stability, solubility, and flexibility. Theoretical approaches can also identify hotspots on the protein sequence for mutagenesis and predict suitable alternatives for selected positions with expected outcomes. This review covers the latest advances in computational methods for enzyme engineering and presents many successful case studies.
Collapse
Affiliation(s)
- Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Sérgio M Marques
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Gaspar P Pinto
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Milos Musil
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic; IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 61266 Brno, Czech Republic
| | - Jan Stourac
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic.
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic.
| |
Collapse
|
36
|
Predicting materials properties without crystal structure: deep representation learning from stoichiometry. Nat Commun 2020; 11:6280. [PMID: 33293567 PMCID: PMC7722901 DOI: 10.1038/s41467-020-19964-7] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 11/04/2020] [Indexed: 01/31/2023] Open
Abstract
Machine learning has the potential to accelerate materials discovery by accurately predicting materials properties at a low computational cost. However, the model inputs remain a key stumbling block. Current methods typically use descriptors constructed from knowledge of either the full crystal structure — therefore only applicable to materials with already characterised structures — or structure-agnostic fixed-length representations hand-engineered from the stoichiometry. We develop a machine learning approach that takes only the stoichiometry as input and automatically learns appropriate and systematically improvable descriptors from data. Our key insight is to treat the stoichiometric formula as a dense weighted graph between elements. Compared to the state of the art for structure-agnostic methods, our approach achieves lower errors with less data. Predicting the structure of unknown materials’ compositions represents a challenge for high-throughput computational approaches. Here the authors introduce a new stoichiometry-based machine learning approach for predicting the properties of inorganic materials from their elemental compositions.
Collapse
|
37
|
Barry MC, Wise KE, Kalidindi SR, Kumar S. Voxelized Atomic Structure Potentials: Predicting Atomic Forces with the Accuracy of Quantum Mechanics Using Convolutional Neural Networks. J Phys Chem Lett 2020; 11:9093-9099. [PMID: 32985196 DOI: 10.1021/acs.jpclett.0c02271] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
This paper introduces voxelized atomic structure (VASt) potentials as a machine learning (ML) framework for developing interatomic potentials. The VASt framework utilizes a voxelized representation of the atomic structure directly as the input to a convolutional neural network (CNN). This allows for high-fidelity representations of highly complex and diverse spatial arrangements of the atomic environments of interest. The CNN implicitly establishes the low-dimensional features needed to correlate each atomic neighborhood to its net atomic force. The selection of the salient features of the atomic structure (i.e., feature engineering) in the VASt framework is implicit, comprehensive, automated, scalable, and highly efficient. The calibrated convolutional layers learn the complex spatial relationships and multibody interactions that govern the physics of atomic systems with remarkable fidelity. We show that VASt potentials predict highly accurate forces on two phases of silicon carbide and the thermal conductivity of silicon over a range of isotropic strain.
Collapse
Affiliation(s)
- Matthew C Barry
- G.W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Kristopher E Wise
- Advanced Materials and Processing Branch, NASA Langley Research Center, Hampton, Virginia 23681, United States
| | - Surya R Kalidindi
- G.W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Satish Kumar
- G.W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
38
|
Yoon J, Ulissi ZW. Differentiable Optimization for the Prediction of Ground State Structures (DOGSS). PHYSICAL REVIEW LETTERS 2020; 125:173001. [PMID: 33156640 DOI: 10.1103/physrevlett.125.173001] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2020] [Accepted: 09/15/2020] [Indexed: 06/11/2023]
Abstract
Ground state or relaxed inorganic structures are the starting point for most computational materials science or surface science analyses. Many of these structure relaxations represent systematic changes to the structure, but there are currently no general methods to improve the initial structure guess based on past calculations. Here we present a method to directly predict the ground state configuration using differentiable optimization and graph neural networks to learn the properties of a simple harmonic force field that approximates the ground state structure and properties. We demonstrate this flexible open source tool for improving the initial configurations for large datasets of inorganic multicomponent surface relaxations across 32 elements and the relaxation of adsorbates (H and CO) on these surfaces. Using these improved initial configurations reduces the expensive adsorbate-covered surface relaxations by approximately 50% and is complementary to other approaches to accelerate the relaxation process.
Collapse
Affiliation(s)
- Junwoong Yoon
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | - Zachary W Ulissi
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| |
Collapse
|
39
|
Gu M, Gu Y, Luo W, Xu G, Yang Z, Zhou J, Qu W. From text to graph: a general transition-based AMR parsing using neural network. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-05378-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
40
|
Melo MCR, Bernardi RC, de la Fuente-Nunez C, Luthey-Schulten Z. Generalized correlation-based dynamical network analysis: a new high-performance approach for identifying allosteric communications in molecular dynamics trajectories. J Chem Phys 2020; 153:134104. [DOI: 10.1063/5.0018980] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Affiliation(s)
- Marcelo C. R. Melo
- Center for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign, Champaign, Illinois 61801, USA
- Department of Chemistry, University of Illinois at Urbana-Champaign, Champaign, Illinois 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Champaign, Illinois 61801, USA
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, Penn Institute for Computational Science, and Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Rafael C. Bernardi
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Champaign, Illinois 61801, USA
- Department of Physics, Auburn University, Auburn, Alabama 36849, USA
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, Penn Institute for Computational Science, and Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Zaida Luthey-Schulten
- Center for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign, Champaign, Illinois 61801, USA
- Department of Chemistry, University of Illinois at Urbana-Champaign, Champaign, Illinois 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Champaign, Illinois 61801, USA
| |
Collapse
|
41
|
Jablonka K, Ongari D, Moosavi SM, Smit B. Big-Data Science in Porous Materials: Materials Genomics and Machine Learning. Chem Rev 2020; 120:8066-8129. [PMID: 32520531 PMCID: PMC7453404 DOI: 10.1021/acs.chemrev.0c00004] [Citation(s) in RCA: 154] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Indexed: 12/16/2022]
Abstract
By combining metal nodes with organic linkers we can potentially synthesize millions of possible metal-organic frameworks (MOFs). The fact that we have so many materials opens many exciting avenues but also create new challenges. We simply have too many materials to be processed using conventional, brute force, methods. In this review, we show that having so many materials allows us to use big-data methods as a powerful technique to study these materials and to discover complex correlations. The first part of the review gives an introduction to the principles of big-data science. We show how to select appropriate training sets, survey approaches that are used to represent these materials in feature space, and review different learning architectures, as well as evaluation and interpretation strategies. In the second part, we review how the different approaches of machine learning have been applied to porous materials. In particular, we discuss applications in the field of gas storage and separation, the stability of these materials, their electronic properties, and their synthesis. Given the increasing interest of the scientific community in machine learning, we expect this list to rapidly expand in the coming years.
Collapse
Affiliation(s)
- Kevin
Maik Jablonka
- Laboratory of Molecular Simulation
(LSMO), Institut des Sciences et Ingénierie Chimiques (ISIC), École Polytechnique Fédérale
de Lausanne (EPFL), Sion, Switzerland
| | - Daniele Ongari
- Laboratory of Molecular Simulation
(LSMO), Institut des Sciences et Ingénierie Chimiques (ISIC), École Polytechnique Fédérale
de Lausanne (EPFL), Sion, Switzerland
| | - Seyed Mohamad Moosavi
- Laboratory of Molecular Simulation
(LSMO), Institut des Sciences et Ingénierie Chimiques (ISIC), École Polytechnique Fédérale
de Lausanne (EPFL), Sion, Switzerland
| | - Berend Smit
- Laboratory of Molecular Simulation
(LSMO), Institut des Sciences et Ingénierie Chimiques (ISIC), École Polytechnique Fédérale
de Lausanne (EPFL), Sion, Switzerland
| |
Collapse
|
42
|
Liu DH, Bai Z, Li M, Yu A, Luo D, Liu W, Yang L, Lu J, Amine K, Chen Z. Developing high safety Li-metal anodes for future high-energy Li-metal batteries: strategies and perspectives. Chem Soc Rev 2020; 49:5407-5445. [DOI: 10.1039/c9cs00636b] [Citation(s) in RCA: 144] [Impact Index Per Article: 36.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Developing high-safety Li-metal anodes (LMAs) are extremely important for the application of high-energy Li-metal batteries. The recently state-of-the-art technologies, strategies and perspectives for developing LMAs are comprehensively summarized in this review.
Collapse
|
43
|
Moradzadeh A, Aluru NR. Molecular Dynamics Properties without the Full Trajectory: A Denoising Autoencoder Network for Properties of Simple Liquids. J Phys Chem Lett 2019; 10:7568-7576. [PMID: 31738568 DOI: 10.1021/acs.jpclett.9b02820] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Molecular dynamics (MD) simulation is a popularly used computational tool to compute microscopic and macroscopic properties of a variety of systems including liquids, solids, biological systems, etc. To determine properties of atomic systems to a good level of accuracy with minimal noise or fluctuation, MD simulations are performed over a long time ranging from a few nanoseconds to several tens to hundreds of nanoseconds depending on the system and the properties of interest. In this study, by considering simple liquids, we explore the feasibility of significantly reducing the MD simulation time to compute various properties of monatomic systems such as the structure, pressure, and isothermal compressibility. To do so, extensive MD simulations are performed on 12 000 distinct Lennard-Jones systems at various thermodynamic states. Then, a deep denoising autoencoder network is trained to take the radial distribution function (RDF) from a single snapshot of a Lennard-Jones liquid to compute the mean, temporally averaged RDF. We show that the method is successful in the prediction of RDF and other properties such as the pressure and isothermal compressibility that can be computed based on the RDF not only for Lennard-Jones liquids at various thermodynamic states but also for various simple liquids described by exponential, Yukawa, and inverse-power-law pair potentials.
Collapse
Affiliation(s)
- Alireza Moradzadeh
- Department of Mechanical Science and Engineering, Beckman Institute for Advanced Science and Technology , University of Illinois at Urbana-Champaign , Urbana , Illinois 61801 , United States
| | - N R Aluru
- Department of Mechanical Science and Engineering, Beckman Institute for Advanced Science and Technology , University of Illinois at Urbana-Champaign , Urbana , Illinois 61801 , United States
| |
Collapse
|
44
|
Abstract
Most current molecular dynamics simulation and analysis methods rely on the idea that the molecular system can be represented by a single global state (e.g., a Markov state in a Markov state model [MSM]). In this approach, molecules can be extensively sampled and analyzed when they only possess a few metastable states, such as small- to medium-sized proteins. However, this approach breaks down in frustrated systems and in large protein assemblies, where the number of global metastable states may grow exponentially with the system size. To address this problem, we here introduce dynamic graphical models (DGMs) that describe molecules as assemblies of coupled subsystems, akin to how spins interact in the Ising model. The change of each subsystem state is only governed by the states of itself and its neighbors. DGMs require fewer parameters than MSMs or other global state models; in particular, we do not need to observe all global system configurations to characterize them. Therefore, DGMs can predict previously unobserved molecular configurations. As a proof of concept, we demonstrate that DGMs can faithfully describe molecular thermodynamics and kinetics and predict previously unobserved metastable states for Ising models and protein simulations.
Collapse
Affiliation(s)
- Simon Olsson
- Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany;
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany;
- Department of Physics, Freie Universität Berlin, 14195 Berlin, Germany
- Department of Chemistry, Rice University, Houston, TX 77005
| |
Collapse
|