1
|
Lee SC, Z Y. Interpretation of autoencoder-learned collective variables using Morse-Smale complex and sublevelset persistent homology: An application on molecular trajectories. J Chem Phys 2024; 160:144104. [PMID: 38591676 DOI: 10.1063/5.0191446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 03/22/2024] [Indexed: 04/10/2024] Open
Abstract
Dimensionality reduction often serves as the first step toward a minimalist understanding of physical systems as well as the accelerated simulations of them. In particular, neural network-based nonlinear dimensionality reduction methods, such as autoencoders, have shown promising outcomes in uncovering collective variables (CVs). However, the physical meaning of these CVs remains largely elusive. In this work, we constructed a framework that (1) determines the optimal number of CVs needed to capture the essential molecular motions using an ensemble of hierarchical autoencoders and (2) provides topology-based interpretations to the autoencoder-learned CVs with Morse-Smale complex and sublevelset persistent homology. This approach was exemplified using a series of n-alkanes and can be regarded as a general, explainable nonlinear dimensionality reduction method.
Collapse
Affiliation(s)
- Shao-Chun Lee
- Department of Nuclear, Plasma, and Radiological Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Y Z
- Department of Nuclear, Plasma, and Radiological Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Department of Nuclear Engineering and Radiological Sciences, Department of Materials Science and Engineering, Department of Robotics, and Applied Physics Program, University of Michigan, Ann Arbor, Michigan 48105, USA
| |
Collapse
|
2
|
Teramoto H, Saito T, Aoki M, Murayama B, Kobayashi M, Nakamura T, Taketsugu T. Reproducing the Reaction Route Map on the Shape Space from Its Quotient by the Complete Nuclear Permutation-Inversion Group. J Chem Theory Comput 2023; 19:5886-5896. [PMID: 37642714 DOI: 10.1021/acs.jctc.3c00500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
This study develops an algorithm to reproduce reaction route maps (RRMs) in the shape space from the outputs of potential search algorithms. To demonstrate the algorithm, global reaction route mapping is utilized as a potential search algorithm, but the proposed algorithm should work with other potential search algorithms in principle. The proposed algorithm does not require any encoding of the molecular configurations and is thus applicable to complicated realistic molecules for which efficient encoding is not readily available. We show that subgraphs of an RRM mapped to each other by the action of the symmetry group are isomorphic and also provide an algorithm to compute the set of feasible transformations in the sense of Longuet-Higgins. We demonstrate the proposed algorithm in toy models and in more realistic molecules. Finally, we remark on absolute rate theory from our perspective.
Collapse
Affiliation(s)
- Hiroshi Teramoto
- Faculty of Engineering Science, Kansai University, Suita 564-8680, Japan
| | - Takuya Saito
- Department of Mathematics, Faculty of Science, Hokkaido University, Sapporo 060-0810, Japan
- Department of Economics and Statistics, University of Turin, 10124 Turin, Italy
| | - Masamitsu Aoki
- Department of Mathematics, Faculty of Science, Hokkaido University, Sapporo 060-0810, Japan
| | - Burai Murayama
- Graduate School of Chemical Sciences and Engineering, Hokkaido University, Sapporo 060-0810, Japan
| | - Masato Kobayashi
- Department of Chemistry, Faculty of Science, Hokkaido University, Sapporo 060-0810, Japan
- WPI-ICReDD, Hokkaido University, Sapporo 001-0021, Japan
| | - Takenobu Nakamura
- National Institute of Advanced Industrial Science and Technology, Tsukuba 305-8568, Japan
| | - Tetsuya Taketsugu
- Department of Chemistry, Faculty of Science, Hokkaido University, Sapporo 060-0810, Japan
- WPI-ICReDD, Hokkaido University, Sapporo 001-0021, Japan
| |
Collapse
|
3
|
Shipman PD, Sharath T, Bradley RM. Topological measures of order for imperfect two-dimensional Bravais lattices. Phys Rev E 2023; 107:044216. [PMID: 37198833 DOI: 10.1103/physreve.107.044216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Accepted: 03/20/2023] [Indexed: 05/19/2023]
Abstract
Motivated by patterns with defects in natural and laboratory systems, we develop two quantitative measures of order for imperfect Bravais lattices in the plane. A tool from topological data analysis called persistent homology combined with the sliced Wasserstein distance, a metric on point distributions, are the key components for defining these measures. The measures generalize previous measures of order using persistent homology that were applicable only to imperfect hexagonal lattices in two dimensions. We illustrate the sensitivities of these measures to the degree of perturbation of perfect hexagonal, square, and rhombic Bravais lattices. We also study imperfect hexagonal, square, and rhombic lattices produced by numerical simulations of pattern-forming partial differential equations. These numerical experiments serve to compare the measures of lattice order and reveal differences in the evolution of the patterns in various partial differential equations.
Collapse
Affiliation(s)
- Patrick D Shipman
- Department of Mathematics and School of Advanced Materials Discovery, Colorado State University, Fort Collins, Colorado 80523, USA
| | - Tejas Sharath
- Department of Physics, Colorado State University, Fort Collins, Colorado 80523, USA
| | - R Mark Bradley
- Departments of Physics and Mathematics, Colorado State University, Fort Collins, Colorado 80523, USA
| |
Collapse
|
4
|
Manuchehrfar F, Li H, Ma A, Liang J. Reactive Vortexes in a Naturally Activated Process: Non-Diffusive Rotational Fluxes at Transition State Uncovered by Persistent Homology. J Phys Chem B 2022; 126:9297-9308. [PMID: 36346639 PMCID: PMC10495042 DOI: 10.1021/acs.jpcb.2c07015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The dynamics of reaction coordinates during barrier-crossing are key to understanding activated processes in complex systems such as proteins. The default assumption from Kramers' physical intuition is that of a diffusion process. However, the dynamics of barrier-crossing in natural complex molecules are largely unexplored. Here we investigate the transition dynamics of alanine dipeptide isomerization, the simplest complex system with a large number of non-reaction coordinates that can serve as an adequate thermal bath feeding energy into the reaction coordinates. We separate conformations along the time axis and construct the dynamic probability surface of reaction. We quantify its topological structure and rotational flux using persistent homology and differential form. Our results uncovered a region with a strong reactive vortex in the configuration-time space, where the highest probability peak and the transition state ensemble are located. This reactive region contains strong rotational fluxes: Most reactive trajectories swirl multiple times around this region in the subspace of the two most important reaction coordinates. Furthermore, the rotational fluxes result from cooperative movement along the isocommitter surfaces and orthogonal barrier-crossing. Overall, our findings offer a first glimpse into the reactive vortex regions that characterize the non-diffusive dynamics of barrier-crossing of a naturally occurring activation process.
Collapse
Affiliation(s)
- Farid Manuchehrfar
- Center for Bioinformatics and Quantiative Biology and Richard and Loan Hill Department of Biomedical Engineering, University of Illinois at Chicago, Chicago, Illinois60607, United States
| | - Huiyu Li
- Center for Bioinformatics and Quantiative Biology and Richard and Loan Hill Department of Biomedical Engineering, University of Illinois at Chicago, Chicago, Illinois60607, United States
| | - Ao Ma
- Center for Bioinformatics and Quantiative Biology and Richard and Loan Hill Department of Biomedical Engineering, University of Illinois at Chicago, Chicago, Illinois60607, United States
| | - Jie Liang
- Center for Bioinformatics and Quantiative Biology and Richard and Loan Hill Department of Biomedical Engineering, University of Illinois at Chicago, Chicago, Illinois60607, United States
| |
Collapse
|
5
|
Clark AE, Adams H, Hernandez R, Krylov AI, Niklasson AMN, Sarupria S, Wang Y, Wild SM, Yang Q. The Middle Science: Traversing Scale In Complex Many-Body Systems. ACS CENTRAL SCIENCE 2021; 7:1271-1287. [PMID: 34471670 PMCID: PMC8393217 DOI: 10.1021/acscentsci.1c00685] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
A roadmap is developed that integrates simulation methodology and data science methods to target new theories that traverse the multiple length- and time-scale features of many-body phenomena.
Collapse
Affiliation(s)
- Aurora E. Clark
- Department of Chemistry, Washington State University, Pullman, Washington 99163, United States
| | - Henry Adams
- Department of Mathematics, Colorado State
University, Fort Collins, Colorado 80523, United States
| | - Rigoberto Hernandez
- Departments
of Chemistry, Chemical and Biomolecular Engineering, and Materials
Science and Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Anna I. Krylov
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | - Anders M. N. Niklasson
- Theoretical
Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Sapna Sarupria
- Department of Chemical and Biomolecular Engineering, Center for Optical
Materials Science and Engineering Technologies (COMSET), Clemson University, Clemson, South Carolina 29670, United States
- Department
of Chemistry, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Yusu Wang
- Halıcıŏglu Data Science Institute, University of California, San Diego, La Jolla, California 92093, United States
| | - Stefan M. Wild
- Mathematics
and Computer Science Division, Argonne National
Laboratory, Lemont, Illinois 60439, United
States
| | - Qian Yang
- Computer Science and Engineering Department, University of Connecticut, Storrs, Connecticut 06269-4155, United States
| |
Collapse
|
6
|
Adams H, Moy M. Topology Applied to Machine Learning: From Global to Local. Front Artif Intell 2021; 4:668302. [PMID: 34056580 PMCID: PMC8160457 DOI: 10.3389/frai.2021.668302] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 04/15/2021] [Indexed: 11/24/2022] Open
Abstract
Through the use of examples, we explain one way in which applied topology has evolved since the birth of persistent homology in the early 2000s. The first applications of topology to data emphasized the global shape of a dataset, such as the three-circle model for 3 × 3 pixel patches from natural images, or the configuration space of the cyclo-octane molecule, which is a sphere with a Klein bottle attached via two circles of singularity. In these studies of global shape, short persistent homology bars are disregarded as sampling noise. More recently, however, persistent homology has been used to address questions about the local geometry of data. For instance, how can local geometry be vectorized for use in machine learning problems? Persistent homology and its vectorization methods, including persistence landscapes and persistence images, provide popular techniques for incorporating both local geometry and global topology into machine learning. Our meta-hypothesis is that the short bars are as important as the long bars for many machine learning tasks. In defense of this claim, we survey applications of persistent homology to shape recognition, agent-based modeling, materials science, archaeology, and biology. Additionally, we survey work connecting persistent homology to geometric features of spaces, including curvature and fractal dimension, and various methods that have been used to incorporate persistent homology into machine learning.
Collapse
Affiliation(s)
- Henry Adams
- Department of Mathematics, Colorado State University, Fort Collins, CO, United States
| | - Michael Moy
- Department of Mathematics, Colorado State University, Fort Collins, CO, United States
| |
Collapse
|