1
|
Mirth J, Zhai Y, Bush J, Alvarado EG, Jordan H, Heim M, Krishnamoorthy B, Pflaum M, Clark A, Z Y, Adams H. Representations of energy landscapes by sublevelset persistent homology: An example with n-alkanes. J Chem Phys 2021; 154:114114. [PMID: 33752361 DOI: 10.1063/5.0036747] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Encoding the complex features of an energy landscape is a challenging task, and often, chemists pursue the most salient features (minima and barriers) along a highly reduced space, i.e., two- or three-dimensions. Even though disconnectivity graphs or merge trees summarize the connectivity of the local minima of an energy landscape via the lowest-barrier pathways, there is much information to be gained by also considering the topology of each connected component at different energy thresholds (or sublevelsets). We propose sublevelset persistent homology as an appropriate tool for this purpose. Our computations on the configuration phase space of n-alkanes from butane to octane allow us to conjecture, and then prove, a complete characterization of the sublevelset persistent homology of the alkane CmH2m+2 Potential Energy Landscapes (PELs), for all m, in all homological dimensions. We further compare both the analytical configurational PELs and sampled data from molecular dynamics simulation using the united and all-atom descriptions of the intramolecular interactions. In turn, this supports the application of distance metrics to quantify sampling fidelity and lays the foundation for future work regarding new metrics that quantify differences between the topological features of high-dimensional energy landscapes.
Collapse
Affiliation(s)
- Joshua Mirth
- Department of Mathematics, Colorado State University, Fort Collins, Colorado 80524, USA
| | - Yanqin Zhai
- Department of Nuclear, Plasma, and Radiological Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Johnathan Bush
- Department of Mathematics, Colorado State University, Fort Collins, Colorado 80524, USA
| | - Enrique G Alvarado
- Department of Mathematics and Statistics, Washington State University, Pullman, Washington 99164, USA
| | - Howie Jordan
- Department of Mathematics, University of Colorado, Boulder, Colorado 80309, USA
| | - Mark Heim
- Department of Mathematics, Colorado State University, Fort Collins, Colorado 80524, USA
| | - Bala Krishnamoorthy
- Department of Mathematics and Statistics, Washington State University, Vancouver, Washington 98686, USA
| | - Markus Pflaum
- Department of Mathematics, University of Colorado, Boulder, Colorado 80309, USA
| | - Aurora Clark
- Department of Chemistry, Washington State University, Pullman, Washington 99164, USA
| | - Y Z
- Department of Nuclear, Plasma, and Radiological Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Henry Adams
- Department of Mathematics, Colorado State University, Fort Collins, Colorado 80524, USA
| |
Collapse
|
2
|
Tan Q, Duan M, Li M, Han L, Huo S. Approximating dynamic proximity with a hybrid geometry energy-based kernel for diffusion maps. J Chem Phys 2019; 151:105101. [PMID: 31521094 DOI: 10.1063/1.5100968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The diffusion map is a dimensionality reduction method. The reduction coordinates are associated with the leading eigenfunctions of the backward Fokker-Planck operator, providing a dynamic meaning for these coordinates. One of the key factors that affect the accuracy of diffusion map embedding is the dynamic measure implemented in the Gaussian kernel. A common practice in diffusion map study of molecular systems is to approximate dynamic proximity with RMSD (root-mean-square deviation). In this paper, we present a hybrid geometry-energy based kernel. Since high energy-barriers may exist between geometrically similar conformations, taking both RMSD and energy difference into account in the kernel can better describe conformational transitions between neighboring conformations and lead to accurate embedding. We applied our diffusion map method to the β-hairpin of the B1 domain of streptococcal protein G and to Trp-cage. Our results in β-hairpin show that the diffusion map embedding achieves better results with the hybrid kernel than that with the RMSD-based kernel in terms of free energy landscape characterization and a new correlation measure between the cluster center Euclidean distances in the reduced-dimension space and the reciprocals of the total net flow between these clusters. In addition, our diffusion map analysis of the ultralong molecular dynamics trajectory of Trp-cage has provided a unified view of its folding mechanism. These promising results demonstrate the effectiveness of our diffusion map approach in the analysis of the dynamics and thermodynamics of molecular systems. The hybrid geometry-energy criterion could be also useful as a general dynamic measure for other purposes.
Collapse
Affiliation(s)
- Qingzhe Tan
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts 01610, USA
| | - Mojie Duan
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts 01610, USA
| | - Minghai Li
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts 01610, USA
| | - Li Han
- Department of Math and Computer Science, Clark University, Worcester, Massachusetts 01610, USA
| | - Shuanghong Huo
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts 01610, USA
| |
Collapse
|
3
|
The Observation of Ligand-Binding-Relevant Open States of Fatty Acid Binding Protein by Molecular Dynamics Simulations and a Markov State Model. Int J Mol Sci 2019; 20:ijms20143476. [PMID: 31311155 PMCID: PMC6678811 DOI: 10.3390/ijms20143476] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Revised: 07/04/2019] [Accepted: 07/10/2019] [Indexed: 12/21/2022] Open
Abstract
As a member of the fatty acids transporter family, the heart fatty acid binding proteins (HFABPs) are responsible for many important biological activities. The binding mechanism of fatty acid with FABP is critical to the understanding of FABP functions. The uncovering of binding-relevant intermediate states and interactions would greatly increase our knowledge of the binding process. In this work, all-atom molecular dynamics (MD) simulations were performed to characterize the structural properties of nativelike intermediate states. Based on multiple 6 μs MD simulations and Markov state model (MSM) analysis, several "open" intermediate states were observed. The transition rates between these states and the native closed state are in good agreement with the experimental measurements, which indicates that these intermediate states are binding relevant. As a common property in the open states, the partially unfolded α2 helix generates a larger portal and provides the driving force to facilitate ligand binding. On the other side, there are two kinds of open states for the ligand-binding HFABP: one has the partially unfolded α2 helix, and the other has the looser β-barrel with disjointing βD-βE strands. Our results provide atomic-level descriptions of the binding-relevant intermediate states and could improve our understanding of the binding mechanism.
Collapse
|
4
|
Liu H, Li M, Fan J, Huo S. Inherent structure versus geometric metric for state space discretization. J Comput Chem 2016; 37:1251-8. [PMID: 26915811 DOI: 10.1002/jcc.24315] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Revised: 11/17/2015] [Accepted: 01/06/2016] [Indexed: 01/13/2023]
Abstract
Inherent structure (IS) and geometry-based clustering methods are commonly used for analyzing molecular dynamics trajectories. ISs are obtained by minimizing the sampled conformations into local minima on potential/effective energy surface. The conformations that are minimized into the same energy basin belong to one cluster. We investigate the influence of the applications of these two methods of trajectory decomposition on our understanding of the thermodynamics and kinetics of alanine tetrapeptide. We find that at the microcluster level, the IS approach and root-mean-square deviation (RMSD)-based clustering method give totally different results. Depending on the local features of energy landscape, the conformations with close RMSDs can be minimized into different minima, while the conformations with large RMSDs could be minimized into the same basin. However, the relaxation timescales calculated based on the transition matrices built from the microclusters are similar. The discrepancy at the microcluster level leads to different macroclusters. Although the dynamic models established through both clustering methods are validated approximately Markovian, the IS approach seems to give a meaningful state space discretization at the macrocluster level in terms of conformational features and kinetics.
Collapse
Affiliation(s)
- Hanzhong Liu
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts, 01610
| | - Minghai Li
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts, 01610
| | - Jue Fan
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts, 01610
| | - Shuanghong Huo
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts, 01610
| |
Collapse
|
5
|
Banerjee R, Yan H, Cukier RI. Conformational Transition in Signal Transduction: Metastable States and Transition Pathways in the Activation of a Signaling Protein. J Phys Chem B 2015; 119:6591-602. [DOI: 10.1021/acs.jpcb.5b02582] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Rahul Banerjee
- Department of Chemistry and ‡Department of
Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Honggao Yan
- Department of Chemistry and ‡Department of
Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Robert I. Cukier
- Department of Chemistry and ‡Department of
Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| |
Collapse
|
6
|
Yu TQ, Chen PY, Chen M, Samanta A, Vanden-Eijnden E, Tuckerman M. Order-parameter-aided temperature-accelerated sampling for the exploration of crystal polymorphism and solid-liquid phase transitions. J Chem Phys 2015; 140:214109. [PMID: 24907992 DOI: 10.1063/1.4878665] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The problem of predicting polymorphism in atomic and molecular crystals constitutes a significant challenge both experimentally and theoretically. From the theoretical viewpoint, polymorphism prediction falls into the general class of problems characterized by an underlying rough energy landscape, and consequently, free energy based enhanced sampling approaches can be brought to bear on the problem. In this paper, we build on a scheme previously introduced by two of the authors in which the lengths and angles of the supercell are targeted for enhanced sampling via temperature accelerated adiabatic free energy dynamics [T. Q. Yu and M. E. Tuckerman, Phys. Rev. Lett. 107, 015701 (2011)]. Here, that framework is expanded to include general order parameters that distinguish different crystalline arrangements as target collective variables for enhanced sampling. The resulting free energy surface, being of quite high dimension, is nontrivial to reconstruct, and we discuss one particular strategy for performing the free energy analysis. The method is applied to the study of polymorphism in xenon crystals at high pressure and temperature using the Steinhardt order parameters without and with the supercell included in the set of collective variables. The expected fcc and bcc structures are obtained, and when the supercell parameters are included as collective variables, we also find several new structures, including fcc states with hcp stacking faults. We also apply the new method to the solid-liquid phase transition in copper at 1300 K using the same Steinhardt order parameters. Our method is able to melt and refreeze the system repeatedly, and the free energy profile can be obtained with high efficiency.
Collapse
Affiliation(s)
- Tang-Qing Yu
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Pei-Yang Chen
- Department of Chemistry, New York University, New York, New York 10003, USA
| | - Ming Chen
- Department of Chemistry, New York University, New York, New York 10003, USA
| | - Amit Samanta
- Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey 08544, USA and Lawrence Livermore National Laboratory, Livermore, California 94550, USA
| | - Eric Vanden-Eijnden
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Mark Tuckerman
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| |
Collapse
|
7
|
Duan M, Li M, Han L, Huo S. Euclidean sections of protein conformation space and their implications in dimensionality reduction. Proteins 2014; 82:2585-96. [PMID: 24913095 DOI: 10.1002/prot.24622] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Revised: 05/06/2014] [Accepted: 05/30/2014] [Indexed: 01/05/2023]
Abstract
Dimensionality reduction is widely used in searching for the intrinsic reaction coordinates for protein conformational changes. We find the dimensionality-reduction methods using the pairwise root-mean-square deviation (RMSD) as the local distance metric face a challenge. We use Isomap as an example to illustrate the problem. We believe that there is an implied assumption for the dimensionality-reduction approaches that aim to preserve the geometric relations between the objects: both the original space and the reduced space have the same kind of geometry, such as Euclidean geometry vs. Euclidean geometry or spherical geometry vs. spherical geometry. When the protein free energy landscape is mapped onto a 2D plane or 3D space, the reduced space is Euclidean, thus the original space should also be Euclidean. For a protein with N atoms, its conformation space is a subset of the 3N-dimensional Euclidean space R(3N). We formally define the protein conformation space as the quotient space of R(3N) by the equivalence relation of rigid motions. Whether the quotient space is Euclidean or not depends on how it is parameterized. When the pairwise RMSD is employed as the local distance metric, implicit representations are used for the protein conformation space, leading to no direct correspondence to a Euclidean set. We have demonstrated that an explicit Euclidean-based representation of protein conformation space and the local distance metric associated to it improve the quality of dimensionality reduction in the tetra-peptide and β-hairpin systems.
Collapse
Affiliation(s)
- Mojie Duan
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, Worcester, Massachusetts, 01610
| | | | | | | |
Collapse
|