1
|
Wang D, Qiu Y, Beyerle ER, Huang X, Tiwary P. Information Bottleneck Approach for Markov Model Construction. J Chem Theory Comput 2024; 20:5352-5367. [PMID: 38859575 PMCID: PMC11199095 DOI: 10.1021/acs.jctc.4c00449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2024]
Abstract
Markov state models (MSMs) have proven valuable in studying the dynamics of protein conformational changes via statistical analysis of molecular dynamics simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific lag time necessitates defining states that circumvent significant internal energy barriers, enabling internal dynamics relaxation within the lag time. This process effectively coarse-grains time and space, integrating out rapid motions within metastable states. Thus, MSMs possess a multiresolution nature, where the granularity of states can be adjusted according to the time-resolution, offering flexibility in capturing system dynamics. This work introduces a continuous embedding approach for molecular conformations using the state predictive information bottleneck (SPIB), a framework that unifies dimensionality reduction and state space partitioning via a continuous, machine learned basis set. Without explicit optimization of the VAMP-based scores, SPIB demonstrates state-of-the-art performance in identifying slow dynamical processes and constructing predictive multiresolution Markovian models. Through applications to well-validated mini-proteins, SPIB showcases unique advantages compared to competing methods. It autonomously and self-consistently adjusts the number of metastable states based on a specified minimal time resolution, eliminating the need for manual tuning. While maintaining efficacy in dynamical properties, SPIB excels in accurately distinguishing metastable states and capturing numerous well-populated macrostates. This contrasts with existing VAMP-based methods, which often emphasize slow dynamics at the expense of incorporating numerous sparsely populated states. Furthermore, SPIB's ability to learn a low-dimensional continuous embedding of the underlying MSMs enhances the interpretation of dynamic pathways. With these benefits, we propose SPIB as an easy-to-implement methodology for end-to-end MSM construction.
Collapse
Affiliation(s)
- Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, United States
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI 53706, United States
- Data Science Institute, University of Wisconsin-Madison, Madison, WI, 53706, United States
| | - Eric R. Beyerle
- Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, United States
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI 53706, United States
- Data Science Institute, University of Wisconsin-Madison, Madison, WI, 53706, United States
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, United States
- University of Maryland Institute for Health Computing, Bethesda, MD 20852, United States
| |
Collapse
|
2
|
Weigle AT, Shukla D. The Arabidopsis AtSWEET13 transporter discriminates sugars by selective facial and positional substrate recognition. Commun Biol 2024; 7:764. [PMID: 38914639 PMCID: PMC11196581 DOI: 10.1038/s42003-024-06291-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 05/03/2024] [Indexed: 06/26/2024] Open
Abstract
Transporters are targeted by endogenous metabolites and exogenous molecules to reach cellular destinations, but it is generally not understood how different substrate classes exploit the same transporter's mechanism. Any disclosure of plasticity in transporter mechanism when treated with different substrates becomes critical for developing general selectivity principles in membrane transport catalysis. Using extensive molecular dynamics simulations with an enhanced sampling approach, we select the Arabidopsis sugar transporter AtSWEET13 as a model system to identify the basis for glucose versus sucrose molecular recognition and transport. Here we find that AtSWEET13 chemical selectivity originates from a conserved substrate facial selectivity demonstrated when committing alternate access, despite mono-/di-saccharides experiencing differing degrees of conformational and positional freedom throughout other stages of transport. However, substrate interactions with structural hallmarks associated with known functional annotations can help reinforce selective preferences in molecular transport.
Collapse
Affiliation(s)
- Austin T Weigle
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Diwakar Shukla
- Department of Chemical & Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.
- Center for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.
| |
Collapse
|
3
|
Ghosh D, Biswas A, Radhakrishna M. Advanced computational approaches to understand protein aggregation. BIOPHYSICS REVIEWS 2024; 5:021302. [PMID: 38681860 PMCID: PMC11045254 DOI: 10.1063/5.0180691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 03/18/2024] [Indexed: 05/01/2024]
Abstract
Protein aggregation is a widespread phenomenon implicated in debilitating diseases like Alzheimer's, Parkinson's, and cataracts, presenting complex hurdles for the field of molecular biology. In this review, we explore the evolving realm of computational methods and bioinformatics tools that have revolutionized our comprehension of protein aggregation. Beginning with a discussion of the multifaceted challenges associated with understanding this process and emphasizing the critical need for precise predictive tools, we highlight how computational techniques have become indispensable for understanding protein aggregation. We focus on molecular simulations, notably molecular dynamics (MD) simulations, spanning from atomistic to coarse-grained levels, which have emerged as pivotal tools in unraveling the complex dynamics governing protein aggregation in diseases such as cataracts, Alzheimer's, and Parkinson's. MD simulations provide microscopic insights into protein interactions and the subtleties of aggregation pathways, with advanced techniques like replica exchange molecular dynamics, Metadynamics (MetaD), and umbrella sampling enhancing our understanding by probing intricate energy landscapes and transition states. We delve into specific applications of MD simulations, elucidating the chaperone mechanism underlying cataract formation using Markov state modeling and the intricate pathways and interactions driving the toxic aggregate formation in Alzheimer's and Parkinson's disease. Transitioning we highlight how computational techniques, including bioinformatics, sequence analysis, structural data, machine learning algorithms, and artificial intelligence have become indispensable for predicting protein aggregation propensity and locating aggregation-prone regions within protein sequences. Throughout our exploration, we underscore the symbiotic relationship between computational approaches and empirical data, which has paved the way for potential therapeutic strategies against protein aggregation-related diseases. In conclusion, this review offers a comprehensive overview of advanced computational methodologies and bioinformatics tools that have catalyzed breakthroughs in unraveling the molecular basis of protein aggregation, with significant implications for clinical interventions, standing at the intersection of computational biology and experimental research.
Collapse
Affiliation(s)
- Deepshikha Ghosh
- Department of Biological Sciences and Engineering, Indian Institute of Technology (IIT) Gandhinagar, Palaj, Gujarat 382355, India
| | - Anushka Biswas
- Department of Chemical Engineering, Indian Institute of Technology (IIT) Gandhinagar, Palaj, Gujarat 382355, India
| | | |
Collapse
|
4
|
Dutta S, Shukla D. Characterization of binding kinetics and intracellular signaling of new psychoactive substances targeting cannabinoid receptor using transition-based reweighting method. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.29.560261. [PMID: 37873328 PMCID: PMC10592854 DOI: 10.1101/2023.09.29.560261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
New psychoactive substances (NPS) targeting cannabinoid receptor 1 pose a significant threat to society as recreational abusive drugs that have pronounced physiological side effects. These greater adverse effects compared to classical cannabinoids have been linked to the higher downstream β-arrestin signaling. Thus, understanding the mechanism of differential signaling will reveal important structure-activity relationship essential for identifying and potentially regulating NPS molecules. In this study, we simulate the slow (un)binding process of NPS MDMB-Fubinaca and classical cannabinoid HU-210 from CB1 using multi-ensemble simulation to decipher the effects of ligand binding dynamics on downstream signaling. The transition-based reweighing method is used for the estimation of transition rates and underlying thermodynamics of (un)binding processes of ligands with nanomolar affinities. Our analyses reveal major interaction differences with transmembrane TM7 between NPS and classical cannabinoids. A variational autoencoder-based approach, neural relational inference (NRI), is applied to assess the allosteric effects on intracellular regions attributable to variations in binding pocket interactions. NRI analysis indicate a heightened level of allosteric control of NPxxY motif for NPS-bound receptors, which contributes to the higher probability of formation of a crucial triad interaction (Y7.53-Y5.58-T3.46) necessary for stronger β-arrestin signaling. Hence, in this work, MD simulation, data-driven statistical methods, and deep learning point out the structural basis for the heightened physiological side effects associated with NPS, contributing to efforts aimed at mitigating their public health impact.
Collapse
Affiliation(s)
- Soumajit Dutta
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801
| | - Diwakar Shukla
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, IL, 61801
| |
Collapse
|
5
|
Wu Y, Cao S, Qiu Y, Huang X. Tutorial on how to build non-Markovian dynamic models from molecular dynamics simulations for studying protein conformational changes. J Chem Phys 2024; 160:121501. [PMID: 38516972 DOI: 10.1063/5.0189429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 02/20/2024] [Indexed: 03/23/2024] Open
Abstract
Protein conformational changes play crucial roles in their biological functions. In recent years, the Markov State Model (MSM) constructed from extensive Molecular Dynamics (MD) simulations has emerged as a powerful tool for modeling complex protein conformational changes. In MSMs, dynamics are modeled as a sequence of Markovian transitions among metastable conformational states at discrete time intervals (called lag time). A major challenge for MSMs is that the lag time must be long enough to allow transitions among states to become memoryless (or Markovian). However, this lag time is constrained by the length of individual MD simulations available to track these transitions. To address this challenge, we have recently developed Generalized Master Equation (GME)-based approaches, encoding non-Markovian dynamics using a time-dependent memory kernel. In this Tutorial, we introduce the theory behind two recently developed GME-based non-Markovian dynamic models: the quasi-Markov State Model (qMSM) and the Integrative Generalized Master Equation (IGME). We subsequently outline the procedures for constructing these models and provide a step-by-step tutorial on applying qMSM and IGME to study two peptide systems: alanine dipeptide and villin headpiece. This Tutorial is available at https://github.com/xuhuihuang/GME_tutorials. The protocols detailed in this Tutorial aim to be accessible for non-experts interested in studying the biomolecular dynamics using these non-Markovian dynamic models.
Collapse
Affiliation(s)
- Yue Wu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Data Science Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
6
|
Dorbath E, Gulzar A, Stock G. Log-periodic oscillations as real-time signatures of hierarchical dynamics in proteins. J Chem Phys 2024; 160:074103. [PMID: 38364004 DOI: 10.1063/5.0188220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Accepted: 01/23/2024] [Indexed: 02/18/2024] Open
Abstract
The time-dependent relaxation of a dynamical system may exhibit a power-law behavior that is superimposed by log-periodic oscillations. D. Sornette [Phys. Rep. 297, 239 (1998)] showed that this behavior can be explained by a discrete scale invariance of the system, which is associated with discrete and equidistant timescales on a logarithmic scale. Examples include such diverse fields as financial crashes, random diffusion, and quantum topological materials. Recent time-resolved experiments and molecular dynamics simulations suggest that discrete scale invariance may also apply to hierarchical dynamics in proteins, where several fast local conformational changes are a prerequisite for a slow global transition to occur. Employing entropy-based timescale analysis and Markov state modeling to a simple one-dimensional hierarchical model and biomolecular simulation data, it is found that hierarchical systems quite generally give rise to logarithmically spaced discrete timescales. By introducing a one-dimensional reaction coordinate that collectively accounts for the hierarchically coupled degrees of freedom, the free energy landscape exhibits a characteristic staircase shape with two metastable end states, which causes the log-periodic time evolution of the system. The period of the log-oscillations reflects the effective roughness of the energy landscape and can, in simple cases, be interpreted in terms of the barriers of the staircase landscape.
Collapse
Affiliation(s)
- Emanuel Dorbath
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| | - Adnan Gulzar
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| | - Gerhard Stock
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| |
Collapse
|
7
|
Colberg M, Schofield J. Diffusive dynamics of a model protein chain in solution. J Chem Phys 2024; 160:075101. [PMID: 38375905 DOI: 10.1063/5.0182607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 01/24/2024] [Indexed: 02/21/2024] Open
Abstract
A Markov state model is a powerful tool that can be used to track the evolution of populations of configurations in an atomistic representation of a protein. For a coarse-grained linear chain model with discontinuous interactions, the transition rates among states that appear in the Markov model when the monomer dynamics is diffusive can be determined by computing the relative entropy of states and their mean first passage times, quantities that are unchanged by the specification of the energies of the relevant states. In this paper, we verify the folding dynamics described by a diffusive linear chain model of the crambin protein in three distinct solvent systems, each differing in complexity: a hard-sphere solvent, a solvent undergoing multi-particle collision dynamics, and an implicit solvent model. The predicted transition rates among configurations agree quantitatively with those observed in explicit molecular dynamics simulations for all three solvent models. These results suggest that the local monomer-monomer interactions provide sufficient friction for the monomer dynamics to be diffusive on timescales relevant to changes in conformation. Factors such as structural ordering and dynamic hydrodynamic effects appear to have minimal influence on transition rates within the studied solvent densities.
Collapse
Affiliation(s)
- Margarita Colberg
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Jeremy Schofield
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| |
Collapse
|
8
|
Wakabayashi T, Oide M, Kato T, Nakasako M. Coenzyme-binding pathway on glutamate dehydrogenase suggested from multiple-binding sites visualized by cryo-electron microscopy. FEBS J 2023; 290:5514-5535. [PMID: 37682540 DOI: 10.1111/febs.16951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 08/10/2023] [Accepted: 09/05/2023] [Indexed: 09/09/2023]
Abstract
The structure of hexameric glutamate dehydrogenase (GDH) in the presence of the coenzyme nicotinamide adenine dinucleotide phosphate (NADP) was visualized using cryogenic transmission electron microscopy to investigate the ligand-binding pathways to the active site of the enzyme. Each subunit of GDH comprises one hexamer-forming core domain and one nucleotide-binding domain (NAD domain), which spontaneously opens and closes the active-site cleft situated between the two domains. In the presence of NADP, the potential map of GDH hexamer, assuming D3 symmetry, was determined at a resolution of 2.4 Å, but the NAD domain was blurred due to the conformational variety. After focused classification with respect to the NAD domain, the potential maps interpreted as NADP molecules appeared at five different sites in the active-site cleft. The subunits associated with NADP molecules were close to one of the four metastable conformations in the unliganded state. Three of the five binding sites suggested a pathway of NADP molecules to approach the active-site cleft for initiating the enzymatic reaction. The other two binding modes may rarely appear in the presence of glutamate, as demonstrated by the reaction kinetics. Based on the visualized structures and the results from the enzymatic kinetics, we discussed the binding modes of NADP to GDH in the absence and presence of glutamate.
Collapse
Grants
- JPMJPR22E2 Japan Science and Technology Agency
- 18J11653 Japan Society for the Promotion of Science
- jp13480214 Japan Society for the Promotion of Science
- jp19204042 Japan Society for the Promotion of Science
- jp21H01050 Japan Society for the Promotion of Science
- jp22244054 Japan Society for the Promotion of Science
- jp26800227 Japan Society for the Promotion of Science
- jp15076210 Ministry of Education, Culture, Sports, Science and Technology
- jp15H01647 Ministry of Education, Culture, Sports, Science and Technology
- jp17H05891 Ministry of Education, Culture, Sports, Science and Technology
- jp20050030 Ministry of Education, Culture, Sports, Science and Technology
- jp22018027 Ministry of Education, Culture, Sports, Science and Technology
- jp23120525 Ministry of Education, Culture, Sports, Science and Technology
- jp25120725 Ministry of Education, Culture, Sports, Science and Technology
- 0436 Japan Agency for Medical Research and Development
Collapse
Affiliation(s)
- Taiki Wakabayashi
- Department of Physics, Faculty of Science and Technology, Keio University, Yokohama, Japan
- RIKEN SPring-8 Center, Sayo-gun, Hyogo, Japan
- RIKEN Cluster for Pioneering Research, Wako, Japan
| | - Mao Oide
- Department of Physics, Faculty of Science and Technology, Keio University, Yokohama, Japan
- RIKEN SPring-8 Center, Sayo-gun, Hyogo, Japan
- RIKEN Cluster for Pioneering Research, Wako, Japan
- PRESTO, Japan Science and Technology Agency, Tokyo, Japan
| | - Takayuki Kato
- Protein Research Institute, Osaka University, Suita, Japan
| | - Masayoshi Nakasako
- Department of Physics, Faculty of Science and Technology, Keio University, Yokohama, Japan
- RIKEN SPring-8 Center, Sayo-gun, Hyogo, Japan
| |
Collapse
|
9
|
Cao S, Qiu Y, Kalin ML, Huang X. Integrative generalized master equation: A method to study long-timescale biomolecular dynamics via the integrals of memory kernels. J Chem Phys 2023; 159:134106. [PMID: 37787134 PMCID: PMC11005468 DOI: 10.1063/5.0167287] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/18/2023] [Indexed: 10/04/2023] Open
Abstract
The generalized master equation (GME) provides a powerful approach to study biomolecular dynamics via non-Markovian dynamic models built from molecular dynamics (MD) simulations. Previously, we have implemented the GME, namely the quasi Markov State Model (qMSM), where we explicitly calculate the memory kernel and propagate dynamics using a discretized GME. qMSM can be constructed with much shorter MD trajectories than the MSM. However, since qMSM needs to explicitly compute the time-dependent memory kernels, it is heavily affected by the numerical fluctuations of simulation data when applied to study biomolecular conformational changes. This can lead to numerical instability of predicted long-time dynamics, greatly limiting the applicability of qMSM in complicated biomolecules. We present a new method, the Integrative GME (IGME), in which we analytically solve the GME under the condition when the memory kernels have decayed to zero. Our IGME overcomes the challenges of the qMSM by using the time integrations of memory kernels, thereby avoiding the numerical instability caused by explicit computation of time-dependent memory kernels. Using our solutions of the GME, we have developed a new approach to compute long-time dynamics based on MD simulations in a numerically stable, accurate and efficient way. To demonstrate its effectiveness, we have applied the IGME in three biomolecules: the alanine dipeptide, FIP35 WW-domain, and Taq RNA polymerase. In each system, the IGME achieves significantly smaller fluctuations for both memory kernels and long-time dynamics compared to the qMSM. We anticipate that the IGME can be widely applied to investigate biomolecular conformational changes.
Collapse
Affiliation(s)
- Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Michael L. Kalin
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
10
|
Mostofian B, Martin HJ, Razavi A, Patel S, Allen B, Sherman W, Izaguirre JA. Targeted Protein Degradation: Advances, Challenges, and Prospects for Computational Methods. J Chem Inf Model 2023; 63:5408-5432. [PMID: 37602861 PMCID: PMC10498452 DOI: 10.1021/acs.jcim.3c00603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Indexed: 08/22/2023]
Abstract
The therapeutic approach of targeted protein degradation (TPD) is gaining momentum due to its potentially superior effects compared with protein inhibition. Recent advancements in the biotech and pharmaceutical sectors have led to the development of compounds that are currently in human trials, with some showing promising clinical results. However, the use of computational tools in TPD is still limited, as it has distinct characteristics compared with traditional computational drug design methods. TPD involves creating a ternary structure (protein-degrader-ligase) responsible for the biological function, such as ubiquitination and subsequent proteasomal degradation, which depends on the spatial orientation of the protein of interest (POI) relative to E2-loaded ubiquitin. Modeling this structure necessitates a unique blend of tools initially developed for small molecules (e.g., docking) and biologics (e.g., protein-protein interaction modeling). Additionally, degrader molecules, particularly heterobifunctional degraders, are generally larger than conventional small molecule drugs, leading to challenges in determining drug-like properties like solubility and permeability. Furthermore, the catalytic nature of TPD makes occupancy-based modeling insufficient. TPD consists of multiple interconnected yet distinct steps, such as POI binding, E3 ligase binding, ternary structure interactions, ubiquitination, and degradation, along with traditional small molecule properties. A comprehensive set of tools is needed to address the dynamic nature of the induced proximity ternary complex and its implications for ubiquitination. In this Perspective, we discuss the current state of computational tools for TPD. We start by describing the series of steps involved in the degradation process and the experimental methods used to characterize them. Then, we delve into a detailed analysis of the computational tools employed in TPD. We also present an integrative approach that has proven successful for degrader design and its impact on project decisions. Finally, we examine the future prospects of computational methods in TPD and the areas with the greatest potential for impact.
Collapse
Affiliation(s)
- Barmak Mostofian
- OpenEye, Cadence Molecular Sciences, Boston, Massachusetts 02114 United States
| | - Holli-Joi Martin
- Laboratory
for Molecular Modeling, Division of Chemical Biology and Medicinal
Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599 United States
| | - Asghar Razavi
- ENKO
Chem, Inc, Mystic, Connecticut 06355 United States
| | - Shivam Patel
- Psivant
Therapeutics, Boston, Massachusetts 02210 United States
| | - Bryce Allen
- Differentiated
Therapeutics, San Diego, California 92056 United States
| | - Woody Sherman
- Psivant
Therapeutics, Boston, Massachusetts 02210 United States
| | - Jesus A Izaguirre
- Differentiated
Therapeutics, San Diego, California 92056 United States
- Atommap
Corporation, New York, New York 10013 United States
| |
Collapse
|
11
|
Nagel D, Sartore S, Stock G. Toward a Benchmark for Markov State Models: The Folding of HP35. J Phys Chem Lett 2023; 14:6956-6967. [PMID: 37504674 DOI: 10.1021/acs.jpclett.3c01561] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Adopting a 300 μs long MD trajectory of the folding of villin headpiece (HP35) by D. E. Shaw Research, we recently constructed a Markov state model (MSM) based on inter-residue contacts. The model reproduces the folding time and predicts that the native basin and unfolded region consist of metastable substates that are structurally well-characterized. Recognizing the need to establish well-defined benchmark problems, we study to what extent and in what sense this MSM can be employed as a reference model. Hence, we test the robustness of the MSM by comparing it to models that use alternative combinations of features, dimensionality reduction methods, and clustering schemes. The study suggests some main characteristics of the folding of HP35 that should be reproduced by other competitive models. Moreover, the discussion reveals which parts of the MSM workflow matter most for the considered problem and illustrates the promises and pitfalls of state-based models for the interpretation of biomolecular simulations.
Collapse
Affiliation(s)
- Daniel Nagel
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| | - Sofia Sartore
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| | - Gerhard Stock
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| |
Collapse
|
12
|
Qiu Y, O’Connor MS, Xue M, Liu B, Huang X. An Efficient Path Classification Algorithm Based on Variational Autoencoder to Identify Metastable Path Channels for Complex Conformational Changes. J Chem Theory Comput 2023; 19:4728-4742. [PMID: 37382437 PMCID: PMC11042546 DOI: 10.1021/acs.jctc.3c00318] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2023]
Abstract
Conformational changes (i.e., dynamic transitions between pairs of conformational states) play important roles in many chemical and biological processes. Constructing the Markov state model (MSM) from extensive molecular dynamics (MD) simulations is an effective approach to dissect the mechanism of conformational changes. When combined with transition path theory (TPT), MSM can be applied to elucidate the ensemble of kinetic pathways connecting pairs of conformational states. However, the application of TPT to analyze complex conformational changes often results in a vast number of kinetic pathways with comparable fluxes. This obstacle is particularly pronounced in heterogeneous self-assembly and aggregation processes. The large number of kinetic pathways makes it challenging to comprehend the molecular mechanisms underlying conformational changes of interest. To address this challenge, we have developed a path classification algorithm named latent-space path clustering (LPC) that efficiently lumps parallel kinetic pathways into distinct metastable path channels, making them easier to comprehend. In our algorithm, MD conformations are first projected onto a low-dimensional space containing a small set of collective variables (CVs) by time-structure-based independent component analysis (tICA) with kinetic mapping. Then, MSM and TPT are constructed to obtain the ensemble of pathways, and a deep learning architecture named the variational autoencoder (VAE) is used to learn the spatial distributions of kinetic pathways in the continuous CV space. Based on the trained VAE model, the TPT-generated ensemble of kinetic pathways can be embedded into a latent space, where the classification becomes clear. We show that LPC can efficiently and accurately identify the metastable path channels in three systems: a 2D potential, the aggregation of two hydrophobic particles in water, and the folding of the Fip35 WW domain. Using the 2D potential, we further demonstrate that our LPC algorithm outperforms the previous path-lumping algorithms by making substantially fewer incorrect assignments of individual pathways to four path channels. We expect that LPC can be widely applied to identify the dominant kinetic pathways underlying complex conformational changes.
Collapse
Affiliation(s)
- Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Michael S. O’Connor
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Mingyi Xue
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Bojun Liu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| |
Collapse
|
13
|
Nagel D, Sartore S, Stock G. Selecting Features for Markov Modeling: A Case Study on HP35. J Chem Theory Comput 2023. [PMID: 37167425 DOI: 10.1021/acs.jctc.3c00240] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Markov state models represent a popular means to interpret molecular dynamics trajectories in terms of memoryless transitions between metastable conformational states. To provide a mechanistic understanding of the considered biomolecular process, these states should reflect structurally distinct conformations and ensure a time scale separation between fast intrastate and slow interstate dynamics. Adopting the folding of villin headpiece (HP35) as a well-established model problem, here we discuss the selection of suitable input coordinates or "features", such as backbone dihedral angles and interresidue distances. We show that dihedral angles account accurately for the structure of the native energy basin of HP35, while the unfolded region of the free energy landscape and the folding process are best described by tertiary contacts of the protein. To construct a contact-based model, we consider various ways to define and select contact distances and introduce a low-pass filtering of the feature trajectory as well as a correlation-based characterization of states. Relying on input data that faithfully account for the mechanistic origin of the studied process, the states of the resulting Markov model are clearly discriminated by the features, describe consistently the hierarchical structure of the free energy landscape, and─as a consequence─correctly reproduce the slow time scales of the process.
Collapse
Affiliation(s)
- Daniel Nagel
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| | - Sofia Sartore
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| | - Gerhard Stock
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| |
Collapse
|
14
|
Kubař T, Elstner M, Cui Q. Hybrid Quantum Mechanical/Molecular Mechanical Methods For Studying Energy Transduction in Biomolecular Machines. Annu Rev Biophys 2023; 52:525-551. [PMID: 36791746 PMCID: PMC10810093 DOI: 10.1146/annurev-biophys-111622-091140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Hybrid quantum mechanical/molecular mechanical (QM/MM) methods have become indispensable tools for the study of biomolecules. In this article, we briefly review the basic methodological details of QM/MM approaches and discuss their applications to various energy transduction problems in biomolecular machines, such as long-range proton transports, fast electron transfers, and mechanochemical coupling. We highlight the particular importance for these applications of balancing computational efficiency and accuracy. Using several recent examples, we illustrate the value and limitations of QM/MM methodologies for both ground and excited states, as well as strategies for calibrating them in specific applications. We conclude with brief comments on several areas that can benefit from further efforts to make QM/MM analyses more quantitative and applicable to increasingly complex biological problems.
Collapse
Affiliation(s)
- T Kubař
- Institute of Physical Chemistry, Karlsruhe Institute of Technology, Karlsruhe, Germany;
| | - M Elstner
- Institute of Physical Chemistry, Karlsruhe Institute of Technology, Karlsruhe, Germany;
- Institute of Biological Interfaces (IBG-2), Karlsruhe Institute of Technology, Karlsruhe, Germany;
| | - Q Cui
- Department of Chemistry, Boston University, Boston, Massachusetts, USA;
- Department of Physics, Boston University, Boston, Massachusetts, USA
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
| |
Collapse
|
15
|
Dominic AJ, Cao S, Montoya-Castillo A, Huang X. Memory Unlocks the Future of Biomolecular Dynamics: Transformative Tools to Uncover Physical Insights Accurately and Efficiently. J Am Chem Soc 2023; 145:9916-9927. [PMID: 37104720 DOI: 10.1021/jacs.3c01095] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
Conformational changes underpin function and encode complex biomolecular mechanisms. Gaining atomic-level detail of how such changes occur has the potential to reveal these mechanisms and is of critical importance in identifying drug targets, facilitating rational drug design, and enabling bioengineering applications. While the past two decades have brought Markov state model techniques to the point where practitioners can regularly use them to glimpse the long-time dynamics of slow conformations in complex systems, many systems are still beyond their reach. In this Perspective, we discuss how including memory (i.e., non-Markovian effects) can reduce the computational cost to predict the long-time dynamics in these complex systems by orders of magnitude and with greater accuracy and resolution than state-of-the-art Markov state models. We illustrate how memory lies at the heart of successful and promising techniques, ranging from the Fokker-Planck and generalized Langevin equations to deep-learning recurrent neural networks and generalized master equations. We delineate how these techniques work, identify insights that they can offer in biomolecular systems, and discuss their advantages and disadvantages in practical settings. We show how generalized master equations can enable the investigation of, for example, the gate-opening process in RNA polymerase II and demonstrate how our recent advances tame the deleterious influence of statistical underconvergence of the molecular dynamics simulations used to parameterize these techniques. This represents a significant leap forward that will enable our memory-based techniques to interrogate systems that are currently beyond the reach of even the best Markov state models. We conclude by discussing some current challenges and future prospects for how exploiting memory will open the door to many exciting opportunities.
Collapse
Affiliation(s)
- Anthony J Dominic
- Department of Chemistry, University of Colorado Boulder, Boulder, Colorado 80309, USA
| | - Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
16
|
Hong X, Song K, Rahman MU, Wei T, Zhang Y, Da LT, Chen HF. Phosphorylation Regulation Mechanism of β2 Integrin for the Binding of Filamin Revealed by Markov State Model. J Chem Inf Model 2023; 63:605-618. [PMID: 36607244 DOI: 10.1021/acs.jcim.2c01177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Leukocyte adhesion deficiency-1 (LAD-1) disorder is a severe immunodeficiency syndrome caused by deficiency or mutation of β2 integrin. The phosphorylation on threonine 758 of β2 integrin acts as a molecular switch inhibiting the binding of filamin. However, the switch mechanism of site-specific phosphorylation at the atom level is still poorly understood. To resolve the regulation mechanism, all-atom molecular dynamics simulation and Markov state model were used to study the dynamic regulation pathway of phosphorylation. Wild type system possessed lower binding free energy and fewer number of states than the phosphorylated system. Both systems underwent local disorder-to-order conformation conversion when achieving steady states. To reach steady states, wild type adopted less number of transition paths/shortest path according to the transition path theory than the phosphorylated system. The underlying phosphorylated regulation pathway was from P1 to P0 and then P4 state, and the main driving force should be hydrogen bond and hydrophobic interaction disturbing the secondary structure of phosphorylated states. These studies will shed light on the pathogenesis of LAD-1 disease and lay a foundation for drug development.
Collapse
Affiliation(s)
- Xiaokun Hong
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Kaiyuan Song
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Mueed Ur Rahman
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Ting Wei
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Yan Zhang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Lin-Tai Da
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Hai-Feng Chen
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
- Shanghai Center for Bioinformation Technology, Shanghai200240, China
| |
Collapse
|
17
|
Unarta IC, Goonetilleke EC, Wang D, Huang X. Nucleotide addition and cleavage by RNA polymerase II: Coordination of two catalytic reactions using a single active site. J Biol Chem 2022; 299:102844. [PMID: 36581202 PMCID: PMC9860460 DOI: 10.1016/j.jbc.2022.102844] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 12/19/2022] [Accepted: 12/22/2022] [Indexed: 12/28/2022] Open
Abstract
RNA polymerase II (Pol II) incorporates complementary ribonucleotides into the growing RNA chain one at a time via the nucleotide addition cycle. The nucleotide addition cycle, however, is prone to misincorporation of noncomplementary nucleotides. Thus, to ensure transcriptional fidelity, Pol II backtracks and then cleaves the misincorporated nucleotides. These two reverse reactions, nucleotide addition and cleavage, are catalyzed in the same active site of Pol II, which is different from DNA polymerases or other endonucleases. Recently, substantial progress has been made to understand how Pol II effectively performs its dual role in the same active site. Our review highlights these recent studies and provides an overall model of the catalytic mechanisms of Pol II. In particular, RNA extension follows the two-metal-ion mechanism, and several Pol II residues play important roles to facilitate the catalysis. In sharp contrast, the cleavage reaction is independent of any Pol II residues. Interestingly, Pol II relies on its residues to recognize the misincorporated nucleotides during the backtracking process, prior to cleavage. In this way, Pol II efficiently compartmentalizes its two distinct catalytic functions using the same active site. Lastly, we also discuss a new perspective on the potential third Mg2+ in the nucleotide addition and intrinsic cleavage reactions.
Collapse
Affiliation(s)
- Ilona Christy Unarta
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Eshani C Goonetilleke
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Dong Wang
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California, USA; Department of Cellular and Molecular Medicine, School of Medicine, University of California, San Diego, La Jolla, California, USA; Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California, USA.
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA.
| |
Collapse
|
18
|
Wang H, Zhu X, Zhao Y, Zang Y, Zhang J, Kang Y, Yang Z, Lin P, Zhang L, Zhang S. Markov State Models Underlying the N-Terminal Premodel of TOPK/PBK. J Phys Chem B 2022; 126:10662-10671. [PMID: 36512332 DOI: 10.1021/acs.jpcb.2c06559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Lymphokine-activated killer T-cell-originated protein kinase (TOPK) is a potential target for cancer therapy. To explore the micromechanism, we proposed the N-terminal premodel (NTPM) of the TOPK monomer via homology modeling and molecular dynamic simulations and analyzed the conformational dynamics by Markov state model analysis. The electronegative insert (ENI) motif of the NTPM can be opened with a small probability under wild type, regulated by the so-called "N-C" interaction zone consisting of the N-terminal head, the coil between β3-strand and αC-helix, and the ENI motif. Glutamate substitution at threonine residue 9 or tyrosine residue 74 promotes the closed-open transition, revealing the details of phosphorylation. Allosteric effects induce functionally relevant structural changes, such as increased structural flexibility and active sites, which are thought to be necessary for further activation or binding. These findings provide rational structural templates for designing state-dependent inhibitors and give insight into the molecular regulatory mechanisms of TOPK monomers.
Collapse
Affiliation(s)
- He Wang
- MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, School of Physics, Xi'an Jiaotong University, Xi'an710049, China
| | - Xun Zhu
- MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, School of Physics, Xi'an Jiaotong University, Xi'an710049, China
| | - Yizhen Zhao
- MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, School of Physics, Xi'an Jiaotong University, Xi'an710049, China
| | - Yongjian Zang
- MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, School of Physics, Xi'an Jiaotong University, Xi'an710049, China
| | - Jianwen Zhang
- MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, School of Physics, Xi'an Jiaotong University, Xi'an710049, China
| | - Ying Kang
- MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, School of Physics, Xi'an Jiaotong University, Xi'an710049, China
| | - Zhiwei Yang
- MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, School of Physics, Xi'an Jiaotong University, Xi'an710049, China
| | - Peng Lin
- National Translational Science Center for Molecular Medicine & Department of Cell Biology, Fourth Military Medical University, Xi'an710032, China
| | - Lei Zhang
- MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, School of Physics, Xi'an Jiaotong University, Xi'an710049, China
| | - Shengli Zhang
- MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, School of Physics, Xi'an Jiaotong University, Xi'an710049, China
| |
Collapse
|
19
|
Xi K, Zhu L. Automated Path Searching Reveals the Mechanism of Hydrolysis Enhancement by T4 Lysozyme Mutants. Int J Mol Sci 2022; 23:ijms232314628. [PMID: 36498954 PMCID: PMC9736071 DOI: 10.3390/ijms232314628] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 11/16/2022] [Accepted: 11/19/2022] [Indexed: 11/25/2022] Open
Abstract
Bacteriophage T4 lysozyme (T4L) is a glycosidase that is widely applied as a natural antimicrobial agent in the food industry. Due to its wide applications and small size, T4L has been regarded as a model system for understanding protein dynamics and for large-scale protein engineering. Through structural insights from the single conformation of T4L, a series of mutations (L99A,G113A,R119P) have been introduced, which have successfully raised the fractional population of its only hydrolysis-competent excited state to 96%. However, the actual impact of these substitutions on its dynamics remains unclear, largely due to the lack of highly efficient sampling algorithms. Here, using our recently developed travelling-salesman-based automated path searching (TAPS), we located the minimum-free-energy path (MFEP) for the transition of three T4L mutants from their ground states to their excited states. All three mutants share a three-step transition: the flipping of F114, the rearrangement of α0/α1 helices, and final refinement. Remarkably, the MFEP revealed that the effects of the mutations are drastically beyond the expectations of their original design: (a) the G113A substitution not only enhances helicity but also fills the hydrophobic Cavity I and reduces the free energy barrier for flipping F114; (b) R119P barely changes the stability of the ground state but stabilizes the excited state through rarely reported polar contacts S117OG:N132ND2, E11OE1:R145NH1, and E11OE2:Q105NE2; (c) the residue W138 flips into Cavity I and further stabilizes the excited state for the triple mutant L99A,G113A,R119P. These novel insights that were unexpected in the original mutant design indicated the necessity of incorporating path searching into the workflow of rational protein engineering.
Collapse
|
20
|
Protein Function Analysis through Machine Learning. Biomolecules 2022; 12:biom12091246. [PMID: 36139085 PMCID: PMC9496392 DOI: 10.3390/biom12091246] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 08/22/2022] [Accepted: 08/31/2022] [Indexed: 11/16/2022] Open
Abstract
Machine learning (ML) has been an important arsenal in computational biology used to elucidate protein function for decades. With the recent burgeoning of novel ML methods and applications, new ML approaches have been incorporated into many areas of computational biology dealing with protein function. We examine how ML has been integrated into a wide range of computational models to improve prediction accuracy and gain a better understanding of protein function. The applications discussed are protein structure prediction, protein engineering using sequence modifications to achieve stability and druggability characteristics, molecular docking in terms of protein–ligand binding, including allosteric effects, protein–protein interactions and protein-centric drug discovery. To quantify the mechanisms underlying protein function, a holistic approach that takes structure, flexibility, stability, and dynamics into account is required, as these aspects become inseparable through their interdependence. Another key component of protein function is conformational dynamics, which often manifest as protein kinetics. Computational methods that use ML to generate representative conformational ensembles and quantify differences in conformational ensembles important for function are included in this review. Future opportunities are highlighted for each of these topics.
Collapse
|
21
|
Oide M, Sugita Y. Protein Folding Intermediates on the Dimensionality Reduced Landscape with UMAP and Native Contact Likelihood. J Chem Phys 2022; 157:075101. [DOI: 10.1063/5.0099094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
To understand protein folding mechanisms from molecular dynamics (MD) simulations, it is important to explore not only folded/unfolded states but also representative intermediate structures on the conformational landscape. Here, we propose a novel approach to construct the landscape using the uniform manifold approximation and projection (UMAP) method, which reduces the dimensionality without losing data-point proximity. In the approach, native contact likelihood is used as feature variables rather than the conventional Cartesian coordinates or dihedral angles of protein structures. We tested the performance of UMAP for coarse-grained MD simulation trajectories of B1 domain in protein G and observed on-pathway transient structures and other metastable states on the UMAP conformational landscape. In contrast, these structures were not clearly distinguished on the dimensionality reduced landscape using principal component analysis (PCA) or time-lagged independent component analysis (tICA). This approach is also useful to obtain dynamical information through Markov State Modeling and would be applicable to large-scale conformational changes in many other biomacromolecules.
Collapse
Affiliation(s)
| | - Yuji Sugita
- Theoretical Molecular Science Laboratory, RIKEN, Japan
| |
Collapse
|
22
|
Yang X, Lu ZY. Nanoparticle cluster formation mechanisms elucidated via Markov state modeling: Attraction range effects, aggregation pathways, and counterintuitive transition rates. J Chem Phys 2022; 156:214902. [DOI: 10.1063/5.0086110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Nanoparticle clusters are promising candidates for developing functional materials. However, it is still a challenging task to fabricate them in a predictable and controllable way, which requires investigation of the possible mechanisms underlying cluster formation at the nanoscale. By constructing Markov state models (MSMs) at the microstate level, we find that for highly dispersed particles to form a highly aggregated cluster, there are multiple coexisting pathways, which correspond to direct aggregation, or pathways that need to pass through partially aggregated, intermediate states. Varying the range of attraction between nanoparticles is found to significantly affect pathways. As the attraction range becomes narrower, compared to direct aggregation, some pathways that need to pass through partially aggregated intermediate states become more competitive. In addition, from MSMs constructed at the macrostate level, the aggregation rate is found to be counterintuitively lower with a lower free-energy barrier, which is also discussed.
Collapse
Affiliation(s)
- Xi Yang
- Institute of Theoretical Chemistry, State Key Laboratory of Supramolecular Structure and Materials, Jilin University, Changchun 130021, China
| | - Zhong-Yuan Lu
- Institute of Theoretical Chemistry, State Key Laboratory of Supramolecular Structure and Materials, Jilin University, Changchun 130021, China
| |
Collapse
|
23
|
Ji L, Li Y, Wang J, Ning A, Zhang N, Liang S, He J, Zhang T, Qu Z, Gao J. Community Reaction Network Reduction for Constructing a Coarse-Grained Representation of Combustion Reaction Mechanisms. J Chem Inf Model 2022; 62:2352-2364. [PMID: 35442657 DOI: 10.1021/acs.jcim.2c00240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
A community-reaction network reduction (CNR) approach is presented for mechanism reduction on the basis of a network-based community detection technique, a concept related to pre-equilibrium in chemical kinetics. In this method, the detailed combustion mechanism is first transformed into a weighted network, in which communities of species that have dense inner connections under the critical ignition conditions are identified. By analyzing the community partitions in different regions, we determine the effective functional groups and driving processes. Then, a skeletal model for the overall mechanism is deduced according to the network centrality data, including transition pathway identification and reaction-path flux. The CNR method is illustrated on the hydrogen autoignition system which has been extensively investigated, and a new reduced mechanism involving seven processes is proposed. Dynamics simulations employing the present CNR model show that the computed ignition time and distribution of major species on a wide range of temperature and pressure conditions are in accord with the experiments and results from other methods.
Collapse
Affiliation(s)
- Lin Ji
- Department of Chemistry, Capital Normal University, Beijing 100048, China
| | - Yue Li
- Department of Chemistry, Capital Normal University, Beijing 100048, China
| | - Jie Wang
- Department of Chemistry, Capital Normal University, Beijing 100048, China
| | - An Ning
- Department of Chemistry, Capital Normal University, Beijing 100048, China
| | - Naixin Zhang
- Department of Chemistry, Capital Normal University, Beijing 100048, China
| | - Shengyao Liang
- Department of Chemistry, Capital Normal University, Beijing 100048, China
| | - Jiyun He
- Department of Chemistry, Capital Normal University, Beijing 100048, China
| | - Tianyu Zhang
- Department of Chemistry, Capital Normal University, Beijing 100048, China
| | - Zexing Qu
- Laboratory of Theoretical and Computational Chemistry, Jilin University, Changchun 130015, China
| | - Jiali Gao
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, Shenzhen 518055, China.,Department of Chemistry and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455, United States
| |
Collapse
|
24
|
Cannariato M, Miceli M, Cavaglià M, Deriu MA. Prediction of Protein–Protein Interactions Between Alsin DH/PH and Rac1 and Resulting Protein Dynamics. Front Mol Neurosci 2022; 14:772122. [PMID: 35126051 PMCID: PMC8811474 DOI: 10.3389/fnmol.2021.772122] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Accepted: 12/15/2021] [Indexed: 11/13/2022] Open
Abstract
Alsin is a protein of 1,657 amino acids known for its crucial role in vesicular trafficking in neurons thanks to its ability to interact with two guanosine triphosphatases, Rac1 and Rab5. Evidence suggests that Rac1 can bind Alsin central region, composed by a Dbl Homology (DH) domain followed by a Pleckstrin Homology (PH) domain, leading to Alsin relocalization. However, Alsin three-dimensional structure and its relationship with known biological functions of this protein are still unknown. In this work, a homology model of the Alsin DH/PH domain was developed and studied through molecular dynamics both in the presence and in the absence of its binding partner, Rac1. Due to different conformations of DH domain, the presence of Rac1 seems to stabilize an open state of the protein, while the absence of its binding partner results in closed conformations. Furthermore, Rac1 interaction was able to reduce the fluctuations in the second conserved region of DH motif, which may be involved in the formation of a homodimer. Moreover, the dynamics of DH/PH was described through a Markov State Model to study the pathways linking the open and closed states. In conclusion, this work provided an all-atom model for the DH/PH domain of Alsin protein; moreover, molecular dynamics investigations suggested underlying molecular mechanisms in the signal transduction between Rac1 and Alsin, providing the basis for a deeper understanding of the whole structure–function relationship for Alsin protein.
Collapse
|
25
|
Konovalov K, Unarta IC, Cao S, Goonetilleke EC, Huang X. Markov State Models to Study the Functional Dynamics of Proteins in the Wake of Machine Learning. JACS AU 2021; 1:1330-1341. [PMID: 34604842 PMCID: PMC8479766 DOI: 10.1021/jacsau.1c00254] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Indexed: 05/19/2023]
Abstract
Markov state models (MSMs) based on molecular dynamics (MD) simulations are routinely employed to study protein folding, however, their application to functional conformational changes of biomolecules is still limited. In the past few years, the field of computational chemistry has experienced a surge of advancements stemming from machine learning algorithms, and MSMs have not been left out. Unlike global processes, such as protein folding, the application of MSMs to functional conformational changes is challenging because they mostly consist of localized structural transitions. Therefore, it is critical to properly select a subset of structural features that can describe the slowest dynamics of these functional conformational changes. To address this challenge, we recommend several automatic feature selection methods such as Spectral-OASIS. To identify states in MSMs, the chosen features can be subject to dimensionality reduction methods such as TICA or deep learning based VAMPNets to project MD conformations onto a few collective variables for subsequent clustering. Another challenge for the application of MSMs to the study of functional conformational changes is the ability to comprehend their biophysical mechanisms, as MSMs built for these processes often require a large number of states. We recommend the recently developed quasi-MSMs (qMSMs) to address this issue. Compared to MSMs, qMSMs encode the non-Markovian dynamics via the generalized master equation and can significantly reduce the number of states. As a result, qMSMs can be built with a handful of states to facilitate the interpretation of functional conformational changes. In the wake of machine learning, we believe that the rapid advancement in the MSM methodology will lead to their wider application in studying functional conformational changes of biomolecules.
Collapse
Affiliation(s)
- Kirill
A. Konovalov
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Ilona Christy Unarta
- Department
of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Siqin Cao
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Eshani C. Goonetilleke
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Xuhui Huang
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Department
of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| |
Collapse
|
26
|
Xi K, Hu Z, Wu Q, Wei M, Qian R, Zhu L. Assessing the Performance of Traveling-salesman based Automated Path Searching (TAPS) on Complex Biomolecular Systems. J Chem Theory Comput 2021; 17:5301-5311. [PMID: 34270241 DOI: 10.1021/acs.jctc.1c00182] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Though crucial for understanding the function of large biomolecular systems, locating the minimum free energy paths (MFEPs) between their key conformational states is far from trivial due to their high-dimensional nature. Most existing path-searching methods require a static collective variable space as input, encoding intuition or prior knowledge of the transition mechanism. Such information is, however, hardly available a priori and expensive to validate. To alleviate this issue, we have previously introduced a Traveling-salesman based Automated Path Searching method (TAPS) and demonstrated its efficiency on simple peptide systems. Having implemented a parallel version of this method, here we assess the performance of TAPS on three realistic systems (tens to hundreds of residues) in explicit solvents. We show that TAPS successfully located the MFEP for the ground/excited state transition of the T4 lysozyme L99A variant, consistent with previous findings. TAPS also helped identifying the important role of the two polar contacts in directing the loop-in/loop-out transition of the mitogen-activated protein kinase kinase (MEK1), which explained previous mutant experiments. Remarkably, at a minimal cost of 126 ns sampling, TAPS revealed that the Ltn40/Ltn10 transition of lymphotactin needs no complete unfolding/refolding of its β-sheets and that five polar contacts are sufficient to stabilize the various partially unfolded intermediates along the MFEP. These results present TAPS as a general and promising tool for studying the functional dynamics of complex biomolecular systems.
Collapse
Affiliation(s)
- Kun Xi
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China.,School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, P. R. China
| | - Zhenquan Hu
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China.,School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, P. R. China
| | - Qiang Wu
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China
| | - Meihan Wei
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China
| | - Runtong Qian
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China
| | - Lizhe Zhu
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China
| |
Collapse
|
27
|
Jiang H, Fan X. The Two-Step Clustering Approach for Metastable States Learning. Int J Mol Sci 2021; 22:6576. [PMID: 34205252 PMCID: PMC8233889 DOI: 10.3390/ijms22126576] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 06/14/2021] [Accepted: 06/14/2021] [Indexed: 01/20/2023] Open
Abstract
Understanding the energy landscape and the conformational dynamics is crucial for studying many biological or chemical processes, such as protein-protein interaction and RNA folding. Molecular Dynamics (MD) simulations have been a major source of dynamic structure. Although many methods were proposed for learning metastable states from MD data, some key problems are still in need of further investigation. Here, we give a brief review on recent progresses in this field, with an emphasis on some popular methods belonging to a two-step clustering framework, and hope to draw more researchers to contribute to this area.
Collapse
Affiliation(s)
- Hangjin Jiang
- Center for Data Science, Zhejiang University, Hangzhou 310058, China;
| | - Xiaodan Fan
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
28
|
Träger S, Tamò G, Aydin D, Fonti G, Audagnotto M, Dal Peraro M. CLoNe: automated clustering based on local density neighborhoods for application to biomolecular structural ensembles. Bioinformatics 2021; 37:921-928. [PMID: 32821900 PMCID: PMC8128458 DOI: 10.1093/bioinformatics/btaa742] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 07/14/2020] [Accepted: 08/18/2020] [Indexed: 11/14/2022] Open
Abstract
Motivation Proteins are intrinsically dynamic entities. Flexibility sampling methods, such as molecular dynamics or those arising from integrative modeling strategies, are now commonplace and enable the study of molecular conformational landscapes in many contexts. Resulting structural ensembles increase in size as technological and algorithmic advancements take place, making their analysis increasingly demanding. In this regard, cluster analysis remains a go-to approach for their classification. However, many state-of-the-art algorithms are restricted to specific cluster properties. Combined with tedious parameter fine-tuning, cluster analysis of protein structural ensembles suffers from the lack of a generally applicable and easy to use clustering scheme. Results We present CLoNe, an original Python-based clustering scheme that builds on the Density Peaks algorithm of Rodriguez and Laio. CLoNe relies on a probabilistic analysis of local density distributions derived from nearest neighbors to find relevant clusters regardless of cluster shape, size, distribution and amount. We show its capabilities on many toy datasets with properties otherwise dividing state-of-the-art approaches and improves on the original algorithm in key aspects. Applied to structural ensembles, CLoNe was able to extract meaningful conformations from membrane binding events and ligand-binding pocket opening as well as identify dominant dimerization motifs or inter-domain organization. CLoNe additionally saves clusters as individual trajectories for further analysis and provides scripts for automated use with molecular visualization software. Availability and implementation www.epfl.ch/labs/lbm/resources, github.com/LBM-EPFL/CLoNe. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sylvain Träger
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Giorgio Tamò
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Deniz Aydin
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Giulia Fonti
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Martina Audagnotto
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Matteo Dal Peraro
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| |
Collapse
|
29
|
A comprehensive mechanism for 5-carboxylcytosine-induced transcriptional pausing revealed by Markov state models. J Biol Chem 2021; 296:100735. [PMID: 33991521 PMCID: PMC8191312 DOI: 10.1016/j.jbc.2021.100735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 04/27/2021] [Accepted: 04/28/2021] [Indexed: 11/23/2022] Open
Abstract
RNA polymerase II (Pol II) surveils the genome, pausing as it encounters DNA lesions and base modifications and initiating signals for DNA repair among other important regulatory events. Recent work suggests that Pol II pauses at 5-carboxycytosine (5caC), an epigenetic modification of cytosine, because of a specific hydrogen bond between the carboxyl group of 5caC and a specific residue in fork loop 3 of Pol II. This hydrogen bond compromises productive NTP binding and slows down elongation. Apart from this specific interaction, the carboxyl group of 5caC can potentially interact with numerous charged residues in the cleft of Pol II. However, it is not clear how other interactions between Pol II and 5caC contribute to pausing. In this study, we use Markov state models (a type of kinetic network models) built from extensive molecular dynamics simulations to comprehensively study the impact of 5caC on Pol II translocation. We describe two translocation intermediates with specific interactions that prevent the template base from loading into the Pol II active site. In addition to the previously observed state with 5caC constrained by fork loop 3, we discovered a new intermediate state with a hydrogen bond between 5caC and fork loop 2. Surprisingly, we find that 5caC may curb translocation by suppressing kinking of the helix bordering the active site (the bridge helix) because its high flexibility is critical to translocation. Our work provides new insights into how epigenetic modifications of genomic DNA can modulate Pol II translocation, inducing pauses in transcription.
Collapse
|
30
|
Suárez E, Wiewiora RP, Wehmeyer C, Noé F, Chodera JD, Zuckerman DM. What Markov State Models Can and Cannot Do: Correlation versus Path-Based Observables in Protein-Folding Models. J Chem Theory Comput 2021; 17:3119-3133. [PMID: 33904312 PMCID: PMC8127341 DOI: 10.1021/acs.jctc.0c01154] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Markov state models (MSMs) have been widely applied to study the kinetics and pathways of protein conformational dynamics based on statistical analysis of molecular dynamics (MD) simulations. These MSMs coarse-grain both configuration space and time in ways that limit what kinds of observables they can reproduce with high fidelity over different spatial and temporal resolutions. Despite their popularity, there is still limited understanding of which biophysical observables can be computed from these MSMs in a robust and unbiased manner, and which suffer from the space-time coarse-graining intrinsic in the MSM model. Most theoretical arguments and practical validity tests for MSMs rely on long-time equilibrium kinetics, such as the slowest relaxation time scales and experimentally observable time-correlation functions. Here, we perform an extensive assessment of the ability of well-validated protein folding MSMs to accurately reproduce path-based observable such as mean first-passage times (MFPTs) and transition path mechanisms compared to a direct trajectory analysis. We also assess a recently proposed class of history-augmented MSMs (haMSMs) that exploit additional information not accounted for in standard MSMs. We conclude with some practical guidance on the use of MSMs to study various problems in conformational dynamics of biomolecules. In brief, MSMs can accurately reproduce correlation functions slower than the lag time, but path-based observables can only be reliably reproduced if the lifetimes of states exceed the lag time, which is a much stricter requirement. Even in the presence of short-lived states, we find that haMSMs reproduce path-based observables more reliably.
Collapse
Affiliation(s)
- Ernesto Suárez
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research, Frederick, MD 21702
| | - Rafal P. Wiewiora
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| | | | | | - John D. Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| | - Daniel M. Zuckerman
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR 97239
| |
Collapse
|
31
|
Role of bacterial RNA polymerase gate opening dynamics in DNA loading and antibiotics inhibition elucidated by quasi-Markov State Model. Proc Natl Acad Sci U S A 2021; 118:2024324118. [PMID: 33883282 DOI: 10.1073/pnas.2024324118] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To initiate transcription, the holoenzyme (RNA polymerase [RNAP] in complex with σ factor) loads the promoter DNA via the flexible loading gate created by the clamp and β-lobe, yet their roles in DNA loading have not been characterized. We used a quasi-Markov State Model (qMSM) built from extensive molecular dynamics simulations to elucidate the dynamics of Thermus aquaticus holoenzyme's gate opening. We showed that during gate opening, β-lobe oscillates four orders of magnitude faster than the clamp, whose opening depends on the Switch 2's structure. Myxopyronin, an antibiotic that binds to Switch 2, was shown to undergo a conformational selection mechanism to inhibit clamp opening. Importantly, we reveal a critical but undiscovered role of β-lobe, whose opening is sufficient for DNA loading even when the clamp is partially closed. These findings open the opportunity for the development of antibiotics targeting β-lobe of RNAP. Finally, we have shown that our qMSMs, which encode non-Markovian dynamics based on the generalized master equation formalism, hold great potential to be widely applied to study biomolecular dynamics.
Collapse
|
32
|
Donati L, Weber M, Keller BG. Markov models from the square root approximation of the Fokker-Planck equation: calculating the grid-dependent flux. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2021; 33:115902. [PMID: 33352543 DOI: 10.1088/1361-648x/abd5f7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Molecular dynamics (MD) are extremely complex, yet understanding the slow components of their dynamics is essential to understanding their macroscopic properties. To achieve this, one models the MD as a stochastic process and analyses the dominant eigenfunctions of the associated Fokker-Planck operator, or of closely related transfer operators. So far, the calculation of the discretized operators requires extensive MD simulations. The square-root approximation of the Fokker-Planck equation is a method to calculate transition rates as a ratio of the Boltzmann densities of neighboring grid cells times a flux, and can in principle be calculated without a simulation. In a previous work we still used MD simulations to determine the flux. Here, we propose several methods to calculate the exact or approximate flux for various grid types, and thus estimate the rate matrix without a simulation. Using model potentials we test computational efficiency of the methods, and the accuracy with which they reproduce the dominant eigenfunctions and eigenvalues. For these model potentials, rate matrices with up to [Formula: see text] states can be obtained within seconds on a single high-performance compute server if regular grids are used.
Collapse
Affiliation(s)
- Luca Donati
- Department of Biology, Chemistry, Pharmacy, Freie Universität Berlin, Takustraße 3, D-14195 Berlin, Germany
| | - Marcus Weber
- Zuse Institute Berlin, Takustr. 7, 14195 Berlin, Germany
| | - Bettina G Keller
- Department of Biology, Chemistry, Pharmacy, Freie Universität Berlin, Takustraße 3, D-14195 Berlin, Germany
| |
Collapse
|
33
|
Cao S, Montoya-Castillo A, Wang W, Markland TE, Huang X. On the advantages of exploiting memory in Markov state models for biomolecular dynamics. J Chem Phys 2021; 153:014105. [PMID: 32640825 DOI: 10.1063/5.0010787] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Biomolecular dynamics play an important role in numerous biological processes. Markov State Models (MSMs) provide a powerful approach to study these dynamic processes by predicting long time scale dynamics based on many short molecular dynamics (MD) simulations. In an MSM, protein dynamics are modeled as a kinetic process consisting of a series of Markovian transitions between different conformational states at discrete time intervals (called "lag time"). To achieve this, a master equation must be constructed with a sufficiently long lag time to allow interstate transitions to become truly Markovian. This imposes a major challenge for MSM studies of proteins since the lag time is bound by the length of relatively short MD simulations available to estimate the frequency of transitions. Here, we show how one can employ the generalized master equation formalism to obtain an exact description of protein conformational dynamics both at short and long time scales without the time resolution restrictions imposed by the MSM lag time. Using a simple kinetic model, alanine dipeptide, and WW domain, we demonstrate that it is possible to construct these quasi-Markov State Models (qMSMs) using MD simulations that are 5-10 times shorter than those required by MSMs. These qMSMs only contain a handful of metastable states and, thus, can greatly facilitate the interpretation of mechanisms associated with protein dynamics. A qMSM opens the door to the study of conformational changes of complex biomolecules where a Markovian model with a few states is often difficult to construct due to the limited length of available MD simulations.
Collapse
Affiliation(s)
- Siqin Cao
- Department of Chemistry, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | | | - Wei Wang
- Department of Chemistry, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Thomas E Markland
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| | - Xuhui Huang
- Department of Chemistry, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| |
Collapse
|
34
|
Linker SM, Weiß RG, Riniker S. Connecting dynamic reweighting Algorithms: Derivation of the dynamic reweighting family tree. J Chem Phys 2020; 153:234106. [PMID: 33353335 DOI: 10.1063/5.0019687] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Thermally driven processes of molecular systems include transitions of energy barriers on the microsecond timescales and higher. Sufficient sampling of such processes with molecular dynamics simulations is challenging and often requires accelerating slow transitions using external biasing potentials. Different dynamic reweighting algorithms have been proposed in the past few years to recover the unbiased kinetics from biased systems. However, it remains an open question if and how these dynamic reweighting approaches are connected. In this work, we establish the link between the two main reweighting types, i.e., path-based and energy-based reweighting. We derive a path-based correction factor for the energy-based dynamic histogram analysis method, thus connecting the previously separate reweighting types. We show that the correction factor can be used to combine the advantages of path-based and energy-based reweighting algorithms: it is integrator independent, more robust, and at the same time able to reweight time-dependent biases. We can furthermore demonstrate the relationship between two independently derived path-based reweighting algorithms. Our theoretical findings are verified on a one-dimensional four-well system. By connecting different dynamic reweighting algorithms, this work helps to clarify the strengths and limitations of the different methods and enables a more robust usage of the combined types.
Collapse
Affiliation(s)
- Stephanie M Linker
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - R Gregor Weiß
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| |
Collapse
|
35
|
Wang Z, Zhou X, Zuo G. EspcTM: Kinetic Transition Network Based on Trajectory Mapping in Effective Energy Rescaling Space. Front Mol Biosci 2020; 7:589718. [PMID: 33195438 PMCID: PMC7653181 DOI: 10.3389/fmolb.2020.589718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 09/24/2020] [Indexed: 11/27/2022] Open
Abstract
The transition network provides a key to reveal the thermodynamic and kinetic properties of biomolecular systems. In this paper, we introduce a new method, named effective energy rescaling space trajectory mapping (EspcTM), to detect metastable states and construct transition networks based on the simulation trajectories of the complex biomolecular system. It mapped simulation trajectories into an orthogonal function space, whose bases were rescaled by effective energy, and clustered the interrelation between these trajectories to locate metastable states. By using the EspcTM method, we identified the metastable states and elucidated interstate transition kinetics of a Brownian particle and a dodecapeptide. It was found that the scaling parameters of effective energy also provided a clue to the dominating factors in dynamics. We believe that the EspcTM method is a useful tool for the studies of dynamics of the complex system and may provide new insight into the understanding of thermodynamics and kinetics of biomolecular systems.
Collapse
Affiliation(s)
- Zhenyu Wang
- T-Life Research Center, State Key Laboratory of Surface Physics, Department of Physics, Fudan University, Shanghai, China
| | - Xin Zhou
- School of Physical Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Guanghong Zuo
- T-Life Research Center, State Key Laboratory of Surface Physics, Department of Physics, Fudan University, Shanghai, China
| |
Collapse
|
36
|
Nagel D, Weber A, Stock G. MSMPathfinder: Identification of Pathways in Markov State Models. J Chem Theory Comput 2020; 16:7874-7882. [PMID: 33141565 DOI: 10.1021/acs.jctc.0c00774] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Markov state models represent a popular means to interpret biomolecular processes in terms of memoryless transitions between metastable conformational states. To gain insight into the underlying mechanism, it is instructive to determine all relevant pathways between initial and final states of the process. Currently available methods, such as Markov chain Monte Carlo and transition path theory, are convenient for identifying the most frequented pathways. They are less suited to account for the typically huge amount of pathways with low probability which, though, may dominate the cumulative flux of the reaction. On the basis of a systematic construction of all possible pathways, the here proposed method MSMPathfinder is able to characterize the multitude of unique pathways (say, up to 1010) in a complex system and to quantitatively calculate their correct weights and associated waiting times with predefined accuracy. Adopting the chiral transitions of a peptide helix and the folding of the villin headpiece as model problems, mechanisms and associated waiting times of these processes are discussed using a kinetic network representation. The analysis reveals that the waiting time distribution may yield only little insight into the diversity of pathways, because the measured folding times do typically not reflect the most probable path lengths but rather the cumulative effect of many different pathways.
Collapse
Affiliation(s)
- Daniel Nagel
- Biomolecular Dynamics, Institute of Physics, Albert Ludwigs University, 79104 Freiburg, Germany
| | - Anna Weber
- Biomolecular Dynamics, Institute of Physics, Albert Ludwigs University, 79104 Freiburg, Germany
| | - Gerhard Stock
- Biomolecular Dynamics, Institute of Physics, Albert Ludwigs University, 79104 Freiburg, Germany
| |
Collapse
|
37
|
Wang X, Unarta IC, Cheung PPH, Huang X. Elucidating molecular mechanisms of functional conformational changes of proteins via Markov state models. Curr Opin Struct Biol 2020; 67:69-77. [PMID: 33126140 DOI: 10.1016/j.sbi.2020.10.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2020] [Revised: 09/28/2020] [Accepted: 10/07/2020] [Indexed: 01/01/2023]
Abstract
Functional conformational changes of proteins can facilitate numerous biological events in cells. The Markov state model (MSM) built from molecular dynamics simulations provide a powerful approach to study them. We here introduce a protocol that is tailor-made for constructing MSMs to study the functional conformational changes of proteins. In this protocol, one of the important steps is to select proper molecular features that can collectively describe the slowest timescales of conformational changes of interest. We recommend spectral oASIS, the modified version of oASIS, as a promising approach for automatic feature selection. Recently developed deep learning methods could also serve efficient approaches for selecting features and finding collective variables. Using DNA repair enzymes and RNA polymerases as examples, we review recent applications of MSMs to elucidate molecular mechanisms of functional conformational changes. Finally, we discuss remaining challenges and future perspectives for constructing MSMs to study functional conformational changes of proteins.
Collapse
Affiliation(s)
- Xiaowei Wang
- The Hong Kong University of Science and Technology-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China; Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Ilona Christy Unarta
- Bioengineering Graduate Program, The Hong Kong University of Science and Technology, Kowloon, 4Hong Kong Center for Neurodegenerative Diseases, Hong Kong
| | - Peter Pak-Hang Cheung
- The Hong Kong University of Science and Technology-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China; Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Xuhui Huang
- The Hong Kong University of Science and Technology-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China; Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong; Bioengineering Graduate Program, The Hong Kong University of Science and Technology, Kowloon, 4Hong Kong Center for Neurodegenerative Diseases, Hong Kong.
| |
Collapse
|
38
|
Liu H, Deng J, Luo Z, Lin Y, Merz KM, Zheng Z. Receptor–Ligand Binding Free Energies from a Consecutive Histograms Monte Carlo Sampling Method. J Chem Theory Comput 2020; 16:6645-6655. [DOI: 10.1021/acs.jctc.0c00457] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Hao Liu
- School of Mechanical and Electronic Engineering, Wuhan University of Technology, 122 Luoshi Road, Wuhan 430070, PR China
| | - Jianpeng Deng
- School of Chemistry, Chemical Engineering and Life Science, Wuhan University of Technology, 122 Luoshi Road, Wuhan 430070, PR China
| | - Zhou Luo
- School of Chemistry, Chemical Engineering and Life Science, Wuhan University of Technology, 122 Luoshi Road, Wuhan 430070, PR China
| | - Yawei Lin
- School of Chemistry, Chemical Engineering and Life Science, Wuhan University of Technology, 122 Luoshi Road, Wuhan 430070, PR China
| | - Kenneth M. Merz
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, United States
| | - Zheng Zheng
- School of Chemistry, Chemical Engineering and Life Science, Wuhan University of Technology, 122 Luoshi Road, Wuhan 430070, PR China
| |
Collapse
|
39
|
Zhang Y, Cao Z, Zhang JZ, Xia F. Double-Well Ultra-Coarse-Grained Model to Describe Protein Conformational Transitions. J Chem Theory Comput 2020; 16:6678-6689. [PMID: 32926616 DOI: 10.1021/acs.jctc.0c00551] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The double-well model is usually used to describe the conformational transition between two states of a protein. Since conformational changes usually occur within a relatively large time scale, coarse-grained models are often used to accelerate the dynamic process due to their inexpensive computational cost. In this work, we develop a double-well ultra-coarse-grained (DW-UCG) model to describe the conformational transitions of the adenylate kinase, glutamine-binding protein, and lactoferrin. The coarse-grained simulation results show that the DW-UCG model of adenylate kinase captures the crucial intermediate states in the LID-closing and NMP-closing pathways, reflecting the key secondary structural changes in the conformational transition. A comparison of the different DW-UCG models of adenylate kinase indicates that an appropriate choice of bead resolution could generate the free energy landscape that is comparable to that from the residue-based model. The coarse-grained simulations for the glutamine-binding protein and lactoferrin also demonstrate that the DW-UCG model is valid in reproducing the correct two-state behavior for their functional study, which indicates the potential application of the DW-UCG model in investigating the mechanism of conformational changes of large proteins.
Collapse
Affiliation(s)
- Yuwei Zhang
- State Key Laboratory of Physical Chemistry of Solid Surfaces and Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, College of Chemistry and Chemistry Engineering, Xiamen University, Xiamen 361005, China.,School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - Zexing Cao
- State Key Laboratory of Physical Chemistry of Solid Surfaces and Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, College of Chemistry and Chemistry Engineering, Xiamen University, Xiamen 361005, China
| | - John Zenghui Zhang
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China.,Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| | - Fei Xia
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China.,Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|
40
|
Bernetti M, Bertazzo M, Masetti M. Data-Driven Molecular Dynamics: A Multifaceted Challenge. Pharmaceuticals (Basel) 2020; 13:E253. [PMID: 32961909 PMCID: PMC7557855 DOI: 10.3390/ph13090253] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 09/14/2020] [Accepted: 09/16/2020] [Indexed: 12/18/2022] Open
Abstract
The big data concept is currently revolutionizing several fields of science including drug discovery and development. While opening up new perspectives for better drug design and related strategies, big data analysis strongly challenges our current ability to manage and exploit an extraordinarily large and possibly diverse amount of information. The recent renewal of machine learning (ML)-based algorithms is key in providing the proper framework for addressing this issue. In this respect, the impact on the exploitation of molecular dynamics (MD) simulations, which have recently reached mainstream status in computational drug discovery, can be remarkable. Here, we review the recent progress in the use of ML methods coupled to biomolecular simulations with potentially relevant implications for drug design. Specifically, we show how different ML-based strategies can be applied to the outcome of MD simulations for gaining knowledge and enhancing sampling. Finally, we discuss how intrinsic limitations of MD in accurately modeling biomolecular systems can be alleviated by including information coming from experimental data.
Collapse
Affiliation(s)
- Mattia Bernetti
- Scuola Internazionale Superiore di Studi Avanzati (SISSA), via Bonomea 265, I-34136 Trieste, Italy;
| | - Martina Bertazzo
- Computational Sciences, Istituto Italiano di Tecnologia, via Morego 30, I-16163 Genova, Italy;
| | - Matteo Masetti
- Department of Pharmacy and Biotechnology, Alma Mater Studiorum—Università di Bologna, via Belmeloro 6, I-40126 Bologna, Italy
| |
Collapse
|
41
|
Target search and recognition mechanisms of glycosylase AlkD revealed by scanning FRET-FCS and Markov state models. Proc Natl Acad Sci U S A 2020; 117:21889-21895. [PMID: 32820079 PMCID: PMC7486748 DOI: 10.1073/pnas.2002971117] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
DNA glycosylase repairs DNA damage to maintain the genome integrity, and thus it is essential for the survival of all organisms. However, it remains a long-standing puzzle how glycosylase diffuses along the genomic DNA to locate the sparse and aberrant lesion sites efficiently and accurately in the genome containing numerous base pairs. Previously, only the high-speed–low-accuracy search mode has been characterized experimentally, while the low-speed–high-accuracy mode is undetectable. Here, we observed the low-speed mode of glycosylase AlkD translocating, and further dissected its molecular mechanisms. To achieve this, we developed an integrated platform by combining scanning FRET-FCS with Markov state model. We expect that this platform can be widely applied to investigate other glycosylases and DNA-binding proteins. DNA glycosylase is responsible for repairing DNA damage to maintain the genome stability and integrity. However, how glycosylase can efficiently and accurately recognize DNA lesions across the enormous DNA genome remains elusive. It has been hypothesized that glycosylase translocates along the DNA by alternating between a fast but low-accuracy diffusion mode and a slow but high-accuracy mode when searching for DNA lesions. However, the slow mode has not been successfully characterized due to the limitation in the spatial and temporal resolutions of current experimental techniques. Using a newly developed scanning fluorescence resonance energy transfer (FRET)–fluorescence correlation spectroscopy (FCS) platform, we were able to observe both slow and fast modes of glycosylase AlkD translocating on double-stranded DNA (dsDNA), reaching the temporal resolution of microsecond and spatial resolution of subnanometer. The underlying molecular mechanism of the slow mode was further elucidated by Markov state model built from extensive all-atom molecular dynamics simulations. We found that in the slow mode, AlkD follows an asymmetric diffusion pathway, i.e., rotation followed by translation. Furthermore, the essential role of Y27 in AlkD diffusion dynamics was identified both experimentally and computationally. Our results provided mechanistic insights on how conformational dynamics of AlkD–dsDNA complex coordinate different diffusion modes to accomplish the search for DNA lesions with high efficiency and accuracy. We anticipate that the mechanism adopted by AlkD to search for DNA lesions could be a general one utilized by other glycosylases and DNA binding proteins.
Collapse
|
42
|
Narayan B, Yuan Y, Fathizadeh A, Elber R, Buchete NV. Long-time methods for molecular dynamics simulations: Markov State Models and Milestoning. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2020; 170:215-237. [PMID: 32145946 DOI: 10.1016/bs.pmbts.2020.01.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Molecular dynamics (MD) studies of biomolecules require the ability to simulate complex biochemical systems with an increasingly larger number of particles and for longer time scales, a problem that cannot be overcome by computational hardware advances alone. A main problem springs from the intrinsically high-dimensional and complex nature of the underlying free energy landscape of most systems, and from the necessity to sample accurately such landscapes for identifying kinetic and thermodynamic states in the configurations space, and for accurate calculations of both free energy differences and of the corresponding transition rates between states. Here, we review and present applications of two increasingly popular methods that allow long-time MD simulations of biomolecular systems that can open a broad spectrum of new studies. A first approach, Markov State Models (MSMs), relies on identifying a set of configuration states in which the system resides sufficiently long to relax and loose the memory of previous transitions, and on using simulations for mapping the underlying complex energy landscape and for extracting accurate thermodynamic and kinetic information. The Markovian independence of the underlying transition probabilities creates the opportunity to increase the sampling efficiency by using sets of appropriately initialized short simulations rather than typically long MD trajectories, which also enhances sampling. This allows MSM-based studies to unveil bio-molecular mechanisms and to estimate free energy barriers with high accuracy, in a manner that is both systematic and relatively automatic, which accounts for their increasing popularity. The second approach presented, Milestoning, targets accurate studies of the ensemble of pathways connecting specific end-states (e.g., reactants and products) in a similarly systematic, accurate and highly automatic manner. Applications presented range from studies of conformational dynamics and binding of amyloid-forming peptides, cell-penetrating peptides and the DFG-flip dynamics in Abl kinase. As highlighted by the increasing number of studies using both methods, we anticipate that they will open new avenues for the investigation of systematic sampling of reactions pathways and mechanisms occurring on longer time scales than currently accessible by purely computational hardware developments.
Collapse
Affiliation(s)
- Brajesh Narayan
- School of Physics, University College Dublin, Dublin, Ireland; Institute for Discovery, University College Dublin, Dublin, Ireland
| | - Ye Yuan
- School of Physics, University College Dublin, Dublin, Ireland; Institute for Discovery, University College Dublin, Dublin, Ireland
| | - Arman Fathizadeh
- Oden Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, TX, United States
| | - Ron Elber
- Oden Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, TX, United States; Department of Chemistry, University of Texas at Austin, Austin, TX, United States
| | - Nicolae-Viorel Buchete
- School of Physics, University College Dublin, Dublin, Ireland; Institute for Discovery, University College Dublin, Dublin, Ireland.
| |
Collapse
|
43
|
Beyerle ER, Guenza MG. Kinetics analysis of ubiquitin local fluctuations with Markov state modeling of the LE4PD normal modes. J Chem Phys 2019; 151:164119. [PMID: 31675886 DOI: 10.1063/1.5123513] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Local fluctuations are important for protein binding and molecular recognition because they provide conformational states that can be trapped through a selection mechanism of binding. Thus, an accurate characterization of local fluctuations may be important for modeling the kinetic mechanism that leads to the biological activity of a protein. In this paper, we study the fluctuation dynamics of the regulatory protein ubiquitin and propose a novel theoretical approach to model its fluctuations. A coarse-grained, diffusive, mode-dependent description of fluctuations is accomplished using the Langevin Equation for Protein Dynamics (LE4PD). This equation decomposes the dynamics of a protein, simulated by molecular dynamics, into dynamical pathways that explore mode-dependent free energy surfaces. We calculate the time scales of the slow, high-amplitude fluctuations by modeling the kinetics of barrier crossing in the two-dimensional free energy surfaces using Markov state modeling. We find that the LE4PD predicts slow fluctuations in three important binding regions in ubiquitin: the C-terminal tail, the Lys11 loop, and the 50 s loop. These results suggest that the LE4PD can provide useful information on the role of fluctuations in the process of molecular recognition regulating the biological activity of ubiquitin.
Collapse
Affiliation(s)
- Eric R Beyerle
- Department of Chemistry and Biochemistry and Institute of Theoretical Science, University of Oregon, Eugene, Oregon 97403, USA
| | - Marina G Guenza
- Department of Chemistry and Biochemistry and Institute of Theoretical Science, University of Oregon, Eugene, Oregon 97403, USA
| |
Collapse
|
44
|
Tian J, Liu F, Xu Z, Shi J, Liang T, Zhang Y, Da LT. Regulatory Role of One Critical Catalytic Loop of Polypeptide N-Acetyl-Galactosaminyltransferase-2 in Substrate Binding and Catalysis during Mucin-Type O-Glycosylation. ACS Catal 2019. [DOI: 10.1021/acscatal.9b03782] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Jiaqi Tian
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Feng Liu
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Zhijue Xu
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Jingjing Shi
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Tao Liang
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Yan Zhang
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Lin-Tai Da
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| |
Collapse
|
45
|
Sharpe DJ, Wales DJ. Identifying mechanistically distinct pathways in kinetic transition networks. J Chem Phys 2019; 151:124101. [DOI: 10.1063/1.5111939] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Daniel J. Sharpe
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - David J. Wales
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
46
|
Encounter complexes and hidden poses of kinase-inhibitor binding on the free-energy landscape. Proc Natl Acad Sci U S A 2019; 116:18404-18409. [PMID: 31451651 PMCID: PMC6744929 DOI: 10.1073/pnas.1904707116] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Modern drug discovery increasingly focuses on the drug-target binding kinetics which depend on drug (un)binding pathways. The conventional molecular dynamics simulation can observe only a few binding events even using the fastest supercomputer. Here, we develop 2D gREST/REUS simulation with enhanced flexibility of the ligand and the protein binding site. Simulation (43 μs in total) applied to an inhibitor binding to c-Src kinase covers 100 binding and unbinding events. On the statistically converged free-energy landscapes, we succeed in predicting the X-ray binding structure, including water positions. Furthermore, we characterize hidden semibound poses and transient encounter complexes on the free-energy landscapes. Regulatory residues distant from the catalytic core are responsible for the initial inhibitor uptake and regulation of subsequent bindings, which was unresolved by experiments. Stabilizing/blocking of either the semibound poses or the encounter complexes can be an effective strategy to optimize drug-target residence time.
Collapse
|
47
|
Lu S, Ni D, Wang C, He X, Lin H, Wang Z, Zhang J. Deactivation Pathway of Ras GTPase Underlies Conformational Substates as Targets for Drug Design. ACS Catal 2019. [DOI: 10.1021/acscatal.9b02556] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Shaoyong Lu
- Key Laboratory of Cell Differentiation and Apoptosis of Chinese Ministry of Education, Clinical and Fundamental Research Center, Department of Pharmacy, Renji Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 200127, China
| | - Duan Ni
- Key Laboratory of Cell Differentiation and Apoptosis of Chinese Ministry of Education, Clinical and Fundamental Research Center, Department of Pharmacy, Renji Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 200127, China
| | - Chengxiang Wang
- Key Laboratory of Cell Differentiation and Apoptosis of Chinese Ministry of Education, Clinical and Fundamental Research Center, Department of Pharmacy, Renji Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 200127, China
| | - Xinheng He
- Key Laboratory of Cell Differentiation and Apoptosis of Chinese Ministry of Education, Clinical and Fundamental Research Center, Department of Pharmacy, Renji Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 200127, China
| | - Houwen Lin
- Research Center for Marine Drugs, State Key Laboratory of Oncogenes and Related Genes, Department of Pharmacy, Renji Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 200127, China
| | - Zheng Wang
- Department of Gastrointestinal Surgery, Renji Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 200127, China
| | - Jian Zhang
- Key Laboratory of Cell Differentiation and Apoptosis of Chinese Ministry of Education, Clinical and Fundamental Research Center, Department of Pharmacy, Renji Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 200127, China
| |
Collapse
|
48
|
Thiede EH, Giannakis D, Dinner AR, Weare J. Galerkin approximation of dynamical quantities using trajectory data. J Chem Phys 2019; 150:244111. [PMID: 31255053 PMCID: PMC6824902 DOI: 10.1063/1.5063730] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2018] [Accepted: 05/13/2019] [Indexed: 11/14/2022] Open
Abstract
Understanding chemical mechanisms requires estimating dynamical statistics such as expected hitting times, reaction rates, and committors. Here, we present a general framework for calculating these dynamical quantities by approximating boundary value problems using dynamical operators with a Galerkin expansion. A specific choice of basis set in the expansion corresponds to the estimation of dynamical quantities using a Markov state model. More generally, the boundary conditions impose restrictions on the choice of basis sets. We demonstrate how an alternative basis can be constructed using ideas from diffusion maps. In our numerical experiments, this basis gives results of comparable or better accuracy to Markov state models. Additionally, we show that delay embedding can reduce the information lost when projecting the system's dynamics for model construction; this improves estimates of dynamical statistics considerably over the standard practice of increasing the lag time.
Collapse
Affiliation(s)
- Erik H Thiede
- Department of Chemistry and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Dimitrios Giannakis
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Aaron R Dinner
- Department of Chemistry and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| |
Collapse
|
49
|
Li W, Wang J, Zhang J, Takada S, Wang W. Overcoming the Bottleneck of the Enzymatic Cycle by Steric Frustration. PHYSICAL REVIEW LETTERS 2019; 122:238102. [PMID: 31298900 DOI: 10.1103/physrevlett.122.238102] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 04/10/2019] [Indexed: 06/10/2023]
Abstract
The enormous catalytic power of natural enzymes relies on the ability to overcome the bottleneck event in the enzymatic cycle, yet the underlying physical mechanisms are not fully understood. Here, by performing molecular simulations of the whole enzymatic cycle for a model multisubstrate enzyme with a dynamic energy landscape model, we show that multisubstrate enzymes can utilize steric frustration to facilitate the rate-limiting product-release step. During the enzymatic cycles, the bottleneck product is actively squeezed out by the binding of a new substrate at the neighboring site through the population of a substrate-product cobound complex, in which the binding pockets are frustrated due to steric incompatibility. Such steric frustration thereby enables an active mechanism of product release driven by substrate-binding energy, facilitating the enzymatic cycle.
Collapse
Affiliation(s)
- Wenfei Li
- Department of Physics, National Laboratory of Solid State Microstructure, and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| | - Jun Wang
- Department of Physics, National Laboratory of Solid State Microstructure, and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| | - Jian Zhang
- Department of Physics, National Laboratory of Solid State Microstructure, and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| | - Shoji Takada
- Department of Biophysics, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan
| | - Wei Wang
- Department of Physics, National Laboratory of Solid State Microstructure, and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| |
Collapse
|
50
|
Zhu L, Sheong FK, Cao S, Liu S, Unarta IC, Huang X. TAPS: A traveling-salesman based automated path searching method for functional conformational changes of biological macromolecules. J Chem Phys 2019; 150:124105. [DOI: 10.1063/1.5082633] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Affiliation(s)
- Lizhe Zhu
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, China
| | - Fu Kit Sheong
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Siqin Cao
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Song Liu
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Ilona C. Unarta
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Xuhui Huang
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Bioengineering Program, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- HKUST-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China
| |
Collapse
|