1
|
Using Gaussian accelerated molecular dynamics combined with Markov state models to explore the mechanism of action of new oral inhibitors on Complex I. Comput Biol Med 2024; 177:108598. [PMID: 38776729 DOI: 10.1016/j.compbiomed.2024.108598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 04/15/2024] [Accepted: 05/11/2024] [Indexed: 05/25/2024]
Abstract
In this study, our focus was on investigating H-1,2,3-triazole derivative HP661 as a novel and highly efficient oral OXPHOS inhibitor, with its molecular-level inhibitory mechanism not yet fully understood. We selected the ND1, NDUFS2, and NDUFS7 subunits of Mitochondrial Complex I as the receptor proteins and established three systems for comparative analysis: protein-IACS-010759, protein-lead compound 10, and protein-HP661. Through extensive analysis involving 500 ns Gaussian molecular dynamics simulations, we gained insights into these systems. Additionally, we constructed a Markov State Models to examine changes in secondary structures during the motion processes. The research findings suggest that the inhibitor HP661 enhances the extensibility and hydrophilicity of the receptor protein. Furthermore, HP661 induces the unwinding of the α-helical structure in the region of residues 726-730. Notably, key roles were identified for Met37, Phe53, and Pro212 in the binding of various inhibitors. In conclusion, we delved into the potential molecular mechanisms of triazole derivative HP661 in inhibiting Complex I. These research outcomes provide crucial information for a deeper understanding of the mechanisms underlying OXPHOS inhibition, offering valuable theoretical support for drug development and disease treatment design.
Collapse
|
2
|
Toward physics-based precision medicine: Exploiting protein dynamics to design new therapeutics and interpret variants. Protein Sci 2024; 33:e4902. [PMID: 38358129 PMCID: PMC10868452 DOI: 10.1002/pro.4902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 12/01/2023] [Accepted: 01/04/2024] [Indexed: 02/16/2024]
Abstract
The goal of precision medicine is to utilize our knowledge of the molecular causes of disease to better diagnose and treat patients. However, there is a substantial mismatch between the small number of food and drug administration (FDA)-approved drugs and annotated coding variants compared to the needs of precision medicine. This review introduces the concept of physics-based precision medicine, a scalable framework that promises to improve our understanding of sequence-function relationships and accelerate drug discovery. We show that accounting for the ensemble of structures a protein adopts in solution with computer simulations overcomes many of the limitations imposed by assuming a single protein structure. We highlight studies of protein dynamics and recent methods for the analysis of structural ensembles. These studies demonstrate that differences in conformational distributions predict functional differences within protein families and between variants. Thanks to new computational tools that are providing unprecedented access to protein structural ensembles, this insight may enable accurate predictions of variant pathogenicity for entire libraries of variants. We further show that explicitly accounting for protein ensembles, with methods like alchemical free energy calculations or docking to Markov state models, can uncover novel lead compounds. To conclude, we demonstrate that cryptic pockets, or cavities absent in experimental structures, provide an avenue to target proteins that are currently considered undruggable. Taken together, our review provides a roadmap for the field of protein science to accelerate precision medicine.
Collapse
|
3
|
Unraveling the Interplay of Extracellular Domain Conformational Changes and Parathyroid Hormone Type 1 Receptor Activation in Class B1 G Protein-Coupled Receptors: Integrating Enhanced Sampling Molecular Dynamics Simulations and Markov State Models. ACS Chem Neurosci 2024; 15:844-853. [PMID: 38314550 DOI: 10.1021/acschemneuro.3c00747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2024] Open
Abstract
Parathyroid hormone (PTH) type 1 receptor (PTH1R), as a typical class B1 G protein-coupled receptor (GPCR), is responsible for regulating bone turnover and maintaining calcium homeostasis, and its dysregulation has been implicated in the development of several diseases. The extracellular domain (ECD) of PTH1R is crucial for the recognition and binding of ligands, and the receptor may exhibit an autoinhibited state with the closure of the ECD in the absence of ligands. However, the correlation between ECD conformations and PTH1R activation remains unclear. Thus, this study combines enhanced sampling molecular dynamics (MD) simulations and Markov state models (MSMs) to reveal the possible relevance between the ECD conformations and the activation of PTH1R. First, 22 intermediate structures are generated from the autoinhibited state to the active state and conducted for 10 independent 200 ns simulations each. Then, the MSM is constructed based on the cumulative 44 μs simulations with six identified microstates. Finally, the potential interplay between ECD conformational changes and PTH1R activation as well as cryptic allosteric pockets in the intermediate states during receptor activation is revealed. Overall, our findings reveal that the activation of PTH1R has a specific correlation with ECD conformational changes and provide essential insights for GPCR biology and developing novel allosteric modulators targeting cryptic sites.
Collapse
|
4
|
Folding-upon-binding pathways of an intrinsically disordered protein from a deep Markov state model. Proc Natl Acad Sci U S A 2024; 121:e2313360121. [PMID: 38294935 PMCID: PMC10861926 DOI: 10.1073/pnas.2313360121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 11/22/2023] [Indexed: 02/02/2024] Open
Abstract
A central challenge in the study of intrinsically disordered proteins is the characterization of the mechanisms by which they bind their physiological interaction partners. Here, we utilize a deep learning-based Markov state modeling approach to characterize the folding-upon-binding pathways observed in a long timescale molecular dynamics simulation of a disordered region of the measles virus nucleoprotein NTAIL reversibly binding the X domain of the measles virus phosphoprotein complex. We find that folding-upon-binding predominantly occurs via two distinct encounter complexes that are differentiated by the binding orientation, helical content, and conformational heterogeneity of NTAIL. We observe that folding-upon-binding predominantly proceeds through a multi-step induced fit mechanism with several intermediates and do not find evidence for the existence of canonical conformational selection pathways. We observe four kinetically separated native-like bound states that interconvert on timescales of eighty to five hundred nanoseconds. These bound states share a core set of native intermolecular contacts and stable NTAIL helices and are differentiated by a sequential formation of native and non-native contacts and additional helical turns. Our analyses provide an atomic resolution structural description of intermediate states in a folding-upon-binding pathway and elucidate the nature of the kinetic barriers between metastable states in a dynamic and heterogenous, or "fuzzy", protein complex.
Collapse
|
5
|
Decoupling the dynamic mechanism revealed by FGFR2 mutation-induced population shift. J Biomol Struct Dyn 2024; 42:1940-1951. [PMID: 37254996 DOI: 10.1080/07391102.2023.2217924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 04/08/2023] [Indexed: 06/01/2023]
Abstract
The fibroblast growth factor receptor 2 (FGFR2) is a key component in cellular signaling networks, and its dysfunctional activation has been implicated in various diseases including cancer and developmental disorders. Mutations at the activation loop (A-loop) have been suggested to trigger an increased basal kinase activity. However, the molecular mechanism underlying this highly dynamic process has not been fully understood due to the limitation of static structural information. Here, we conducted multiple, large-scale Gaussian accelerated molecular dynamics simulations of five (K659E, K659N, K659M, K659Q, and K659T) FGFR2 mutants at the A-loop, and comprehensively analyzed the dynamic molecular basis of FGFR2 activation. The results quantified the population shift of each system, revealing that all mutants had a higher proportion of active-like states. Using Markov state models, we extracted the representative structure of different conformational states and identified key residues related to the increased kinase activity. Furthermore, community network analysis showed enhanced information connections in the mutants, highlighting the long-range allosteric communication between the A-loop and the hinge region. Our findings may provide insights into the dynamic mechanism for FGFR2 dysfunctional activation and allosteric drug discovery.Communicated by Ramaswamy H. Sarma.
Collapse
|
6
|
Mechanism of β-hairpin formation in AzoChignolin and Chignolin. J Comput Chem 2023; 44:988-1001. [PMID: 36575994 DOI: 10.1002/jcc.27059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 11/23/2022] [Accepted: 11/30/2022] [Indexed: 12/29/2022]
Abstract
AzoChignolin is a photoswitchable variant of the mini-protein Chignolin with an azobenzene (AMPP) replacing the central loop. AzoChignolin is unfolded with AMPP in the trans-isomer. Transition to the cis-isomer causes β-hairpin folding similar to Chignolin. The AzoChignolin system is excellently suited for comprehensive analysis of folding nucleation kinetics. Utilizing multiple long-time MD simulations of AzoChignolin and Chignolin in MeOH and water, we estimated Markov models to examine folding kinetics of both peptides. We show that while AzoChignolin mimics Chignolin's structure well, the folding kinetics are quite different. Not only folding times but also intermediate states differ, particularly Chignolin is able to fold in MeOH into an α-helical intermediate which is impossible to form in AzoChignolin. The Markov models demonstrate that AzoChignolin's kinetics are generally faster, specifically when comparing the two main microfolding processes of hydrophobic collapse and turn formation. Photoswitchable loops are used frequently to understand the kinetics of elementary protein folding nucleation. However, our results indicate that intermediates and folding kinetics may differ between natural loops and photoswitchable variants.
Collapse
|
7
|
Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations. Proc Natl Acad Sci U S A 2023; 120:e2221048120. [PMID: 36920924 PMCID: PMC10041170 DOI: 10.1073/pnas.2221048120] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 02/21/2023] [Indexed: 03/16/2023] Open
Abstract
The ability to predict and understand complex molecular motions occurring over diverse timescales ranging from picoseconds to seconds and even hours in biological systems remains one of the largest challenges to chemical theory. Markov state models (MSMs), which provide a memoryless description of the transitions between different states of a biochemical system, have provided numerous important physically transparent insights into biological function. However, constructing these models often necessitates performing extremely long molecular simulations to converge the rates. Here, we show that by incorporating memory via the time-convolutionless generalized master equation (TCL-GME) one can build a theoretically transparent and physically intuitive memory-enriched model of biochemical processes with up to a three order of magnitude reduction in the simulation data required while also providing a higher temporal resolution. We derive the conditions under which the TCL-GME provides a more efficient means to capture slow dynamics than MSMs and rigorously prove when the two provide equally valid and efficient descriptions of the slow configurational dynamics. We further introduce a simple averaging procedure that enables our TCL-GME approach to quickly converge and accurately predict long-time dynamics even when parameterized with noisy reference data arising from short trajectories. We illustrate the advantages of the TCL-GME using alanine dipeptide, the human argonaute complex, and FiP35 WW domain.
Collapse
|
8
|
Dynamic conformational states of apo, ATP and cabozantinib bound TAM kinases to differentiate active-inactive kinetic models. J Biomol Struct Dyn 2023; 41:11394-11414. [PMID: 36591700 DOI: 10.1080/07391102.2022.2162128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 12/18/2022] [Indexed: 01/03/2023]
Abstract
The dynamically active and inactive conformations of kinases play a crucial role in the activation of intracellular downstream signaling pathways. The all-atom molecular dynamics (MD) simulations at microsecond (µs) timescale and longer provide robust insights into the structural details of conformational alterations in kinases that contribute to their cellular metabolic activities and signaling pathways. Tyro3, Axl and Mer (TAM) receptor tyrosine kinases (RTKs) are overexpressed in several types of human cancers. Cabozantinib, a small molecule inhibitor constrains the activity of TAM kinases at nanomolar concentrations. The apo, complexes of ATP (active state) and cabozantinib (active and inactive states) with TAM RTKs were studied by 1 µs MD simulations followed by trajectory analyses. The dynamic mechanistic pathways intrinsic to the kinase activity and protein conformational landscape in the cabozantinib bound TAM kinases are revealed due to the alterations in the P-loop, α-helix and activation loop that result in breaking the regulatory (R) and catalytic (C) spines, while the active states of ATP bound TAM kinases are retained. The co-existence of dynamical states when bound to cabozantinib was observed and the long-lived kinetic transition states of distinct active and inactive structural models were deciphered from MD simulation trajectories that have not been revealed so far.Communicated by Ramaswamy H. Sarma.
Collapse
|
9
|
Structural Rearrangement of the Serotonin Transporter Intracellular Gate Induced by Thr276 Phosphorylation. ACS Chem Neurosci 2022; 13:933-945. [PMID: 35258286 DOI: 10.1021/acschemneuro.1c00714] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The reuptake of the neurotransmitter serotonin from the synaptic cleft by the serotonin transporter, SERT, is essential for proper neurological signaling. Biochemical studies have shown that Thr276 of transmembrane helix 5 is a site of PKG-mediated SERT phosphorylation, which has been proposed to shift the SERT conformational equilibria to promote inward-facing states, thus enhancing 5-HT transport. Recent structural and simulation studies have provided insights into the conformation transitions during substrate transport but have not shed light on SERT regulation via post-translational modifications. Using molecular dynamics simulations and Markov state models, we investigate how Thr276 phosphorylation impacts the SERT mechanism and its role in enhancing transporter stability and function. Our simulations show that Thr276 phosphorylation alters the hydrogen-bonding network involving residues on transmembrane helix 5. This in turn decreases the free energy barriers for SERT to transition to the inward-facing state, thus facilitating 5-HT import. The results provide atomistic insights into in vivo SERT regulation and can be extended to other pharmacologically important transporters in the solute carrier family.
Collapse
|
10
|
Molecular Dynamics Simulations of Protein Aggregation: Protocols for Simulation Setup and Analysis with Markov State Models and Transition Networks. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2340:235-279. [PMID: 35167078 DOI: 10.1007/978-1-0716-1546-1_12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Protein disorder and aggregation play significant roles in the pathogenesis of numerous neurodegenerative diseases, such as Alzheimer's and Parkinson's diseases. The end products of the aggregation process in these diseases are highly structured amyloid fibrils. Though in most cases, small, soluble oligomers formed during amyloid aggregation are the toxic species. A full understanding of the physicochemical forces that drive protein aggregation is thus required if one aims for the rational design of drugs targeting the formation of amyloid oligomers. Among a multitude of biophysical and biochemical techniques that are employed for studying protein aggregation, molecular dynamics (MD) simulations at the atomic level provide the highest temporal and spatial resolution of this process, capturing key steps during the formation of amyloid oligomers. Here we provide a step-by-step guide for setting up, running, and analyzing MD simulations of aggregating peptides using GROMACS. For the analysis, we provide the scripts that were developed in our lab, which allow to determine the oligomer size and inter-peptide contacts that drive the aggregation process. Moreover, we explain and provide the tools to derive Markov state models and transition networks from MD data of peptide aggregation.
Collapse
|
11
|
Abstract
INTRODUCTION Hidden allosteric sites are not visible in apo-crystal structures, but they may be visible in holo-structures when a certain ligand binds and maintains the ligand intended conformation. Several computational and experimental techniques have been used to investigate these hidden sites but identifying them remains a challenge. AREAS COVERED This review provides a summary of the many theoretical approaches for predicting hidden allosteric sites in disease-related proteins. Furthermore, promising cases have been thoroughly examined to reveal the hidden allosteric site and its modulator. EXPERT OPINION In the recent past, with the development in scientific techniques and bioinformatics tools, the number of drug targets for complex human diseases has significantly increased but unfortunately most of these targets are undruggable due to several reasons. Alternative strategies such as finding cryptic (hidden) allosteric sites are an attractive approach for exploitation of the discovery of new targets. These hidden sites are difficult to recognize compared to allosteric sites, mainly due to a lack of visibility in the crystal structure. In our opinion, after many years of development, MD simulations are finally becoming successful for obtaining a detailed molecular description of drug-target interaction.
Collapse
|
12
|
Development of enhanced conformational sampling methods to probe the activation landscape of GPCRs. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2021; 128:325-359. [PMID: 35034722 DOI: 10.1016/bs.apcsb.2021.11.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
G protein-coupled receptors (GPCRs) make up the largest superfamily of integral membrane proteins and play critical signal transduction roles in many physiological processes. Developments in molecular biology, biophysical, biochemical, pharmacological, and computational techniques aimed at these important therapeutic targets are beginning to provide unprecedented details on the structural as well as functional basis of their pleiotropic signaling mediated by G proteins, β arrestins, and other transducers. This pleiotropy presents a pharmacological challenge as the same ligand-receptor interaction can cause a therapeutic effect as well as an undesirable on-target side-effect through different downstream pathways. GPCRs don't function as simple binary on-off switches but as finely tuned shape-shifting machines described by conformational ensembles, where unique subsets of conformations may be responsible for specific signaling cascades. X-ray crystallography and more recently cryo-electron microscopy are providing snapshots of some of these functionally-important receptor conformations bound to ligands and/or transducers, which are being utilized by computational methods to describe the dynamic conformational energy landscape of GPCRs. In this chapter, we review the progress in computational conformational sampling methods based on molecular dynamics and discrete sampling approaches that have been successful in complementing biophysical and biochemical studies on these receptors in terms of their activation mechanisms, allosteric effects, actions of biased ligands, and effects of pathological mutations. Some of the sampled simulation time scales are beginning to approach receptor activation time scales. The list of conformational sampling methods and example uses discussed is not exhaustive but includes representative examples that have pushed the limits of classical molecular dynamics and discrete sampling methods to describe the activation energy landscape of GPCRs.
Collapse
|
13
|
Physics-based protein structure refinement in the era of artificial intelligence. Proteins 2021; 89:1870-1887. [PMID: 34156124 PMCID: PMC8616793 DOI: 10.1002/prot.26161] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 05/31/2021] [Accepted: 06/08/2021] [Indexed: 12/21/2022]
Abstract
Protein structure refinement is the last step in protein structure prediction pipelines. Physics-based refinement via molecular dynamics (MD) simulations has made significant progress during recent years. During CASP14, we tested a new refinement protocol based on an improved sampling strategy via MD simulations. MD simulations were carried out at an elevated temperature (360 K). An optimized use of biasing restraints and the use of multiple starting models led to enhanced sampling. The new protocol generally improved the model quality. In comparison with our previous protocols, the CASP14 protocol showed clear improvements. Our approach was successful with most initial models, many based on deep learning methods. However, we found that our approach was not able to refine machine-learning models from the AlphaFold2 group, often decreasing already high initial qualities. To better understand the role of refinement given new types of models based on machine-learning, a detailed analysis via MD simulations and Markov state modeling is presented here. We continue to find that MD-based refinement has the potential to improve AI predictions. We also identified several practical issues that make it difficult to realize that potential. Increasingly important is the consideration of inter-domain and oligomeric contacts in simulations; the presence of large kinetic barriers in refinement pathways also continues to present challenges. Finally, we provide a perspective on how physics-based refinement could continue to play a role in the future for improving initial predictions based on machine learning-based methods.
Collapse
|
14
|
Modulation of adenosine A2a receptor oligomerization by receptor activation and PIP 2 interactions. Structure 2021; 29:1312-1325.e3. [PMID: 34270937 PMCID: PMC8581623 DOI: 10.1016/j.str.2021.06.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 05/28/2021] [Accepted: 06/25/2021] [Indexed: 11/23/2022]
Abstract
GPCRs have been shown to form oligomers, which generate distinctive signaling outcomes. However, the structural nature of the oligomerization process remains uncertain. We have characterized oligomeric configurations of the adenosine A2a receptor (A2aR) by combining large-scale molecular dynamics simulations with Markov state models. These oligomeric structures may also serve as templates for studying oligomerization of other class A GPCRs. Our simulation data revealed that receptor activation results in enhanced oligomerization, more diverse oligomer populations, and a more connected oligomerization network. The active state conformation of the A2aR shifts protein-protein association interfaces to those involving intracellular loop ICL3 and transmembrane helix TM6. Binding of PIP2 to A2aR stabilizes protein-protein interactions via PIP2-mediated association interfaces. These results indicate that A2aR oligomerization is responsive to the local membrane lipid environment. This, in turn, suggests a modulatory effect on A2aR whereby a given oligomerization profile favors the dynamic formation of specific supramolecular signaling complexes.
Collapse
|
15
|
Markov State Models and Molecular Dynamics Simulations Provide Understanding of the Nucleotide-Dependent Dimerization-Based Activation of LRRK2 ROC Domain. Molecules 2021; 26:5647. [PMID: 34577121 PMCID: PMC8467336 DOI: 10.3390/molecules26185647] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 09/11/2021] [Accepted: 09/14/2021] [Indexed: 01/26/2023] Open
Abstract
Mutations in leucine-rich repeat kinase 2 (LRRK2) are recognized as the most frequent cause of Parkinson's disease (PD). As a multidomain ROCO protein, LRRK2 is characterized by the presence of both a Ras-of-complex (ROC) GTPase domain and a kinase domain connected through the C-terminal of an ROC domain (COR). The bienzymatic ROC-COR-kinase catalytic triad indicated the potential role of GTPase domain in regulating kinase activity. However, as a functional GTPase, the detailed intrinsic regulation of the ROC activation cycle remains poorly understood. Here, combining extensive molecular dynamics simulations and Markov state models, we disclosed the dynamic structural rearrangement of ROC's homodimer during nucleotide turnover. Our study revealed the coupling between dimerization extent and nucleotide-binding state, indicating a nucleotide-dependent dimerization-based activation scheme adopted by ROC GTPase. Furthermore, inspired by the well-known R1441C/G/H PD-relevant mutations within the ROC domain, we illuminated the potential allosteric molecular mechanism for its pathogenetic effects through enabling faster interconversion between inactive and active states, thus trapping ROC in a prolonged activated state, while the implicated allostery could provide further guidance for identification of regulatory allosteric pockets on the ROC complex. Our investigations illuminated the thermodynamics and kinetics of ROC homodimer during nucleotide-dependent activation for the first time and provided guidance for further exploiting ROC as therapeutic targets for controlling LRRK2 functionality in PD treatment.
Collapse
|
16
|
Role of substrate recognition in modulating strigolactone receptor selectivity in witchweed. J Biol Chem 2021; 297:101092. [PMID: 34437903 PMCID: PMC8487064 DOI: 10.1016/j.jbc.2021.101092] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Revised: 07/26/2021] [Accepted: 08/16/2021] [Indexed: 01/14/2023] Open
Abstract
Witchweed, or Striga hermonthica, is a parasitic weed that destroys billions of dollars' worth of crops globally every year. Its germination is stimulated by strigolactones exuded by its host plants. Despite high sequence, structure, and ligand-binding site conservation across different plant species, one strigolactone receptor in witchweed, ShHTL7, uniquely exhibits a picomolar EC50 for downstream signaling. Previous biochemical and structural analyses have hypothesized that this unique ligand sensitivity can be attributed to a large binding pocket volume in ShHTL7 resulting in enhanced ability to bind substrates, but additional structural details of the substrate-binding process would help explain its role in modulating the ligand selectivity. Using long-timescale molecular dynamics simulations, we demonstrate that mutations at the entrance of the binding pocket facilitate a more direct ligand-binding pathway to ShHTL7, whereas hydrophobicity at the binding pocket entrance results in a stable “anchored” state. We also demonstrate that several residues on the D-loop of AtD14 stabilize catalytically inactive conformations. Finally, we show that strigolactone selectivity is not modulated by binding pocket volume. Our results indicate that while ligand binding is not the sole modulator of strigolactone receptor selectivity, it is a significant contributing factor. These results can be used to inform the design of selective antagonists for strigolactone receptors in witchweed.
Collapse
|
17
|
Independent Markov decomposition: Toward modeling kinetics of biomolecular complexes. Proc Natl Acad Sci U S A 2021; 118:e2105230118. [PMID: 34321356 PMCID: PMC8346863 DOI: 10.1073/pnas.2105230118] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
To advance the mission of in silico cell biology, modeling the interactions of large and complex biological systems becomes increasingly relevant. The combination of molecular dynamics (MD) simulations and Markov state models (MSMs) has enabled the construction of simplified models of molecular kinetics on long timescales. Despite its success, this approach is inherently limited by the size of the molecular system. With increasing size of macromolecular complexes, the number of independent or weakly coupled subsystems increases, and the number of global system states increases exponentially, making the sampling of all distinct global states unfeasible. In this work, we present a technique called independent Markov decomposition (IMD) that leverages weak coupling between subsystems to compute a global kinetic model without requiring the sampling of all combinatorial states of subsystems. We give a theoretical basis for IMD and propose an approach for finding and validating such a decomposition. Using empirical few-state MSMs of ion channel models that are well established in electrophysiology, we demonstrate that IMD models can reproduce experimental conductance measurements with a major reduction in sampling compared with a standard MSM approach. We further show how to find the optimal partition of all-atom protein simulations into weakly coupled subunits.
Collapse
|
18
|
In Silico Modeling of the Mitochondrial Pumping Complexes with Markov State Models. Methods Mol Biol 2021. [PMID: 34060059 DOI: 10.1007/978-1-0716-1266-8_31] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
The mechanism of proton pumping by the mitochondrial electron transport chain complexes is still enigmatic after decades of research. Recently, there has been interest in in silico Markov state models to model the mitochondrial pumping complexes at the microscopic level, and this chapter describes the methods of constructing and simulating such models.
Collapse
|
19
|
How do antiporters exchange substrates across the cell membrane? An atomic-level description of the complete exchange cycle in NarK. Structure 2021; 29:922-933.e3. [PMID: 33836147 DOI: 10.1016/j.str.2021.03.014] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 01/07/2021] [Accepted: 03/19/2021] [Indexed: 11/19/2022]
Abstract
Major facilitator superfamily (MFS) proteins operate via three different mechanisms: uniport, symport, and antiport. Despite extensive investigations, the molecular understanding of antiporters is less advanced than that of other transporters due to the complex coupling between two substrates and the lack of distinct structures. We employ extensive all-atom molecular dynamics simulations to dissect the complete substrate exchange cycle of the bacterial NO3-/NO2- antiporter, NarK. We show that paired basic residues in the binding site prevent the closure of unbound protein and ensure the exchange of two substrates. Conformational transition occurs only in the presence of substrate, which weakens the electrostatic repulsion and stabilizes the transporter. Furthermore, we propose a state-dependent substrate exchange model, in which the relative spacing between the paired basic residues determines whether NO3- and NO2- bind simultaneously or sequentially. Overall, this work presents a general working model for the antiport mechanism within the MFS.
Collapse
|
20
|
Abstract
Dynamical spectral estimation is a well-established numerical approach for estimating eigenvalues and eigenfunctions of the Markov transition operator from trajectory data. Although the approach has been widely applied in biomolecular simulations, its error properties remain poorly understood. Here we analyze the error of a dynamical spectral estimation method called "the variational approach to conformational dynamics" (VAC). We bound the approximation error and estimation error for VAC estimates. Our analysis establishes VAC's convergence properties and suggests new strategies for tuning VAC to improve accuracy.
Collapse
|
21
|
Data-Driven Molecular Dynamics: A Multifaceted Challenge. Pharmaceuticals (Basel) 2020; 13:E253. [PMID: 32961909 PMCID: PMC7557855 DOI: 10.3390/ph13090253] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 09/14/2020] [Accepted: 09/16/2020] [Indexed: 12/18/2022] Open
Abstract
The big data concept is currently revolutionizing several fields of science including drug discovery and development. While opening up new perspectives for better drug design and related strategies, big data analysis strongly challenges our current ability to manage and exploit an extraordinarily large and possibly diverse amount of information. The recent renewal of machine learning (ML)-based algorithms is key in providing the proper framework for addressing this issue. In this respect, the impact on the exploitation of molecular dynamics (MD) simulations, which have recently reached mainstream status in computational drug discovery, can be remarkable. Here, we review the recent progress in the use of ML methods coupled to biomolecular simulations with potentially relevant implications for drug design. Specifically, we show how different ML-based strategies can be applied to the outcome of MD simulations for gaining knowledge and enhancing sampling. Finally, we discuss how intrinsic limitations of MD in accurately modeling biomolecular systems can be alleviated by including information coming from experimental data.
Collapse
|
22
|
Allosteric Regulation at the Crossroads of New Technologies: Multiscale Modeling, Networks, and Machine Learning. Front Mol Biosci 2020; 7:136. [PMID: 32733918 PMCID: PMC7363947 DOI: 10.3389/fmolb.2020.00136] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 06/08/2020] [Indexed: 12/12/2022] Open
Abstract
Allosteric regulation is a common mechanism employed by complex biomolecular systems for regulation of activity and adaptability in the cellular environment, serving as an effective molecular tool for cellular communication. As an intrinsic but elusive property, allostery is a ubiquitous phenomenon where binding or disturbing of a distal site in a protein can functionally control its activity and is considered as the "second secret of life." The fundamental biological importance and complexity of these processes require a multi-faceted platform of synergistically integrated approaches for prediction and characterization of allosteric functional states, atomistic reconstruction of allosteric regulatory mechanisms and discovery of allosteric modulators. The unifying theme and overarching goal of allosteric regulation studies in recent years have been integration between emerging experiment and computational approaches and technologies to advance quantitative characterization of allosteric mechanisms in proteins. Despite significant advances, the quantitative characterization and reliable prediction of functional allosteric states, interactions, and mechanisms continue to present highly challenging problems in the field. In this review, we discuss simulation-based multiscale approaches, experiment-informed Markovian models, and network modeling of allostery and information-theoretical approaches that can describe the thermodynamics and hierarchy allosteric states and the molecular basis of allosteric mechanisms. The wealth of structural and functional information along with diversity and complexity of allosteric mechanisms in therapeutically important protein families have provided a well-suited platform for development of data-driven research strategies. Data-centric integration of chemistry, biology and computer science using artificial intelligence technologies has gained a significant momentum and at the forefront of many cross-disciplinary efforts. We discuss new developments in the machine learning field and the emergence of deep learning and deep reinforcement learning applications in modeling of molecular mechanisms and allosteric proteins. The experiment-guided integrated approaches empowered by recent advances in multiscale modeling, network science, and machine learning can lead to more reliable prediction of allosteric regulatory mechanisms and discovery of allosteric modulators for therapeutically important protein targets.
Collapse
|
23
|
Dynamics Rationalize Proteolytic Susceptibility of the Major Birch Pollen Allergen Bet v 1. Front Mol Biosci 2020; 7:18. [PMID: 32154264 PMCID: PMC7045072 DOI: 10.3389/fmolb.2020.00018] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Accepted: 01/31/2020] [Indexed: 12/21/2022] Open
Abstract
Proteolytic susceptibility during endolysosomal degradation is decisive for allergic sensitization. In the major birch pollen allergen Bet v 1 most protease cleavage sites are located within its secondary structure elements, which are inherently inaccessible to proteases. The allergen thus must unfold locally, exposing the cleavage sites to become susceptible to proteolysis. Hence, allergen cleavage rates are presumed to be linked to their fold stability, i.e., unfolding probability. Yet, these locally unfolded structures have neither been captured in experiment nor simulation due to limitations in resolution and sampling time, respectively. Here, we perform classic and enhanced molecular dynamics (MD) simulations to quantify fold dynamics on extended timescales of Bet v 1a and two variants with higher and lower cleavage rates. Already at the nanosecond-timescale we observe a significantly higher flexibility for the destabilized variant compared to Bet v 1a and the proteolytically stabilized mutant. Estimating the thermodynamics and kinetics of local unfolding around an initial cleavage site, we find that the Bet v 1 variant with the highest cleavage rate also shows the highest probability for local unfolding. For the stabilized mutant on the other hand we only find minimal unfolding probability. These results strengthen the link between the conformational dynamics of allergen proteins and their stability during endolysosomal degradation. The presented approach further allows atomistic insights in the conformational ensemble of allergen proteins and provides probability estimates below experimental detection limits.
Collapse
|
24
|
Markov models for the elucidation of allosteric regulation. Philos Trans R Soc Lond B Biol Sci 2019; 373:rstb.2017.0178. [PMID: 29735732 DOI: 10.1098/rstb.2017.0178] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/14/2018] [Indexed: 11/12/2022] Open
Abstract
Allosteric regulation refers to the process where the effect of binding of a ligand at one site of a protein is transmitted to another, often distant, functional site. In recent years, it has been demonstrated that allosteric mechanisms can be understood by the conformational ensembles of a protein. Molecular dynamics (MD) simulations are often used for the study of protein allostery as they provide an atomistic view of the dynamics of a protein. However, given the wealth of detailed information hidden in MD data, one has to apply a method that allows extraction of the conformational ensembles underlying allosteric regulation from these data. Markov state models are one of the most promising methods for this purpose. We provide a short introduction to the theory of Markov state models and review their application to various examples of protein allostery studied by MD simulations. We also include a discussion of studies where Markov modelling has been employed to analyse experimental data on allosteric regulation. We conclude our review by advertising the wider application of Markov state models to elucidate allosteric mechanisms, especially since in recent years it has become straightforward to construct such models thanks to software programs like PyEMMA and MSMBuilder.This article is part of a discussion meeting issue 'Allostery and molecular machines'.
Collapse
|
25
|
Simulation of spontaneous G protein activation reveals a new intermediate driving GDP unbinding. eLife 2018; 7:e38465. [PMID: 30289386 PMCID: PMC6224197 DOI: 10.7554/elife.38465] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 10/04/2018] [Indexed: 12/12/2022] Open
Abstract
Activation of heterotrimeric G proteins is a key step in many signaling cascades. However, a complete mechanism for this process, which requires allosteric communication between binding sites that are ~30 Å apart, remains elusive. We construct an atomically detailed model of G protein activation by combining three powerful computational methods: metadynamics, Markov state models (MSMs), and CARDS analysis of correlated motions. We uncover a mechanism that is consistent with a wide variety of structural and biochemical data. Surprisingly, the rate-limiting step for GDP release correlates with tilting rather than translation of the GPCR-binding helix 5. β-Strands 1 - 3 and helix 1 emerge as hubs in the allosteric network that links conformational changes in the GPCR-binding site to disordering of the distal nucleotide-binding site and consequent GDP release. Our approach and insights provide foundations for understanding disease-implicated G protein mutants, illuminating slow events in allosteric networks, and examining unbinding processes with slow off-rates.
Collapse
|
26
|
Simulations of the regulatory ACT domain of human phenylalanine hydroxylase (PAH) unveil its mechanism of phenylalanine binding. J Biol Chem 2018; 293:19532-19543. [PMID: 30287685 DOI: 10.1074/jbc.ra118.004909] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Revised: 09/17/2018] [Indexed: 12/20/2022] Open
Abstract
Phenylalanine hydroxylase (PAH) regulates phenylalanine (Phe) levels in mammals to prevent neurotoxicity resulting from high Phe concentrations as observed in genetic disorders leading to hyperphenylalaninemia and phenylketonuria. PAH senses elevated Phe concentrations by transient allosteric Phe binding to a protein-protein interface between ACT domains of different subunits in a PAH tetramer. This interface is present in an activated PAH (A-PAH) tetramer and absent in a resting-state PAH (RS-PAH) tetramer. To investigate this allosteric sensing mechanism, here we used the GROMACS molecular dynamics simulation suite on the Folding@home computing platform to perform extensive molecular simulations and Markov state model (MSM) analysis of Phe binding to ACT domain dimers. These simulations strongly implicated a conformational selection mechanism for Phe association with ACT domain dimers and revealed protein motions that act as a gating mechanism for Phe binding. The MSMs also illuminate a highly mobile hairpin loop, consistent with experimental findings also presented here that the PAH variant L72W does not shift the PAH structural equilibrium toward the activated state. Finally, simulations of ACT domain monomers are presented, in which spontaneous transitions between resting-state and activated conformations are observed, also consistent with a mechanism of conformational selection. These mechanistic details provide detailed insight into the regulation of PAH activation and provide testable hypotheses for the development of new allosteric effectors to correct structural and functional defects in PAH.
Collapse
|
27
|
Uncovering universal rules governing the selectivity of the archetypal DNA glycosylase TDG. Proc Natl Acad Sci U S A 2018; 115:5974-5979. [PMID: 29784784 DOI: 10.1073/pnas.1803323115] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Thymine DNA glycosylase (TDG) is a pivotal enzyme with dual roles in both genome maintenance and epigenetic regulation. TDG is involved in cytosine demethylation at CpG sites in DNA. Here we have used molecular modeling to delineate the lesion search and DNA base interrogation mechanisms of TDG. First, we examined the capacity of TDG to interrogate not only DNA substrates with 5-carboxyl cytosine modifications but also G:T mismatches and nonmismatched (A:T) base pairs using classical and accelerated molecular dynamics. To determine the kinetics, we constructed Markov state models. Base interrogation was found to be highly stochastic and proceeded through insertion of an arginine-containing loop into the DNA minor groove to transiently disrupt Watson-Crick pairing. Next, we employed chain-of-replicas path-sampling methodologies to compute minimum free energy paths for TDG base extrusion. We identified the key intermediates imparting selectivity and determined effective free energy profiles for the lesion search and base extrusion into the TDG active site. Our results show that DNA sculpting, dynamic glycosylase interactions, and stabilizing contacts collectively provide a powerful mechanism for the detection and discrimination of modified bases and epigenetic marks in DNA.
Collapse
|
28
|
Abstract
Since the availability of the first crystal structure of a bacterial Na+ channel in 2011, understanding selectivity across this family of membrane proteins has been the subject of intense research efforts. Initially, free energy calculations based on molecular dynamics simulations revealed that although sodium ions can easily permeate the channel with their first hydration shell almost intact, the selectivity filter is too narrow for efficient conduction of hydrated potassium ions. This steric view of selectivity was subsequently questioned by microsecond atomic trajectories, which proved that the selectivity filter appears to the permeating ions as a highly degenerate, liquid-like environment. Although this liquid-like environment looks optimal for rapid conduction of Na+, it seems incompatible with efficient discrimination between similar ion species, such as Na+ and K+, through steric effects. Here extensive molecular dynamics simulations, combined with Markov state model analyses, reveal that at positive membrane potentials, potassium ions trigger a conformational change of the selectivity toward a nonconductive metastable state. It is this transition of the selectivity filter, and not steric effects, that prevents the outward flux of K+ at positive membrane potentials. This description of selectivity, triggered by the nature of the permeating ions, might have implications on the current understanding of how ion channels, and in particular bacterial Na+ channels, operate at the atomic scale.
Collapse
|
29
|
Abstract
Background Much of the structure-based mechanistic understandings of the function of SLC6A neurotransmitter transporters emerged from the study of their bacterial LeuT-fold homologs. It has become evident, however, that structural differences such as the long N- and C-termini of the eukaryotic neurotransmitter transporters are involved in an expanded set of functional properties to the eukaryotic transporters. These functional properties are not shared by the bacterial homologs, which lack the structural elements that appeared later in evolution. However, mechanistic insights into some of the measured functional properties of the eukaryotic transporters that have been suggested to involve these structural elements are sparse or merely descriptive. Results To learn how the structural elements added in evolution enable mechanisms of the eukaryotic transporters in ways not shared with their bacterial LeuT-like homologs, we focused on the human dopamine transporter (hDAT) as a prototype. We present the results of a study employing large-scale molecular dynamics simulations and comparative Markov state model analysis of experimentally determined properties of the wild-type and mutant hDAT constructs. These offer a quantitative outline of mechanisms in which a rich spectrum of interactions of the hDAT N-terminus and C-terminus contribute to the regulation of transporter function (e.g., by phosphorylation) and/or to entirely new phenotypes (e.g., reverse uptake (efflux)) that were added in evolution. Conclusions The findings are consistent with the proposal that the size of eukaryotic neurotransmitter transporter termini increased during evolution to enable more functions (e.g., efflux) not shared with the bacterial homologs. The mechanistic explanations for the experimental findings about the modulation of function in DAT, the serotonin transporter, and other eukaryotic transporters reveal separate roles for the distal and proximal segments of the much larger N-terminus in eukaryotic transporters compared to the bacterial ones. The involvement of the proximal and distal segments — such as the role of the proximal segment in sustaining transport in phosphatidylinositol 4,5-bisphosphate-depleted membranes and of the distal segment in modulating efflux — may represent an evolutionary adaptation required for the function of eukaryotic transporters expressed in various cell types of the same organism that differ in the lipid composition and protein complement of their membrane environment. Electronic supplementary material The online version of this article (10.1186/s12915-018-0495-6) contains supplementary material, which is available to authorized users.
Collapse
|
30
|
What makes it difficult to refine protein models further via molecular dynamics simulations? Proteins 2018; 86 Suppl 1:177-188. [PMID: 28975670 PMCID: PMC5820117 DOI: 10.1002/prot.25393] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Revised: 09/11/2017] [Accepted: 09/29/2017] [Indexed: 01/20/2023]
Abstract
Protein structure refinement remains a challenging yet important problem as it has the potential to bring already accurate template-based models to near-native resolution. Refinement based on molecular dynamics simulations has been a highly promising approach and the performance of MD-based refinement in the Feig group during CASP12 is described here. During CASP12, sampling was extended well into the microsecond scale, an improved force field was applied, and new protocol variations were tested. Progress over previous rounds of CASP was found to be limited which is analyzed in terms of the quality of the initial models and dependency on the amount of sampling and refinement protocol variations. As current MD-based refinement protocols appear to be reaching a plateau, detailed analysis is presented to provide new insight into the major challenges towards more extensive structure refinement, focusing in particular on sampling with and without restraints.
Collapse
|
31
|
Mechanisms of Lipid Scrambling by the G Protein-Coupled Receptor Opsin. Structure 2017; 26:356-367.e3. [PMID: 29290486 DOI: 10.1016/j.str.2017.11.020] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2017] [Revised: 10/29/2017] [Accepted: 11/27/2017] [Indexed: 01/05/2023]
Abstract
Several class-A G protein-coupled receptor (GPCR) proteins act as constitutive phospholipid scramblases catalyzing the transbilayer translocation of >10,000 phospholipids per second when reconstituted into synthetic vesicles. To address the molecular mechanism by which these proteins facilitate rapid lipid scrambling, we carried out large-scale ensemble atomistic molecular dynamics simulations of the opsin GPCR. We report that, in the process of scrambling, lipid head groups traverse a dynamically revealed hydrophilic pathway in the region between transmembrane helices 6 and 7 of the protein while their hydrophobic tails remain in the bilayer environment. We present quantitative kinetic models of the translocation process based on Markov State Model analysis. As key residues on the lipid translocation pathway are conserved within the class-A GPCR family, our results illuminate unique aspects of GPCR structure and dynamics while providing a rigorous basis for the design of variants of these proteins with defined scramblase activity.
Collapse
|
32
|
Combining experimental and simulation data of molecular processes via augmented Markov models. Proc Natl Acad Sci U S A 2017; 114:8265-8270. [PMID: 28716931 DOI: 10.1073/pnas.1704803114] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Accurate mechanistic description of structural changes in biomolecules is an increasingly important topic in structural and chemical biology. Markov models have emerged as a powerful way to approximate the molecular kinetics of large biomolecules while keeping full structural resolution in a divide-and-conquer fashion. However, the accuracy of these models is limited by that of the force fields used to generate the underlying molecular dynamics (MD) simulation data. Whereas the quality of classical MD force fields has improved significantly in recent years, remaining errors in the Boltzmann weights are still on the order of a few [Formula: see text], which may lead to significant discrepancies when comparing to experimentally measured rates or state populations. Here we take the view that simulations using a sufficiently good force-field sample conformations that are valid but have inaccurate weights, yet these weights may be made accurate by incorporating experimental data a posteriori. To do so, we propose augmented Markov models (AMMs), an approach that combines concepts from probability theory and information theory to consistently treat systematic force-field error and statistical errors in simulation and experiment. Our results demonstrate that AMMs can reconcile conflicting results for protein mechanisms obtained by different force fields and correct for a wide range of stationary and dynamical observables even when only equilibrium measurements are incorporated into the estimation process. This approach constitutes a unique avenue to combine experiment and computation into integrative models of biomolecular structure and dynamics.
Collapse
|
33
|
Molecular dynamics simulations reveal the conformational dynamics of Arabidopsis thaliana BRI1 and BAK1 receptor-like kinases. J Biol Chem 2017; 292:12643-12652. [PMID: 28559283 DOI: 10.1074/jbc.m117.792762] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2017] [Revised: 05/18/2017] [Indexed: 11/06/2022] Open
Abstract
The structural motifs responsible for activation and regulation of eukaryotic protein kinases in animals have been studied extensively in recent years, and a coherent picture of their activation mechanisms has begun to emerge. In contrast, non-animal eukaryotic protein kinases are not as well understood from a structural perspective, representing a large knowledge gap. To this end, we investigated the conformational dynamics of two key Arabidopsis thaliana receptor-like kinases, brassinosteroid-insensitive 1 (BRI1) and BRI1-associated kinase 1 (BAK1), through extensive molecular dynamics simulations of their fully phosphorylated kinase domains. Molecular dynamics simulations calculate the motion of each atom in a protein based on classical approximations of interatomic forces, giving researchers insight into protein function at unparalleled spatial and temporal resolutions. We found that in an otherwise "active" BAK1 the αC helix is highly disordered, a hallmark of deactivation, whereas the BRI1 αC helix is moderately disordered and displays swinging behavior similar to numerous animal kinases. An analysis of all known sequences in the A. thaliana kinome found that αC helix disorder may be a common feature of plant kinases.
Collapse
|
34
|
Toward a direct and scalable identification of reduced models for categorical processes. Proc Natl Acad Sci U S A 2017; 114:4863-4868. [PMID: 28432182 DOI: 10.1073/pnas.1612619114] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The applicability of many computational approaches is dwelling on the identification of reduced models defined on a small set of collective variables (colvars). A methodology for scalable probability-preserving identification of reduced models and colvars directly from the data is derived-not relying on the availability of the full relation matrices at any stage of the resulting algorithm, allowing for a robust quantification of reduced model uncertainty and allowing us to impose a priori available physical information. We show two applications of the methodology: (i) to obtain a reduced dynamical model for a polypeptide dynamics in water and (ii) to identify diagnostic rules from a standard breast cancer dataset. For the first example, we show that the obtained reduced dynamical model can reproduce the full statistics of spatial molecular configurations-opening possibilities for a robust dimension and model reduction in molecular dynamics. For the breast cancer data, this methodology identifies a very simple diagnostics rule-free of any tuning parameters and exhibiting the same performance quality as the state of the art machine-learning applications with multiple tuning parameters reported for this problem.
Collapse
|
35
|
Abstract
Hidden Markov models (HMMs) provide a framework to analyze large trajectories of biomolecular simulation datasets. HMMs decompose the conformational space of a biological molecule into finite number of states that interconvert among each other with certain rates. HMMs simplify long timescale trajectories for human comprehension, and allow comparison of simulations with experimental data. In this chapter, we provide an overview of building HMMs for analyzing bimolecular simulation datasets. We demonstrate the procedure for building a Hidden Markov model for Met-enkephalin peptide simulation dataset and compare the timescales of the process.
Collapse
|
36
|
Abstract
Nonreceptor tyrosine kinases of the Src family are large multidomain allosteric proteins that are crucial to cellular signaling pathways. In a previous study, we generated a Markov state model (MSM) to simulate the activation of c-Src catalytic domain, used as a prototypical tyrosine kinase. The long-time kinetics of transition predicted by the MSM was in agreement with experimental observations. In the present study, we apply the framework of transition path theory (TPT) to the previously constructed MSM to characterize the main features of the activation pathway. The analysis indicates that the activating transition, in which the activation loop first opens up followed by an inward rotation of the αC-helix, takes place via a dense set of intermediate microstates distributed within a fairly broad "transition tube" in a multidimensional conformational subspace connecting the two end-point conformations. Multiple microstates with negligible equilibrium probabilities carry a large transition flux associated with the activating transition, which explains why extensive conformational sampling is necessary to accurately determine the kinetics of activation. Our results suggest that the combination of MSM with TPT provides an effective framework to represent conformational transitions in complex biomolecular systems.
Collapse
|
37
|
Abstract
We introduce the general transition-based reweighting analysis method (TRAM), a statistically optimal approach to integrate both unbiased and biased molecular dynamics simulations, such as umbrella sampling or replica exchange. TRAM estimates a multiensemble Markov model (MEMM) with full thermodynamic and kinetic information at all ensembles. The approach combines the benefits of Markov state models-clustering of high-dimensional spaces and modeling of complex many-state systems-with those of the multistate Bennett acceptance ratio of exploiting biased or high-temperature ensembles to accelerate rare-event sampling. TRAM does not depend on any rate model in addition to the widely used Markov state model approximation, but uses only fundamental relations such as detailed balance and binless reweighting of configurations between ensembles. Previous methods, including the multistate Bennett acceptance ratio, discrete TRAM, and Markov state models are special cases and can be derived from the TRAM equations. TRAM is demonstrated by efficiently computing MEMMs in cases where other estimators break down, including the full thermodynamics and rare-event kinetics from high-dimensional simulation data of an all-atom protein-ligand binding model.
Collapse
|
38
|
Abstract
Life is fundamentally a nonequilibrium phenomenon. At the expense of dissipated energy, living things perform irreversible processes that allow them to propagate and reproduce. Within cells, evolution has designed nanoscale machines to do meaningful work with energy harnessed from a continuous flux of heat and particles. As dictated by the Second Law of Thermodynamics and its fluctuation theorem corollaries, irreversibility in nonequilibrium processes can be quantified in terms of how much entropy such dynamics produce. In this work, we seek to address a fundamental question linking biology and nonequilibrium physics: can the evolved dissipative pathways that facilitate biomolecular function be identified by their extent of entropy production in general relaxation processes? We here synthesize massive molecular dynamics simulations, Markov state models (MSMs), and nonequilibrium statistical mechanical theory to probe dissipation in two key classes of signaling proteins: kinases and G-protein-coupled receptors (GPCRs). Applying machinery from large deviation theory, we use MSMs constructed from protein simulations to generate dynamics conforming to positive levels of entropy production. We note the emergence of an array of peaks in the dynamical response (transient analogs of phase transitions) that draw the proteins between distinct levels of dissipation, and we see that the binding of ATP and agonist molecules modifies the observed dissipative landscapes. Overall, we find that dissipation is tightly coupled to activation in these signaling systems: dominant entropy-producing trajectories become localized near important barriers along known biological activation pathways. We go on to classify an array of equilibrium and nonequilibrium molecular switches that harmonize to promote functional dynamics.
Collapse
|
39
|
Abstract
Molecular dynamics simulations hold the promise to be an important tool for biological research and drug discovery. Historically, however, there were several obstacles for it to become a practical research tool. Limitations in computer hardware had previously made it difficult to simulate for long enough to see interesting biological processes. Recent improvements in hardware and algorithms have largely removed this issue, leaving data analysis as the main obstacle. Advances in Markov state modeling appear to be on the way to remove this obstacle. We outline these advances here and discuss numerous recent studies that demonstrate that molecular dynamics simulations will start to be an important tool for pharmaceutical research.
Collapse
|