1
|
Stulajter MM, Rappoport D. Reaction Networks Resemble Low-Dimensional Regular Lattices. J Chem Theory Comput 2024. [PMID: 39236261 DOI: 10.1021/acs.jctc.4c00810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2024]
Abstract
The computational exploration, manipulation, and design of complex chemical reactions face fundamental challenges related to the high-dimensional nature of potential energy surfaces (PESs) that govern reactivity. Accurately modeling complex reactions is crucial for understanding the chemical processes involved in, for example, organocatalysis, autocatalytic cycles, and one-pot molecular assembly. Our prior research demonstrated that discretizing PESs using heuristics based on bond breaking and bond formation produces a reaction network representation with a low-dimensional structure (metric space). We now find that these stoichiometry-preserving reaction networks possess additional, though approximate, structure and resemble low-dimensional regular lattices with a small amount of random edge rewiring. The heuristics-based discretization thus generates a nonlinear dimensionality reduction by a factor of 10 with an a posteriori error measure (probability of random rewiring). The structure becomes evident through a comparative analysis of CHNO reaction networks of varying stoichiometries against a panel of size-matched generative network models, taking into account their local, metric, and global properties. The generative models include random networks (Erdős-Rényi and bipartite random networks), regular lattices (periodic and nonperiodic), and network models with a tunable level of "randomness" (Watts-Strogatz graphs and regular lattices with random rewiring). The CHNO networks are simultaneously closely matched in all these properties by 3-4-dimensional regular lattices with 10% or less of edges randomly rewired. The effective dimensionality reduction is found to be independent of the system size, stoichiometry, and ruleset, suggesting that search and sampling algorithms for PESs of complex chemical reactions can be effectively leveraged.
Collapse
Affiliation(s)
- Miko M Stulajter
- Department of Chemistry, University of California Irvine, Irvine, California 92697, United States
- Computational Science Research Center, San Diego State University, San Diego, California 92182, United States
| | - Dmitrij Rappoport
- Department of Chemistry, University of California Irvine, Irvine, California 92697, United States
| |
Collapse
|
2
|
Jani V, Sonavane U, Joshi R. Insight into structural dynamics involved in activation mechanism of full length KRAS wild type and P-loop mutants. Heliyon 2024; 10:e36161. [PMID: 39247361 PMCID: PMC11379609 DOI: 10.1016/j.heliyon.2024.e36161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 08/06/2024] [Accepted: 08/11/2024] [Indexed: 09/10/2024] Open
Abstract
KRAS protein is known to be frequently mutated in various cancers. The most common mutations being at position 12, 13 and 61. The positions 12 and 13 form part of the phosphate binding region (P-loop) of KRAS. Owing to mutation, the protein remains in continuous active state and affects the normal cellular process. Understanding the structural changes owing to mutations in GDP-bound (inactive state) and GTP-bound (active state) may help in the design of better therapeutics. To understand the structural flexibility due to the mutations specifically located at P-loop regions (G12D, G12V and G13D), extensive molecular dynamics simulations (24 μs) have been carried for both inactive (GDP-bound) and active (GTP-bound) structures for the wild type and these mutants. The study revealed that the local structural changes at the site of mutations allosterically guide changes in distant regions of the protein through hydrogen bond and hydrophobic signalling network. The dynamic cross correlation analysis and the comparison of the correlated motions among different systems manifested that changes in SW-I, SW-II, α3 and the loop preceding α3 affects the interactions of GDP/GTP with different regions of the protein thereby affecting its hydrolysis. Further, the Markov state modelling analysis confirmed that the mutations, especially G13D imparts rigidity to structure compared to wild type and thus limiting its conformational state in either intermediate state or active state. The study suggests that along with SW-I and SW-II regions, the loop region preceding the α3 helix and α3 helix are also involved in affecting the hydrolysis of nucleotides and may be considered while designing therapeutics against KRAS.
Collapse
Affiliation(s)
- Vinod Jani
- Centre for Development of Advanced Computing (C-DAC), Panchavati, Pashan, Pune, India
| | - Uddhavesh Sonavane
- Centre for Development of Advanced Computing (C-DAC), Panchavati, Pashan, Pune, India
| | - Rajendra Joshi
- Centre for Development of Advanced Computing (C-DAC), Panchavati, Pashan, Pune, India
| |
Collapse
|
3
|
Schäfer JL, Keller BG. Implementation of Girsanov Reweighting in OpenMM and Deeptime. J Phys Chem B 2024; 128:6014-6027. [PMID: 38865491 PMCID: PMC11215775 DOI: 10.1021/acs.jpcb.4c01702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 05/22/2024] [Accepted: 05/22/2024] [Indexed: 06/14/2024]
Abstract
Classical molecular dynamics (MD) simulations provide invaluable insights into complex molecular systems but face limitations in capturing phenomena occurring on time scales beyond their reach. To bridge this gap, various enhanced sampling techniques have been developed, which are complemented by reweighting techniques to recover the unbiased dynamics. Girsanov reweighting is a reweighting technique that reweights simulation paths, generated by a stochastic MD integrator, without evoking an effective model of the dynamics. Instead, it calculates the relative path probability density at the time resolution of the MD integrator. Efficient implementation of Girsanov reweighting requires that the reweighting factors are calculated on-the-fly during the simulations and thus needs to be implemented within the MD integrator. Here, we present a comprehensive guide for implementing Girsanov reweighting into MD simulations. We demonstrate the implementation in the MD simulation package OpenMM by extending the library openmmtools. Additionally, we implemented a reweighted Markov state model estimator within the time series analysis package Deeptime.
Collapse
Affiliation(s)
- Joana-Lysiane Schäfer
- Department of Biology, Chemistry, and
Pharmacy, Freie Universität Berlin, Berlin 14195, Germany
| | - Bettina G. Keller
- Department of Biology, Chemistry, and
Pharmacy, Freie Universität Berlin, Berlin 14195, Germany
| |
Collapse
|
4
|
Wang D, Qiu Y, Beyerle ER, Huang X, Tiwary P. Information Bottleneck Approach for Markov Model Construction. J Chem Theory Comput 2024; 20:5352-5367. [PMID: 38859575 PMCID: PMC11199095 DOI: 10.1021/acs.jctc.4c00449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2024]
Abstract
Markov state models (MSMs) have proven valuable in studying the dynamics of protein conformational changes via statistical analysis of molecular dynamics simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific lag time necessitates defining states that circumvent significant internal energy barriers, enabling internal dynamics relaxation within the lag time. This process effectively coarse-grains time and space, integrating out rapid motions within metastable states. Thus, MSMs possess a multiresolution nature, where the granularity of states can be adjusted according to the time-resolution, offering flexibility in capturing system dynamics. This work introduces a continuous embedding approach for molecular conformations using the state predictive information bottleneck (SPIB), a framework that unifies dimensionality reduction and state space partitioning via a continuous, machine learned basis set. Without explicit optimization of the VAMP-based scores, SPIB demonstrates state-of-the-art performance in identifying slow dynamical processes and constructing predictive multiresolution Markovian models. Through applications to well-validated mini-proteins, SPIB showcases unique advantages compared to competing methods. It autonomously and self-consistently adjusts the number of metastable states based on a specified minimal time resolution, eliminating the need for manual tuning. While maintaining efficacy in dynamical properties, SPIB excels in accurately distinguishing metastable states and capturing numerous well-populated macrostates. This contrasts with existing VAMP-based methods, which often emphasize slow dynamics at the expense of incorporating numerous sparsely populated states. Furthermore, SPIB's ability to learn a low-dimensional continuous embedding of the underlying MSMs enhances the interpretation of dynamic pathways. With these benefits, we propose SPIB as an easy-to-implement methodology for end-to-end MSM construction.
Collapse
Affiliation(s)
- Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, United States
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI 53706, United States
- Data Science Institute, University of Wisconsin-Madison, Madison, WI, 53706, United States
| | - Eric R. Beyerle
- Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, United States
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI 53706, United States
- Data Science Institute, University of Wisconsin-Madison, Madison, WI, 53706, United States
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, United States
- University of Maryland Institute for Health Computing, Bethesda, MD 20852, United States
| |
Collapse
|
5
|
Ghosh D, Biswas A, Radhakrishna M. Advanced computational approaches to understand protein aggregation. BIOPHYSICS REVIEWS 2024; 5:021302. [PMID: 38681860 PMCID: PMC11045254 DOI: 10.1063/5.0180691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 03/18/2024] [Indexed: 05/01/2024]
Abstract
Protein aggregation is a widespread phenomenon implicated in debilitating diseases like Alzheimer's, Parkinson's, and cataracts, presenting complex hurdles for the field of molecular biology. In this review, we explore the evolving realm of computational methods and bioinformatics tools that have revolutionized our comprehension of protein aggregation. Beginning with a discussion of the multifaceted challenges associated with understanding this process and emphasizing the critical need for precise predictive tools, we highlight how computational techniques have become indispensable for understanding protein aggregation. We focus on molecular simulations, notably molecular dynamics (MD) simulations, spanning from atomistic to coarse-grained levels, which have emerged as pivotal tools in unraveling the complex dynamics governing protein aggregation in diseases such as cataracts, Alzheimer's, and Parkinson's. MD simulations provide microscopic insights into protein interactions and the subtleties of aggregation pathways, with advanced techniques like replica exchange molecular dynamics, Metadynamics (MetaD), and umbrella sampling enhancing our understanding by probing intricate energy landscapes and transition states. We delve into specific applications of MD simulations, elucidating the chaperone mechanism underlying cataract formation using Markov state modeling and the intricate pathways and interactions driving the toxic aggregate formation in Alzheimer's and Parkinson's disease. Transitioning we highlight how computational techniques, including bioinformatics, sequence analysis, structural data, machine learning algorithms, and artificial intelligence have become indispensable for predicting protein aggregation propensity and locating aggregation-prone regions within protein sequences. Throughout our exploration, we underscore the symbiotic relationship between computational approaches and empirical data, which has paved the way for potential therapeutic strategies against protein aggregation-related diseases. In conclusion, this review offers a comprehensive overview of advanced computational methodologies and bioinformatics tools that have catalyzed breakthroughs in unraveling the molecular basis of protein aggregation, with significant implications for clinical interventions, standing at the intersection of computational biology and experimental research.
Collapse
Affiliation(s)
- Deepshikha Ghosh
- Department of Biological Sciences and Engineering, Indian Institute of Technology (IIT) Gandhinagar, Palaj, Gujarat 382355, India
| | - Anushka Biswas
- Department of Chemical Engineering, Indian Institute of Technology (IIT) Gandhinagar, Palaj, Gujarat 382355, India
| | | |
Collapse
|
6
|
Shewani K, Madhu MK, Murarka RK. Mechanistic insights into G-protein activation via phosphorylation mediated non-canonical pathway. Biophys Chem 2024; 309:107234. [PMID: 38603989 DOI: 10.1016/j.bpc.2024.107234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 03/21/2024] [Accepted: 04/02/2024] [Indexed: 04/13/2024]
Abstract
Activation of heterotrimeric G-proteins (Gαβγ) downstream to receptor tyrosine kinases (RTKs) is a well-established crosstalk between the signaling pathways mediated by G-protein coupled receptors (GPCRs) and RTKs. While GPCR serves as a guanine exchange factor (GEF) in the canonical activation of Gα that facilitates the exchange of GDP for GTP, the mechanism through which RTK phosphorylations induce Gα activation remains unclear. Recent experimental studies revealed that the epidermal growth factor receptor (EGFR), a well-known RTK, phosphorylates the helical domain tyrosine residues Y154 and Y155 and accelerates the GDP release from the Gαi3, a subtype of Gα-protein. Using well-tempered metadynamics and extensive unbiased molecular dynamics simulations, we captured the GDP release event and identified the intermediates between bound and unbound states through Markov state models. In addition to weakened salt bridges at the domain interface, phosphorylations induced the unfolding of helix αF, which contributed to increased flexibility near the hinge region, facilitating a greater distance between domains in the phosphorylated Gαi3. Although the larger domain separation in the phosphorylated system provided an unobstructed path for the nucleotide, the accelerated release of GDP was attributed to increased fluctuations in several conserved regions like P-loop, switch 1, and switch 2. Overall, this study provides atomistic insights into the activation of G-proteins induced by RTK phosphorylations and identifies the specific structural motifs involved in the process. The knowledge gained from the study could establish a foundation for targeting non-canonical signaling pathways and developing therapeutic strategies against the ailments associated with dysregulated G-protein signaling.
Collapse
Affiliation(s)
- Kunal Shewani
- Department of Chemistry, Indian Institute of Science Education and Research Bhopal, Bhopal Bypass Road, Bhopal 462066, MP, India
| | - Midhun K Madhu
- Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal Bypass Road, Bhopal 462066, MP, India
| | - Rajesh K Murarka
- Department of Chemistry, Indian Institute of Science Education and Research Bhopal, Bhopal Bypass Road, Bhopal 462066, MP, India.
| |
Collapse
|
7
|
Muduli S, Karmakar S, Mishra S. Conformational Dynamics in Corynebacterium glutamicum Diaminopimelate Epimerase: Insights from Ligand Parameterization, Atomistic Simulation, and Markov State Modeling. J Chem Inf Model 2024; 64:4250-4262. [PMID: 38701175 DOI: 10.1021/acs.jcim.4c00480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2024]
Abstract
The microbial enzyme diaminopimelate epimerase (DapF), a vital enzyme in the lysine biosynthetic pathway, catalyzes the conversion of L, L-diaminopimelate (L, L-DAP) to D, L-diaminopimelate (D, L-DAP) using a catalytic cysteine dyad with one cysteine in thiol state and another in thiolate. Under oxidizing conditions, the catalytic cysteines of apo DapF form a disulfide bond that alters the structure and function of DapF. Given its potential as a target for antimicrobial resistance treatments, understanding DapF's functional dynamics is imperative. In the present work, we employ microsecond-scale all-atom molecular dynamics simulations of product-bound DapF and apo-DapF under oxidized and reduced conditions. We employ a polarized charge model for the ligand and the active site residues, which was necessary to preserve the electrostatic environment in the active site and retain the ligand in the active site. The product-bound DapF and apo-DapF in oxidized and reduced conditions exhibit a closed, semi-open, and open conformation, respectively, as identified using the internal coordinates of the dimeric enzyme and the principal component analysis. The conformational switch is guided by the dynamic catalytic (DC) loop, loop II, and loop III movements in the active site. The time scale of the close-to-open conformational transition is estimated to be 0.8 μs through Markov state modeling (MSM) and transition path theory (TPT). The present study explains the role of various active site residues and loops in ligand binding and protein dynamics in the DapF enzyme under different redox conditions. Such information will be helpful in future inhibitor design studies targeting the DapF enzyme.
Collapse
Affiliation(s)
- Sunita Muduli
- Department of Chemistry, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Soumyajit Karmakar
- Department of Chemistry, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Sabyashachi Mishra
- Department of Chemistry, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| |
Collapse
|
8
|
Wu Y, Cao S, Qiu Y, Huang X. Tutorial on how to build non-Markovian dynamic models from molecular dynamics simulations for studying protein conformational changes. J Chem Phys 2024; 160:121501. [PMID: 38516972 DOI: 10.1063/5.0189429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 02/20/2024] [Indexed: 03/23/2024] Open
Abstract
Protein conformational changes play crucial roles in their biological functions. In recent years, the Markov State Model (MSM) constructed from extensive Molecular Dynamics (MD) simulations has emerged as a powerful tool for modeling complex protein conformational changes. In MSMs, dynamics are modeled as a sequence of Markovian transitions among metastable conformational states at discrete time intervals (called lag time). A major challenge for MSMs is that the lag time must be long enough to allow transitions among states to become memoryless (or Markovian). However, this lag time is constrained by the length of individual MD simulations available to track these transitions. To address this challenge, we have recently developed Generalized Master Equation (GME)-based approaches, encoding non-Markovian dynamics using a time-dependent memory kernel. In this Tutorial, we introduce the theory behind two recently developed GME-based non-Markovian dynamic models: the quasi-Markov State Model (qMSM) and the Integrative Generalized Master Equation (IGME). We subsequently outline the procedures for constructing these models and provide a step-by-step tutorial on applying qMSM and IGME to study two peptide systems: alanine dipeptide and villin headpiece. This Tutorial is available at https://github.com/xuhuihuang/GME_tutorials. The protocols detailed in this Tutorial aim to be accessible for non-experts interested in studying the biomolecular dynamics using these non-Markovian dynamic models.
Collapse
Affiliation(s)
- Yue Wu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Data Science Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
9
|
Liu X, Xing J, Fu H, Shao X, Cai W. Analyzing Molecular Dynamics Trajectories Thermodynamically through Artificial Intelligence. J Chem Theory Comput 2024; 20:665-676. [PMID: 38193858 DOI: 10.1021/acs.jctc.3c00975] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2024]
Abstract
Molecular dynamics simulations produce trajectories that correspond to vast amounts of structure when exploring biochemical processes. Extracting valuable information, e.g., important intermediate states and collective variables (CVs) that describe the major movement modes, from molecular trajectories to understand the underlying mechanisms of biological processes presents a significant challenge. To achieve this goal, we introduce a deep learning approach, coined DIKI (deep identification of key intermediates), to determine low-dimensional CVs distinguishing key intermediate conformations without a-priori assumptions. DIKI dynamically plans the distribution of latent space and groups together similar conformations within the same cluster. Moreover, by incorporating two user-defined parameters, namely, coarse focus knob and fine focus knob, to help identify conformations with low free energy and differentiate the subtle distinctions among these conformations, resolution-tunable clustering was achieved. Furthermore, the integration of DIKI with a path-finding algorithm contributes to the identification of crucial intermediates along the lowest free-energy pathway. We postulate that DIKI is a robust and flexible tool that can find widespread applications in the analysis of complex biochemical processes.
Collapse
Affiliation(s)
- Xuyang Liu
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Jingya Xing
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Haohao Fu
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Xueguang Shao
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Wensheng Cai
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| |
Collapse
|
10
|
Woods EJ, Wales DJ. Analysis and interpretation of first passage time distributions featuring rare events. Phys Chem Chem Phys 2024; 26:1640-1657. [PMID: 38059562 DOI: 10.1039/d3cp04199a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023]
Abstract
In this contribution we consider theory and associated computational tools to treat the kinetics associated with competing pathways on multifunnel energy landscapes. Multifunnel landscapes are associated with molecular switches and multifunctional materials, and are expected to exhibit multiple relaxation time scales and associated thermodynamic signatures in the heat capacity. Our focus here is on the first passage time distribution, which is encoded in a kinetic transition network containing all the locally stable states and the pathways between them. This network can be renormalised to reduce the dimensionality, while exactly conserving the mean first passage time and approximately conserving the full distribution. The structure of the reduced network can be visualised using disconnectivity graphs. We show how features in the first passage time distribution can be associated with specific kinetic traps, and how the appearance of competing relaxation time scales depends on the starting conditions. The theory is tested for two model landscapes and applied to an atomic cluster and a disordered peptide. Our most important contribution is probably the reconstruction of the full distribution for long time scales, where numerical problems prevent direct calculations. Here we combine accurate treatment of the mean first passage time with the reliable part of the distribution corresponding to faster time scales. Hence we now have a fundamental understanding of both thermodynamic and kinetic signatures of multifunnel landscapes.
Collapse
Affiliation(s)
- Esmae J Woods
- Cavendish Laboratory, Department of Physics, University of Cambridge, Cambridge CB3 0HE, UK
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK.
| | - David J Wales
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK.
| |
Collapse
|
11
|
Tian J, Dong X, Wu T, Wen P, Liu X, Zhang M, An X, Shi D. Revealing the conformational dynamics of UDP-GlcNAc recognition by O-GlcNAc transferase via Markov state model. Int J Biol Macromol 2024; 256:128405. [PMID: 38016609 DOI: 10.1016/j.ijbiomac.2023.128405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 11/20/2023] [Accepted: 11/22/2023] [Indexed: 11/30/2023]
Abstract
The O-linked N-acetylglucosamine (O-GlcNAc) glycosylation is a critical post-translational modification and closely linked to various physiological and pathological conditions. The O-GlcNAc transferase (OGT) functions as the only glycosyltransferase of O-GlcNAc glycosylation by transferring GlcNAc from UDP-GlcNAc to serine or threonine residues on protein substrates. The interaction mode of UDP-GlcNAc against OGT has been preliminarily revealed by the crystal structures, yet an atomic-level comprehension for the conformational dynamics of the recognition process remains elusive. Here, we construct the Markov state model based on extensive all-atom molecular dynamics (MD) simulations with an aggregated simulation time of ∼9 μs, and reveal that the UDP-GlcNAc recognition process by OGT encompasses four key metastable states, occurring within an estimated timescale of ∼10 μs. During UDP-GlcNAc recognition process, we find the pyrophosphate moiety (P2O52-) initially anchors to the active pocket via salt bridge and hydrogen bonds, facilitating subsequent binding of the uridine and GlcNAc moieties. Furthermore, the functional roles of K842 involved in the salt bridge with P2O52- were evaluated through extra mutant MD simulations. Overall, our study provides valuable insights into the UDP-GlcNAc recognition mechanism by OGT, which could further aid in mechanistic studies of O-GlcNAc glycosylation and drug development targeting on OGT.
Collapse
Affiliation(s)
- Jiaqi Tian
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu Province, China
| | - Xin Dong
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu Province, China
| | - Tianshuo Wu
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu Province, China
| | - Pengbo Wen
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu Province, China
| | - Xin Liu
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu Province, China
| | - Mengying Zhang
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu Province, China
| | - Xiaoli An
- School of Chemical Engineering, Institute of Pharmaceutical Engineering Technology and Application, Sichuan University of Science & Engineering, Xueyuan Street 180, Huixing Road, Zigong 643000, Sichuan, China.
| | - Danfeng Shi
- Warshel Institute for Computational Biology, School of Life and Health Sciences, School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, Guangdong, China.
| |
Collapse
|
12
|
Mollaei P, Barati Farimani A. Unveiling Switching Function of Amino Acids in Proteins Using a Machine Learning Approach. J Chem Theory Comput 2023; 19:8472-8480. [PMID: 37933128 PMCID: PMC10688191 DOI: 10.1021/acs.jctc.3c00665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Revised: 10/13/2023] [Accepted: 10/16/2023] [Indexed: 11/08/2023]
Abstract
Dynamics of individual amino acids play key roles in the overall properties of proteins. However, the knowledge of protein structural features at the residue level is limited due to the current resolutions of experimental and computational techniques. To address this issue, we designed a novel machine-learning (ML) framework that uses Molecular Dynamics (MD) trajectories to identify the major conformational states of individual amino acids, classify amino acids switching between two distinct modes, and evaluate their degree of dynamic stability. The Random Forest model achieved 96.94% classification accuracy in identifying switch residues within proteins. Additionally, our framework distinguishes between the stable switch (SS) residues, which remain stable in one angular state and jump once to another state during protein dynamics, and unstable switch (US) residues, which constantly fluctuate between the two angular states. This study also illustrates the correlation between the dynamics of SS residues and the protein's global properties.
Collapse
Affiliation(s)
- Parisa Mollaei
- Department
of Mechanical Engineering, Carnegie Mellon
University, 5000 Forbes Avenue, Pittsburgh, Pennsylvania 15213, United States
| | - Amir Barati Farimani
- Department
of Mechanical Engineering, Carnegie Mellon
University, 5000 Forbes Avenue, Pittsburgh, Pennsylvania 15213, United States
- Department
of Biomedical Engineering, Carnegie Mellon
University, 5000 Forbes Avenue, Pittsburgh, Pennsylvania 15213, United States
- Machine
Learning Department, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, Pennsylvania 15213, United States
| |
Collapse
|
13
|
Kasahara K, Masayama R, Okita K, Matubayasi N. Elucidating protein-ligand binding kinetics based on returning probability theory. J Chem Phys 2023; 159:134103. [PMID: 37787130 DOI: 10.1063/5.0165692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 09/14/2023] [Indexed: 10/04/2023] Open
Abstract
The returning probability (RP) theory, a rigorous diffusion-influenced reaction theory, enables us to analyze the binding process systematically in terms of thermodynamics and kinetics using molecular dynamics (MD) simulations. Recently, the theory was extended to atomistically describe binding processes by adopting the host-guest interaction energy as the reaction coordinate. The binding rate constants can be estimated by computing the thermodynamic and kinetic properties of the reactive state existing in the binding processes. Here, we propose a methodology based on the RP theory in conjunction with the energy representation theory of solution, applicable to complex binding phenomena, such as protein-ligand binding. The derived scheme of calculating the equilibrium constant between the reactive and dissociate states, required in the RP theory, can be used for arbitrary types of reactive states. We apply the present method to the bindings of small fragment molecules [4-hydroxy-2-butanone (BUT) and methyl methylthiomethyl sulphoxide (DSS)] to FK506 binding protein (FKBP) in an aqueous solution. Estimated binding rate constants are consistent with those obtained from long-timescale MD simulations. Furthermore, by decomposing the rate constants to the thermodynamic and kinetic contributions, we clarify that the higher thermodynamic stability of the reactive state for DSS causes the faster binding kinetics compared with BUT.
Collapse
Affiliation(s)
- Kento Kasahara
- Division of Chemical Engineering, Graduate School of Engineering Science, Osaka University, Toyonaka, Osaka 560-8531, Japan
| | - Ren Masayama
- Division of Chemical Engineering, Graduate School of Engineering Science, Osaka University, Toyonaka, Osaka 560-8531, Japan
| | - Kazuya Okita
- Division of Chemical Engineering, Graduate School of Engineering Science, Osaka University, Toyonaka, Osaka 560-8531, Japan
| | - Nobuyuki Matubayasi
- Division of Chemical Engineering, Graduate School of Engineering Science, Osaka University, Toyonaka, Osaka 560-8531, Japan
| |
Collapse
|
14
|
Cao S, Qiu Y, Kalin ML, Huang X. Integrative generalized master equation: A method to study long-timescale biomolecular dynamics via the integrals of memory kernels. J Chem Phys 2023; 159:134106. [PMID: 37787134 PMCID: PMC11005468 DOI: 10.1063/5.0167287] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/18/2023] [Indexed: 10/04/2023] Open
Abstract
The generalized master equation (GME) provides a powerful approach to study biomolecular dynamics via non-Markovian dynamic models built from molecular dynamics (MD) simulations. Previously, we have implemented the GME, namely the quasi Markov State Model (qMSM), where we explicitly calculate the memory kernel and propagate dynamics using a discretized GME. qMSM can be constructed with much shorter MD trajectories than the MSM. However, since qMSM needs to explicitly compute the time-dependent memory kernels, it is heavily affected by the numerical fluctuations of simulation data when applied to study biomolecular conformational changes. This can lead to numerical instability of predicted long-time dynamics, greatly limiting the applicability of qMSM in complicated biomolecules. We present a new method, the Integrative GME (IGME), in which we analytically solve the GME under the condition when the memory kernels have decayed to zero. Our IGME overcomes the challenges of the qMSM by using the time integrations of memory kernels, thereby avoiding the numerical instability caused by explicit computation of time-dependent memory kernels. Using our solutions of the GME, we have developed a new approach to compute long-time dynamics based on MD simulations in a numerically stable, accurate and efficient way. To demonstrate its effectiveness, we have applied the IGME in three biomolecules: the alanine dipeptide, FIP35 WW-domain, and Taq RNA polymerase. In each system, the IGME achieves significantly smaller fluctuations for both memory kernels and long-time dynamics compared to the qMSM. We anticipate that the IGME can be widely applied to investigate biomolecular conformational changes.
Collapse
Affiliation(s)
- Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Michael L. Kalin
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
15
|
Liu B, Xue M, Qiu Y, Konovalov KA, O’Connor MS, Huang X. GraphVAMPnets for uncovering slow collective variables of self-assembly dynamics. J Chem Phys 2023; 159:094901. [PMID: 37655771 PMCID: PMC11005469 DOI: 10.1063/5.0158903] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 08/11/2023] [Indexed: 09/02/2023] Open
Abstract
Uncovering slow collective variables (CVs) of self-assembly dynamics is important to elucidate its numerous kinetic assembly pathways and drive the design of novel structures for advanced materials through the bottom-up approach. However, identifying the CVs for self-assembly presents several challenges. First, self-assembly systems often consist of identical monomers, and the feature representations should be invariant to permutations and rotational symmetries. Physical coordinates, such as aggregate size, lack high-resolution detail, while common geometric coordinates like pairwise distances are hindered by the permutation and rotational symmetry challenges. Second, self-assembly is usually a downhill process, and the trajectories often suffer from insufficient sampling of backward transitions that correspond to the dissociation of self-assembled structures. Popular dimensionality reduction methods, such as time-structure independent component analysis, impose detailed balance constraints, potentially obscuring the true dynamics of self-assembly. In this work, we employ GraphVAMPnets, which combines graph neural networks with a variational approach for Markovian process (VAMP) theory to identify the slow CVs of the self-assembly processes. First, GraphVAMPnets bears the advantages of graph neural networks, in which the graph embeddings can represent self-assembly structures in high-resolution while being invariant to permutations and rotational symmetries. Second, it is built upon VAMP theory, which studies Markov processes without forcing detailed balance constraints, which addresses the out-of-equilibrium challenge in the self-assembly process. We demonstrate GraphVAMPnets for identifying slow CVs of self-assembly kinetics in two systems: the aggregation of two hydrophobic molecules and the self-assembly of patchy particles. We expect that our GraphVAMPnets can be widely applied to molecular self-assembly.
Collapse
Affiliation(s)
- Bojun Liu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Mingyi Xue
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Kirill A. Konovalov
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Michael S. O’Connor
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Author to whom correspondence should be addressed:
| |
Collapse
|
16
|
Kozlowski N, Grubmüller H. Uncertainties in Markov State Models of Small Proteins. J Chem Theory Comput 2023; 19:5516-5524. [PMID: 37540193 PMCID: PMC10448719 DOI: 10.1021/acs.jctc.3c00372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Indexed: 08/05/2023]
Abstract
Markov state models are widely used to describe and analyze protein dynamics based on molecular dynamics simulations, specifically to extract functionally relevant characteristic time scales and motions. Particularly for larger biomolecules such as proteins, however, insufficient sampling is a notorious concern and often the source of large uncertainties that are difficult to quantify. Furthermore, there are several other sources of uncertainty, such as choice of the number of Markov states and lag time, choice and parameters of dimension reduction preprocessing step, and uncertainty due to the limited number of observed transitions; the latter is often estimated via a Bayesian approach. Here, we quantified and ranked all of these uncertainties for four small globular test proteins. We found that the largest uncertainty is due to insufficient sampling and initially increases with the total trajectory length T up to a critical tipping point, after which it decreases as 1 / T , thus providing guidelines for how much sampling is required for given accuracy. We also found that single long trajectories yielded better sampling accuracy than many shorter trajectories starting from the same structure. In comparison, the remaining sources of the above uncertainties are generally smaller by a factor of about 5, rendering them less of a concern but certainly not negligible. Importantly, the Bayes uncertainty, commonly used as the only uncertainty estimate, captures only a relatively small part of the true uncertainty, which is thus often drastically underestimated.
Collapse
Affiliation(s)
- Nicolai Kozlowski
- Department of Theoretical and Computational
Biophysics, Max-Planck-Institute for Multidisciplinary
Sciences, Göttingen 37077, Germany
| | - Helmut Grubmüller
- Department of Theoretical and Computational
Biophysics, Max-Planck-Institute for Multidisciplinary
Sciences, Göttingen 37077, Germany
| |
Collapse
|
17
|
Nagel D, Sartore S, Stock G. Toward a Benchmark for Markov State Models: The Folding of HP35. J Phys Chem Lett 2023; 14:6956-6967. [PMID: 37504674 DOI: 10.1021/acs.jpclett.3c01561] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Adopting a 300 μs long MD trajectory of the folding of villin headpiece (HP35) by D. E. Shaw Research, we recently constructed a Markov state model (MSM) based on inter-residue contacts. The model reproduces the folding time and predicts that the native basin and unfolded region consist of metastable substates that are structurally well-characterized. Recognizing the need to establish well-defined benchmark problems, we study to what extent and in what sense this MSM can be employed as a reference model. Hence, we test the robustness of the MSM by comparing it to models that use alternative combinations of features, dimensionality reduction methods, and clustering schemes. The study suggests some main characteristics of the folding of HP35 that should be reproduced by other competitive models. Moreover, the discussion reveals which parts of the MSM workflow matter most for the considered problem and illustrates the promises and pitfalls of state-based models for the interpretation of biomolecular simulations.
Collapse
Affiliation(s)
- Daniel Nagel
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| | - Sofia Sartore
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| | - Gerhard Stock
- Biomolecular Dynamics, Institute of Physics, University of Freiburg, 79104 Freiburg, Germany
| |
Collapse
|
18
|
Voelz VA, Pande VS, Bowman GR. Folding@home: Achievements from over 20 years of citizen science herald the exascale era. Biophys J 2023; 122:2852-2863. [PMID: 36945779 PMCID: PMC10398258 DOI: 10.1016/j.bpj.2023.03.028] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 01/26/2023] [Accepted: 03/16/2023] [Indexed: 03/23/2023] Open
Abstract
Simulations of biomolecules have enormous potential to inform our understanding of biology but require extremely demanding calculations. For over 20 years, the Folding@home distributed computing project has pioneered a massively parallel approach to biomolecular simulation, harnessing the resources of citizen scientists across the globe. Here, we summarize the scientific and technical advances this perspective has enabled. As the project's name implies, the early years of Folding@home focused on driving advances in our understanding of protein folding by developing statistical methods for capturing long-timescale processes and facilitating insight into complex dynamical processes. Success laid a foundation for broadening the scope of Folding@home to address other functionally relevant conformational changes, such as receptor signaling, enzyme dynamics, and ligand binding. Continued algorithmic advances, hardware developments such as graphics processing unit (GPU)-based computing, and the growing scale of Folding@home have enabled the project to focus on new areas where massively parallel sampling can be impactful. While previous work sought to expand toward larger proteins with slower conformational changes, new work focuses on large-scale comparative studies of different protein sequences and chemical compounds to better understand biology and inform the development of small-molecule drugs. Progress on these fronts enabled the community to pivot quickly in response to the COVID-19 pandemic, expanding to become the world's first exascale computer and deploying this massive resource to provide insight into the inner workings of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus and aid the development of new antivirals. This success provides a glimpse of what is to come as exascale supercomputers come online and as Folding@home continues its work.
Collapse
Affiliation(s)
- Vincent A Voelz
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania
| | | | - Gregory R Bowman
- Departments of Biochemistry & Biophysics and of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania.
| |
Collapse
|
19
|
Qiu Y, O’Connor MS, Xue M, Liu B, Huang X. An Efficient Path Classification Algorithm Based on Variational Autoencoder to Identify Metastable Path Channels for Complex Conformational Changes. J Chem Theory Comput 2023; 19:4728-4742. [PMID: 37382437 PMCID: PMC11042546 DOI: 10.1021/acs.jctc.3c00318] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2023]
Abstract
Conformational changes (i.e., dynamic transitions between pairs of conformational states) play important roles in many chemical and biological processes. Constructing the Markov state model (MSM) from extensive molecular dynamics (MD) simulations is an effective approach to dissect the mechanism of conformational changes. When combined with transition path theory (TPT), MSM can be applied to elucidate the ensemble of kinetic pathways connecting pairs of conformational states. However, the application of TPT to analyze complex conformational changes often results in a vast number of kinetic pathways with comparable fluxes. This obstacle is particularly pronounced in heterogeneous self-assembly and aggregation processes. The large number of kinetic pathways makes it challenging to comprehend the molecular mechanisms underlying conformational changes of interest. To address this challenge, we have developed a path classification algorithm named latent-space path clustering (LPC) that efficiently lumps parallel kinetic pathways into distinct metastable path channels, making them easier to comprehend. In our algorithm, MD conformations are first projected onto a low-dimensional space containing a small set of collective variables (CVs) by time-structure-based independent component analysis (tICA) with kinetic mapping. Then, MSM and TPT are constructed to obtain the ensemble of pathways, and a deep learning architecture named the variational autoencoder (VAE) is used to learn the spatial distributions of kinetic pathways in the continuous CV space. Based on the trained VAE model, the TPT-generated ensemble of kinetic pathways can be embedded into a latent space, where the classification becomes clear. We show that LPC can efficiently and accurately identify the metastable path channels in three systems: a 2D potential, the aggregation of two hydrophobic particles in water, and the folding of the Fip35 WW domain. Using the 2D potential, we further demonstrate that our LPC algorithm outperforms the previous path-lumping algorithms by making substantially fewer incorrect assignments of individual pathways to four path channels. We expect that LPC can be widely applied to identify the dominant kinetic pathways underlying complex conformational changes.
Collapse
Affiliation(s)
- Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Michael S. O’Connor
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Mingyi Xue
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Bojun Liu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| |
Collapse
|
20
|
Yang J, Singer SJ. A New Approach for Estimating the Free Energy Differences among Multiple Thermodynamic States in Statistical Simulations. J Phys Chem Lett 2023; 14:5127-5133. [PMID: 37249593 PMCID: PMC10493164 DOI: 10.1021/acs.jpclett.3c00620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
In this letter, a new approach to compute free energy differences (FEDs) between multiple thermodynamics states is introduced. The method directly uses energy probability densities, which can be extracted with high accuracy from equilibrium simulations to obtain FEDs. Methods in current use, such as Bennett acceptance ratio (BAR), its multistate generalization (MBAR), or the weighted histogram analysis method (WHAM), require iterative solution of nonlinear equations which are known to be slowly convergent. The equations providing MBAR FEDs are identical to those derived earlier by Souaille and Roux in a method that has become known informally as "binless WHAM". In contrast, we obtain FEDs by solution of linear equations. For the classic two-state problem, the statistical error of our method, solving linear equations, is shown analytically to match that of BAR under common conditions.
Collapse
Affiliation(s)
- Jaehoon Yang
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
| | - Sherwin J Singer
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
| |
Collapse
|
21
|
Bhat V, Callaway CP, Risko C. Computational Approaches for Organic Semiconductors: From Chemical and Physical Understanding to Predicting New Materials. Chem Rev 2023. [PMID: 37141497 DOI: 10.1021/acs.chemrev.2c00704] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
While a complete understanding of organic semiconductor (OSC) design principles remains elusive, computational methods─ranging from techniques based in classical and quantum mechanics to more recent data-enabled models─can complement experimental observations and provide deep physicochemical insights into OSC structure-processing-property relationships, offering new capabilities for in silico OSC discovery and design. In this Review, we trace the evolution of these computational methods and their application to OSCs, beginning with early quantum-chemical methods to investigate resonance in benzene and building to recent machine-learning (ML) techniques and their application to ever more sophisticated OSC scientific and engineering challenges. Along the way, we highlight the limitations of the methods and how sophisticated physical and mathematical frameworks have been created to overcome those limitations. We illustrate applications of these methods to a range of specific challenges in OSCs derived from π-conjugated polymers and molecules, including predicting charge-carrier transport, modeling chain conformations and bulk morphology, estimating thermomechanical properties, and describing phonons and thermal transport, to name a few. Through these examples, we demonstrate how advances in computational methods accelerate the deployment of OSCsin wide-ranging technologies, such as organic photovoltaics (OPVs), organic light-emitting diodes (OLEDs), organic thermoelectrics, organic batteries, and organic (bio)sensors. We conclude by providing an outlook for the future development of computational techniques to discover and assess the properties of high-performing OSCs with greater accuracy.
Collapse
Affiliation(s)
- Vinayak Bhat
- Department of Chemistry & Center for Applied Energy Research, University of Kentucky, Lexington, Kentucky 40506-0055, United States
| | - Connor P Callaway
- Department of Chemistry & Center for Applied Energy Research, University of Kentucky, Lexington, Kentucky 40506-0055, United States
| | - Chad Risko
- Department of Chemistry & Center for Applied Energy Research, University of Kentucky, Lexington, Kentucky 40506-0055, United States
| |
Collapse
|
22
|
Dominic AJ, Cao S, Montoya-Castillo A, Huang X. Memory Unlocks the Future of Biomolecular Dynamics: Transformative Tools to Uncover Physical Insights Accurately and Efficiently. J Am Chem Soc 2023; 145:9916-9927. [PMID: 37104720 DOI: 10.1021/jacs.3c01095] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
Conformational changes underpin function and encode complex biomolecular mechanisms. Gaining atomic-level detail of how such changes occur has the potential to reveal these mechanisms and is of critical importance in identifying drug targets, facilitating rational drug design, and enabling bioengineering applications. While the past two decades have brought Markov state model techniques to the point where practitioners can regularly use them to glimpse the long-time dynamics of slow conformations in complex systems, many systems are still beyond their reach. In this Perspective, we discuss how including memory (i.e., non-Markovian effects) can reduce the computational cost to predict the long-time dynamics in these complex systems by orders of magnitude and with greater accuracy and resolution than state-of-the-art Markov state models. We illustrate how memory lies at the heart of successful and promising techniques, ranging from the Fokker-Planck and generalized Langevin equations to deep-learning recurrent neural networks and generalized master equations. We delineate how these techniques work, identify insights that they can offer in biomolecular systems, and discuss their advantages and disadvantages in practical settings. We show how generalized master equations can enable the investigation of, for example, the gate-opening process in RNA polymerase II and demonstrate how our recent advances tame the deleterious influence of statistical underconvergence of the molecular dynamics simulations used to parameterize these techniques. This represents a significant leap forward that will enable our memory-based techniques to interrogate systems that are currently beyond the reach of even the best Markov state models. We conclude by discussing some current challenges and future prospects for how exploiting memory will open the door to many exciting opportunities.
Collapse
Affiliation(s)
- Anthony J Dominic
- Department of Chemistry, University of Colorado Boulder, Boulder, Colorado 80309, USA
| | - Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
23
|
Jiang H, Li H, Wong WH, Fan X. Revealing Free Energy Landscape From MD Data via Conditional Angle Partition Tree. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:1384-1394. [PMID: 35503836 DOI: 10.1109/tcbb.2022.3172352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Deciphering the free energy landscape of biomolecular structure space is crucial for understanding many complex molecular processes, such as protein-protein interaction, RNA folding, and protein folding. A major source of current dynamic structure data is Molecular Dynamics (MD) simulations. Several methods have been proposed to investigate the free energy landscape from MD data, but all of them rely on the assumption that kinetic similarity is associated with global geometric similarity, which may lead to unsatisfactory results. In this paper, we proposed a new method called Conditional Angle Partition Tree to reveal the hierarchical free energy landscape by correlating local geometric similarity with kinetic similarity. Its application on the benchmark alanine dipeptide MD data showed a much better performance than existing methods in exploring and understanding the free energy landscape. We also applied it to the MD data of Villin HP35. Our results are more reasonable on various aspects than those from other methods and very informative on the hierarchical structure of its energy landscape.
Collapse
|
24
|
Ojha AA, Thakur S, Ahn SH, Amaro RE. DeepWEST: Deep Learning of Kinetic Models with the Weighted Ensemble Simulation Toolkit for Enhanced Sampling. J Chem Theory Comput 2023; 19:1342-1359. [PMID: 36719802 DOI: 10.1021/acs.jctc.2c00282] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Recent advances in computational power and algorithms have enabled molecular dynamics (MD) simulations to reach greater time scales. However, for observing conformational transitions associated with biomolecular processes, MD simulations still have limitations. Several enhanced sampling techniques seek to address this challenge, including the weighted ensemble (WE) method, which samples transitions between metastable states using many weighted trajectories to estimate kinetic rate constants. However, initial sampling of the potential energy surface has a significant impact on the performance of WE, i.e., convergence and efficiency. We therefore introduce deep-learned kinetic modeling approaches that extract statistically relevant information from short MD trajectories to provide a well-sampled initial state distribution for WE simulations. This hybrid approach overcomes any statistical bias to the system, as it runs short unbiased MD trajectories and identifies meaningful metastable states of the system. It is shown to provide a more refined free energy landscape closer to the steady state that could efficiently sample kinetic properties such as rate constants.
Collapse
Affiliation(s)
- Anupam Anand Ojha
- Department of Chemistry, University of California San Diego, La Jolla, California92093, United States
| | - Saumya Thakur
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai, Maharashtra400076, India
| | - Surl-Hee Ahn
- Department of Chemical Engineering, University of California Davis, Davis, California95616, United States
| | - Rommie E Amaro
- Department of Chemistry, University of California San Diego, La Jolla, California92093, United States
| |
Collapse
|
25
|
Galama MM, Wu H, Krämer A, Sadeghi M, Noé F. Stochastic Approximation to MBAR and TRAM: Batchwise Free Energy Estimation. J Chem Theory Comput 2023; 19:758-766. [PMID: 36689637 DOI: 10.1021/acs.jctc.2c00976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
The dynamics of molecules are governed by rare event transitions between long-lived (metastable) states. To explore these transitions efficiently, many enhanced sampling protocols have been introduced that involve using simulations with biases or changed temperatures. Two established statistically optimal estimators for obtaining unbiased equilibrium properties from such simulations are the multistate Bennett acceptance ratio (MBAR) and the transition-based reweighting analysis method (TRAM). Both MBAR and TRAM are solved iteratively and can suffer from long convergence times. Here, we introduce stochastic approximators (SA) for both estimators, resulting in SAMBAR and SATRAM, which are shown to converge faster than their deterministic counterparts, without significant accuracy loss. Both methods are demonstrated on different molecular systems.
Collapse
Affiliation(s)
- Maaike M Galama
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
| | - Hao Wu
- School of Mathematical Sciences, Institute of Natural Sciences, and MOE-LSC, Shanghai Jiao Tong University, 200240Shanghai, China.,School of Mathematical Sciences, Tongji University, 200092Shanghai, China
| | - Andreas Krämer
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
| | - Mohsen Sadeghi
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195Berlin, Germany.,Microsoft Research AI4Science, Karl Liebknecht Str 32, 10178Berlin, Germany.,Department of Physics, Freie Universität Berlin, Arnimallee 14, 14195Berlin, Germany.,Department of Chemistry, Rice University, 6100 Main St., Houston, Texas77005-1827, United States
| |
Collapse
|
26
|
Hong X, Song K, Rahman MU, Wei T, Zhang Y, Da LT, Chen HF. Phosphorylation Regulation Mechanism of β2 Integrin for the Binding of Filamin Revealed by Markov State Model. J Chem Inf Model 2023; 63:605-618. [PMID: 36607244 DOI: 10.1021/acs.jcim.2c01177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Leukocyte adhesion deficiency-1 (LAD-1) disorder is a severe immunodeficiency syndrome caused by deficiency or mutation of β2 integrin. The phosphorylation on threonine 758 of β2 integrin acts as a molecular switch inhibiting the binding of filamin. However, the switch mechanism of site-specific phosphorylation at the atom level is still poorly understood. To resolve the regulation mechanism, all-atom molecular dynamics simulation and Markov state model were used to study the dynamic regulation pathway of phosphorylation. Wild type system possessed lower binding free energy and fewer number of states than the phosphorylated system. Both systems underwent local disorder-to-order conformation conversion when achieving steady states. To reach steady states, wild type adopted less number of transition paths/shortest path according to the transition path theory than the phosphorylated system. The underlying phosphorylated regulation pathway was from P1 to P0 and then P4 state, and the main driving force should be hydrogen bond and hydrophobic interaction disturbing the secondary structure of phosphorylated states. These studies will shed light on the pathogenesis of LAD-1 disease and lay a foundation for drug development.
Collapse
Affiliation(s)
- Xiaokun Hong
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Kaiyuan Song
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Mueed Ur Rahman
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Ting Wei
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Yan Zhang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Lin-Tai Da
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
| | - Hai-Feng Chen
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai200240, China
- Shanghai Center for Bioinformation Technology, Shanghai200240, China
| |
Collapse
|
27
|
Koulgi S, Achalere A, Sonavane U, Joshi R. Markov State Modeling Analysis Captures Changes in the Temperature-Sensitive N-Terminal and β-Turn Regions of the p53 DNA-Binding Domain. J Chem Inf Model 2022; 62:6449-6461. [PMID: 35614540 DOI: 10.1021/acs.jcim.2c00380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The transcription factor p53 is one of the most widely studied cancer proteins. Its temperature-sensitive nature suggests reduction in functionality at physiological temperatures. Temperature-induced conformational variations and their impact on its functional ability still remain unexplored. A total of 20.8 μs molecular dynamics simulations of wildtype p53 in the apo and the DNA-bound states have been performed at 300 K and 310 K. Further, Markov State Modeling (MSM) analyses were performed, considering Cα-Cα distances as reaction coordinates. Filtering of these distances based on correlation with the time-independent components (tICs) resulted in 16 and 32 distances for apo and DNA-bound systems, respectively. Individual MSM analyses using these filtered distances were performed for both p53 systems. These Cα-Cα residue pairs belonged to the N-terminal, S6/7 β-turn, loop L2, loop L3, and hydrophobic core residues. At physiological temperatures, apo-p53 exhibits exposure of its hydrophobic core, where the temperature-sensitive hotspot residues were also located. This exposure was the result of the S6/7 β-turn and N-terminal moving apart. In the DNA-bound p53 system, loop L1 attains an open conformation at physiological temperatures, which weakens the DNA binding. It is already known that p53 mutants that lack DNA binding also tend to show similar conformational variations. The S6/7 β-turn along with the already known functionally important loop L2 may pose as regions to be targeted to overcome the loss in binding of temperature-sensitive wildtype p53. Rescue strategies directed toward these temperature-sensitive regions may be useful to recuperate its strong binding at physiological temperatures.
Collapse
Affiliation(s)
- Shruti Koulgi
- High Performance Computing - Medical and Bioinformatics Applications Group, Centre for Development for Advanced Computing (C-DAC), Panchawati, Pashan, Pune 411 008, India
| | - Archana Achalere
- High Performance Computing - Medical and Bioinformatics Applications Group, Centre for Development for Advanced Computing (C-DAC), Panchawati, Pashan, Pune 411 008, India
| | - Uddhavesh Sonavane
- High Performance Computing - Medical and Bioinformatics Applications Group, Centre for Development for Advanced Computing (C-DAC), Panchawati, Pashan, Pune 411 008, India
| | - Rajendra Joshi
- High Performance Computing - Medical and Bioinformatics Applications Group, Centre for Development for Advanced Computing (C-DAC), Panchawati, Pashan, Pune 411 008, India
| |
Collapse
|
28
|
Unarta IC, Goonetilleke EC, Wang D, Huang X. Nucleotide addition and cleavage by RNA polymerase II: Coordination of two catalytic reactions using a single active site. J Biol Chem 2022; 299:102844. [PMID: 36581202 PMCID: PMC9860460 DOI: 10.1016/j.jbc.2022.102844] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 12/19/2022] [Accepted: 12/22/2022] [Indexed: 12/28/2022] Open
Abstract
RNA polymerase II (Pol II) incorporates complementary ribonucleotides into the growing RNA chain one at a time via the nucleotide addition cycle. The nucleotide addition cycle, however, is prone to misincorporation of noncomplementary nucleotides. Thus, to ensure transcriptional fidelity, Pol II backtracks and then cleaves the misincorporated nucleotides. These two reverse reactions, nucleotide addition and cleavage, are catalyzed in the same active site of Pol II, which is different from DNA polymerases or other endonucleases. Recently, substantial progress has been made to understand how Pol II effectively performs its dual role in the same active site. Our review highlights these recent studies and provides an overall model of the catalytic mechanisms of Pol II. In particular, RNA extension follows the two-metal-ion mechanism, and several Pol II residues play important roles to facilitate the catalysis. In sharp contrast, the cleavage reaction is independent of any Pol II residues. Interestingly, Pol II relies on its residues to recognize the misincorporated nucleotides during the backtracking process, prior to cleavage. In this way, Pol II efficiently compartmentalizes its two distinct catalytic functions using the same active site. Lastly, we also discuss a new perspective on the potential third Mg2+ in the nucleotide addition and intrinsic cleavage reactions.
Collapse
Affiliation(s)
- Ilona Christy Unarta
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Eshani C Goonetilleke
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Dong Wang
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California, USA; Department of Cellular and Molecular Medicine, School of Medicine, University of California, San Diego, La Jolla, California, USA; Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California, USA.
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA.
| |
Collapse
|
29
|
Donati L, Weber M. Assessing transition rates as functions of environmental variables. J Chem Phys 2022; 157:224103. [PMID: 36546809 DOI: 10.1063/5.0109555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We present a method to estimate the transition rates of molecular systems under different environmental conditions that cause the formation or the breaking of bonds and require the sampling of the Grand Canonical Ensemble. For this purpose, we model the molecular system in terms of probable "scenarios," governed by different potential energy functions, which are separately sampled by classical MD simulations. Reweighting the canonical distribution of each scenario according to specific environmental variables, we estimate the grand canonical distribution, then use the Square Root Approximation method to discretize the Fokker-Planck operator into a rate matrix and the robust Perron Cluster Cluster Analysis method to coarse-grain the kinetic model. This permits efficiently estimating the transition rates of conformational states as functions of environmental variables, for example, the local pH at a cell membrane. In this work, we formalize the theoretical framework of the procedure, and we present a numerical experiment comparing the results with those provided by a constant-pH method based on non-equilibrium Molecular Dynamics Monte Carlo simulations. The method is relevant for the development of new drug design strategies that take into account how the cellular environment influences biochemical processes.
Collapse
Affiliation(s)
- Luca Donati
- Zuse Institute Berlin, Takustr. 7, D-14195 Berlin, Germany
| | - Marcus Weber
- Zuse Institute Berlin, Takustr. 7, D-14195 Berlin, Germany
| |
Collapse
|
30
|
Gong S, He X, Meng Q, Ma Z, Shao B, Wang T, Liu TY. Stochastic Lag Time Parameterization for Markov State Models of Protein Dynamics. J Phys Chem B 2022; 126:9465-9475. [PMID: 36345778 DOI: 10.1021/acs.jpcb.2c03711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Markov state models (MSMs) play a key role in studying protein conformational dynamics. A sliding count window with a fixed lag time is widely used to sample sub-trajectories for transition counting and MSM construction. However, sub-trajectories sampled with a fixed lag time may not perform well under different selections of lag time, which requires strong prior practice and leads to less robust estimation. To alleviate it, we propose a novel stochastic method from a Poisson process to generate perturbative lag time for sub-trajectory sampling and utilize it to construct a Markov chain. Comprehensive evaluations on the double-well system, WW domain, BPTI, and RBD-ACE2 complex of SARS-CoV-2 reveal that our algorithm significantly increases the robustness and power of a constructed MSM without disturbing the Markovian properties. Furthermore, the superiority of our algorithm is amplified for slow dynamic modes in complex biological processes.
Collapse
Affiliation(s)
- Shiqi Gong
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Zhongguancun East Road, Beijing100190, China.,University of Chinese Academy of Sciences, No. 19 Yuquan Road, Beijing100049, China.,Microsoft Research AI4Science, Beijing100080, China
| | - Xinheng He
- University of Chinese Academy of Sciences, No. 19 Yuquan Road, Beijing100049, China.,Microsoft Research AI4Science, Beijing100080, China.,The CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai201203, China
| | - Qi Meng
- Microsoft Research AI4Science, Beijing100080, China
| | - Zhiming Ma
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Zhongguancun East Road, Beijing100190, China.,University of Chinese Academy of Sciences, No. 19 Yuquan Road, Beijing100049, China
| | - Bin Shao
- Microsoft Research AI4Science, Beijing100080, China
| | - Tong Wang
- Microsoft Research AI4Science, Beijing100080, China
| | - Tie-Yan Liu
- Microsoft Research AI4Science, Beijing100080, China
| |
Collapse
|
31
|
Xi K, Zhu L. Automated Path Searching Reveals the Mechanism of Hydrolysis Enhancement by T4 Lysozyme Mutants. Int J Mol Sci 2022; 23:ijms232314628. [PMID: 36498954 PMCID: PMC9736071 DOI: 10.3390/ijms232314628] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 11/16/2022] [Accepted: 11/19/2022] [Indexed: 11/25/2022] Open
Abstract
Bacteriophage T4 lysozyme (T4L) is a glycosidase that is widely applied as a natural antimicrobial agent in the food industry. Due to its wide applications and small size, T4L has been regarded as a model system for understanding protein dynamics and for large-scale protein engineering. Through structural insights from the single conformation of T4L, a series of mutations (L99A,G113A,R119P) have been introduced, which have successfully raised the fractional population of its only hydrolysis-competent excited state to 96%. However, the actual impact of these substitutions on its dynamics remains unclear, largely due to the lack of highly efficient sampling algorithms. Here, using our recently developed travelling-salesman-based automated path searching (TAPS), we located the minimum-free-energy path (MFEP) for the transition of three T4L mutants from their ground states to their excited states. All three mutants share a three-step transition: the flipping of F114, the rearrangement of α0/α1 helices, and final refinement. Remarkably, the MFEP revealed that the effects of the mutations are drastically beyond the expectations of their original design: (a) the G113A substitution not only enhances helicity but also fills the hydrophobic Cavity I and reduces the free energy barrier for flipping F114; (b) R119P barely changes the stability of the ground state but stabilizes the excited state through rarely reported polar contacts S117OG:N132ND2, E11OE1:R145NH1, and E11OE2:Q105NE2; (c) the residue W138 flips into Cavity I and further stabilizes the excited state for the triple mutant L99A,G113A,R119P. These novel insights that were unexpected in the original mutant design indicated the necessity of incorporating path searching into the workflow of rational protein engineering.
Collapse
|
32
|
Mardt A, Hempel T, Clementi C, Noé F. Deep learning to decompose macromolecules into independent Markovian domains. Nat Commun 2022; 13:7101. [PMID: 36402768 PMCID: PMC9675806 DOI: 10.1038/s41467-022-34603-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 10/27/2022] [Indexed: 11/21/2022] Open
Abstract
The increasing interest in modeling the dynamics of ever larger proteins has revealed a fundamental problem with models that describe the molecular system as being in a global configuration state. This notion limits our ability to gather sufficient statistics of state probabilities or state-to-state transitions because for large molecular systems the number of metastable states grows exponentially with size. In this manuscript, we approach this challenge by introducing a method that combines our recent progress on independent Markov decomposition (IMD) with VAMPnets, a deep learning approach to Markov modeling. We establish a training objective that quantifies how well a given decomposition of the molecular system into independent subdomains with Markovian dynamics approximates the overall dynamics. By constructing an end-to-end learning framework, the decomposition into such subdomains and their individual Markov state models are simultaneously learned, providing a data-efficient and easily interpretable summary of the complex system dynamics. While learning the dynamical coupling between Markovian subdomains is still an open issue, the present results are a significant step towards learning Ising models of large molecular complexes from simulation data.
Collapse
Affiliation(s)
- Andreas Mardt
- grid.14095.390000 0000 9116 4836Freie Universität Berlin, Department of Mathematics and Computer Science, Berlin, Germany
| | - Tim Hempel
- grid.14095.390000 0000 9116 4836Freie Universität Berlin, Department of Mathematics and Computer Science, Berlin, Germany ,grid.14095.390000 0000 9116 4836Freie Universität Berlin, Department of Physics, Berlin, Germany
| | - Cecilia Clementi
- grid.14095.390000 0000 9116 4836Freie Universität Berlin, Department of Physics, Berlin, Germany ,grid.21940.3e0000 0004 1936 8278Rice University, Department of Chemistry, Houston, TX USA ,grid.509984.90000 0004 5907 3802Rice University, Center for Theoretical Biological Physics, Houston, TX USA
| | - Frank Noé
- grid.14095.390000 0000 9116 4836Freie Universität Berlin, Department of Mathematics and Computer Science, Berlin, Germany ,grid.14095.390000 0000 9116 4836Freie Universität Berlin, Department of Physics, Berlin, Germany ,grid.21940.3e0000 0004 1936 8278Rice University, Department of Chemistry, Houston, TX USA ,Microsoft Research AI4Science, Berlin, Germany
| |
Collapse
|
33
|
Bone RA, Sharpe DJ, Wales DJ, Green JR. Stochastic paths controlling speed and dissipation. Phys Rev E 2022; 106:054151. [PMID: 36559408 DOI: 10.1103/physreve.106.054151] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 10/28/2022] [Indexed: 11/24/2022]
Abstract
Natural processes occur in a finite amount of time and dissipate energy, entropy, and matter. Near equilibrium, thermodynamic intuition suggests that fast irreversible processes will dissipate more energy and entropy than slow quasistatic processes connecting the same initial and final states. For small systems, recently discovered thermodynamic speed limits suggest that faster processes will dissipate more than slower processes. Here, we test the hypothesis that this relationship between speed and dissipation holds for stochastic paths far from equilibrium. To analyze stochastic paths on finite timescales, we derive an exact expression for the path probabilities of continuous-time Markov chains from the path summation solution to the master equation. We present a minimal model for a driven system in which relative energies of the initial and target states control the speed, and the nonequilibrium currents of a cycle control the dissipation. Although the hypothesis holds near equilibrium, we find that faster processes can dissipate less under far-from-equilibrium conditions because of strong currents. This model serves as a minimal prototype for designing kinetics to sculpt the nonequilibrium path space so that faster paths produce less dissipation.
Collapse
Affiliation(s)
- Rebecca A Bone
- Department of Chemistry, University of Massachusetts Boston, Boston, Massachusetts 02125, USA
| | - Daniel J Sharpe
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, Cambridge, United Kingdom
| | - David J Wales
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, Cambridge, United Kingdom
| | - Jason R Green
- Department of Chemistry, University of Massachusetts Boston, Boston, Massachusetts 02125, USA.,Department of Physics, University of Massachusetts Boston, Boston, Massachusetts 02125, USA
| |
Collapse
|
34
|
Zhang H, Chu G, Wang G, Yao M, Lu S, Chen T. Mechanistic Understanding of the Palmitoylation of G o Protein in the Allosteric Regulation of Adhesion Receptor GPR97. Pharmaceutics 2022; 14:pharmaceutics14091856. [PMID: 36145604 PMCID: PMC9504338 DOI: 10.3390/pharmaceutics14091856] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 08/24/2022] [Accepted: 08/30/2022] [Indexed: 11/21/2022] Open
Abstract
Adhesion G-protein-coupled receptors (aGPCRs)—a major family of GPCRs—play critical roles in the regulation of tissue development and cancer progression. The orphan receptor GPR97, activated by glucocorticoid stress hormones, is a prototypical aGPCR. Although it has been established that the palmitoylation of the C-terminal Go protein is essential for Go’s efficient engagement with the active GPR97, the detailed allosteric mechanism remains to be clarified. Hence, we performed extensive large-scale molecular dynamics (MD) simulations of the GPR97−Go complex in the presence or absence of Go palmitoylation. The conformational landscapes analyzed by Markov state models revealed that the overall conformation of GPR97 is preferred to be fully active when interacting with palmitoylated Go protein. Structural and energetic analyses indicated that the palmitoylation of Go can allosterically stabilize the critical residues in the ligand-binding pocket of GPR97 and increase the affinity of the ligand for GPR97. Furthermore, the community network analysis suggests that the palmitoylation of Go not only allosterically strengthens the internal interactions between Gαo and Gβγ, but also enhances the coupling between Go and GPR97. Our study provides mechanistic insights into the regulation of aGPCRs via post-translational modifications of the Go protein, and offers guidance for future drug design of aGPCRs.
Collapse
Affiliation(s)
- Hao Zhang
- Department of Plastic and Reconstructive Surgery, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 200011, China
| | - Guojun Chu
- Department of Cardiology, Changzheng Hospital, Naval Medical University, Shanghai 200003, China
| | - Gaoming Wang
- Medicinal Chemistry and Bioinformatics Center, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Min Yao
- Department of Plastic and Reconstructive Surgery, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University, School of Medicine, Shanghai 200011, China
- Correspondence: (M.Y.); (S.L.); (T.C.)
| | - Shaoyong Lu
- Medicinal Chemistry and Bioinformatics Center, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
- Correspondence: (M.Y.); (S.L.); (T.C.)
| | - Ting Chen
- Department of Cardiology, Changzheng Hospital, Naval Medical University, Shanghai 200003, China
- Correspondence: (M.Y.); (S.L.); (T.C.)
| |
Collapse
|
35
|
Zhang H, Ni D, Fan J, Li M, Zhang J, Hua C, Nussinov R, Lu S. Markov State Models and Molecular Dynamics Simulations Reveal the Conformational Transition of the Intrinsically Disordered Hypervariable Region of K-Ras4B to the Ordered Conformation. J Chem Inf Model 2022; 62:4222-4231. [PMID: 35994329 DOI: 10.1021/acs.jcim.2c00591] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
K-Ras4B, the most frequently mutated Ras isoform in human tumors, plays a vital part in cell growth, differentiation, and survival. Its tail, the C-terminal hypervariable region (HVR), is involved in anchoring K-Ras4B at the cellular plasma membrane and in isoform-specific protein-protein interactions and signaling. In the inactive guanosine diphosphate-bound state, the intrinsically disordered HVR interacts with the catalytic domain at the effector-binding region, rendering K-Ras4B in its autoinhibited state. Activation releases the HVR from the catalytic domain, with its ensemble favoring an ordered α-helical structure. The large-scale conformational transition of the HVR from the intrinsically disordered to the ordered conformation remains poorly understood. Here, we deploy a computational scheme that integrates a transition path-generation algorithm, extensive molecular dynamics simulation, and Markov state model analysis to investigate the conformational landscape of the HVR transition pathway. Our findings reveal a stepwise pathway for the HVR transition and uncover several key conformational substates along the transition pathway. Importantly, key interactions between the HVR and the catalytic domain are unraveled, highlighting the pathogenesis of K-Ras4B mild mutations in several congenital developmental anomaly syndromes. Together, these findings provide a deeper understanding of the HVR transition mechanism and the regulation of K-Ras4B activity at an atomic level.
Collapse
Affiliation(s)
- Hao Zhang
- Department of Plastic and Reconstructive Surgery, Shanghai Ninth People's Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200011, China
| | - Duan Ni
- The Charles Perkins Centre, University of Sydney, Sydney, New South Wales 2006, Australia
| | - Jigang Fan
- Department of Pathophysiology, Key Laboratory of Cell Differentiation and Apoptosis of Chinese Ministry of Education, School of Medicine, Shanghai Jiao Tong University, Shanghai 200025, China
| | - Minyu Li
- Department of Pathophysiology, Key Laboratory of Cell Differentiation and Apoptosis of Chinese Ministry of Education, School of Medicine, Shanghai Jiao Tong University, Shanghai 200025, China
| | - Jian Zhang
- Department of Pathophysiology, Key Laboratory of Cell Differentiation and Apoptosis of Chinese Ministry of Education, School of Medicine, Shanghai Jiao Tong University, Shanghai 200025, China
| | - Chen Hua
- Department of Plastic and Reconstructive Surgery, Shanghai Ninth People's Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200011, China
| | - Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Cancer Innovation Laboratory, National Cancer Institute, Frederick, Maryland 21702, United States.,Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Sackler Institute of Molecular Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Shaoyong Lu
- Department of Pathophysiology, Key Laboratory of Cell Differentiation and Apoptosis of Chinese Ministry of Education, School of Medicine, Shanghai Jiao Tong University, Shanghai 200025, China.,Medicinal Chemistry and Bioinformatics Centre, School of Medicine, Shanghai Jiao Tong University, Shanghai 200025, China
| |
Collapse
|
36
|
Christoforou E, Leontiadou H, Noé F, Samios J, Emiris IZ, Cournia Z. Investigating the Bioactive Conformation of Angiotensin II Using Markov State Modeling Revisited with Web-Scale Clustering. J Chem Theory Comput 2022; 18:5636-5648. [PMID: 35944098 DOI: 10.1021/acs.jctc.1c00881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Molecular dynamics simulation is a powerful technique for studying the structure and dynamics of biomolecules in atomic-level detail by sampling their various conformations in real time. Because of the long timescales that need to be sampled to study biomolecular processes and the big and complex nature of the corresponding data, relevant analyses of important biophysical phenomena are challenging. Clustering and Markov state models (MSMs) are efficient computational techniques that can be used to extract dominant conformational states and to connect those with kinetic information. In this work, we perform Molecular Dynamics simulations to investigate the free energy landscape of Angiotensin II (AngII) in order to unravel its bioactive conformations using different clustering techniques and Markov state modeling. AngII is an octapeptide hormone, which binds to the AT1 transmembrane receptor, and plays a vital role in the regulation of blood pressure, conservation of total blood volume, and salt homeostasis. To mimic the water-membrane interface as AngII approaches the AT1 receptor and to compare our findings with available experimental results, the simulations were performed in water as well as in water-ethanol mixtures. Our results show that in the water-ethanol environment, AngII adopts more compact U-shaped (folded) conformations than in water, which resembles its structure when bound to the AT1 receptor. For clustering of the conformations, we validate the efficiency of an inverted-quantized k-means algorithm, as a fast approximate clustering technique for web-scale data (millions of points into thousands or millions of clusters) compared to k-means, on data from trajectories of molecular dynamics simulations with reasonable trade-offs between time and accuracy. Finally, we extract MSMs using various clustering techniques for the generation of microstates and macrostates and for the selection of the macrostate representatives.
Collapse
Affiliation(s)
- Emmanouil Christoforou
- ITMB, Department of Informatics & Telecommunications, National and Kapodistrian University of Athens, Athens 15772, Greece.,Biomedical Research Foundation, Academy of Athens, 4 Soranou Ephessiou, Athens 11527, Greece
| | - Hari Leontiadou
- Biomedical Research Foundation, Academy of Athens, 4 Soranou Ephessiou, Athens 11527, Greece
| | - Frank Noé
- Fachbereich Mathematik und Informatik, Freie Universität Berlin, Arnimallee 6, Berlin 14195, Germany
| | - Jannis Samios
- Department of Chemistry, Laboratory of Physical Chemistry, National & Kapodistrian University of Athens, Athens 15772, Greece
| | - Ioannis Z Emiris
- ITMB, Department of Informatics & Telecommunications, National and Kapodistrian University of Athens, Athens 15772, Greece.,Athena Research Center, Marousi 15125, Greece
| | - Zoe Cournia
- ITMB, Department of Informatics & Telecommunications, National and Kapodistrian University of Athens, Athens 15772, Greece.,Biomedical Research Foundation, Academy of Athens, 4 Soranou Ephessiou, Athens 11527, Greece
| |
Collapse
|
37
|
Oide M, Sugita Y. Protein Folding Intermediates on the Dimensionality Reduced Landscape with UMAP and Native Contact Likelihood. J Chem Phys 2022; 157:075101. [DOI: 10.1063/5.0099094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
To understand protein folding mechanisms from molecular dynamics (MD) simulations, it is important to explore not only folded/unfolded states but also representative intermediate structures on the conformational landscape. Here, we propose a novel approach to construct the landscape using the uniform manifold approximation and projection (UMAP) method, which reduces the dimensionality without losing data-point proximity. In the approach, native contact likelihood is used as feature variables rather than the conventional Cartesian coordinates or dihedral angles of protein structures. We tested the performance of UMAP for coarse-grained MD simulation trajectories of B1 domain in protein G and observed on-pathway transient structures and other metastable states on the UMAP conformational landscape. In contrast, these structures were not clearly distinguished on the dimensionality reduced landscape using principal component analysis (PCA) or time-lagged independent component analysis (tICA). This approach is also useful to obtain dynamical information through Markov State Modeling and would be applicable to large-scale conformational changes in many other biomacromolecules.
Collapse
Affiliation(s)
| | - Yuji Sugita
- Theoretical Molecular Science Laboratory, RIKEN, Japan
| |
Collapse
|
38
|
Abstract
Multifunctional systems, such as molecular switches, exhibit multifunnel energy landscapes associated with the alternative functional states. In this contribution the multifunnel organization is decoded from dynamical signatures in the first passage time distribution between reactants and products. Characteristic relaxation rates are revealed by analyzing the kinetics as a function of the observation time scale, which scans the underlying distribution. Extracting the corresponding dynamical signatures provides direct insight into the organization of the molecular energy landscape, which will facilitate a rational design of target functionality. Examples are illustrated for multifunnel landscapes in biomolecular systems and an atomic cluster.
Collapse
|
39
|
Koehl P, Orland H. Sampling constrained stochastic trajectories using Brownian bridges. J Chem Phys 2022; 157:054105. [DOI: 10.1063/5.0102295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
We present a new method to sample conditioned trajectories of a system evolving under Langevin dynamics, based on Brownian bridges. <p>The trajectories are conditioned to end at a certain point (or in a certain region) in space.</p> <p>The bridge equations can be recast exactly in the form of a non linear stochastic integro-differential equation.</p> <p>This equation can be very well approximated when the trajectories are closely bundled together in space, i.e. at low temperature, or for transition paths. The approximate equation can be solved iteratively, using a fixed point method.</p> <p>We discuss how to choose the initial trajectories and show some examples of the performance of this method on some simple problems.</p> <p>The method allows to generate conditioned trajectories with a high accuracy.
Collapse
Affiliation(s)
- Patrice Koehl
- Computer Science and Genome Center, University of California Davis, United States of America
| | - Henri Orland
- Institut de Physique Theorique, CEA, Saclay, France
| |
Collapse
|
40
|
Kamberaj H. Random walks in a free energy landscape combining augmented molecular dynamics simulations with a dynamic graph neural network model. J Mol Graph Model 2022; 114:108199. [DOI: 10.1016/j.jmgm.2022.108199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 04/09/2022] [Accepted: 04/11/2022] [Indexed: 10/18/2022]
|
41
|
Yang X, Lu ZY. Nanoparticle cluster formation mechanisms elucidated via Markov state modeling: Attraction range effects, aggregation pathways, and counterintuitive transition rates. J Chem Phys 2022; 156:214902. [DOI: 10.1063/5.0086110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Nanoparticle clusters are promising candidates for developing functional materials. However, it is still a challenging task to fabricate them in a predictable and controllable way, which requires investigation of the possible mechanisms underlying cluster formation at the nanoscale. By constructing Markov state models (MSMs) at the microstate level, we find that for highly dispersed particles to form a highly aggregated cluster, there are multiple coexisting pathways, which correspond to direct aggregation, or pathways that need to pass through partially aggregated, intermediate states. Varying the range of attraction between nanoparticles is found to significantly affect pathways. As the attraction range becomes narrower, compared to direct aggregation, some pathways that need to pass through partially aggregated intermediate states become more competitive. In addition, from MSMs constructed at the macrostate level, the aggregation rate is found to be counterintuitively lower with a lower free-energy barrier, which is also discussed.
Collapse
Affiliation(s)
- Xi Yang
- Institute of Theoretical Chemistry, State Key Laboratory of Supramolecular Structure and Materials, Jilin University, Changchun 130021, China
| | - Zhong-Yuan Lu
- Institute of Theoretical Chemistry, State Key Laboratory of Supramolecular Structure and Materials, Jilin University, Changchun 130021, China
| |
Collapse
|
42
|
Lan J, He X, Ren Y, Wang Z, Zhou H, Fan S, Zhu C, Liu D, Shao B, Liu TY, Wang Q, Zhang L, Ge J, Wang T, Wang X. Structural insights into the SARS-CoV-2 Omicron RBD-ACE2 interaction. Cell Res 2022; 32:593-595. [PMID: 35418218 DOI: 10.1101/2022.01.03.474855] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 03/03/2022] [Indexed: 05/25/2023] Open
Abstract
ABSTRACTSince SARS-CoV-2 Omicron variant (B.1.1.529) was reported in November 2021, it has quickly spread to many countries and outcompeted the globally dominant Delta variant in several countries. The Omicron variant contains the largest number of mutations to date, with 32 mutations located at spike (S) glycoprotein, which raised great concern for its enhanced viral fitness and immune escape[1–4]. In this study, we reported the crystal structure of the receptor binding domain (RBD) of Omicron variant S glycoprotein bound to human ACE2 at a resolution of 2.6 Å. Structural comparison, molecular dynamics simulation and binding free energy calculation collectively identified four key mutations (S477N, G496S, Q498R and N501Y) for the enhanced binding of ACE2 by the Omicron RBD compared to the WT RBD. Representative states of the WT and Omicron RBD-ACE2 systems were identified by Markov State Model, which provides a dynamic explanation for the enhanced binding of Omicron RBD. The effects of the mutations in the RBD for antibody recognition were analyzed, especially for the S371L/S373P/S375F substitutions significantly changing the local conformation of the residing loop to deactivate several class IV neutralizing antibodies.
Collapse
Affiliation(s)
- Jun Lan
- The Ministry of Education Key Laboratory of Protein Science, Beijing Advanced Innovation Center for Structural Biology, Beijing Frontier Research Center for Biological Structure, Collaborative Innovation Center for Biotherapy, School of Life Sciences, Tsinghua University, Beijing, China
| | | | - Yifei Ren
- The Ministry of Education Key Laboratory of Protein Science, Beijing Advanced Innovation Center for Structural Biology, Beijing Frontier Research Center for Biological Structure, Collaborative Innovation Center for Biotherapy, School of Life Sciences, Tsinghua University, Beijing, China
| | - Ziyi Wang
- The Ministry of Education Key Laboratory of Protein Science, Beijing Advanced Innovation Center for Structural Biology, Beijing Frontier Research Center for Biological Structure, Collaborative Innovation Center for Biotherapy, School of Life Sciences, Tsinghua University, Beijing, China
| | - Huan Zhou
- Shanghai Synchrotron Radiation Facility, Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China
| | - Shilong Fan
- The Ministry of Education Key Laboratory of Protein Science, Beijing Advanced Innovation Center for Structural Biology, Beijing Frontier Research Center for Biological Structure, Collaborative Innovation Center for Biotherapy, School of Life Sciences, Tsinghua University, Beijing, China
| | - Chenyou Zhu
- Department of Chemistry, Tsinghua University, Beijing, China
| | - Dongsheng Liu
- Department of Chemistry, Tsinghua University, Beijing, China
| | - Bin Shao
- Microsoft Research Asia, Beijing, China
| | | | - Qisheng Wang
- Shanghai Synchrotron Radiation Facility, Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China
| | - Linqi Zhang
- Center for Global Health and Infectious Diseases, Comprehensive AIDS Research Center, and Beijing Advanced Innovation Center for Structural Biology, School of Medicine, Tsinghua University, Beijing, China.
| | - Jiwan Ge
- The Ministry of Education Key Laboratory of Protein Science, Beijing Advanced Innovation Center for Structural Biology, Beijing Frontier Research Center for Biological Structure, Collaborative Innovation Center for Biotherapy, School of Life Sciences, Tsinghua University, Beijing, China.
| | - Tong Wang
- Microsoft Research Asia, Beijing, China.
| | - Xinquan Wang
- The Ministry of Education Key Laboratory of Protein Science, Beijing Advanced Innovation Center for Structural Biology, Beijing Frontier Research Center for Biological Structure, Collaborative Innovation Center for Biotherapy, School of Life Sciences, Tsinghua University, Beijing, China.
| |
Collapse
|
43
|
Cong X, Zhang X, Liang X, He X, Tang Y, Zheng X, Lu S, Zhang J, Chen T. Delineating the conformational landscape and intrinsic properties of the angiotensin II type 2 receptor using a computational study. Comput Struct Biotechnol J 2022; 20:2268-2279. [PMID: 35615027 PMCID: PMC9117689 DOI: 10.1016/j.csbj.2022.05.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 05/04/2022] [Accepted: 05/06/2022] [Indexed: 12/22/2022] Open
Abstract
As a key regulator for the renin-angiotensin system, a class A G protein-coupled receptor (GPCR), AngII type 2 receptor (AT2R), plays a pivotal role in the homeostasis of the cardiovascular system. Compared with other GPCRs, AT2R has a unique antagonist-bound conformation and its mechanism is still an enigma. Here, we applied combined dynamic and evolutional approaches to investigate the conformational space and intrinsic properties of AT2R. With molecular dynamic simulations, Markov State Models, and statistics coupled analysis, we captured the conformational landscape of AT2R and identified its uniquity from both dynamical and evolutional viewpoints. A cryptic pocket was also discovered in the intermediate state during conformation transitions. These findings offer a deeper understanding of the AT2R mechanism at an atomic level and provide hints for the design of novel AT2R modulators.
Collapse
Affiliation(s)
- Xiaoliang Cong
- Department of Cardiology, Shanghai Changzheng Hospital, the Second Affiliated Hospital of Naval Medical University, Shanghai 200003, China
| | - Xiaogang Zhang
- Department of Cardiology, Shanghai University of Medicine & Health Sciences Affiliated Zhoupu Hospital, Shanghai 201318, China
| | - Xin Liang
- Department of Cardiology, Shanghai Changzheng Hospital, the Second Affiliated Hospital of Naval Medical University, Shanghai 200003, China
| | - Xinheng He
- Medicinal Chemistry and Bioinformatics Centre, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Yehua Tang
- Department of Cardiology, Shanghai Changzheng Hospital, the Second Affiliated Hospital of Naval Medical University, Shanghai 200003, China
| | - Xing Zheng
- Department of Cardiology, Changhai Hospital, Naval Medical University, Shanghai 200433, China
| | - Shaoyong Lu
- Medicinal Chemistry and Bioinformatics Centre, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
- Corresponding authors.
| | - Jiayou Zhang
- Department of Cardiology, Shanghai Changzheng Hospital, the Second Affiliated Hospital of Naval Medical University, Shanghai 200003, China
- Corresponding authors.
| | - Ting Chen
- Department of Cardiology, Shanghai Changzheng Hospital, the Second Affiliated Hospital of Naval Medical University, Shanghai 200003, China
- Corresponding authors.
| |
Collapse
|
44
|
Xu H, Song K, Da LT. Dynamics of peptide loading into major histocompatibility complex class I molecules chaperoned by TAPBPR. Phys Chem Chem Phys 2022; 24:12397-12409. [PMID: 35575131 DOI: 10.1039/d2cp00423b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Major histocompatibility complex class I (MHC-I) molecules display antigenic peptides on the cell surface for T cell receptor scanning, thereby activating the immune response. Peptide loading into MHC-I molecules is thus a critical step during the antigen presentation process. Chaperone TAP-binding protein related (TAPBPR) plays a critical role in promoting high-affinity peptide loading into MHC-I, by discriminating against the low-affinity ones. However, the complete peptide loading dynamics into TAPBPR-bound MHC-I is still elusive. Here, we constructed kinetic network models based on hundreds of short-time MD simulations with an aggregated simulation time of ∼21.7 μs, and revealed, at atomic level, four key intermediate states of one antigenic peptide derived from melanoma-associated MART-1/Melan-A protein during its loading process into TAPBPR-bound MHC-I. We find that the TAPBPR binding at the MHC-I pocket-F can substantially reshape the distant pocket-B via allosteric regulations, which in turn promotes the following peptide N-terminal loading. Intriguingly, the partially loaded peptide could profoundly weaken the TAPBPR-MHC stability, promoting the dissociation of the TAPBPR scoop-loop (SL) region from the pocket-F to a more solvent-exposed conformation. Structural inspections further indicate that the peptide loading could remotely affect the SL binding site through both allosteric perturbations and direct contacts. In addition, another structural motif of TAPBPR, the jack hairpin region, was also found to participate in mediating the peptide editing. Our study sheds light on the detailed molecular mechanisms underlying the peptide loading process into TAPBPR-bound MHC-I and pinpoints the key structural factors responsible for dictating the peptide-loading dynamics.
Collapse
Affiliation(s)
- Honglin Xu
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China.
| | - Kaiyuan Song
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China.
| | - Lin-Tai Da
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China.
| |
Collapse
|
45
|
Ghorbani M, Prasad S, Klauda JB, Brooks BR. GraphVAMPNet, using graph neural networks and variational approach to Markov processes for dynamical modeling of biomolecules. J Chem Phys 2022; 156:184103. [PMID: 35568532 PMCID: PMC9094994 DOI: 10.1063/5.0085607] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 04/22/2022] [Indexed: 11/14/2022] Open
Abstract
Finding a low dimensional representation of data from long-timescale trajectories of biomolecular processes, such as protein folding or ligand-receptor binding, is of fundamental importance, and kinetic models, such as Markov modeling, have proven useful in describing the kinetics of these systems. Recently, an unsupervised machine learning technique called VAMPNet was introduced to learn the low dimensional representation and the linear dynamical model in an end-to-end manner. VAMPNet is based on the variational approach for Markov processes and relies on neural networks to learn the coarse-grained dynamics. In this paper, we combine VAMPNet and graph neural networks to generate an end-to-end framework to efficiently learn high-level dynamics and metastable states from the long-timescale molecular dynamics trajectories. This method bears the advantages of graph representation learning and uses graph message passing operations to generate an embedding for each datapoint, which is used in the VAMPNet to generate a coarse-grained dynamical model. This type of molecular representation results in a higher resolution and a more interpretable Markov model than the standard VAMPNet, enabling a more detailed kinetic study of the biomolecular processes. Our GraphVAMPNet approach is also enhanced with an attention mechanism to find the important residues for classification into different metastable states.
Collapse
Affiliation(s)
| | - Samarjeet Prasad
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20824, USA
| | - Jeffery B. Klauda
- Department of Chemical and Biomolecular Engineering, University of Maryland, College Park, Maryland 20742, USA
| | - Bernard R. Brooks
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20824, USA
| |
Collapse
|
46
|
Rahman MU, Song K, Da LT, Chen HF. Early aggregation mechanism of Aβ 16-22 revealed by Markov state models. Int J Biol Macromol 2022; 204:606-616. [PMID: 35134456 DOI: 10.1016/j.ijbiomac.2022.02.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 01/24/2022] [Accepted: 02/01/2022] [Indexed: 12/19/2022]
Abstract
Aβ16-22 is believed to have critical role in early aggregation of full length amyloids that are associated with the Alzheimer's disease and can aggregate to form amyloid fibrils. However, the early aggregation mechanism is still unsolved. Here, multiple long-term molecular dynamics simulations combining with Markov state model were used to probe the early oligomerization mechanism of Aβ16-22 peptides. The identified dimeric form adopted either globular random-coil or extended β-strand like conformations. The observed dimers of these variants shared many overall conformational characteristics but differed in several aspects at detailed level. In all cases, the most common type of secondary structure was intermolecular antiparallel β-sheets. The inter-state transitions were very frequent ranges from few to hundred nanoseconds. More strikingly, those states which contain fraction of β secondary structure and significant amount of extended coiled structures, therefore exposed to the solvent, were majorly participated in aggregation. The assembly of low-energy dimers, in which the peptides form antiparallel β sheets, occurred by multiple pathways with the formation of an obligatory intermediates. We proposed that these states might facilitate the Aβ16-22 aggregation through a significant component of the conformational selection mechanism, because they might increase the aggregates population by promoting the inter-chain hydrophobic and the hydrogen bond contacts. The formation of early stage antiparallel β sheet structures is critical for oligomerization, and at the same time provided a flat geometry to seed the ordered β-strand packing of the fibrils. Our findings hint at reorganization of this part of the molecule as a potentially critical step in Aβ aggregation and will insight into early oligomerization for large β amyloids.
Collapse
Affiliation(s)
- Mueed Ur Rahman
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Kaiyuan Song
- Key Laboratory of System Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Lin-Tai Da
- Key Laboratory of System Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Hai-Feng Chen
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China; Shanghai Center for Bioinformation Technology, Shanghai, 200235, China.
| |
Collapse
|
47
|
Gu H, Wang W, Cao S, Unarta IC, Yao Y, Sheong FK, Huang X. RPnet: a reverse-projection-based neural network for coarse-graining metastable conformational states for protein dynamics. Phys Chem Chem Phys 2022; 24:1462-1474. [PMID: 34985469 DOI: 10.1039/d1cp03622j] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The Markov State Model (MSM) is a powerful tool for modeling long timescale dynamics based on numerous short molecular dynamics (MD) simulation trajectories, which makes it a useful tool for elucidating the conformational changes of biological macromolecules. By partitioning the phase space into discretized states and estimating the probabilities of inter-state transitions based on short MD trajectories, one can construct a kinetic network model that could be used to extrapolate long-timescale kinetics if the Markovian condition is met. However, meeting the Markovian condition often requires hundreds or even thousands of states (microstates), which greatly hinders the comprehension of the conformational dynamics of complex biomolecules. Kinetic lumping algorithms can coarse grain numerous microstates into a handful of metastable states (macrostates), which would greatly facilitate the elucidation of biological mechanisms. In this work, we have developed a reverse-projection-based neural network (RPnet) to lump microstates into macrostates, by making use of a physics-based loss function that is based on the projection operator framework of conformational dynamics. By recognizing that microstate and macrostate transition modes can be related through a projection process, we have developed a reverse-projection scheme to directly compare the microstate and macrostate dynamics. Based on this reverse-projection scheme, we designed a loss function that allows the effective assessment of the quality of a given kinetic lumping. We then make use of a neural network to efficiently minimize this loss function to obtain an optimized set of macrostates. We have demonstrated the power of our RPnet in analyzing the dynamics of a numerical 2D potential, alanine dipeptide, and the clamp opening of an RNA polymerase. In all these systems, we have illustrated that our method could yield comparable or better results than competing methods in terms of state partitioning and reproduction of slow dynamics. We expect that our RPnet holds promise in analyzing the conformational dynamics of biological macromolecules.
Collapse
Affiliation(s)
- Hanlin Gu
- Department of Mathematics, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Wei Wang
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong.
| | - Siqin Cao
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong.
| | - Ilona Christy Unarta
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Yuan Yao
- Department of Mathematics, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Fu Kit Sheong
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong. .,Institute for Advanced Study, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Xuhui Huang
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong. .,Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| |
Collapse
|
48
|
Mardt A, Noé F. Progress in deep Markov state modeling: Coarse graining and experimental data restraints. J Chem Phys 2021; 155:214106. [PMID: 34879670 DOI: 10.1063/5.0064668] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Recent advances in deep learning frameworks have established valuable tools for analyzing the long-timescale behavior of complex systems, such as proteins. In particular, the inclusion of physical constraints, e.g., time-reversibility, was a crucial step to make the methods applicable to biophysical systems. Furthermore, we advance the method by incorporating experimental observables into the model estimation showing that biases in simulation data can be compensated for. We further develop a new neural network layer in order to build a hierarchical model allowing for different levels of details to be studied. Finally, we propose an attention mechanism, which highlights important residues for the classification into different states. We demonstrate the new methodology on an ultralong molecular dynamics simulation of the Villin headpiece miniprotein.
Collapse
Affiliation(s)
- Andreas Mardt
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
49
|
Zhu L, Jiang H, Cao S, Unarta IC, Gao X, Huang X. Critical role of backbone coordination in the mRNA recognition by RNA induced silencing complex. Commun Biol 2021; 4:1345. [PMID: 34848812 PMCID: PMC8632932 DOI: 10.1038/s42003-021-02822-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 10/26/2021] [Indexed: 01/02/2023] Open
Abstract
Despite its functional importance, the molecular mechanism underlying target mRNA recognition by Argonaute (Ago) remains largely elusive. Based on extensive all-atom molecular dynamics simulations, we constructed quasi-Markov State Model (qMSM) to reveal the dynamics during recognition at position 6-7 in the seed region of human Argonaute 2 (hAgo2). Interestingly, we found that the slowest mode of motion therein is not the gRNA-target base-pairing, but the coordination of the target phosphate groups with a set of positively charged residues of hAgo2. Moreover, the ability of Helix-7 to approach the PIWI and MID domains was found to reduce the effective volume accessible to the target mRNA and therefore facilitate both the backbone coordination and base-pair formation. Further mutant simulations revealed that alanine mutation of the D358 residue on Helix-7 enhanced a trap state to slow down the loading of target mRNA. Similar trap state was also observed when wobble pairs were introduced in g6 and g7, indicating the role of Helix-7 in suppressing non-canonical base-paring. Our study pointed to a general mechanism for mRNA recognition by eukaryotic Agos and demonstrated the promise of qMSM in investigating complex conformational changes of biomolecular systems.
Collapse
Affiliation(s)
- Lizhe Zhu
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong, 518172, China
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Hanlun Jiang
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Department of Biochemistry, Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Siqin Cao
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Ilona Christy Unarta
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Xin Gao
- Computational Bioscience Research Center, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
| | - Xuhui Huang
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
- Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
| |
Collapse
|
50
|
Ghorbani M, Prasad S, Klauda JB, Brooks BR. Variational embedding of protein folding simulations using Gaussian mixture variational autoencoders. J Chem Phys 2021; 155:194108. [PMID: 34800961 DOI: 10.1063/5.0069708] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Conformational sampling of biomolecules using molecular dynamics simulations often produces a large amount of high dimensional data that makes it difficult to interpret using conventional analysis techniques. Dimensionality reduction methods are thus required to extract useful and relevant information. Here, we devise a machine learning method, Gaussian mixture variational autoencoder (GMVAE), that can simultaneously perform dimensionality reduction and clustering of biomolecular conformations in an unsupervised way. We show that GMVAE can learn a reduced representation of the free energy landscape of protein folding with highly separated clusters that correspond to the metastable states during folding. Since GMVAE uses a mixture of Gaussians as its prior, it can directly acknowledge the multi-basin nature of the protein folding free energy landscape. To make the model end-to-end differentiable, we use a Gumbel-softmax distribution. We test the model on three long-timescale protein folding trajectories and show that GMVAE embedding resembles the folding funnel with folded states down the funnel and unfolded states outside the funnel path. Additionally, we show that the latent space of GMVAE can be used for kinetic analysis and Markov state models built on this embedding produce folding and unfolding timescales that are in close agreement with other rigorous dynamical embeddings such as time independent component analysis.
Collapse
Affiliation(s)
- Mahdi Ghorbani
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20824, USA
| | - Samarjeet Prasad
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20824, USA
| | - Jeffery B Klauda
- Department of Chemical and Biomolecular Engineering, University of Maryland, College Park, Maryland 20742, USA
| | - Bernard R Brooks
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland 20824, USA
| |
Collapse
|