1
|
Harunari PE, Dal Cengio S, Lecomte V, Polettini M. Mutual Linearity of Nonequilibrium Network Currents. PHYSICAL REVIEW LETTERS 2024; 133:047401. [PMID: 39121407 DOI: 10.1103/physrevlett.133.047401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 05/02/2024] [Accepted: 06/10/2024] [Indexed: 08/11/2024]
Abstract
For continuous-time Markov chains and open unimolecular chemical reaction networks, we prove that any two stationary currents are linearly related upon perturbations of a single edge's transition rates, arbitrarily far from equilibrium. We extend the result to nonstationary currents in the frequency domain, provide and discuss an explicit expression for the current-current susceptibility in terms of the network topology, and discuss possible generalizations. In practical scenarios, the mutual linearity relation has predictive power and can be used as a tool for inference or model proof testing.
Collapse
|
2
|
Yuan Y, Mao X, Pan X, Zhang R, Su W. Kinetic Ensemble of Tau Protein through the Markov State Model and Deep Learning Analysis. J Chem Theory Comput 2024; 20:2947-2958. [PMID: 38501645 DOI: 10.1021/acs.jctc.3c01211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]
Abstract
The ordered assembly of Tau protein into filaments characterizes Alzheimer's and other neurodegenerative diseases, and thus, stabilization of Tau protein is a promising avenue for tauopathies therapy. To dissect the underlying aggregation mechanisms on Tau, we employ a set of molecular simulations and the Markov state model to determine the kinetics of ensemble of K18. K18 is the microtubule-binding domain of Tau protein and plays a vital role in the microtubule assembly, recycling processes, and amyloid fibril formation. Here, we efficiently explore the conformation of K18 with about 150 μs lifetimes in silico. Our results observe that all four repeat regions (R1-R4) are very dynamic, featuring frequent conformational conversion and lacking stable conformations, and the R2 region is more flexible than the R1, R3, and R4 regions. Additionally, it is worth noting that residues 300-310 in R2-R3 and residues 319-336 in R3 tend to form sheet structures, indicating that K18 has a broader functional role than individual repeat monomers. Finally, the simulations combined with Markov state models and deep learning reveal 5 key conformational states along the transition pathway and provide the information on the microsecond time scale interstate transition rates. Overall, this study offers significant insights into the molecular mechanism of Tau pathological aggregation and develops novel strategies for both securing tauopathies and advancing drug discovery.
Collapse
Affiliation(s)
- Yongna Yuan
- School of Information Science & Engineering, Lanzhou University, South Tianshui Road, Lanzhou 730000, Gansu, China
| | - Xuqi Mao
- School of Information Science & Engineering, Lanzhou University, South Tianshui Road, Lanzhou 730000, Gansu, China
| | - Xiaohang Pan
- School of Information Science & Engineering, Lanzhou University, South Tianshui Road, Lanzhou 730000, Gansu, China
| | - Ruisheng Zhang
- School of Information Science & Engineering, Lanzhou University, South Tianshui Road, Lanzhou 730000, Gansu, China
| | - Wei Su
- School of Information Science & Engineering, Lanzhou University, South Tianshui Road, Lanzhou 730000, Gansu, China
| |
Collapse
|
3
|
Xu T, Li Y, Gao X, Zhang L. Understanding the Fast-Triggering Unfolding Dynamics of FK-11 upon Photoexcitation of Azobenzene. J Phys Chem Lett 2024; 15:3531-3540. [PMID: 38526058 DOI: 10.1021/acs.jpclett.4c00091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2024]
Abstract
Photoswitchable molecules can control the activity and functions of biomolecules by triggering conformational changes. However, it is still challenging to fully understand such fast-triggering conformational evolution from nonequilibrium to equilibrium distribution at the molecular level. Herein, we successfully simulated the unfolding of the FK-11 peptide upon the photoinduced trans-to-cis isomerization of azobenzene based on the Markov state model. We found that the ensemble of FK-11 contains five conformational states, constituting two unfolding pathways. More intriguingly, we observed the microsecond-scale conformational propagation of the FK-11 peptide from the fully folded state to the equilibrium populations of the five states. The computed CD spectra match well with the experimental data, validating our simulation method. Overall, our study not only offers a protocol to study the photoisomerization-induced conformational changes of enzymes but also could orientate the rational design of a photoswitchable molecule to manipulate biological functions.
Collapse
Affiliation(s)
- Tiantian Xu
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou, Fujian 350002, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yongfang Li
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou, Fujian 350002, China
| | - Xin Gao
- Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| | - Lu Zhang
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou, Fujian 350002, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Fuzhou, Fujian 361005, China
| |
Collapse
|
4
|
Wu Y, Cao S, Qiu Y, Huang X. Tutorial on how to build non-Markovian dynamic models from molecular dynamics simulations for studying protein conformational changes. J Chem Phys 2024; 160:121501. [PMID: 38516972 DOI: 10.1063/5.0189429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 02/20/2024] [Indexed: 03/23/2024] Open
Abstract
Protein conformational changes play crucial roles in their biological functions. In recent years, the Markov State Model (MSM) constructed from extensive Molecular Dynamics (MD) simulations has emerged as a powerful tool for modeling complex protein conformational changes. In MSMs, dynamics are modeled as a sequence of Markovian transitions among metastable conformational states at discrete time intervals (called lag time). A major challenge for MSMs is that the lag time must be long enough to allow transitions among states to become memoryless (or Markovian). However, this lag time is constrained by the length of individual MD simulations available to track these transitions. To address this challenge, we have recently developed Generalized Master Equation (GME)-based approaches, encoding non-Markovian dynamics using a time-dependent memory kernel. In this Tutorial, we introduce the theory behind two recently developed GME-based non-Markovian dynamic models: the quasi-Markov State Model (qMSM) and the Integrative Generalized Master Equation (IGME). We subsequently outline the procedures for constructing these models and provide a step-by-step tutorial on applying qMSM and IGME to study two peptide systems: alanine dipeptide and villin headpiece. This Tutorial is available at https://github.com/xuhuihuang/GME_tutorials. The protocols detailed in this Tutorial aim to be accessible for non-experts interested in studying the biomolecular dynamics using these non-Markovian dynamic models.
Collapse
Affiliation(s)
- Yue Wu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Data Science Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
5
|
Arshi SA, Chauhan M, Sharma A. Disruption of the FMN-A524 interaction cascade and Glu513-induced collapse of the hydrophobic barrier promotes light-induced Jα-helix unfolding in AsLOV2. Biophys J 2023; 122:4670-4685. [PMID: 37978801 PMCID: PMC10754690 DOI: 10.1016/j.bpj.2023.11.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 10/10/2023] [Accepted: 11/14/2023] [Indexed: 11/19/2023] Open
Abstract
The C-terminal Jα-helix of the Avena sativa's Light Oxygen and Voltage (AsLOV2) protein, unfolds on exposure to blue light. This characteristic seeks relevance in applications related to engineering novel biological photoswitches. Using molecular dynamics simulations and the Markov state modeling (MSM) approach we provide the mechanism that explains the stepwise unfolding of the Jα-helix. The unfolding was resolved into seven steps represented by the structurally distinguishable states distributed over the initiation and the post initiation phases. Whereas, the initiation phase occurs due to the collapse of the interaction cascade FMN-Q513-N492-L480-W491-Q479-V520-A524, the onset of the post initiation phase is marked by breaking of the hydrophobic interactions between the Jα-helix and the Iβ-strand. This study indicates that the displacement of N492 out of the FMN binding pocket, not necessarily requiring Q513, is essential for the initiation of the Jα-helix unfolding. Rather, the structural reorientation of Q513 activates the protein to cross the hydrophobic barrier and enter the post initiation phase. Similarly, the structural deviations in N482, rather than its integral role in unfolding, could enhance the unfolding rates. Furthermore, the MSM studies on the wild-type and the Q513 mutant, provide the spatiotemporal roadmap that lay out the possible pathways of structural transition between the dark and the light states of the protein. Overall, the study provides insights useful to enhance the performance of AsLOV2-based photoswitches.
Collapse
Affiliation(s)
- Syeda Amna Arshi
- Multidisciplinary Centre for Advance Research and Studies, Jamia Millia Islamia, New Delhi, India
| | - Manisha Chauhan
- Multidisciplinary Centre for Advance Research and Studies, Jamia Millia Islamia, New Delhi, India
| | - Amit Sharma
- Multidisciplinary Centre for Advance Research and Studies, Jamia Millia Islamia, New Delhi, India.
| |
Collapse
|
6
|
Wang X, Xu T, Yao Y, Cheung PPH, Gao X, Zhang L. SARS-CoV-2 RNA-Dependent RNA Polymerase Follows Asynchronous Translocation Pathway for Viral Transcription and Replication. J Phys Chem Lett 2023; 14:10119-10128. [PMID: 37922192 DOI: 10.1021/acs.jpclett.3c01249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2023]
Abstract
Translocation is one essential step for the SARS-CoV-2 RNA-dependent RNA polymerase (RdRp) to exert viral replication and transcription. Although cryo-EM structures of SARS-CoV-2 RdRp are available, the molecular mechanisms of dynamic translocation remain elusive. Herein, we constructed a Markov state model based on extensive molecular dynamics simulations to elucidate the translocation dynamics of the SARS-CoV-2 RdRp. We identified two intermediates that pinpoint the rate-limiting step of translocation and characterize the asynchronous movement of the template-primer duplex. The 3'-terminal nucleotide in the primer strand lags behind due to the uneven distribution of protein-RNA interactions, while the translocation of the template strand is delayed by the hurdle residue K500. Even so, the two strands share the same "ratchet" to stabilize the polymerase in the post-translocation state, suggesting a Brownian-ratchet model. Overall, our study provides intriguing insights into SARS-CoV-2 replication and transcription, which would open a new avenue for drug discoveries.
Collapse
Affiliation(s)
- Xiaowei Wang
- Department of Chemical and Biological Engineering and Department of Mathematics, Hong Kong University of Science and Technology Kowloon, Clear Water Bay, Hong Kong
| | - Tiantian Xu
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou, Fujian 350002, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yuan Yao
- Department of Chemical and Biological Engineering and Department of Mathematics, Hong Kong University of Science and Technology Kowloon, Clear Water Bay, Hong Kong
| | - Peter Pak-Hang Cheung
- Li Ka Shing Institute of Health Sciences, Department of Chemical Pathology, Chinese University of Hong Kong, 999077, Hong Kong
| | - Xin Gao
- Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Lu Zhang
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou, Fujian 350002, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Fuzhou, Fujian 361005, China
| |
Collapse
|
7
|
Cao S, Qiu Y, Kalin ML, Huang X. Integrative generalized master equation: A method to study long-timescale biomolecular dynamics via the integrals of memory kernels. J Chem Phys 2023; 159:134106. [PMID: 37787134 PMCID: PMC11005468 DOI: 10.1063/5.0167287] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/18/2023] [Indexed: 10/04/2023] Open
Abstract
The generalized master equation (GME) provides a powerful approach to study biomolecular dynamics via non-Markovian dynamic models built from molecular dynamics (MD) simulations. Previously, we have implemented the GME, namely the quasi Markov State Model (qMSM), where we explicitly calculate the memory kernel and propagate dynamics using a discretized GME. qMSM can be constructed with much shorter MD trajectories than the MSM. However, since qMSM needs to explicitly compute the time-dependent memory kernels, it is heavily affected by the numerical fluctuations of simulation data when applied to study biomolecular conformational changes. This can lead to numerical instability of predicted long-time dynamics, greatly limiting the applicability of qMSM in complicated biomolecules. We present a new method, the Integrative GME (IGME), in which we analytically solve the GME under the condition when the memory kernels have decayed to zero. Our IGME overcomes the challenges of the qMSM by using the time integrations of memory kernels, thereby avoiding the numerical instability caused by explicit computation of time-dependent memory kernels. Using our solutions of the GME, we have developed a new approach to compute long-time dynamics based on MD simulations in a numerically stable, accurate and efficient way. To demonstrate its effectiveness, we have applied the IGME in three biomolecules: the alanine dipeptide, FIP35 WW-domain, and Taq RNA polymerase. In each system, the IGME achieves significantly smaller fluctuations for both memory kernels and long-time dynamics compared to the qMSM. We anticipate that the IGME can be widely applied to investigate biomolecular conformational changes.
Collapse
Affiliation(s)
- Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Michael L. Kalin
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
8
|
Liu B, Xue M, Qiu Y, Konovalov KA, O’Connor MS, Huang X. GraphVAMPnets for uncovering slow collective variables of self-assembly dynamics. J Chem Phys 2023; 159:094901. [PMID: 37655771 PMCID: PMC11005469 DOI: 10.1063/5.0158903] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 08/11/2023] [Indexed: 09/02/2023] Open
Abstract
Uncovering slow collective variables (CVs) of self-assembly dynamics is important to elucidate its numerous kinetic assembly pathways and drive the design of novel structures for advanced materials through the bottom-up approach. However, identifying the CVs for self-assembly presents several challenges. First, self-assembly systems often consist of identical monomers, and the feature representations should be invariant to permutations and rotational symmetries. Physical coordinates, such as aggregate size, lack high-resolution detail, while common geometric coordinates like pairwise distances are hindered by the permutation and rotational symmetry challenges. Second, self-assembly is usually a downhill process, and the trajectories often suffer from insufficient sampling of backward transitions that correspond to the dissociation of self-assembled structures. Popular dimensionality reduction methods, such as time-structure independent component analysis, impose detailed balance constraints, potentially obscuring the true dynamics of self-assembly. In this work, we employ GraphVAMPnets, which combines graph neural networks with a variational approach for Markovian process (VAMP) theory to identify the slow CVs of the self-assembly processes. First, GraphVAMPnets bears the advantages of graph neural networks, in which the graph embeddings can represent self-assembly structures in high-resolution while being invariant to permutations and rotational symmetries. Second, it is built upon VAMP theory, which studies Markov processes without forcing detailed balance constraints, which addresses the out-of-equilibrium challenge in the self-assembly process. We demonstrate GraphVAMPnets for identifying slow CVs of self-assembly kinetics in two systems: the aggregation of two hydrophobic molecules and the self-assembly of patchy particles. We expect that our GraphVAMPnets can be widely applied to molecular self-assembly.
Collapse
Affiliation(s)
- Bojun Liu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Mingyi Xue
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Kirill A. Konovalov
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Michael S. O’Connor
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Author to whom correspondence should be addressed:
| |
Collapse
|
9
|
Qiu Y, O’Connor MS, Xue M, Liu B, Huang X. An Efficient Path Classification Algorithm Based on Variational Autoencoder to Identify Metastable Path Channels for Complex Conformational Changes. J Chem Theory Comput 2023; 19:4728-4742. [PMID: 37382437 PMCID: PMC11042546 DOI: 10.1021/acs.jctc.3c00318] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2023]
Abstract
Conformational changes (i.e., dynamic transitions between pairs of conformational states) play important roles in many chemical and biological processes. Constructing the Markov state model (MSM) from extensive molecular dynamics (MD) simulations is an effective approach to dissect the mechanism of conformational changes. When combined with transition path theory (TPT), MSM can be applied to elucidate the ensemble of kinetic pathways connecting pairs of conformational states. However, the application of TPT to analyze complex conformational changes often results in a vast number of kinetic pathways with comparable fluxes. This obstacle is particularly pronounced in heterogeneous self-assembly and aggregation processes. The large number of kinetic pathways makes it challenging to comprehend the molecular mechanisms underlying conformational changes of interest. To address this challenge, we have developed a path classification algorithm named latent-space path clustering (LPC) that efficiently lumps parallel kinetic pathways into distinct metastable path channels, making them easier to comprehend. In our algorithm, MD conformations are first projected onto a low-dimensional space containing a small set of collective variables (CVs) by time-structure-based independent component analysis (tICA) with kinetic mapping. Then, MSM and TPT are constructed to obtain the ensemble of pathways, and a deep learning architecture named the variational autoencoder (VAE) is used to learn the spatial distributions of kinetic pathways in the continuous CV space. Based on the trained VAE model, the TPT-generated ensemble of kinetic pathways can be embedded into a latent space, where the classification becomes clear. We show that LPC can efficiently and accurately identify the metastable path channels in three systems: a 2D potential, the aggregation of two hydrophobic particles in water, and the folding of the Fip35 WW domain. Using the 2D potential, we further demonstrate that our LPC algorithm outperforms the previous path-lumping algorithms by making substantially fewer incorrect assignments of individual pathways to four path channels. We expect that LPC can be widely applied to identify the dominant kinetic pathways underlying complex conformational changes.
Collapse
Affiliation(s)
- Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Michael S. O’Connor
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Mingyi Xue
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Bojun Liu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| |
Collapse
|
10
|
Dominic AJ, Cao S, Montoya-Castillo A, Huang X. Memory Unlocks the Future of Biomolecular Dynamics: Transformative Tools to Uncover Physical Insights Accurately and Efficiently. J Am Chem Soc 2023; 145:9916-9927. [PMID: 37104720 DOI: 10.1021/jacs.3c01095] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
Conformational changes underpin function and encode complex biomolecular mechanisms. Gaining atomic-level detail of how such changes occur has the potential to reveal these mechanisms and is of critical importance in identifying drug targets, facilitating rational drug design, and enabling bioengineering applications. While the past two decades have brought Markov state model techniques to the point where practitioners can regularly use them to glimpse the long-time dynamics of slow conformations in complex systems, many systems are still beyond their reach. In this Perspective, we discuss how including memory (i.e., non-Markovian effects) can reduce the computational cost to predict the long-time dynamics in these complex systems by orders of magnitude and with greater accuracy and resolution than state-of-the-art Markov state models. We illustrate how memory lies at the heart of successful and promising techniques, ranging from the Fokker-Planck and generalized Langevin equations to deep-learning recurrent neural networks and generalized master equations. We delineate how these techniques work, identify insights that they can offer in biomolecular systems, and discuss their advantages and disadvantages in practical settings. We show how generalized master equations can enable the investigation of, for example, the gate-opening process in RNA polymerase II and demonstrate how our recent advances tame the deleterious influence of statistical underconvergence of the molecular dynamics simulations used to parameterize these techniques. This represents a significant leap forward that will enable our memory-based techniques to interrogate systems that are currently beyond the reach of even the best Markov state models. We conclude by discussing some current challenges and future prospects for how exploiting memory will open the door to many exciting opportunities.
Collapse
Affiliation(s)
- Anthony J Dominic
- Department of Chemistry, University of Colorado Boulder, Boulder, Colorado 80309, USA
| | - Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
11
|
Dominic AJ, Sayer T, Cao S, Markland TE, Huang X, Montoya-Castillo A. Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations. Proc Natl Acad Sci U S A 2023; 120:e2221048120. [PMID: 36920924 PMCID: PMC10041170 DOI: 10.1073/pnas.2221048120] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 02/21/2023] [Indexed: 03/16/2023] Open
Abstract
The ability to predict and understand complex molecular motions occurring over diverse timescales ranging from picoseconds to seconds and even hours in biological systems remains one of the largest challenges to chemical theory. Markov state models (MSMs), which provide a memoryless description of the transitions between different states of a biochemical system, have provided numerous important physically transparent insights into biological function. However, constructing these models often necessitates performing extremely long molecular simulations to converge the rates. Here, we show that by incorporating memory via the time-convolutionless generalized master equation (TCL-GME) one can build a theoretically transparent and physically intuitive memory-enriched model of biochemical processes with up to a three order of magnitude reduction in the simulation data required while also providing a higher temporal resolution. We derive the conditions under which the TCL-GME provides a more efficient means to capture slow dynamics than MSMs and rigorously prove when the two provide equally valid and efficient descriptions of the slow configurational dynamics. We further introduce a simple averaging procedure that enables our TCL-GME approach to quickly converge and accurately predict long-time dynamics even when parameterized with noisy reference data arising from short trajectories. We illustrate the advantages of the TCL-GME using alanine dipeptide, the human argonaute complex, and FiP35 WW domain.
Collapse
Affiliation(s)
| | - Thomas Sayer
- Department of Chemistry, University of Colorado, Boulder, CO80309
| | - Siqin Cao
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI53706
| | | | - Xuhui Huang
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI53706
| | | |
Collapse
|
12
|
Unarta IC, Goonetilleke EC, Wang D, Huang X. Nucleotide addition and cleavage by RNA polymerase II: Coordination of two catalytic reactions using a single active site. J Biol Chem 2022; 299:102844. [PMID: 36581202 PMCID: PMC9860460 DOI: 10.1016/j.jbc.2022.102844] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 12/19/2022] [Accepted: 12/22/2022] [Indexed: 12/28/2022] Open
Abstract
RNA polymerase II (Pol II) incorporates complementary ribonucleotides into the growing RNA chain one at a time via the nucleotide addition cycle. The nucleotide addition cycle, however, is prone to misincorporation of noncomplementary nucleotides. Thus, to ensure transcriptional fidelity, Pol II backtracks and then cleaves the misincorporated nucleotides. These two reverse reactions, nucleotide addition and cleavage, are catalyzed in the same active site of Pol II, which is different from DNA polymerases or other endonucleases. Recently, substantial progress has been made to understand how Pol II effectively performs its dual role in the same active site. Our review highlights these recent studies and provides an overall model of the catalytic mechanisms of Pol II. In particular, RNA extension follows the two-metal-ion mechanism, and several Pol II residues play important roles to facilitate the catalysis. In sharp contrast, the cleavage reaction is independent of any Pol II residues. Interestingly, Pol II relies on its residues to recognize the misincorporated nucleotides during the backtracking process, prior to cleavage. In this way, Pol II efficiently compartmentalizes its two distinct catalytic functions using the same active site. Lastly, we also discuss a new perspective on the potential third Mg2+ in the nucleotide addition and intrinsic cleavage reactions.
Collapse
Affiliation(s)
- Ilona Christy Unarta
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Eshani C Goonetilleke
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Dong Wang
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California, USA; Department of Cellular and Molecular Medicine, School of Medicine, University of California, San Diego, La Jolla, California, USA; Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California, USA.
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA.
| |
Collapse
|
13
|
Xi K, Zhu L. Automated Path Searching Reveals the Mechanism of Hydrolysis Enhancement by T4 Lysozyme Mutants. Int J Mol Sci 2022; 23:ijms232314628. [PMID: 36498954 PMCID: PMC9736071 DOI: 10.3390/ijms232314628] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 11/16/2022] [Accepted: 11/19/2022] [Indexed: 11/25/2022] Open
Abstract
Bacteriophage T4 lysozyme (T4L) is a glycosidase that is widely applied as a natural antimicrobial agent in the food industry. Due to its wide applications and small size, T4L has been regarded as a model system for understanding protein dynamics and for large-scale protein engineering. Through structural insights from the single conformation of T4L, a series of mutations (L99A,G113A,R119P) have been introduced, which have successfully raised the fractional population of its only hydrolysis-competent excited state to 96%. However, the actual impact of these substitutions on its dynamics remains unclear, largely due to the lack of highly efficient sampling algorithms. Here, using our recently developed travelling-salesman-based automated path searching (TAPS), we located the minimum-free-energy path (MFEP) for the transition of three T4L mutants from their ground states to their excited states. All three mutants share a three-step transition: the flipping of F114, the rearrangement of α0/α1 helices, and final refinement. Remarkably, the MFEP revealed that the effects of the mutations are drastically beyond the expectations of their original design: (a) the G113A substitution not only enhances helicity but also fills the hydrophobic Cavity I and reduces the free energy barrier for flipping F114; (b) R119P barely changes the stability of the ground state but stabilizes the excited state through rarely reported polar contacts S117OG:N132ND2, E11OE1:R145NH1, and E11OE2:Q105NE2; (c) the residue W138 flips into Cavity I and further stabilizes the excited state for the triple mutant L99A,G113A,R119P. These novel insights that were unexpected in the original mutant design indicated the necessity of incorporating path searching into the workflow of rational protein engineering.
Collapse
|
14
|
Wu F, Jin S, Jiang Y, Jin X, Tang B, Niu Z, Liu X, Zhang Q, Zeng X, Li SZ. Pre-Training of Equivariant Graph Matching Networks with Conformation Flexibility for Drug Binding. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2022; 9:e2203796. [PMID: 36202759 PMCID: PMC9685463 DOI: 10.1002/advs.202203796] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 09/07/2022] [Indexed: 05/16/2023]
Abstract
The latest biological findings observe that the motionless "lock-and-key" theory is not generally applicable and that changes in atomic sites and binding pose can provide important information for understanding drug binding. However, the computational expenditure limits the growth of protein trajectory-related studies, thus hindering the possibility of supervised learning. A spatial-temporal pre-training method based on the modified equivariant graph matching networks, dubbed ProtMD which has two specially designed self-supervised learning tasks: atom-level prompt-based denoising generative task and conformation-level snapshot ordering task to seize the flexibility information inside molecular dynamics (MD) trajectories with very fine temporal resolutions is presented. The ProtMD can grant the encoder network the capacity to capture the time-dependent geometric mobility of conformations along MD trajectories. Two downstream tasks are chosen to verify the effectiveness of ProtMD through linear detection and task-specific fine-tuning. A huge improvement from current state-of-the-art methods, with a decrease of 4.3% in root mean square error for the binding affinity problem and an average increase of 13.8% in the area under receiver operating characteristic curve and the area under the precision-recall curve for the ligand efficacy problem is observed. The results demonstrate a strong correlation between the magnitude of conformation's motion in the 3D space and the strength with which the ligand binds with its receptor.
Collapse
Affiliation(s)
- Fang Wu
- School of EngineeringWestlake UniversityHangzhou310024China
- MindRank AI Ltd.Hangzhou310000China
| | - Shuting Jin
- MindRank AI Ltd.Hangzhou310000China
- School of InformaticsXiamen UniversityXiamen361005China
| | | | | | | | | | - Xiangrong Liu
- School of InformaticsXiamen UniversityXiamen361005China
| | - Qiang Zhang
- ZJU‐Hangzhou Global Scientific and Technological Innovation CenterHangzhou311200China
- College of Computer Science and TechnologyZhejiang UniversityHangzhou310013China
| | - Xiangxiang Zeng
- School of Information Science and EngineeringHunan UniversityHunan410082China
| | - Stan Z. Li
- School of EngineeringWestlake UniversityHangzhou310024China
| |
Collapse
|
15
|
Masson P, Lushchekina S. Conformational Stability and Denaturation Processes of Proteins Investigated by Electrophoresis under Extreme Conditions. Molecules 2022; 27:6861. [PMID: 36296453 PMCID: PMC9610776 DOI: 10.3390/molecules27206861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 10/10/2022] [Accepted: 10/10/2022] [Indexed: 11/17/2022] Open
Abstract
The functional structure of proteins results from marginally stable folded conformations. Reversible unfolding, irreversible denaturation, and deterioration can be caused by chemical and physical agents due to changes in the physicochemical conditions of pH, ionic strength, temperature, pressure, and electric field or due to the presence of a cosolvent that perturbs the delicate balance between stabilizing and destabilizing interactions and eventually induces chemical modifications. For most proteins, denaturation is a complex process involving transient intermediates in several reversible and eventually irreversible steps. Knowledge of protein stability and denaturation processes is mandatory for the development of enzymes as industrial catalysts, biopharmaceuticals, analytical and medical bioreagents, and safe industrial food. Electrophoresis techniques operating under extreme conditions are convenient tools for analyzing unfolding transitions, trapping transient intermediates, and gaining insight into the mechanisms of denaturation processes. Moreover, quantitative analysis of electrophoretic mobility transition curves allows the estimation of the conformational stability of proteins. These approaches include polyacrylamide gel electrophoresis and capillary zone electrophoresis under cold, heat, and hydrostatic pressure and in the presence of non-ionic denaturing agents or stabilizers such as polyols and heavy water. Lastly, after exposure to extremes of physical conditions, electrophoresis under standard conditions provides information on irreversible processes, slow conformational drifts, and slow renaturation processes. The impressive developments of enzyme technology with multiple applications in fine chemistry, biopharmaceutics, and nanomedicine prompted us to revisit the potentialities of these electrophoretic approaches. This feature review is illustrated with published and unpublished results obtained by the authors on cholinesterases and paraoxonase, two physiologically and toxicologically important enzymes.
Collapse
Affiliation(s)
- Patrick Masson
- Biochemical Neuropharmacology Laboratory, Kazan Federal University, Kremlievskaya Str. 18, 420111 Kazan, Russia
| | - Sofya Lushchekina
- Emanuel Institute of Biochemical Physics, Russian Academy of Sciences, Kosygin Str. 4, 119334 Moscow, Russia
| |
Collapse
|
16
|
Mishra RP, Gupta S, Rathore AS, Goel G. Multi-Level High-Throughput Screening for Discovery of Ligands That Inhibit Insulin Aggregation. Mol Pharm 2022; 19:3770-3783. [PMID: 36173709 DOI: 10.1021/acs.molpharmaceut.2c00219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We have developed a multi-level virtual screening protocol to identify lead molecules from the FDA inactives database that can inhibit insulin aggregation. The method is based on the presence of structural and interaction specificity in non-native aggregation pathway protein-protein interactions. Some key challenges specific to the present problem, when compared with native protein association, include structural heterogeneity of the protein species involved, multiple association pathways, and relatively higher probability of conformational rearrangement of the association complex. In this multi-step method, the inactives database was first screened using the dominant pharmacophore features of previously identified molecules shown to significantly inhibit insulin aggregation nucleation by binding to its aggregation-prone conformers. We then performed ensemble docking of several low-energy ligand conformations on these aggregation-prone conformers followed by molecular dynamics simulations and binding affinity calculations on a subset of docked complexes to identify a final set of five potential lead molecules to inhibit insulin aggregation nucleation. Their effect on aggregation inhibition was extensively investigated by incubating insulin under aggregation-prone aqueous buffer conditions (low pH, high temperature). Aggregation kinetics were characterized using size exclusion chromatography and Thioflavin T fluorescence assay, and the secondary structure was determined using circular dichroism spectroscopy. Riboflavin provided the best aggregation inhibition, with 85% native monomer retention after 48 h incubation under aggregation-prone conditions, whereas the no-ligand formulation showed complete monomer loss after 36 h. Further, insulin incubated with two of the screened inactives (aspartame, riboflavin) had the characteristic α-helical dip in CD spectra, while the no-ligand formulation showed a change to β-sheet rich conformations.
Collapse
Affiliation(s)
- Rit Pratik Mishra
- Department of Chemical Engineering, Indian Institute Technology Delhi, New Delhi, 110016, India
| | - Surbhi Gupta
- Department of Chemical Engineering, Indian Institute Technology Delhi, New Delhi, 110016, India
| | - Anurag Singh Rathore
- Department of Chemical Engineering, Indian Institute Technology Delhi, New Delhi, 110016, India
| | - Gaurav Goel
- Department of Chemical Engineering, Indian Institute Technology Delhi, New Delhi, 110016, India
| |
Collapse
|
17
|
Kabir KL, Ma B, Nussinov R, Shehu A. Fewer Dimensions, More Structures for Improved Discrete Models of Dynamics of Free versus Antigen-Bound Antibody. Biomolecules 2022; 12:biom12071011. [PMID: 35883567 PMCID: PMC9313177 DOI: 10.3390/biom12071011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 07/12/2022] [Accepted: 07/19/2022] [Indexed: 12/10/2022] Open
Abstract
Over the past decade, Markov State Models (MSM) have emerged as powerful methodologies to build discrete models of dynamics over structures obtained from Molecular Dynamics trajectories. The identification of macrostates for the MSM is a central decision that impacts the quality of the MSM but depends on both the selected representation of a structure and the clustering algorithm utilized over the featurized structures. Motivated by a large molecular system in its free and bound state, this paper investigates two directions of research, further reducing the representation dimensionality in a non-parametric, data-driven manner and including more structures in the computation. Rigorous evaluation of the quality of obtained MSMs via various statistical tests in a comparative setting firmly shows that fewer dimensions and more structures result in a better MSM. Many interesting findings emerge from the best MSM, advancing our understanding of the relationship between antibody dynamics and antibody–antigen recognition.
Collapse
Affiliation(s)
- Kazi Lutful Kabir
- Department of Computer Science, George Mason University, Fairfax, VA 22030, USA;
- Correspondence: ; Tel.: +1-571-201-5070
| | - Buyong Ma
- Engineering Research Center of Cell & Therapeutic Antibody School of Pharmacy, Shanghai Jiaotong University, Shanghai 200240, China;
| | - Ruth Nussinov
- Computational Structural Biology Section, Cancer Innovation Laboratory, Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA;
| | - Amarda Shehu
- Department of Computer Science, George Mason University, Fairfax, VA 22030, USA;
| |
Collapse
|
18
|
Gu H, Wang W, Cao S, Unarta IC, Yao Y, Sheong FK, Huang X. RPnet: a reverse-projection-based neural network for coarse-graining metastable conformational states for protein dynamics. Phys Chem Chem Phys 2022; 24:1462-1474. [PMID: 34985469 DOI: 10.1039/d1cp03622j] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The Markov State Model (MSM) is a powerful tool for modeling long timescale dynamics based on numerous short molecular dynamics (MD) simulation trajectories, which makes it a useful tool for elucidating the conformational changes of biological macromolecules. By partitioning the phase space into discretized states and estimating the probabilities of inter-state transitions based on short MD trajectories, one can construct a kinetic network model that could be used to extrapolate long-timescale kinetics if the Markovian condition is met. However, meeting the Markovian condition often requires hundreds or even thousands of states (microstates), which greatly hinders the comprehension of the conformational dynamics of complex biomolecules. Kinetic lumping algorithms can coarse grain numerous microstates into a handful of metastable states (macrostates), which would greatly facilitate the elucidation of biological mechanisms. In this work, we have developed a reverse-projection-based neural network (RPnet) to lump microstates into macrostates, by making use of a physics-based loss function that is based on the projection operator framework of conformational dynamics. By recognizing that microstate and macrostate transition modes can be related through a projection process, we have developed a reverse-projection scheme to directly compare the microstate and macrostate dynamics. Based on this reverse-projection scheme, we designed a loss function that allows the effective assessment of the quality of a given kinetic lumping. We then make use of a neural network to efficiently minimize this loss function to obtain an optimized set of macrostates. We have demonstrated the power of our RPnet in analyzing the dynamics of a numerical 2D potential, alanine dipeptide, and the clamp opening of an RNA polymerase. In all these systems, we have illustrated that our method could yield comparable or better results than competing methods in terms of state partitioning and reproduction of slow dynamics. We expect that our RPnet holds promise in analyzing the conformational dynamics of biological macromolecules.
Collapse
Affiliation(s)
- Hanlin Gu
- Department of Mathematics, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Wei Wang
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong.
| | - Siqin Cao
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong.
| | - Ilona Christy Unarta
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Yuan Yao
- Department of Mathematics, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Fu Kit Sheong
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong. .,Institute for Advanced Study, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Xuhui Huang
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong. .,Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| |
Collapse
|
19
|
Novel Big Data-Driven Machine Learning Models for Drug Discovery Application. Molecules 2022; 27:molecules27030594. [PMID: 35163865 PMCID: PMC8840520 DOI: 10.3390/molecules27030594] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 01/14/2022] [Accepted: 01/14/2022] [Indexed: 11/16/2022] Open
Abstract
Most contemporary drug discovery projects start with a ‘hit discovery’ phase where small chemicals are identified that have the capacity to interact, in a chemical sense, with a protein target involved in a given disease. To assist and accelerate this initial drug discovery process, ’virtual docking calculations’ are routinely performed, where computational models of proteins and computational models of small chemicals are evaluated for their capacities to bind together. In cutting-edge, contemporary implementations of this process, several conformations of protein targets are independently assayed in parallel ‘ensemble docking’ calculations. Some of these protein conformations, a minority of them, will be capable of binding many chemicals, while other protein conformations, the majority of them, will not be able to do so. This fact that only some of the conformations accessible to a protein will be ’selected’ by chemicals is known as ’conformational selection’ process in biology. This work describes a machine learning approach to characterize and identify the properties of protein conformations that will be selected (i.e., bind to) chemicals, and classified as potential binding drug candidates, unlike the remaining non-binding drug candidate protein conformations. This work also addresses the class imbalance problem through advanced machine learning techniques that maximize the prediction rate of potential protein molecular conformations for the test case proteins ADORA2A (Adenosine A2a Receptor) and OPRK1 (Opioid Receptor Kappa 1), and subsequently reduces the failure rates and hastens the drug discovery process.
Collapse
|
20
|
Konovalov K, Unarta IC, Cao S, Goonetilleke EC, Huang X. Markov State Models to Study the Functional Dynamics of Proteins in the Wake of Machine Learning. JACS AU 2021; 1:1330-1341. [PMID: 34604842 PMCID: PMC8479766 DOI: 10.1021/jacsau.1c00254] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Indexed: 05/19/2023]
Abstract
Markov state models (MSMs) based on molecular dynamics (MD) simulations are routinely employed to study protein folding, however, their application to functional conformational changes of biomolecules is still limited. In the past few years, the field of computational chemistry has experienced a surge of advancements stemming from machine learning algorithms, and MSMs have not been left out. Unlike global processes, such as protein folding, the application of MSMs to functional conformational changes is challenging because they mostly consist of localized structural transitions. Therefore, it is critical to properly select a subset of structural features that can describe the slowest dynamics of these functional conformational changes. To address this challenge, we recommend several automatic feature selection methods such as Spectral-OASIS. To identify states in MSMs, the chosen features can be subject to dimensionality reduction methods such as TICA or deep learning based VAMPNets to project MD conformations onto a few collective variables for subsequent clustering. Another challenge for the application of MSMs to the study of functional conformational changes is the ability to comprehend their biophysical mechanisms, as MSMs built for these processes often require a large number of states. We recommend the recently developed quasi-MSMs (qMSMs) to address this issue. Compared to MSMs, qMSMs encode the non-Markovian dynamics via the generalized master equation and can significantly reduce the number of states. As a result, qMSMs can be built with a handful of states to facilitate the interpretation of functional conformational changes. In the wake of machine learning, we believe that the rapid advancement in the MSM methodology will lead to their wider application in studying functional conformational changes of biomolecules.
Collapse
Affiliation(s)
- Kirill
A. Konovalov
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Ilona Christy Unarta
- Department
of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Siqin Cao
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Eshani C. Goonetilleke
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Xuhui Huang
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Department
of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| |
Collapse
|
21
|
Li C, Liu Z, Goonetilleke EC, Huang X. Temperature-dependent kinetic pathways of heterogeneous ice nucleation competing between classical and non-classical nucleation. Nat Commun 2021; 12:4954. [PMID: 34400646 PMCID: PMC8367957 DOI: 10.1038/s41467-021-25267-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2021] [Accepted: 07/26/2021] [Indexed: 12/04/2022] Open
Abstract
Ice nucleation on the surface plays a vital role in diverse areas, ranging from physics and cryobiology to atmospheric science. Compared to ice nucleation in the bulk, the water-surface interactions present in heterogeneous ice nucleation complicate the nucleation process, making heterogeneous ice nucleation less comprehended, especially the relationship between the kinetics and the structures of the critical ice nucleus. Here we combine Markov State Models and transition path theory to elucidate the ensemble pathways of heterogeneous ice nucleation. Our Markov State Models reveal that the classical one-step and non-classical two-step nucleation pathways can surprisingly co-exist with comparable fluxes at T = 230 K. Interestingly, we find that the disordered mixing of rhombic and hexagonal ice leads to a favorable configurational entropy that stabilizes the critical nucleus, facilitating the non-classical pathway. In contrast, the favorable energetics promotes the formation of hexagonal ice, resulting in the classical pathway. Furthermore, we discover that, at elevated temperatures, the nucleation process prefers to proceed via the classical pathway, as opposed to the non-classical pathway, since the potential energy contributions override the configurational entropy compensation. This study provides insights into the mechanisms of heterogeneous ice nucleation and sheds light on the rational designs to control crystallization processes.
Collapse
Affiliation(s)
- Chu Li
- Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Zhuo Liu
- Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Institute for Advanced Study, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Eshani C Goonetilleke
- Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Xuhui Huang
- Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong.
| |
Collapse
|
22
|
Xi K, Hu Z, Wu Q, Wei M, Qian R, Zhu L. Assessing the Performance of Traveling-salesman based Automated Path Searching (TAPS) on Complex Biomolecular Systems. J Chem Theory Comput 2021; 17:5301-5311. [PMID: 34270241 DOI: 10.1021/acs.jctc.1c00182] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Though crucial for understanding the function of large biomolecular systems, locating the minimum free energy paths (MFEPs) between their key conformational states is far from trivial due to their high-dimensional nature. Most existing path-searching methods require a static collective variable space as input, encoding intuition or prior knowledge of the transition mechanism. Such information is, however, hardly available a priori and expensive to validate. To alleviate this issue, we have previously introduced a Traveling-salesman based Automated Path Searching method (TAPS) and demonstrated its efficiency on simple peptide systems. Having implemented a parallel version of this method, here we assess the performance of TAPS on three realistic systems (tens to hundreds of residues) in explicit solvents. We show that TAPS successfully located the MFEP for the ground/excited state transition of the T4 lysozyme L99A variant, consistent with previous findings. TAPS also helped identifying the important role of the two polar contacts in directing the loop-in/loop-out transition of the mitogen-activated protein kinase kinase (MEK1), which explained previous mutant experiments. Remarkably, at a minimal cost of 126 ns sampling, TAPS revealed that the Ltn40/Ltn10 transition of lymphotactin needs no complete unfolding/refolding of its β-sheets and that five polar contacts are sufficient to stabilize the various partially unfolded intermediates along the MFEP. These results present TAPS as a general and promising tool for studying the functional dynamics of complex biomolecular systems.
Collapse
Affiliation(s)
- Kun Xi
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China.,School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, P. R. China
| | - Zhenquan Hu
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China.,School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, P. R. China
| | - Qiang Wu
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China
| | - Meihan Wei
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China
| | - Runtong Qian
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China
| | - Lizhe Zhu
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, P. R. China
| |
Collapse
|
23
|
A comprehensive mechanism for 5-carboxylcytosine-induced transcriptional pausing revealed by Markov state models. J Biol Chem 2021; 296:100735. [PMID: 33991521 PMCID: PMC8191312 DOI: 10.1016/j.jbc.2021.100735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 04/27/2021] [Accepted: 04/28/2021] [Indexed: 11/23/2022] Open
Abstract
RNA polymerase II (Pol II) surveils the genome, pausing as it encounters DNA lesions and base modifications and initiating signals for DNA repair among other important regulatory events. Recent work suggests that Pol II pauses at 5-carboxycytosine (5caC), an epigenetic modification of cytosine, because of a specific hydrogen bond between the carboxyl group of 5caC and a specific residue in fork loop 3 of Pol II. This hydrogen bond compromises productive NTP binding and slows down elongation. Apart from this specific interaction, the carboxyl group of 5caC can potentially interact with numerous charged residues in the cleft of Pol II. However, it is not clear how other interactions between Pol II and 5caC contribute to pausing. In this study, we use Markov state models (a type of kinetic network models) built from extensive molecular dynamics simulations to comprehensively study the impact of 5caC on Pol II translocation. We describe two translocation intermediates with specific interactions that prevent the template base from loading into the Pol II active site. In addition to the previously observed state with 5caC constrained by fork loop 3, we discovered a new intermediate state with a hydrogen bond between 5caC and fork loop 2. Surprisingly, we find that 5caC may curb translocation by suppressing kinking of the helix bordering the active site (the bridge helix) because its high flexibility is critical to translocation. Our work provides new insights into how epigenetic modifications of genomic DNA can modulate Pol II translocation, inducing pauses in transcription.
Collapse
|
24
|
Marinova V, Dodd L, Lee SJ, Wood GPF, Marziano I, Salvalaglio M. Identifying Conformational Isomers of Organic Molecules in Solution via Unsupervised Clustering. J Chem Inf Model 2021; 61:2263-2273. [PMID: 33913713 PMCID: PMC8278389 DOI: 10.1021/acs.jcim.0c01387] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We present a systematic approach for the identification of statistically relevant conformational macrostates of organic molecules from molecular dynamics trajectories. The approach applies to molecules characterized by an arbitrary number of torsional degrees of freedom and enables the transferability of the macrostates definition across different environments. We formulate a dissimilarity measure between molecular configurations that incorporates information on the characteristic energetic cost associated with transitions along all relevant torsional degrees of freedom. Such metric is employed to perform unsupervised clustering of molecular configurations based on the Fast Search and Find of Density Peaks algorithm. We apply this method to investigate the equilibrium conformational ensemble of Sildenafil, a conformationally complex pharmaceutical compound, in different environments including the crystal bulk, the gas phase, and three different solvents (acetonitrile, 1-butanol, and toluene). We demonstrate that while Sildenafil can adopt more than 100 metastable conformational configurations, only 12 are significantly populated across all of the environments investigated. Despite the complexity of the conformational space, we find that the most abundant conformers in solution are the closest to the conformers found in the most common Sildenafil crystal phase.
Collapse
Affiliation(s)
- Veselina Marinova
- Thomas Young Centre and Department of Chemical Engineering, University College London, London WC1E 7JE, U.K.,Department of Materials Science and Engineering, The University of Sheffield, Sheffield S1 3JD, U.K
| | - Laurence Dodd
- Thomas Young Centre and Department of Chemical Engineering, University College London, London WC1E 7JE, U.K
| | - Song-Jun Lee
- Thomas Young Centre and Department of Chemical Engineering, University College London, London WC1E 7JE, U.K
| | - Geoffrey P F Wood
- Pfizer Worldwide Research and Development, Groton Laboratories, Groton, Connecticut 06340, United States
| | - Ivan Marziano
- Pfizer Worldwide Research and Development, Sandwich CT13 9NJ, Kent, U.K
| | - Matteo Salvalaglio
- Thomas Young Centre and Department of Chemical Engineering, University College London, London WC1E 7JE, U.K
| |
Collapse
|
25
|
Role of bacterial RNA polymerase gate opening dynamics in DNA loading and antibiotics inhibition elucidated by quasi-Markov State Model. Proc Natl Acad Sci U S A 2021; 118:2024324118. [PMID: 33883282 DOI: 10.1073/pnas.2024324118] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To initiate transcription, the holoenzyme (RNA polymerase [RNAP] in complex with σ factor) loads the promoter DNA via the flexible loading gate created by the clamp and β-lobe, yet their roles in DNA loading have not been characterized. We used a quasi-Markov State Model (qMSM) built from extensive molecular dynamics simulations to elucidate the dynamics of Thermus aquaticus holoenzyme's gate opening. We showed that during gate opening, β-lobe oscillates four orders of magnitude faster than the clamp, whose opening depends on the Switch 2's structure. Myxopyronin, an antibiotic that binds to Switch 2, was shown to undergo a conformational selection mechanism to inhibit clamp opening. Importantly, we reveal a critical but undiscovered role of β-lobe, whose opening is sufficient for DNA loading even when the clamp is partially closed. These findings open the opportunity for the development of antibiotics targeting β-lobe of RNAP. Finally, we have shown that our qMSMs, which encode non-Markovian dynamics based on the generalized master equation formalism, hold great potential to be widely applied to study biomolecular dynamics.
Collapse
|
26
|
Cao S, Montoya-Castillo A, Wang W, Markland TE, Huang X. On the advantages of exploiting memory in Markov state models for biomolecular dynamics. J Chem Phys 2021; 153:014105. [PMID: 32640825 DOI: 10.1063/5.0010787] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Biomolecular dynamics play an important role in numerous biological processes. Markov State Models (MSMs) provide a powerful approach to study these dynamic processes by predicting long time scale dynamics based on many short molecular dynamics (MD) simulations. In an MSM, protein dynamics are modeled as a kinetic process consisting of a series of Markovian transitions between different conformational states at discrete time intervals (called "lag time"). To achieve this, a master equation must be constructed with a sufficiently long lag time to allow interstate transitions to become truly Markovian. This imposes a major challenge for MSM studies of proteins since the lag time is bound by the length of relatively short MD simulations available to estimate the frequency of transitions. Here, we show how one can employ the generalized master equation formalism to obtain an exact description of protein conformational dynamics both at short and long time scales without the time resolution restrictions imposed by the MSM lag time. Using a simple kinetic model, alanine dipeptide, and WW domain, we demonstrate that it is possible to construct these quasi-Markov State Models (qMSMs) using MD simulations that are 5-10 times shorter than those required by MSMs. These qMSMs only contain a handful of metastable states and, thus, can greatly facilitate the interpretation of mechanisms associated with protein dynamics. A qMSM opens the door to the study of conformational changes of complex biomolecules where a Markovian model with a few states is often difficult to construct due to the limited length of available MD simulations.
Collapse
Affiliation(s)
- Siqin Cao
- Department of Chemistry, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | | | - Wei Wang
- Department of Chemistry, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Thomas E Markland
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| | - Xuhui Huang
- Department of Chemistry, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| |
Collapse
|
27
|
Wang X, Unarta IC, Cheung PPH, Huang X. Elucidating molecular mechanisms of functional conformational changes of proteins via Markov state models. Curr Opin Struct Biol 2020; 67:69-77. [PMID: 33126140 DOI: 10.1016/j.sbi.2020.10.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2020] [Revised: 09/28/2020] [Accepted: 10/07/2020] [Indexed: 01/01/2023]
Abstract
Functional conformational changes of proteins can facilitate numerous biological events in cells. The Markov state model (MSM) built from molecular dynamics simulations provide a powerful approach to study them. We here introduce a protocol that is tailor-made for constructing MSMs to study the functional conformational changes of proteins. In this protocol, one of the important steps is to select proper molecular features that can collectively describe the slowest timescales of conformational changes of interest. We recommend spectral oASIS, the modified version of oASIS, as a promising approach for automatic feature selection. Recently developed deep learning methods could also serve efficient approaches for selecting features and finding collective variables. Using DNA repair enzymes and RNA polymerases as examples, we review recent applications of MSMs to elucidate molecular mechanisms of functional conformational changes. Finally, we discuss remaining challenges and future perspectives for constructing MSMs to study functional conformational changes of proteins.
Collapse
Affiliation(s)
- Xiaowei Wang
- The Hong Kong University of Science and Technology-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China; Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Ilona Christy Unarta
- Bioengineering Graduate Program, The Hong Kong University of Science and Technology, Kowloon, 4Hong Kong Center for Neurodegenerative Diseases, Hong Kong
| | - Peter Pak-Hang Cheung
- The Hong Kong University of Science and Technology-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China; Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Xuhui Huang
- The Hong Kong University of Science and Technology-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China; Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong; Bioengineering Graduate Program, The Hong Kong University of Science and Technology, Kowloon, 4Hong Kong Center for Neurodegenerative Diseases, Hong Kong.
| |
Collapse
|
28
|
Villani G. A Time-Dependent Quantum Approach to Allostery and a Comparison With Light-Harvesting in Photosynthetic Phenomenon. Front Mol Biosci 2020; 7:156. [PMID: 33005625 PMCID: PMC7483663 DOI: 10.3389/fmolb.2020.00156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 06/19/2020] [Indexed: 11/26/2022] Open
Abstract
The allosteric effect is one of the most important processes in regulating the function of proteins, and the elucidation of this phenomenon plays a significant role in understanding emergent behaviors in biological regulation. In this process, a perturbation, generated by a ligand in a part of the macromolecule (the allosteric site), moves along this system and reaches a specific (active) site, dozens of Ångströms away, with a great efficiency. The dynamics of this perturbation in the macromolecule can model precisely the allosteric process. In this article, we will be studying the general characteristics of allostery, using a time-dependent quantum approach to obtain rules that apply to this kind of process. Considering the perturbation as a wave that moves within the molecular system, we will characterize the allosteric process with three of the properties of this wave in the active site: (1) ta, the characteristic time for reaching that site, (2) Aa, the amplitude of the wave in this site, and (3) Ba, its corresponding spectral broadening. These three parameters, together with the process mechanism and the perturbation efficiency in the process, can describe the phenomenon. One of the main purposes of this paper is to link the parameters ta, Aa, and Ba and the perturbation efficiency to the characteristics of the system. There is another fundamental process for life that has some characteristics similar to allostery: the light-harvesting (LH) process in photosynthesis. Here, as in allostery, two distant macromolecular sites are involved—two sites dozens of Ångströms away. In both processes, it is particularly important that the perturbation is distributed efficiently without dissipating in the infinite degrees of freedom within the macromolecule. The importance of considering quantum effects in the LH process is well documented in literature, and the quantum coherences are experimentally proven by time-dependent spectroscopic techniques. Given the existing similarities between these two processes in macromolecules, in this work, we suggest using Quantum Mechanics (QM) to study allostery.
Collapse
Affiliation(s)
- Giovanni Villani
- Istituto di Chimica dei Composti OrganoMetallici (UOS Pisa) - CNR, Area della Ricerca di Pisa, Pisa, Italy
| |
Collapse
|
29
|
Target search and recognition mechanisms of glycosylase AlkD revealed by scanning FRET-FCS and Markov state models. Proc Natl Acad Sci U S A 2020; 117:21889-21895. [PMID: 32820079 PMCID: PMC7486748 DOI: 10.1073/pnas.2002971117] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
DNA glycosylase repairs DNA damage to maintain the genome integrity, and thus it is essential for the survival of all organisms. However, it remains a long-standing puzzle how glycosylase diffuses along the genomic DNA to locate the sparse and aberrant lesion sites efficiently and accurately in the genome containing numerous base pairs. Previously, only the high-speed–low-accuracy search mode has been characterized experimentally, while the low-speed–high-accuracy mode is undetectable. Here, we observed the low-speed mode of glycosylase AlkD translocating, and further dissected its molecular mechanisms. To achieve this, we developed an integrated platform by combining scanning FRET-FCS with Markov state model. We expect that this platform can be widely applied to investigate other glycosylases and DNA-binding proteins. DNA glycosylase is responsible for repairing DNA damage to maintain the genome stability and integrity. However, how glycosylase can efficiently and accurately recognize DNA lesions across the enormous DNA genome remains elusive. It has been hypothesized that glycosylase translocates along the DNA by alternating between a fast but low-accuracy diffusion mode and a slow but high-accuracy mode when searching for DNA lesions. However, the slow mode has not been successfully characterized due to the limitation in the spatial and temporal resolutions of current experimental techniques. Using a newly developed scanning fluorescence resonance energy transfer (FRET)–fluorescence correlation spectroscopy (FCS) platform, we were able to observe both slow and fast modes of glycosylase AlkD translocating on double-stranded DNA (dsDNA), reaching the temporal resolution of microsecond and spatial resolution of subnanometer. The underlying molecular mechanism of the slow mode was further elucidated by Markov state model built from extensive all-atom molecular dynamics simulations. We found that in the slow mode, AlkD follows an asymmetric diffusion pathway, i.e., rotation followed by translation. Furthermore, the essential role of Y27 in AlkD diffusion dynamics was identified both experimentally and computationally. Our results provided mechanistic insights on how conformational dynamics of AlkD–dsDNA complex coordinate different diffusion modes to accomplish the search for DNA lesions with high efficiency and accuracy. We anticipate that the mechanism adopted by AlkD to search for DNA lesions could be a general one utilized by other glycosylases and DNA binding proteins.
Collapse
|
30
|
Ligand-bound glutamine binding protein assumes multiple metastable binding sites with different binding affinities. Commun Biol 2020; 3:419. [PMID: 32747735 PMCID: PMC7400645 DOI: 10.1038/s42003-020-01149-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 07/14/2020] [Indexed: 11/08/2022] Open
Abstract
Protein dynamics plays key roles in ligand binding. However, the microscopic description of conformational dynamics-coupled ligand binding remains a challenge. In this study, we integrate molecular dynamics simulations, Markov state model (MSM) analysis and experimental methods to characterize the conformational dynamics of ligand-bound glutamine binding protein (GlnBP). We show that ligand-bound GlnBP has high conformational flexibility and additional metastable binding sites, presenting a more complex energy landscape than the scenario in the absence of ligand. The diverse conformations of GlnBP demonstrate different binding affinities and entail complex transition kinetics, implicating a concerted ligand binding mechanism. Single molecule fluorescence resonance energy transfer measurements and mutagenesis experiments are performed to validate our MSM-derived structure ensemble as well as the binding mechanism. Collectively, our study provides deeper insights into the protein dynamics-coupled ligand binding, revealing an intricate regulatory network underlying the apparent binding affinity. Zhang, Wu, Feng et al. show that ligand-bound glutamine binding protein assumes multiple metastable binding sites, presenting a more dynamic energy landscape than its ligand-free form. This study provides insights into the ligand-binding mechanisms coupled with protein dynamics that underly the apparent binding affinity.
Collapse
|
31
|
George A, Purnaprajna M, Athri P. Laplacian score and genetic algorithm based automatic feature selection for Markov State Models in adaptive sampling based molecular dynamics. PEERJ PHYSICAL CHEMISTRY 2020. [DOI: 10.7717/peerj-pchem.9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Adaptive sampling molecular dynamics based on Markov State Models use short parallel MD simulations to accelerate simulations, and are proven to identify hidden conformers. The accuracy of the predictions provided by it depends on the features extracted from the simulated data that is used to construct it. The identification of the most important features in the trajectories of the simulated system has a considerable effect on the results.
Methods
In this study, we use a combination of Laplacian scoring and genetic algorithms to obtain an optimized feature subset for the construction of the MSM. The approach is validated on simulations of three protein folding complexes, and two protein ligand binding complexes.
Results
Our experiments show that this approach produces better results when the number of samples is significantly lesser than the number of features extracted. We also observed that this method mitigates over fitting that occurs due to high dimensionality of large biosystems with shorter simulation times.
Collapse
Affiliation(s)
- Anu George
- Department of Computer Science & Engineering, Amrita School of Engineering, Bengaluru, Amrita Vishwa Vidyapeetham, India
| | - Madhura Purnaprajna
- Department of Computer Science & Engineering, Amrita School of Engineering, Bengaluru, Amrita Vishwa Vidyapeetham, India
| | - Prashanth Athri
- Department of Computer Science & Engineering, Amrita School of Engineering, Bengaluru, Amrita Vishwa Vidyapeetham, India
| |
Collapse
|
32
|
Kabir KL, Akhter N, Shehu A. From molecular energy landscapes to equilibrium dynamics via landscape analysis and markov state models. J Bioinform Comput Biol 2020; 17:1940014. [DOI: 10.1142/s0219720019400146] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Molecular dynamics (MD) simulation software allows probing the equilibrium structural dynamics of a molecule of interest, revealing how a molecule navigates its structure space one structure at a time. To obtain a broader view of dynamics, typically one needs to launch many such simulations, obtaining many trajectories. A summarization of the equilibrium dynamics requires integrating the information in the various trajectories, and Markov State Models (MSM) are increasingly being used for this task. At its core, the task involves organizing the structures accessed in simulation into structural states, and then constructing a transition probability matrix revealing the transitions between states. While now considered a mature technology and widely used to summarize equilibrium dynamics, the underlying computational process in the construction of an MSM ignores energetics even though the transition of a molecule between two nearby structures in an MD trajectory is governed by the corresponding energies. In this paper, we connect theory with simulation and analysis of equilibrium dynamics. A molecule navigates the energy landscape underlying the structure space. The structural states that are identified via off-the-shelf clustering algorithms need to be connected to thermodynamically-stable and semi-stable (macro)states among which transitions can then be quantified. Leveraging recent developments in the analysis of energy landscapes that identify basins in the landscape, we evaluate the hypothesis that basins, directly tied to stable and semi-stable states, lead to better models of dynamics. Our analysis indicates that basins lead to MSMs of better quality and thus can be useful to further advance this widely-used technology for summarization of molecular equilibrium dynamics.
Collapse
Affiliation(s)
- Kazi Lutful Kabir
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, VA 22030, USA
| | - Nasrin Akhter
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, VA 22030, USA
| | - Amarda Shehu
- Department of Computer Science, Department of Bioengineering, School of Systems Biology, George Mason University, 4400 University Drive, Fairfax, VA 22030, USA
| |
Collapse
|
33
|
Juárez-Jiménez J, Gupta AA, Karunanithy G, Mey ASJS, Georgiou C, Ioannidis H, De Simone A, Barlow PN, Hulme AN, Walkinshaw MD, Baldwin AJ, Michel J. Dynamic design: manipulation of millisecond timescale motions on the energy landscape of cyclophilin A. Chem Sci 2020; 11:2670-2680. [PMID: 34084326 PMCID: PMC8157532 DOI: 10.1039/c9sc04696h] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 01/14/2020] [Indexed: 12/21/2022] Open
Abstract
Proteins need to interconvert between many conformations in order to function, many of which are formed transiently, and sparsely populated. Particularly when the lifetimes of these states approach the millisecond timescale, identifying the relevant structures and the mechanism by which they interconvert remains a tremendous challenge. Here we introduce a novel combination of accelerated MD (aMD) simulations and Markov state modelling (MSM) to explore these 'excited' conformational states. Applying this to the highly dynamic protein CypA, a protein involved in immune response and associated with HIV infection, we identify five principally populated conformational states and the atomistic mechanism by which they interconvert. A rational design strategy predicted that the mutant D66A should stabilise the minor conformations and substantially alter the dynamics, whereas the similar mutant H70A should leave the landscape broadly unchanged. These predictions are confirmed using CPMG and R1ρ solution state NMR measurements. By efficiently exploring functionally relevant, but sparsely populated conformations with millisecond lifetimes in silico, our aMD/MSM method has tremendous promise for the design of dynamic protein free energy landscapes for both protein engineering and drug discovery.
Collapse
Affiliation(s)
- Jordi Juárez-Jiménez
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Arun A Gupta
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Gogulan Karunanithy
- Department of Chemistry, Physical and Theoretical Chemistry Laboratory, University of Oxford South Parks Road Oxford OX1 3QZ UK
| | - Antonia S J S Mey
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Charis Georgiou
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Harris Ioannidis
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Alessio De Simone
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Paul N Barlow
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Alison N Hulme
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Malcolm D Walkinshaw
- School of Biological Sciences Michael Swann Building, Max Born Crescent Edinburgh EH9 3BF UK
| | - Andrew J Baldwin
- Department of Chemistry, Physical and Theoretical Chemistry Laboratory, University of Oxford South Parks Road Oxford OX1 3QZ UK
| | - Julien Michel
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| |
Collapse
|
34
|
Liu H, Zhong H, Xu Z, Zhang Q, Shah SJA, Liu H, Yao X. The misfolding mechanism of the key fragment R3 of tau protein: a combined molecular dynamics simulation and Markov state model study. Phys Chem Chem Phys 2020; 22:10968-10980. [DOI: 10.1039/c9cp06954b] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
All-atom molecular dynamics (MD) simulation combined with Markov state model (MSM) were used to uncover the structural characteristics and misfolding mechanism of the key R3 fragment of tau protein at the atomic level.
Collapse
Affiliation(s)
- Hongli Liu
- School of Pharmacy
- Lanzhou University
- Lanzhou 730000
- China
- Jiangsu Key Laboratory of New Drug Research and Clinical Pharmacy
| | - Haiyang Zhong
- State Key Laboratory of Applied Organic Chemistry and Department of Chemistry
- Lanzhou University
- Lanzhou 730000
- China
| | - Zerong Xu
- School of Pharmacy
- Lanzhou University
- Lanzhou 730000
- China
| | | | | | - Huanxiang Liu
- School of Pharmacy
- Lanzhou University
- Lanzhou 730000
- China
| | - Xiaojun Yao
- State Key Laboratory of Applied Organic Chemistry and Department of Chemistry
- Lanzhou University
- Lanzhou 730000
- China
- State Key Laboratory of Quality Research in Chinese Medicine
| |
Collapse
|
35
|
Exploration of the Misfolding Mechanism of Transthyretin Monomer: Insights from Hybrid-Resolution Simulations and Markov State Model Analysis. Biomolecules 2019; 9:biom9120889. [PMID: 31861226 PMCID: PMC6995605 DOI: 10.3390/biom9120889] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Revised: 12/13/2019] [Accepted: 12/15/2019] [Indexed: 01/08/2023] Open
Abstract
Misfolding and aggregation of transthyretin (TTR) is widely known to be responsible for a progressive systemic disorder called amyloid transthyretin (ATTR) amyloidosis. Studies suggest that TTR aggregation is initiated by a rate-limiting dissociation of the homo-tetramer into its monomers, which can rapidly misfold and self-assemble into amyloid fibril. Thus, exploring conformational change involved in TTR monomer misfolding is of vital importance for understanding the pathogenesis of ATTR amyloidosis. In this work, microsecond timescale hybrid-resolution molecular dynamics (MD) simulations combined with Markov state model (MSM) analysis were performed to investigate the misfolding mechanism of the TTR monomer. The results indicate that a macrostate with partially unfolded conformations may serve as the misfolded state of the TTR monomer. This misfolded state was extremely stable with a very large equilibrium probability of about 85.28%. With secondary structure analysis, we found the DAGH sheet in this state to be significantly destroyed. The CBEF sheet was relatively stable and sheet structure was maintained. However, the F-strand in this sheet was likely to move away from E-strand and reform a new β-sheet with the H-strand. This observation is consistent with experimental finding that F and H strands in the outer edge drive the misfolding of TTR. Finally, transition pathways from a near native state to this misfolded macrostate showed that the conformational transition can occur either through a native-like β-sheet intermediates or through partially unfolded intermediates, while the later appears to be the main pathway. As a whole, we identified a potential misfolded state of the TTR monomer and elucidated the misfolding pathway for its conformational transition. This work can provide a valuable theoretical basis for understanding of TTR aggregation and the pathogenesis of ATTR amyloidosis at the atomic level.
Collapse
|
36
|
Beyerle ER, Guenza MG. Kinetics analysis of ubiquitin local fluctuations with Markov state modeling of the LE4PD normal modes. J Chem Phys 2019; 151:164119. [PMID: 31675886 DOI: 10.1063/1.5123513] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Local fluctuations are important for protein binding and molecular recognition because they provide conformational states that can be trapped through a selection mechanism of binding. Thus, an accurate characterization of local fluctuations may be important for modeling the kinetic mechanism that leads to the biological activity of a protein. In this paper, we study the fluctuation dynamics of the regulatory protein ubiquitin and propose a novel theoretical approach to model its fluctuations. A coarse-grained, diffusive, mode-dependent description of fluctuations is accomplished using the Langevin Equation for Protein Dynamics (LE4PD). This equation decomposes the dynamics of a protein, simulated by molecular dynamics, into dynamical pathways that explore mode-dependent free energy surfaces. We calculate the time scales of the slow, high-amplitude fluctuations by modeling the kinetics of barrier crossing in the two-dimensional free energy surfaces using Markov state modeling. We find that the LE4PD predicts slow fluctuations in three important binding regions in ubiquitin: the C-terminal tail, the Lys11 loop, and the 50 s loop. These results suggest that the LE4PD can provide useful information on the role of fluctuations in the process of molecular recognition regulating the biological activity of ubiquitin.
Collapse
Affiliation(s)
- Eric R Beyerle
- Department of Chemistry and Biochemistry and Institute of Theoretical Science, University of Oregon, Eugene, Oregon 97403, USA
| | - Marina G Guenza
- Department of Chemistry and Biochemistry and Institute of Theoretical Science, University of Oregon, Eugene, Oregon 97403, USA
| |
Collapse
|
37
|
Sun B, Vaughan D, Tikunova S, Creamer TP, Davis JP, Kekenes-Huskey PM. Calmodulin-Calcineurin Interaction beyond the Calmodulin-Binding Region Contributes to Calcineurin Activation. Biochemistry 2019; 58:4070-4085. [PMID: 31483613 DOI: 10.1021/acs.biochem.9b00626] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Calcineurin (CaN) is a calcium-dependent phosphatase involved in numerous signaling pathways. Its activation is in part driven by the binding of calmodulin (CaM) to a CaM recognition region (CaMBR) within CaN's regulatory domain (RD). However, secondary interactions between CaM and the CaN RD may be necessary to fully activate CaN. Specifically, it is established that the CaN RD folds upon CaM binding and a region C-terminal to CaMBR, the "distal helix", assumes an α-helix fold and contributes to activation [Dunlap, T. B., et al. (2013) Biochemistry 52, 8643-8651]. We hypothesized in that previous study that this distal helix can bind CaM in a region distinct from the canonical CaMBR. To test this hypothesis, we utilized molecular simulations, including replica-exchange molecular dynamics, protein-protein docking, and computational mutagenesis, to determine potential distal helix-binding sites on CaM's surface. We isolated a potential binding site on CaM (site D) that facilitates moderate-affinity interprotein interactions and predicted that mutation of site D residues K30 and G40 on CaM would weaken CaN distal helix binding. We experimentally confirmed that two variants (K30E and G40D) indicate weaker binding of a phosphate substrate p-nitrophenyl phosphate to the CaN catalytic site by a phosphatase assay. This weakened substrate affinity is consistent with competitive binding of the CaN autoinhibition domain to the catalytic site, which we suggest is due to the weakened distal helix-CaM interactions. This study therefore suggests a novel mechanism for CaM regulation of CaN that may extend to other CaM targets.
Collapse
Affiliation(s)
- Bin Sun
- Department of Chemistry , University of Kentucky , Lexington , Kentucky 40506 , United States
| | - Darin Vaughan
- Department of Chemistry , University of Kentucky , Lexington , Kentucky 40506 , United States
| | - Svetlana Tikunova
- Department of Physiology and Cell Biology , The Ohio State University , Columbus , Ohio 43210 , United States
| | - Trevor P Creamer
- Center for Structural Biology and Department of Molecular & Cellular Biochemistry , University of Kentucky , Lexington , Kentucky 40536 , United States
| | - Jonathan P Davis
- Department of Physiology and Cell Biology , The Ohio State University , Columbus , Ohio 43210 , United States
| | - P M Kekenes-Huskey
- Department of Chemical and Materials Engineering , University of Kentucky , Lexington , Kentucky 40506 , United States.,Department of Cell and Molecular Physiology , Loyola University Chicago , Maywood , Illinois 60153 , United States
| |
Collapse
|
38
|
Tan Q, Duan M, Li M, Han L, Huo S. Approximating dynamic proximity with a hybrid geometry energy-based kernel for diffusion maps. J Chem Phys 2019; 151:105101. [PMID: 31521094 DOI: 10.1063/1.5100968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The diffusion map is a dimensionality reduction method. The reduction coordinates are associated with the leading eigenfunctions of the backward Fokker-Planck operator, providing a dynamic meaning for these coordinates. One of the key factors that affect the accuracy of diffusion map embedding is the dynamic measure implemented in the Gaussian kernel. A common practice in diffusion map study of molecular systems is to approximate dynamic proximity with RMSD (root-mean-square deviation). In this paper, we present a hybrid geometry-energy based kernel. Since high energy-barriers may exist between geometrically similar conformations, taking both RMSD and energy difference into account in the kernel can better describe conformational transitions between neighboring conformations and lead to accurate embedding. We applied our diffusion map method to the β-hairpin of the B1 domain of streptococcal protein G and to Trp-cage. Our results in β-hairpin show that the diffusion map embedding achieves better results with the hybrid kernel than that with the RMSD-based kernel in terms of free energy landscape characterization and a new correlation measure between the cluster center Euclidean distances in the reduced-dimension space and the reciprocals of the total net flow between these clusters. In addition, our diffusion map analysis of the ultralong molecular dynamics trajectory of Trp-cage has provided a unified view of its folding mechanism. These promising results demonstrate the effectiveness of our diffusion map approach in the analysis of the dynamics and thermodynamics of molecular systems. The hybrid geometry-energy criterion could be also useful as a general dynamic measure for other purposes.
Collapse
Affiliation(s)
- Qingzhe Tan
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts 01610, USA
| | - Mojie Duan
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts 01610, USA
| | - Minghai Li
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts 01610, USA
| | - Li Han
- Department of Math and Computer Science, Clark University, Worcester, Massachusetts 01610, USA
| | - Shuanghong Huo
- Gustaf H. Carlson School of Chemistry and Biochemistry, Clark University, 950 Main Street, Worcester, Massachusetts 01610, USA
| |
Collapse
|
39
|
Dämgen MA, Biggin PC. Computational methods to examine conformational changes and ligand-binding properties: Examples in neurobiology. Neurosci Lett 2019. [DOI: 10.1016/j.neulet.2018.03.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
|
40
|
Structural basis for ligand modulation of the CCR2 conformational landscape. Proc Natl Acad Sci U S A 2019; 116:8131-8136. [PMID: 30975755 DOI: 10.1073/pnas.1814131116] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
CC chemokine receptor 2 (CCR2) is a part of the chemokine receptor family, an important class of therapeutic targets. These class A G-protein coupled receptors (GPCRs) are involved in mammalian signaling pathways and control cell migration toward endogenous CC chemokine ligands, named for the adjacent cysteine motif on their N terminus. Chemokine receptors and their associated ligands are involved in a wide range of diseases and thus have become important drug targets. CCR2, in particular, promotes the metastasis of cancer cells and is also implicated in autoimmunity-driven type-1 diabetes, diabetic nephropathy, multiple sclerosis, asthma, atherosclerosis, neuropathic pain, and rheumatoid arthritis. Although promising, CCR2 antagonists have been largely unsuccessful to date. Here, we investigate the effect of an orthosteric and an allosteric antagonist on CCR2 dynamics by coupling long-timescale molecular dynamics simulations with Markov-state model theory. We find that the antagonists shift CCR2 into several stable inactive conformations that are distinct from the crystal structure conformation and disrupt a continuous internal water and sodium ion pathway, preventing transitions to an active-like state. Several metastable conformations present a cryptic drug-binding pocket near the allosteric site that may be amenable to targeting with small molecules. Without antagonists, the apo dynamics reveal intermediate conformations along the activation pathway that provide insight into the basal dynamics of CCR2 and may also be useful for future drug design.
Collapse
|
41
|
Zhu L, Sheong FK, Cao S, Liu S, Unarta IC, Huang X. TAPS: A traveling-salesman based automated path searching method for functional conformational changes of biological macromolecules. J Chem Phys 2019; 150:124105. [DOI: 10.1063/1.5082633] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Affiliation(s)
- Lizhe Zhu
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong 518172, China
| | - Fu Kit Sheong
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Siqin Cao
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Song Liu
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Ilona C. Unarta
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Xuhui Huang
- Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Bioengineering Program, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- HKUST-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China
| |
Collapse
|
42
|
Leong KY, Wang F. On approximating a weak Markovian process as Markovian: Are we justified when discarding longtime correlations. J Chem Phys 2019; 150:085101. [PMID: 30823762 DOI: 10.1063/1.5056242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The effect for removing weak longtime correlation is studied using a model system that contains a driven atom at liquid density under strong thermal fluctuations. The force that drives the tagged particle is about 1% of the average random force experienced by the particle. The tagged particle is allowed to assume a range of masses from 1/8 to 80 times that of a surrounding particle to study the effects of inertia. The driving force is indefinitely correlated but much weaker than "random" fluctuations from the environment. From this study, it is shown that the environmental influence is not fully random leading to the force autocorrelation function being a poor metric for detecting the correlated driving force. Although the velocity autocorrelation function shows stronger correlation for systems with higher inertia, the velocity autocorrelation function decays to a very small value of 2.5×10-3 even for the most massive driven particle. For systems with small inertia, our study reveals that discarding longtime correlation has negligible influence on the first passage time (FPT) estimate, whereas for particles with large inertia, the deviation can indeed be appreciable. It is interesting that the Markov State Model (MSM) still produces reasonable estimates on the FPT even when a very short lag time that clearly violates the Markovianity assumption is used. This is likely a result of favorable error cancellations when the MSM transition probability matrices were constructed using trajectories that are much longer than the lag time.
Collapse
Affiliation(s)
- Kai-Yang Leong
- Department of Chemistry and Biochemistry, University of Arkansas, Fayetteville, Arkansas 72701, USA
| | - Feng Wang
- Department of Chemistry and Biochemistry, University of Arkansas, Fayetteville, Arkansas 72701, USA
| |
Collapse
|
43
|
|
44
|
Huggins DJ, Biggin PC, Dämgen MA, Essex JW, Harris SA, Henchman RH, Khalid S, Kuzmanic A, Laughton CA, Michel J, Mulholland AJ, Rosta E, Sansom MSP, van der Kamp MW. Biomolecular simulations: From dynamics and mechanisms to computational assays of biological activity. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2018. [DOI: 10.1002/wcms.1393] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- David J. Huggins
- TCM Group, Cavendish Laboratory University of Cambridge Cambridge UK
- Unilever Centre, Department of Chemistry University of Cambridge Cambridge UK
- Department of Physiology and Biophysics Weill Cornell Medical College New York NY
| | | | - Marc A. Dämgen
- Department of Biochemistry University of Oxford Oxford UK
| | - Jonathan W. Essex
- School of Chemistry University of Southampton Southampton UK
- Institute for Life Sciences University of Southampton Southampton UK
| | - Sarah A. Harris
- School of Physics and Astronomy University of Leeds Leeds UK
- Astbury Centre for Structural and Molecular Biology University of Leeds Leeds UK
| | - Richard H. Henchman
- Manchester Institute of Biotechnology The University of Manchester Manchester UK
- School of Chemistry The University of Manchester Oxford UK
| | - Syma Khalid
- School of Chemistry University of Southampton Southampton UK
- Institute for Life Sciences University of Southampton Southampton UK
| | | | - Charles A. Laughton
- School of Pharmacy University of Nottingham Nottingham UK
- Centre for Biomolecular Sciences University of Nottingham Nottingham UK
| | - Julien Michel
- EaStCHEM school of Chemistry University of Edinburgh Edinburgh UK
| | - Adrian J. Mulholland
- Centre of Computational Chemistry, School of Chemistry University of Bristol Bristol UK
| | - Edina Rosta
- Department of Chemistry King's College London London UK
| | | | - Marc W. van der Kamp
- Centre of Computational Chemistry, School of Chemistry University of Bristol Bristol UK
- School of Biochemistry, Biomedical Sciences Building University of Bristol Bristol UK
| |
Collapse
|
45
|
Wang W, Liang T, Sheong FK, Fan X, Huang X. An efficient Bayesian kinetic lumping algorithm to identify metastable conformational states via Gibbs sampling. J Chem Phys 2018; 149:072337. [PMID: 30134698 DOI: 10.1063/1.5027001] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Markov State Model (MSM) has become a popular approach to study the conformational dynamics of complex biological systems in recent years. Built upon a large number of short molecular dynamics simulation trajectories, MSM is able to predict the long time scale dynamics of complex systems. However, to achieve Markovianity, an MSM often contains hundreds or thousands of states (microstates), hindering human interpretation of the underlying system mechanism. One way to reduce the number of states is to lump kinetically similar states together and thus coarse-grain the microstates into macrostates. In this work, we introduce a probabilistic lumping algorithm, the Gibbs lumping algorithm, to assign a probability to any given kinetic lumping using the Bayesian inference. In our algorithm, the transitions among kinetically distinct macrostates are modeled by Poisson processes, which will well reflect the separation of time scales in the underlying free energy landscape of biomolecules. Furthermore, to facilitate the search for the optimal kinetic lumping (i.e., the lumped model with the highest probability), a Gibbs sampling algorithm is introduced. To demonstrate the power of our new method, we apply it to three systems: a 2D potential, alanine dipeptide, and a WW protein domain. In comparison with six other popular lumping algorithms, we show that our method can persistently produce the lumped macrostate model with the highest probability as well as the largest metastability. We anticipate that our Gibbs lumping algorithm holds great promise to be widely applied to investigate conformational changes in biological macromolecules.
Collapse
Affiliation(s)
- Wei Wang
- HKUST-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China
| | - Tong Liang
- Department of Statistics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Fu Kit Sheong
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Xiaodan Fan
- Department of Statistics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Xuhui Huang
- HKUST-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China
| |
Collapse
|
46
|
Amaro RE, Baudry J, Chodera J, Demir Ö, McCammon JA, Miao Y, Smith JC. Ensemble Docking in Drug Discovery. Biophys J 2018; 114:2271-2278. [PMID: 29606412 DOI: 10.1016/j.bpj.2018.02.038] [Citation(s) in RCA: 263] [Impact Index Per Article: 43.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 02/13/2018] [Accepted: 02/20/2018] [Indexed: 12/11/2022] Open
Abstract
Ensemble docking corresponds to the generation of an "ensemble" of drug target conformations in computational structure-based drug discovery, often obtained by using molecular dynamics simulation, that is used in docking candidate ligands. This approach is now well established in the field of early-stage drug discovery. This review gives a historical account of the development of ensemble docking and discusses some pertinent methodological advances in conformational sampling.
Collapse
Affiliation(s)
- Rommie E Amaro
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - Jerome Baudry
- University of Alabama at Huntsville, Huntsville, Alabama
| | - John Chodera
- University of California, Berkeley, Berkeley, California
| | - Özlem Demir
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - J Andrew McCammon
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - Yinglong Miao
- Department of Computational Biology and Molecular Biosciences, University of Kansas, Lawrence, Kansas
| | - Jeremy C Smith
- UT/ORNL Center for Molecular Biophysics, Oak Ridge National Laboratory, Oak Ridge, Tennessee; Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, Tennessee.
| |
Collapse
|
47
|
Zeng X, Zhu L, Zheng X, Cecchini M, Huang X. Harnessing complexity in molecular self-assembly using computer simulations. Phys Chem Chem Phys 2018; 20:6767-6776. [PMID: 29479585 DOI: 10.1039/c7cp06181a] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In molecular self-assembly, hundreds of thousands of freely-diffusing molecules associate to form ordered and functional architectures in the absence of an actuator. This intriguing phenomenon plays a critical role in biology and has become a powerful tool for the fabrication of advanced nanomaterials. Due to the limited spatial and temporal resolutions of current experimental techniques, computer simulations offer a complementary strategy to explore self-assembly with atomic resolution. Here, we review recent computational studies focusing on both thermodynamic and kinetic aspects. As we shall see, thermodynamic approaches based on modeling and statistical mechanics offer initial guidelines to design nanostructures with modest computational effort. Computationally more intensive analyses based on molecular dynamics simulations and kinetic network models (KNMs) reach beyond it, opening the door to the rational design of self-assembly pathways. Current limitations of these methodologies are discussed. We anticipate that the synergistic use of thermodynamic and kinetic analyses based on computer simulations will provide an important contribution to the de novo design of self-assembly.
Collapse
Affiliation(s)
- Xiangze Zeng
- Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, Hong Kong Branch of Chinese National Engineering Research Center for Tissue Restoration & Reconstruction, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
| | | | | | | | | |
Collapse
|
48
|
Zhu L, Sheong FK, Zeng X, Huang X. Elucidation of the conformational dynamics of multi-body systems by construction of Markov state models. Phys Chem Chem Phys 2018; 18:30228-30235. [PMID: 27314275 DOI: 10.1039/c6cp02545e] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Constructing Markov State Models (MSMs) based on short molecular dynamics simulations is a powerful computational technique to complement experiments in predicting long-time kinetics of biomolecular processes at atomic resolution. Even though the MSM approach has been widely applied to study one-body processes such as protein folding and enzyme conformational changes, the majority of biological processes, e.g. protein-ligand recognition, signal transduction, and protein aggregation, essentially involve multiple entities. Here we review the attempts at constructing MSMs for multi-body systems, point out the challenges therein and discuss recent algorithmic progresses that alleviate these challenges. In particular, we describe an automatic kinetics based partitioning method that achieves optimal definition of the conformational states in a multi-body system, and discuss a novel maximum-likelihood approach that efficiently estimates the slow uphill kinetics utilizing pre-computed equilibrium populations of all states. We expect that these new algorithms and their combinations may boost investigations of important multi-body biological processes via the efficient construction of MSMs.
Collapse
Affiliation(s)
- Lizhe Zhu
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China. and Centre of Systems Biology and Human Health, School of Science and Institute for Advance Study, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
| | - Fu Kit Sheong
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China.
| | - Xiangze Zeng
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China. and Centre of Systems Biology and Human Health, School of Science and Institute for Advance Study, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
| | - Xuhui Huang
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China. and Centre of Systems Biology and Human Health, School of Science and Institute for Advance Study, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
| |
Collapse
|
49
|
Da LT, E C, Shuai Y, Wu S, Su XD, Yu J. T7 RNA polymerase translocation is facilitated by a helix opening on the fingers domain that may also prevent backtracking. Nucleic Acids Res 2017; 45:7909-7921. [PMID: 28575393 PMCID: PMC5737862 DOI: 10.1093/nar/gkx495] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Accepted: 05/24/2017] [Indexed: 12/04/2022] Open
Abstract
Here, we studied the complete process of a viral T7 RNA polymerase (RNAP) translocation on DNA during transcription elongation by implementing extensive all-atom molecular dynamics (MD) simulations to construct a Markov state model (MSM). Our studies show that translocation proceeds in a Brownian motion, and the RNAP thermally transits among multiple metastable states. We observed non-synchronized backbone movements of the nucleic acid (NA) chains with the RNA translocation accomplished first, while the template DNA lagged. Notably, both the O-helix and Y-helix on the fingers domain play key roles in facilitating NA translocation through the helix opening. The helix opening allows a key residue Tyr639 to become inserted into the active site, which pushes the RNA–DNA hybrid forward. Another key residue, Phe644, coordinates the downstream template DNA motions by stacking and un-stacking with a transition nucleotide (TN) and its adjacent nucleotide. Moreover, the O-helix opening at pre-translocation (pre-trans) likely resists backtracking. To test this hypothesis, we computationally designed mutants of T7 RNAP by replacing the amino acids on the O-helix with counterpart residues from a mitochondrial RNAP that is capable of backtracking. The current experimental results support the hypothesis.
Collapse
Affiliation(s)
- Lin-Tai Da
- Beijing Computational Science Research Center, Beijing 100193, China.,Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai JiaoTong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Chao E
- Beijing Computational Science Research Center, Beijing 100193, China
| | - Yao Shuai
- School of Life Sciences, Peking University, Beijing 100871, China
| | - Shaogui Wu
- Beijing Computational Science Research Center, Beijing 100193, China
| | - Xiao-Dong Su
- School of Life Sciences, Peking University, Beijing 100871, China
| | - Jin Yu
- Beijing Computational Science Research Center, Beijing 100193, China
| |
Collapse
|
50
|
McKiernan KA, Husic BE, Pande VS. Modeling the mechanism of CLN025 beta-hairpin formation. J Chem Phys 2017; 147:104107. [PMID: 28915754 PMCID: PMC5597441 DOI: 10.1063/1.4993207] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2017] [Accepted: 08/24/2017] [Indexed: 01/26/2023] Open
Abstract
Beta-hairpins are substructures found in proteins that can lend insight into more complex systems. Furthermore, the folding of beta-hairpins is a valuable test case for benchmarking experimental and theoretical methods. Here, we simulate the folding of CLN025, a miniprotein with a beta-hairpin structure, at its experimental melting temperature using a range of state-of-the-art protein force fields. We construct Markov state models in order to examine the thermodynamics, kinetics, mechanism, and rate-determining step of folding. Mechanistically, we find the folding process is rate-limited by the formation of the turn region hydrogen bonds, which occurs following the downhill hydrophobic collapse of the extended denatured protein. These results are presented in the context of established and contradictory theories of the beta-hairpin folding process. Furthermore, our analysis suggests that the AMBER-FB15 force field, at this temperature, best describes the characteristics of the full experimental CLN025 conformational ensemble, while the AMBER ff99SB-ILDN and CHARMM22* force fields display a tendency to overstabilize the native state.
Collapse
Affiliation(s)
- Keri A McKiernan
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| | - Brooke E Husic
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| | - Vijay S Pande
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| |
Collapse
|