1
|
Ge Y, Voelz VA. Estimation of binding rates and affinities from multiensemble Markov models and ligand decoupling. J Chem Phys 2022; 156:134115. [PMID: 35395889 PMCID: PMC8993428 DOI: 10.1063/5.0088024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Accurate and efficient simulation of the thermodynamics and kinetics of protein-ligand interactions is crucial for computational drug discovery. Multiensemble Markov Model (MEMM) estimators can provide estimates of both binding rates and affinities from collections of short trajectories but have not been systematically explored for situations when a ligand is decoupled through scaling of non-bonded interactions. In this work, we compare the performance of two MEMM approaches for estimating ligand binding affinities and rates: (1) the transition-based reweighting analysis method (TRAM) and (2) a Maximum Caliber (MaxCal) based method. As a test system, we construct a small host-guest system where the ligand is a single uncharged Lennard-Jones (LJ) particle, and the receptor is an 11-particle icosahedral pocket made from the same atom type. To realistically mimic a protein-ligand binding system, the LJ ϵ parameter was tuned, and the system was placed in a periodic box with 860 TIP3P water molecules. A benchmark was performed using over 80 µs of unbiased simulation, and an 18-state Markov state model was used to estimate reference binding affinities and rates. We then tested the performance of TRAM and MaxCal when challenged with limited data. Both TRAM and MaxCal approaches perform better than conventional Markov state models, with TRAM showing better convergence and accuracy. We find that subsampling of trajectories to remove time correlation improves the accuracy of both TRAM and MaxCal and that in most cases, only a single biased ensemble to enhance sampled transitions is required to make accurate estimates.
Collapse
Affiliation(s)
- Yunhui Ge
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, USA
| | - Vincent A Voelz
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, USA
| |
Collapse
|
2
|
Brotzakis ZF, Vendruscolo M, Bolhuis PG. A method of incorporating rate constants as kinetic constraints in molecular dynamics simulations. Proc Natl Acad Sci U S A 2021; 118:e2012423118. [PMID: 33376207 PMCID: PMC7812743 DOI: 10.1073/pnas.2012423118] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
From the point of view of statistical mechanics, a full characterization of a molecular system requires an accurate determination of its possible states, their populations, and the respective interconversion rates. Toward this goal, well-established methods increase the accuracy of molecular dynamics simulations by incorporating experimental information about states using structural restraints and about populations using thermodynamics restraints. However, it is still unclear how to include experimental knowledge about interconversion rates. Here, we introduce a method of imposing known rate constants as constraints in molecular dynamics simulations, which is based on a combination of the maximum-entropy and maximum-caliber principles. Starting from an existing ensemble of trajectories, obtained from either molecular dynamics or enhanced trajectory sampling, this method provides a minimally perturbed path distribution consistent with the kinetic constraints, as well as modified free energy and committor landscapes. We illustrate the application of the method to a series of model systems, including all-atom molecular simulations of protein folding. Our results show that by combining experimental rate constants and molecular dynamics simulations, this approach enables the determination of transition states, reaction mechanisms, and free energies. We anticipate that this method will extend the applicability of molecular simulations to kinetic studies in structural biology and that it will assist the development of force fields to reproduce kinetic and thermodynamic observables. Furthermore, this approach is generally applicable to a wide range of systems in biology, physics, chemistry, and material science.
Collapse
Affiliation(s)
- Z Faidon Brotzakis
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom
| | - Michele Vendruscolo
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom
| | - Peter G Bolhuis
- van't Hoff Institute for Molecular Sciences, University of Amsterdam, 1090 GD Amsterdam, The Netherlands
| |
Collapse
|
3
|
Pant S, Smith Z, Wang Y, Tajkhorshid E, Tiwary P. Confronting pitfalls of AI-augmented molecular dynamics using statistical physics. J Chem Phys 2020; 153:234118. [PMID: 33353347 PMCID: PMC7863682 DOI: 10.1063/5.0030931] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 11/29/2020] [Indexed: 12/31/2022] Open
Abstract
Artificial intelligence (AI)-based approaches have had indubitable impact across the sciences through the ability to extract relevant information from raw data. Recently, AI has also found use in enhancing the efficiency of molecular simulations, wherein AI derived slow modes are used to accelerate the simulation in targeted ways. However, while typical fields where AI is used are characterized by a plethora of data, molecular simulations, per construction, suffer from limited sampling and thus limited data. As such, the use of AI in molecular simulations can suffer from a dangerous situation where the AI-optimization could get stuck in spurious regimes, leading to incorrect characterization of the reaction coordinate (RC) for the problem at hand. When such an incorrect RC is then used to perform additional simulations, one could start to deviate progressively from the ground truth. To deal with this problem of spurious AI-solutions, here, we report a novel and automated algorithm using ideas from statistical mechanics. It is based on the notion that a more reliable AI-solution will be one that maximizes the timescale separation between slow and fast processes. To learn this timescale separation even from limited data, we use a maximum caliber-based framework. We show the applicability of this automatic protocol for three classic benchmark problems, namely, the conformational dynamics of a model peptide, ligand-unbinding from a protein, and folding/unfolding energy landscape of the C-terminal domain of protein G. We believe that our work will lead to increased and robust use of trustworthy AI in molecular simulations of complex systems.
Collapse
Affiliation(s)
- Shashank Pant
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | | | | | - Emad Tajkhorshid
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | | |
Collapse
|
4
|
Abstract
Ever since Clausius in 1865 and Boltzmann in 1877, the concepts of entropy and of its maximization have been the foundations for predicting how material equilibria derive from microscopic properties. But, despite much work, there has been no equally satisfactory general variational principle for nonequilibrium situations. However, in 1980, a new avenue was opened by E.T. Jaynes and by Shore and Johnson. We review here maximum caliber, which is a maximum-entropy-like principle that can infer distributions of flows over pathways, given dynamical constraints. This approach is providing new insights, particularly into few-particle complex systems, such as gene circuits, protein conformational reaction coordinates, network traffic, bird flocking, cell motility, and neuronal firing.
Collapse
Affiliation(s)
- Kingshuk Ghosh
- Department of Physics and Astronomy, University of Denver, Denver, Colorado 80209, USA
| | - Purushottam D. Dixit
- Department of Systems Biology, Columbia University, New York, NY 10032, USA,Department of Physics, University of Florida, Gainesville, Florida 32611, USA
| | - Luca Agozzino
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, USA
| | - Ken A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, USA
| |
Collapse
|
5
|
Matsunaga Y, Sugita Y. Use of single-molecule time-series data for refining conformational dynamics in molecular simulations. Curr Opin Struct Biol 2020; 61:153-159. [DOI: 10.1016/j.sbi.2019.12.022] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 12/24/2019] [Accepted: 12/27/2019] [Indexed: 12/18/2022]
|
6
|
Orioli S, Larsen AH, Bottaro S, Lindorff-Larsen K. How to learn from inconsistencies: Integrating molecular simulations with experimental data. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2020; 170:123-176. [PMID: 32145944 DOI: 10.1016/bs.pmbts.2019.12.006] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Molecular simulations and biophysical experiments can be used to provide independent and complementary insights into the molecular origin of biological processes. A particularly useful strategy is to use molecular simulations as a modeling tool to interpret experimental measurements, and to use experimental data to refine our biophysical models. Thus, explicit integration and synergy between molecular simulations and experiments is fundamental for furthering our understanding of biological processes. This is especially true in the case where discrepancies between measured and simulated observables emerge. In this chapter, we provide an overview of some of the core ideas behind methods that were developed to improve the consistency between experimental information and numerical predictions. We distinguish between situations where experiments are used to refine our understanding and models of specific systems, and situations where experiments are used more generally to refine transferable models. We discuss different philosophies and attempt to unify them in a single framework. Until now, such integration between experiments and simulations have mostly been applied to equilibrium data, and we discuss more recent developments aimed to analyze time-dependent or time-resolved data.
Collapse
Affiliation(s)
- Simone Orioli
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Structural Biophysics, Niels Bohr Institute, Faculty of Science, University of Copenhagen, Copenhagen, Denmark
| | - Andreas Haahr Larsen
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Structural Biophysics, Niels Bohr Institute, Faculty of Science, University of Copenhagen, Copenhagen, Denmark
| | - Sandro Bottaro
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Atomistic Simulations Laboratory, Istituto Italiano di Tecnologia, Genova, Italy
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
7
|
Noé F. Machine Learning for Molecular Dynamics on Long Timescales. MACHINE LEARNING MEETS QUANTUM PHYSICS 2020. [DOI: 10.1007/978-3-030-40245-7_16] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
8
|
Bause M, Wittenstein T, Kremer K, Bereau T. Microscopic reweighting for nonequilibrium steady-state dynamics. Phys Rev E 2019; 100:060103. [PMID: 31962494 DOI: 10.1103/physreve.100.060103] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Indexed: 06/10/2023]
Abstract
Computer simulations generate trajectories at a single, well-defined thermodynamic state point. Statistical reweighting offers the means to reweight static and dynamical properties to different equilibrium state points by means of analytic relations. We extend these ideas to nonequilibrium steady states by relying on a maximum path entropy formalism subject to physical constraints. Stochastic thermodynamics analytically relates the forward and backward probabilities of any pathway through the external nonconservative force, enabling reweighting both in and out of equilibrium. We avoid the combinatorial explosion of microtrajectories by systematically constructing pathways through Markovian transitions. We further identify a quantity that is invariant to dynamical reweighting, analogous to the density of states in equilibrium reweighting.
Collapse
Affiliation(s)
- Marius Bause
- Max Planck Institute for Polymer Research, 55128 Mainz, Germany
| | | | - Kurt Kremer
- Max Planck Institute for Polymer Research, 55128 Mainz, Germany
| | - Tristan Bereau
- Max Planck Institute for Polymer Research, 55128 Mainz, Germany
| |
Collapse
|
9
|
Provasi D. Ligand-Binding Calculations with Metadynamics. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2019; 2022:233-253. [PMID: 31396906 DOI: 10.1007/978-1-4939-9608-7_10] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
All-atom molecular dynamics simulations can capture the dynamic degrees of freedom that characterize molecular recognition, the knowledge of which constitutes the cornerstone of rational approaches to drug design and optimization. In particular, enhanced sampling algorithms, such as metadynamics, are powerful tools to dramatically reduce the computational cost required for a mechanistic description of the binding process. Here, we describe the essential details characterizing these simulation strategies, focusing on the critical step of identifying suitable reaction coordinates, as well as on the different analysis algorithms to estimate binding affinity and residence times. We conclude with a survey of published applications that provides explicit examples of successful simulations for several targets.
Collapse
Affiliation(s)
- Davide Provasi
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|
10
|
Chen S, Wiewiora RP, Meng F, Babault N, Ma A, Yu W, Qian K, Hu H, Zou H, Wang J, Fan S, Blum G, Pittella-Silva F, Beauchamp KA, Tempel W, Jiang H, Chen K, Skene RJ, Zheng YG, Brown PJ, Jin J, Luo C, Chodera JD, Luo M. The dynamic conformational landscape of the protein methyltransferase SETD8. eLife 2019; 8:45403. [PMID: 31081496 PMCID: PMC6579520 DOI: 10.7554/elife.45403] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Accepted: 05/08/2019] [Indexed: 12/27/2022] Open
Abstract
Elucidating the conformational heterogeneity of proteins is essential for understanding protein function and developing exogenous ligands. With the rapid development of experimental and computational methods, it is of great interest to integrate these approaches to illuminate the conformational landscapes of target proteins. SETD8 is a protein lysine methyltransferase (PKMT), which functions in vivo via the methylation of histone and nonhistone targets. Utilizing covalent inhibitors and depleting native ligands to trap hidden conformational states, we obtained diverse X-ray structures of SETD8. These structures were used to seed distributed atomistic molecular dynamics simulations that generated a total of six milliseconds of trajectory data. Markov state models, built via an automated machine learning approach and corroborated experimentally, reveal how slow conformational motions and conformational states are relevant to catalysis. These findings provide molecular insight on enzymatic catalysis and allosteric mechanisms of a PKMT via its detailed conformational landscape. Our cells contain thousands of proteins that perform many different tasks. Such tasks often involve significant changes in the shape of a protein that allow it to interact with other proteins or ligands. Understanding these shape changes can be an essential step for predicting and manipulating how proteins work or designing new drugs. Some changes in protein shape happen quickly, whereas others take longer. Existing experimental approaches generally only capture some, but not all, of the different shapes an individual protein adopts. A family of proteins known as protein lysine methyltransferases (PKMTs) help to regulate the activities of other proteins by adding small tags called methyl groups to specific positions on their target proteins. PKMTs play important roles in many life processes including in activating genes, maintaining stem cells and controlling how organs develop. It is important for cells to properly control the activity of PKMTs because too much, or too little, activity can promote cancers and neurological diseases. For example, genetic mutations that increase the levels of a PKMT known as SETD8 appear to promote the progression of some breast cancers and childhood leukemia. There is a pressing need to develop new drugs that can inhibit SETD8 and other PKMTs in human patients. However, these efforts are hindered by the lack of understanding of exactly how the shape of PKMT proteins change as they operate in cells. Chen, Wiewiora et al. used a technique called X-ray crystallography to generate structural models of the human SETD8 protein in the presence or absence of native or foreign ligands. These models were used to develop computer simulations of how the shape of SETD8 changes as it operates. Further computational analysis and laboratory experiments revealed how slow changes in the shape of SETD8 contribute to the ability of the protein to attach methyl groups to other proteins. This work is a significant stepping-stone to developing a complete model of how the SETD8 protein works, as well as understanding how genetic mutations may affect the protein’s role in the body. The next step is to refine the model by integrating data from other approaches including biophysical models and mathematical calculations of the energy associated with the shape changes, with a long-term goal to better understand and then manipulate the function of SETD8.
Collapse
Affiliation(s)
- Shi Chen
- Tri-Institutional PhD Program in Chemical Biology, Memorial Sloan Kettering Cancer Center, New York, United States.,Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Rafal P Wiewiora
- Tri-Institutional PhD Program in Chemical Biology, Memorial Sloan Kettering Cancer Center, New York, United States.,Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Fanwang Meng
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| | - Nicolas Babault
- Mount Sinai Center for Therapeutics Discovery, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Oncological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, United States
| | - Anqi Ma
- Mount Sinai Center for Therapeutics Discovery, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Oncological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, United States
| | - Wenyu Yu
- Structural Genomics Consortium, University of Toronto, Toronto, Canada
| | - Kun Qian
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, United States
| | - Hao Hu
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, United States
| | - Hua Zou
- Takeda California, Science Center Drive, San Diego, United States
| | - Junyi Wang
- Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Shijie Fan
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Gil Blum
- Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Fabio Pittella-Silva
- Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Kyle A Beauchamp
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Wolfram Tempel
- Structural Genomics Consortium, University of Toronto, Toronto, Canada
| | - Hualiang Jiang
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Kaixian Chen
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Robert J Skene
- Takeda California, Science Center Drive, San Diego, United States
| | - Yujun George Zheng
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, United States
| | - Peter J Brown
- Structural Genomics Consortium, University of Toronto, Toronto, Canada
| | - Jian Jin
- Mount Sinai Center for Therapeutics Discovery, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Oncological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, United States
| | - Cheng Luo
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - John D Chodera
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Minkui Luo
- Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States.,Program of Pharmacology, Weill Cornell Medical College of Cornell University, New York, United States
| |
Collapse
|
11
|
Dixit PD. Introducing User-Prescribed Constraints in Markov Chains for Nonlinear Dimensionality Reduction. Neural Comput 2019; 31:980-997. [PMID: 30883279 DOI: 10.1162/neco_a_01184] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Stochastic kernel-based dimensionality-reduction approaches have become popular in the past decade. The central component of many of these methods is a symmetric kernel that quantifies the vicinity between pairs of data points and a kernel-induced Markov chain on the data. Typically, the Markov chain is fully specified by the kernel through row normalization. However, in many cases, it is desirable to impose user-specified stationary-state and dynamical constraints on the Markov chain. Unfortunately, no systematic framework exists to impose such user-defined constraints. Here, based on our previous work on inference of Markov models, we introduce a path entropy maximization based approach to derive the transition probabilities of Markov chains using a kernel and additional user-specified constraints. We illustrate the usefulness of these Markov chains with examples.
Collapse
Affiliation(s)
- Purushottam D Dixit
- Department of Systems Biology, Columbia University, New York, NY 10032, U.S.A.
| |
Collapse
|
12
|
Dixit PD, Dill KA. Building Markov state models using optimal transport theory. J Chem Phys 2019; 150:054105. [DOI: 10.1063/1.5086681] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Affiliation(s)
- Purushottam D. Dixit
- Department of Systems Biology, Columbia University, New York, New York 10032, USA
| | - Ken A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, USA
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, USA
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, New York 11794, USA
| |
Collapse
|
13
|
Bottaro S, Lindorff-Larsen K. Biophysical experiments and biomolecular simulations: A perfect match? Science 2018; 361:355-360. [DOI: 10.1126/science.aat4010] [Citation(s) in RCA: 135] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
A fundamental challenge in biological research is achieving an atomic-level description and mechanistic understanding of the function of biomolecules. Techniques for biomolecular simulations have undergone substantial developments, and their accuracy and scope have expanded considerably. Progress has been made through an increasingly tight integration of experiments and simulations, with experiments being used to refine simulations and simulations used to interpret experiments. Here we review the underpinnings of this progress, including methods for more efficient conformational sampling, accuracy of the physical models used, and theoretical approaches to integrate experiments and simulations. These developments are enabling detailed studies of complex biomolecular assemblies.
Collapse
|
14
|
Capelli R, Tiana G, Camilloni C. An implementation of the maximum-caliber principle by replica-averaged time-resolved restrained simulations. J Chem Phys 2018; 148:184114. [PMID: 29764124 DOI: 10.1063/1.5030339] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Inferential methods can be used to integrate experimental informations and molecular simulations. The maximum entropy principle provides a framework for using equilibrium experimental data, and it has been shown that replica-averaged simulations, restrained using a static potential, are a practical and powerful implementation of such a principle. Here we show that replica-averaged simulations restrained using a time-dependent potential are equivalent to the principle of maximum caliber, the dynamic version of the principle of maximum entropy, and thus may allow us to integrate time-resolved data in molecular dynamics simulations. We provide an analytical proof of the equivalence as well as a computational validation making use of simple models and synthetic data. Some limitations and possible solutions are also discussed.
Collapse
Affiliation(s)
- Riccardo Capelli
- Center for Complexity and Biosystems and Department of Physics, Università degli Studi di Milano and INFN, Via Celoria 16, I-20133 Milano, Italy
| | - Guido Tiana
- Center for Complexity and Biosystems and Department of Physics, Università degli Studi di Milano and INFN, Via Celoria 16, I-20133 Milano, Italy
| | - Carlo Camilloni
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, I-20133 Milano, Italy
| |
Collapse
|
15
|
Matsunaga Y, Sugita Y. Linking time-series of single-molecule experiments with molecular dynamics simulations by machine learning. eLife 2018; 7:32668. [PMID: 29723137 PMCID: PMC5933924 DOI: 10.7554/elife.32668] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Accepted: 04/23/2018] [Indexed: 12/27/2022] Open
Abstract
Single-molecule experiments and molecular dynamics (MD) simulations are indispensable tools for investigating protein conformational dynamics. The former provide time-series data, such as donor-acceptor distances, whereas the latter give atomistic information, although this information is often biased by model parameters. Here, we devise a machine-learning method to combine the complementary information from the two approaches and construct a consistent model of conformational dynamics. It is applied to the folding dynamics of the formin-binding protein WW domain. MD simulations over 400 μs led to an initial Markov state model (MSM), which was then "refined" using single-molecule Förster resonance energy transfer (FRET) data through hidden Markov modeling. The refined or data-assimilated MSM reproduces the FRET data and features hairpin one in the transition-state ensemble, consistent with mutation experiments. The folding pathway in the data-assimilated MSM suggests interplay between hydrophobic contacts and turn formation. Our method provides a general framework for investigating conformational transitions in other proteins.
Collapse
Affiliation(s)
- Yasuhiro Matsunaga
- Computational Biophysics Research Team, RIKEN Center for Computational Science, Kobe, Japan.,JST PRESTO, Kawaguchi, Japan
| | - Yuji Sugita
- Computational Biophysics Research Team, RIKEN Center for Computational Science, Kobe, Japan.,Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, Wako, Japan.,Laboratory for Biomolecular Function Simulation, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| |
Collapse
|
16
|
Amaro RE, Baudry J, Chodera J, Demir Ö, McCammon JA, Miao Y, Smith JC. Ensemble Docking in Drug Discovery. Biophys J 2018; 114:2271-2278. [PMID: 29606412 DOI: 10.1016/j.bpj.2018.02.038] [Citation(s) in RCA: 255] [Impact Index Per Article: 42.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 02/13/2018] [Accepted: 02/20/2018] [Indexed: 12/11/2022] Open
Abstract
Ensemble docking corresponds to the generation of an "ensemble" of drug target conformations in computational structure-based drug discovery, often obtained by using molecular dynamics simulation, that is used in docking candidate ligands. This approach is now well established in the field of early-stage drug discovery. This review gives a historical account of the development of ensemble docking and discusses some pertinent methodological advances in conformational sampling.
Collapse
Affiliation(s)
- Rommie E Amaro
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - Jerome Baudry
- University of Alabama at Huntsville, Huntsville, Alabama
| | - John Chodera
- University of California, Berkeley, Berkeley, California
| | - Özlem Demir
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - J Andrew McCammon
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - Yinglong Miao
- Department of Computational Biology and Molecular Biosciences, University of Kansas, Lawrence, Kansas
| | - Jeremy C Smith
- UT/ORNL Center for Molecular Biophysics, Oak Ridge National Laboratory, Oak Ridge, Tennessee; Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, Tennessee.
| |
Collapse
|
17
|
Dixit PD. Communication: Introducing prescribed biases in out-of-equilibrium Markov models. J Chem Phys 2018. [DOI: 10.1063/1.5023232] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Purushottam D. Dixit
- Department of Systems Biology, Columbia University, New York, New York 10032, USA
| |
Collapse
|
18
|
Affiliation(s)
- Brooke E. Husic
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| | - Vijay S. Pande
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
19
|
Husic BE, McKiernan KA, Wayment-Steele HK, Sultan MM, Pande VS. A Minimum Variance Clustering Approach Produces Robust and Interpretable Coarse-Grained Models. J Chem Theory Comput 2018; 14:1071-1082. [PMID: 29253336 DOI: 10.1021/acs.jctc.7b01004] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Markov state models (MSMs) are a powerful framework for the analysis of molecular dynamics data sets, such as protein folding simulations, because of their straightforward construction and statistical rigor. The coarse-graining of MSMs into an interpretable number of macrostates is a crucial step for connecting theoretical results with experimental observables. Here we present the minimum variance clustering approach (MVCA) for the coarse-graining of MSMs into macrostate models. The method utilizes agglomerative clustering with Ward's minimum variance objective function, and the similarity of the microstate dynamics is determined using the Jensen-Shannon divergence between the corresponding rows in the MSM transition probability matrix. We first show that MVCA produces intuitive results for a simple tripeptide system and is robust toward long-duration statistical artifacts. MVCA is then applied to two protein folding simulations of the same protein in different force fields to demonstrate that a different number of macrostates is appropriate for each model, revealing a misfolded state present in only one of the simulations. Finally, we show that the same method can be used to analyze a data set containing many MSMs from simulations in different force fields by aggregating them into groups and quantifying their dynamical similarity in the context of force field parameter choices. The minimum variance clustering approach with the Jensen-Shannon divergence provides a powerful tool to group dynamics by similarity, both among model states and among dynamical models themselves.
Collapse
Affiliation(s)
- Brooke E Husic
- Department of Chemistry, Stanford University , Stanford, California 94305, United States
| | - Keri A McKiernan
- Department of Chemistry, Stanford University , Stanford, California 94305, United States
| | | | - Mohammad M Sultan
- Department of Chemistry, Stanford University , Stanford, California 94305, United States
| | - Vijay S Pande
- Department of Chemistry, Stanford University , Stanford, California 94305, United States
| |
Collapse
|