1
|
Yasuda T, Morita R, Shigeta Y, Harada R. Protein Structure Validation Derives a Smart Conformational Search in a Physically Relevant Configurational Subspace. J Chem Inf Model 2022; 62:6217-6227. [PMID: 36449380 DOI: 10.1021/acs.jcim.2c01173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Since proteins perform biological functions through their dynamic properties, molecular dynamics (MD) simulation is a sophisticated strategy for investigating their functions. Analyses of trajectories provide statistical information about a specific protein as a free-energy landscape (FEL). However, the timescale of normal MD is shorter than that of biological functions, resulting in statistically insufficient conformational sampling, finally leading to unreliable FEL calculation. To search for a broad configurational subspace, an external bias is imposed on a target protein as biased sampling. However, its regulation is challenging because the optimal strength of the perturbation is unknown. Furthermore, a physically irrelevant configurational subspace was searched when imposing an inappropriate external bias. To address this issue, we newly proposed an external biased regulation scheme known as the G-factor external bias limiter (GERBIL). In GERBIL, protein configurations generated by external bias are structurally validated by an indicator (G-factor), enabling the search for a physically relevant subspace. In addition to biased sampling, nonbiased sampling might search for a physically irrelevant configurational subspace because repeating multiple MD simulations from several initial structures tends to search for an overly broad configurational subspace. For this issue, the structural qualities of configurations generated by nonbiased sampling have not been investigated. Therefore, we confirmed whether the G-factor screened the collapsed (low-quality) configurations generated by nonbiased sampling. To address this issue, the outlier flooding method (OFLOOD) was adopted in GERBIL as a nonbiased sampling method, which is referred to as OFLOOD-GERBIL. OFLOOD rapidly expands a configurational subspace by resampling the rarely occurring states of a given protein and tends to search an overly broad subspace. Thus, we considered that GERBIL might improve the excessive conformational search of OFLOOD for a physically irrelevant configurational subspace. As a demonstration, OFLOOD and OFLOOD-GERBIL were applied to a globular protein (T4 lysozyme) and their conformational search qualities were assessed. Based on our assessment, normal OFLOOD without the outlier validation frequently sampled low-quality configurations, whereas OFLOOD-GERBIL with the outlier validation intensively sampled high-quality configurations. In conclusion, OFLOOD-GERBIL derives a smart conformational search in a physically relevant configurational subspace, indicating that protein structure validation works in both nonbiased and biased sampling methods.
Collapse
Affiliation(s)
- Takunori Yasuda
- College of Biological Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki305-0821, Japan
| | - Rikuri Morita
- Center for Computational Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki305-8577, Japan
| | - Yasuteru Shigeta
- Center for Computational Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki305-8577, Japan
| | - Ryuhei Harada
- Center for Computational Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki305-8577, Japan
| |
Collapse
|
2
|
Krivov SV. Additive eigenvectors as optimal reaction coordinates, conditioned trajectories, and time-reversible description of stochastic processes. J Chem Phys 2022; 157:014108. [DOI: 10.1063/5.0088061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A fundamental way to analyze complex multidimensional stochastic dynamics is to describe it as diffusion on a free energy landscape—free energy as a function of reaction coordinates (RCs). For such a description to be quantitatively accurate, the RC should be chosen in an optimal way. The committor function is a primary example of an optimal RC for the description of equilibrium reaction dynamics between two states. Here, additive eigenvectors (addevs) are considered as optimal RCs to address the limitations of the committor. An addev master equation for a Markov chain is derived. A stationary solution of the equation describes a sub-ensemble of trajectories conditioned on having the same optimal RC for the forward and time-reversed dynamics in the sub-ensemble. A collection of such sub-ensembles of trajectories, called stochastic eigenmodes, can be used to describe/approximate the stochastic dynamics. A non-stationary solution describes the evolution of the probability distribution. However, in contrast to the standard master equation, it provides a time-reversible description of stochastic dynamics. It can be integrated forward and backward in time. The developed framework is illustrated on two model systems—unidirectional random walk and diffusion.
Collapse
Affiliation(s)
- Sergei V. Krivov
- University of Leeds, Astbury Center for Structural Molecular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
| |
Collapse
|
3
|
Modeling Catalysis in Allosteric Enzymes: Capturing Conformational Consequences. Top Catal 2021; 65:165-186. [DOI: 10.1007/s11244-021-01521-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
4
|
Abstract
We extend the nonparametric framework of reaction coordinate optimization to nonequilibrium ensembles of (short) trajectories. For example, we show how, starting from such an ensemble, one can obtain an equilibrium free-energy profile along the committor, which can be used to determine important properties of the dynamics exactly. A new adaptive sampling approach, the transition-state ensemble enrichment, is suggested, which samples the configuration space by "growing" committor segments toward each other starting from the boundary states. This framework is suggested as a general tool, alternative to the Markov state models, for a rigorous and accurate analysis of simulations of large biomolecular systems, as it has the following attractive properties. It is immune to the curse of dimensionality, does not require system-specific information, can approximate arbitrary reaction coordinates with high accuracy, and has sensitive and rigorous criteria to test optimality and convergence. The approaches are illustrated on a 50-dimensional model system and a realistic protein folding trajectory.
Collapse
Affiliation(s)
- Sergei V Krivov
- Astbury Center for Structural Molecular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, U.K
| |
Collapse
|
5
|
Pérez A, Herrera-Nieto P, Doerr S, De Fabritiis G. AdaptiveBandit: A Multi-armed Bandit Framework for Adaptive Sampling in Molecular Simulations. J Chem Theory Comput 2020; 16:4685-4693. [DOI: 10.1021/acs.jctc.0c00205] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Adrià Pérez
- Computational Science Laboratory, Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Pablo Herrera-Nieto
- Computational Science Laboratory, Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Stefan Doerr
- Computational Science Laboratory, Universitat Pompeu Fabra, 08003 Barcelona, Spain
- Acellera Labs, 08005 Barcelona, Spain
| | - Gianni De Fabritiis
- Computational Science Laboratory, Universitat Pompeu Fabra, 08003 Barcelona, Spain
- Acellera Labs, 08005 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats, 08010 Barcelona, Spain
| |
Collapse
|
6
|
Betz RM, Dror RO. How Effectively Can Adaptive Sampling Methods Capture Spontaneous Ligand Binding? J Chem Theory Comput 2019; 15:2053-2063. [PMID: 30645108 PMCID: PMC6795214 DOI: 10.1021/acs.jctc.8b00913] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
![]()
Molecular dynamics (MD) simulations
that capture the spontaneous
binding of drugs and other ligands to their target proteins can reveal
a great deal of useful information, but most drug-like ligands bind
on time scales longer than those accessible to individual MD simulations.
Adaptive sampling methods―in which one performs multiple rounds
of simulation, with the initial conditions of each round based on
the results of previous rounds―offer a promising potential
solution to this problem. No comprehensive analysis of the performance
gains from adaptive sampling is available for ligand binding, however,
particularly for protein–ligand systems typical of those encountered
in drug discovery. Moreover, most previous work presupposes knowledge
of the ligand’s bound pose. Here we outline existing methods
for adaptive sampling of the ligand-binding process and introduce
several improvements, with a focus on methods that do not require
prior knowledge of the binding site or bound pose. We then evaluate
these methods by comparing them to traditional, long MD simulations
for realistic protein–ligand systems. We find that adaptive
sampling simulations typically fail to reach the bound pose more efficiently
than traditional MD. However, adaptive sampling identifies multiple
potential binding sites more efficiently than traditional MD and also
provides better characterization of binding pathways. We explain these
results by showing that protein–ligand binding is an example
of an exploration–exploitation dilemma. Existing adaptive sampling
methods for ligand binding in the absence of a known bound pose vastly
favor the broad exploration of protein–ligand space, sometimes
failing to sufficiently exploit intermediate states as they are discovered.
We suggest potential avenues for future research to address this shortcoming.
Collapse
Affiliation(s)
- Robin M Betz
- Biophysics Program , Stanford University , Stanford , California 94305 , United States.,Department of Computer Science , Stanford University , Stanford, California 94305 , United States.,Department of Molecular and Cellular Physiology , Stanford University , Stanford , California 94305 , United States.,Department of Structural Biology , Stanford University , Stanford , California 94305 , United States.,Institute for Computational and Mathematical Engineering , Stanford University , Stanford , California 94305 , United States
| | - Ron O Dror
- Biophysics Program , Stanford University , Stanford , California 94305 , United States.,Department of Computer Science , Stanford University , Stanford, California 94305 , United States.,Department of Molecular and Cellular Physiology , Stanford University , Stanford , California 94305 , United States.,Department of Structural Biology , Stanford University , Stanford , California 94305 , United States.,Institute for Computational and Mathematical Engineering , Stanford University , Stanford , California 94305 , United States
| |
Collapse
|
7
|
Bhoutekar A, Ghosh S, Bhattacharya S, Chatterjee A. A new class of enhanced kinetic sampling methods for building Markov state models. J Chem Phys 2018; 147:152702. [PMID: 29055344 DOI: 10.1063/1.4984932] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Markov state models (MSMs) and other related kinetic network models are frequently used to study the long-timescale dynamical behavior of biomolecular and materials systems. MSMs are often constructed bottom-up using brute-force molecular dynamics (MD) simulations when the model contains a large number of states and kinetic pathways that are not known a priori. However, the resulting network generally encompasses only parts of the configurational space, and regardless of any additional MD performed, several states and pathways will still remain missing. This implies that the duration for which the MSM can faithfully capture the true dynamics, which we term as the validity time for the MSM, is always finite and unfortunately much shorter than the MD time invested to construct the model. A general framework that relates the kinetic uncertainty in the model to the validity time, missing states and pathways, network topology, and statistical sampling is presented. Performing additional calculations for frequently-sampled states/pathways may not alter the MSM validity time. A new class of enhanced kinetic sampling techniques is introduced that aims at targeting rare states/pathways that contribute most to the uncertainty so that the validity time is boosted in an effective manner. Examples including straightforward 1D energy landscapes, lattice models, and biomolecular systems are provided to illustrate the application of the method. Developments presented here will be of interest to the kinetic Monte Carlo community as well.
Collapse
Affiliation(s)
- Arti Bhoutekar
- Department of Chemical Engineering, Indian Institute of Technology Bombay, Mumbai 400076, India
| | - Susmita Ghosh
- Department of Physics, Indian Institute of Technology Guwahati, Guwahati 781039, India
| | - Swati Bhattacharya
- Department of Chemical Engineering, Indian Institute of Technology Bombay, Mumbai 400076, India
| | - Abhijit Chatterjee
- Department of Chemical Engineering, Indian Institute of Technology Bombay, Mumbai 400076, India
| |
Collapse
|
8
|
Bacci M, Langini C, Vymětal J, Caflisch A, Vitalis A. Focused conformational sampling in proteins. J Chem Phys 2018; 147:195102. [PMID: 29166086 DOI: 10.1063/1.4996879] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
A detailed understanding of the conformational dynamics of biological molecules is difficult to obtain by experimental techniques due to resolution limitations in both time and space. Computer simulations avoid these in theory but are often too short to sample rare events reliably. Here we show that the progress index-guided sampling (PIGS) protocol can be used to enhance the sampling of rare events in selected parts of biomolecules without perturbing the remainder of the system. The method is very easy to use as it only requires as essential input a set of several features representing the parts of interest sufficiently. In this feature space, new states are discovered by spontaneous fluctuations alone and in unsupervised fashion. Because there are no energetic biases acting on phase space variables or projections thereof, the trajectories PIGS generates can be analyzed directly in the framework of transition networks. We demonstrate the possibility and usefulness of such focused explorations of biomolecules with two loops that are part of the binding sites of bromodomains, a family of epigenetic "reader" modules. This real-life application uncovers states that are structurally and kinetically far away from the initial crystallographic structures and are also metastable. Representative conformations are intended to be used in future high-throughput virtual screening campaigns.
Collapse
Affiliation(s)
- Marco Bacci
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Cassiano Langini
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Jiří Vymětal
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Amedeo Caflisch
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Andreas Vitalis
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| |
Collapse
|
9
|
Kukharenko O, Sawade K, Steuer J, Peter C. Using Dimensionality Reduction to Systematically Expand Conformational Sampling of Intrinsically Disordered Peptides. J Chem Theory Comput 2016; 12:4726-4734. [DOI: 10.1021/acs.jctc.6b00503] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
| | - Kevin Sawade
- Theoretical
Chemistry, University of Konstanz, 78547 Konstanz, Germany
| | - Jakob Steuer
- Theoretical
Chemistry, University of Konstanz, 78547 Konstanz, Germany
| | - Christine Peter
- Theoretical
Chemistry, University of Konstanz, 78547 Konstanz, Germany
| |
Collapse
|
10
|
Qiao Q, Zhang HD, Huang X. Enhancing pairwise state-transition weights: A new weighting scheme in simulated tempering that can minimize transition time between a pair of conformational states. J Chem Phys 2016; 144:154107. [PMID: 27389209 DOI: 10.1063/1.4946793] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Simulated tempering (ST) is a widely used enhancing sampling method for Molecular Dynamics simulations. As one expanded ensemble method, ST is a combination of canonical ensembles at different temperatures and the acceptance probability of cross-temperature transitions is determined by both the temperature difference and the weights of each temperature. One popular way to obtain the weights is to adopt the free energy of each canonical ensemble, which achieves uniform sampling among temperature space. However, this uniform distribution in temperature space may not be optimal since high temperatures do not always speed up the conformational transitions of interest, as anti-Arrhenius kinetics are prevalent in protein and RNA folding. Here, we propose a new method: Enhancing Pairwise State-transition Weights (EPSW), to obtain the optimal weights by minimizing the round-trip time for transitions among different metastable states at the temperature of interest in ST. The novelty of the EPSW algorithm lies in explicitly considering the kinetics of conformation transitions when optimizing the weights of different temperatures. We further demonstrate the power of EPSW in three different systems: a simple two-temperature model, a two-dimensional model for protein folding with anti-Arrhenius kinetics, and the alanine dipeptide. The results from these three systems showed that the new algorithm can substantially accelerate the transitions between conformational states of interest in the ST expanded ensemble and further facilitate the convergence of thermodynamics compared to the widely used free energy weights. We anticipate that this algorithm is particularly useful for studying functional conformational changes of biological systems where the initial and final states are often known from structural biology experiments.
Collapse
Affiliation(s)
- Qin Qiao
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Hou-Dao Zhang
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Xuhui Huang
- Department of Chemistry, Division of Biomedical Engineering, Center of Systems Biology and Human Health, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| |
Collapse
|
11
|
Morriss-Andrews A, Shea JE. Computational Studies of Protein Aggregation: Methods and Applications. Annu Rev Phys Chem 2015; 66:643-66. [DOI: 10.1146/annurev-physchem-040513-103738] [Citation(s) in RCA: 132] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
| | - Joan-Emma Shea
- Department of Physics and
- Department of Chemistry, University of California, Santa Barbara, California 93106;
| |
Collapse
|
12
|
Hédin F, Plattner N, Doll JD, Meuwly M. Spatial Averaging: Sampling Enhancement for Exploring Configurational Space of Atomic Clusters and Biomolecules. J Chem Theory Comput 2014; 10:4284-96. [DOI: 10.1021/ct500529w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- Florent Hédin
- Department
of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | - Nuria Plattner
- Department
of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 6, D-14195 Berlin, Germany
| | - J. D. Doll
- Department
of Chemistry, Brown University, Providence, Rhode Island 02912, United States
| | - Markus Meuwly
- Department
of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
- Department
of Chemistry, Brown University, Providence, Rhode Island 02912, United States
| |
Collapse
|
13
|
Bacci M, Vitalis A, Caflisch A. A molecular simulation protocol to avoid sampling redundancy and discover new states. Biochim Biophys Acta Gen Subj 2014; 1850:889-902. [PMID: 25193737 DOI: 10.1016/j.bbagen.2014.08.013] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Revised: 08/21/2014] [Accepted: 08/25/2014] [Indexed: 10/24/2022]
Abstract
BACKGROUND For biomacromolecules or their assemblies, experimental knowledge is often restricted to specific states. Ambiguity pervades simulations of these complex systems because there is no prior knowledge of relevant phase space domains, and sampling recurrence is difficult to achieve. In molecular dynamics methods, ruggedness of the free energy surface exacerbates this problem by slowing down the unbiased exploration of phase space. Sampling is inefficient if dwell times in metastable states are large. METHODS We suggest a heuristic algorithm to terminate and reseed trajectories run in multiple copies in parallel. It uses a recent method to order snapshots, which provides notions of "interesting" and "unique" for individual simulations. We define criteria to guide the reseeding of runs from more "interesting" points if they sample overlapping regions of phase space. RESULTS Using a pedagogical example and an α-helical peptide, the approach is demonstrated to amplify the rate of exploration of phase space and to discover metastable states not found by conventional sampling schemes. Evidence is provided that accurate kinetics and pathways can be extracted from the simulations. CONCLUSIONS The method, termed PIGS for Progress Index Guided Sampling, proceeds in unsupervised fashion, is scalable, and benefits synergistically from larger numbers of replicas. Results confirm that the underlying ideas are appropriate and sufficient to enhance sampling. GENERAL SIGNIFICANCE In molecular simulations, errors caused by not exploring relevant domains in phase space are always unquantifiable and can be arbitrarily large. Our protocol adds to the toolkit available to researchers in reducing these types of errors. This article is part of a Special Issue entitled "Recent developments of molecular dynamics".
Collapse
Affiliation(s)
- Marco Bacci
- University of Zurich, Department of Biochemistry, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Andreas Vitalis
- University of Zurich, Department of Biochemistry, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland.
| | - Amedeo Caflisch
- University of Zurich, Department of Biochemistry, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland.
| |
Collapse
|
14
|
Molecular Dynamics Simulations of Bromodomains Reveal Binding-Site Flexibility and Multiple Binding Modes of the Natural Ligand Acetyl-Lysine. Isr J Chem 2014. [DOI: 10.1002/ijch.201400009] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
15
|
Doerr S, De Fabritiis G. On-the-Fly Learning and Sampling of Ligand Binding by High-Throughput Molecular Simulations. J Chem Theory Comput 2014; 10:2064-9. [DOI: 10.1021/ct400919u] [Citation(s) in RCA: 124] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- S. Doerr
- Computational Biophysics Laboratory (GRIB-IMIM), Universitat Pompeu Fabra, Barcelona Biomedical Research
Park (PRBB), C/Doctor Aiguader 88, 08003 Barcelona, Spain
| | - G. De Fabritiis
- Computational Biophysics Laboratory (GRIB-IMIM), Universitat Pompeu Fabra, Barcelona Biomedical Research
Park (PRBB), C/Doctor Aiguader 88, 08003 Barcelona, Spain
| |
Collapse
|
16
|
Zheng W, Rohrdanz MA, Clementi C. Rapid exploration of configuration space with diffusion-map-directed molecular dynamics. J Phys Chem B 2013; 117:12769-76. [PMID: 23865517 PMCID: PMC3808479 DOI: 10.1021/jp401911h] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The gap between the time scale of interesting behavior in macromolecular systems and that which our computational resources can afford often limits molecular dynamics (MD) from understanding experimental results and predicting what is inaccessible in experiments. In this paper, we introduce a new sampling scheme, named diffusion-map-directed MD (DM-d-MD), to rapidly explore molecular configuration space. The method uses a diffusion map to guide MD on the fly. DM-d-MD can be combined with other methods to reconstruct the equilibrium free energy, and here, we used umbrella sampling as an example. We present results from two systems: alanine dipeptide and alanine-12. In both systems, we gain tremendous speedup with respect to standard MD both in exploring the configuration space and reconstructing the equilibrium distribution. In particular, we obtain 3 orders of magnitude of speedup over standard MD in the exploration of the configurational space of alanine-12 at 300 K with DM-d-MD. The method is reaction coordinate free and minimally dependent on a priori knowledge of the system. We expect wide applications of DM-d-MD to other macromolecular systems in which equilibrium sampling is not affordable by standard MD.
Collapse
Affiliation(s)
- Wenwei Zheng
- Department of Chemistry, Rice University, Houston TX 77005
| | | | | |
Collapse
|
17
|
Kalgin IV, Caflisch A, Chekmarev SF, Karplus M. New insights into the folding of a β-sheet miniprotein in a reduced space of collective hydrogen bond variables: application to a hydrodynamic analysis of the folding flow. J Phys Chem B 2013; 117:6092-105. [PMID: 23621790 DOI: 10.1021/jp401742y] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A new analysis of the 20 μs equilibrium folding/unfolding molecular dynamics simulations of the three-stranded antiparallel β-sheet miniprotein (beta3s) in implicit solvent is presented. The conformation space is reduced in dimensionality by introduction of linear combinations of hydrogen bond distances as the collective variables making use of a specially adapted principal component analysis (PCA); i.e., to make structured conformations more pronounced, only the formed bonds are included in determining the principal components. It is shown that a three-dimensional (3D) subspace gives a meaningful representation of the folding behavior. The first component, to which eight native hydrogen bonds make the major contribution (four in each beta hairpin), is found to play the role of the reaction coordinate for the overall folding process, while the second and third components distinguish the structured conformations. The representative points of the trajectory in the 3D space are grouped into conformational clusters that correspond to locally stable conformations of beta3s identified in earlier work. A simplified kinetic network based on the three components is constructed, and it is complemented by a hydrodynamic analysis. The latter, making use of "passive tracers" in 3D space, indicates that the folding flow is much more complex than suggested by the kinetic network. A 2D representation of streamlines shows there are vortices which correspond to repeated local rearrangement, not only around minima of the free energy surface but also in flat regions between minima. The vortices revealed by the hydrodynamic analysis are apparently not evident in folding pathways generated by transition-path sampling. Making use of the fact that the values of the collective hydrogen bond variables are linearly related to the Cartesian coordinate space, the RMSD between clusters is determined. Interestingly, the transition rates show an approximate exponential correlation with distance in the hydrogen bond subspace. Comparison with the many published studies shows good agreement with the present analysis for the parts that can be compared, supporting the robust character of our understanding of this "hydrogen atom" of protein folding.
Collapse
Affiliation(s)
- Igor V Kalgin
- Department of Physics, Novosibirsk State University, 630090 Novosibirsk, Russia
| | | | | | | |
Collapse
|
18
|
Zhou T, Caflisch A. Distribution of Reciprocal of Interatomic Distances: A Fast Structural Metric. J Chem Theory Comput 2012; 8:2930-7. [PMID: 26592131 DOI: 10.1021/ct3003145] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
We present a structural metric based on the Distribution of Reciprocal of Interatomic Distances (DRID) for evaluating geometrical similarity between two conformations of a molecule. A molecular conformation is described by a vector of 3N orientation-independent DRID descriptors where N is the number of molecular centroids, for example, the non-hydrogen atoms in all nonsymmetric groups of a peptide. For two real-world applications (pairwise comparison of snapshots from an explicit solvent simulation of a protease/peptide substrate complex and implicit solvent simulations of reversible folding of a 20-residue β-sheet peptide), the DRID-based metric is shown to be about 5 and 15 times faster than coordinate root-mean-square deviation (cRMSD) and distance root-mean-square deviation (dRMSD), respectively. A single core of a mainstream processor can perform about 10(8) DRID comparisons in one-half a minute. Importantly, the DRID metric has closer similarity to kinetic distance than does either cRMSD or dRMSD. Therefore, DRID is suitable for clustering molecular dynamics trajectories for kinetic analysis, for example, by Markov state models. Moreover, conformational space networks and free energy profiles derived by DRID-based clustering preserve the kinetic information.
Collapse
Affiliation(s)
- Ting Zhou
- Department of Biochemistry, University of Zurich , CH-8057 Zurich, Switzerland
| | - Amedeo Caflisch
- Department of Biochemistry, University of Zurich , CH-8057 Zurich, Switzerland
| |
Collapse
|