1
|
Chatterjee S, Ray D. Acceleration with Interpretability: A Surrogate Model-Based Collective Variable for Enhanced Sampling. J Chem Theory Comput 2025; 21:1561-1571. [PMID: 39905595 DOI: 10.1021/acs.jctc.4c01603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2025]
Abstract
Most enhanced sampling methods facilitate the exploration of molecular free energy landscapes by applying a bias potential along a reduced dimensional collective variable (CV) space. The success of these methods depends on the ability of the CVs to follow the relevant slow modes of the system. Intuitive CVs, such as distances or contacts, often prove inadequate, particularly in biological systems involving many coupled degrees of freedom. Machine learning algorithms, especially neural networks (NN), can automate the process of CV discovery by combining a large number of molecular descriptors and often outperform intuitive CVs in sampling efficiency. However, their lack of interpretability and high cost of evaluation during trajectory propagation make NN-CVs difficult to apply to large biomolecular processes. Here, we introduce a surrogate model approach using lasso regression to express the output of a neural network as a linear combination of an automatically chosen subset of the input descriptors. We demonstrate successful applications of our surrogate model CVs in the enhanced sampling simulation of the conformational landscape of alanine dipeptide and chignolin mini-protein. In addition to providing mechanistic insights due to their explainable nature, the surrogate model CVs showed a negligible loss in efficiency and accuracy, compared to the NN-CVs, in reconstructing the underlying free energy surface. Moreover, due to their simplified functional forms, these CVs are better at extrapolating to unseen regions of the conformational space, e.g., saddle points. Surrogate model CVs are also less expensive to evaluate compared to their NN counterparts, making them suitable for enhanced sampling simulation of large and complex biomolecular processes.
Collapse
Affiliation(s)
- Sompriya Chatterjee
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon 97403, United States
- Materials Science Institute, University of Oregon, Eugene, Oregon 97403, United States
| | - Dhiman Ray
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon 97403, United States
- Materials Science Institute, University of Oregon, Eugene, Oregon 97403, United States
| |
Collapse
|
2
|
Church J, Blumer O, Keidar TD, Ploutno L, Reuveni S, Hirshberg B. Accelerating Molecular Dynamics through Informed Resetting. J Chem Theory Comput 2025; 21:605-613. [PMID: 39772645 PMCID: PMC11781593 DOI: 10.1021/acs.jctc.4c01238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 12/20/2024] [Accepted: 12/23/2024] [Indexed: 01/11/2025]
Abstract
We present a procedure for enhanced sampling of molecular dynamics simulations through informed stochastic resetting. Many phenomena, such as protein folding and crystal nucleation, occur over time scales inaccessible in standard simulations. We recently showed that stochastic resetting can accelerate molecular simulations that exhibit broad transition time distributions. However, standard stochastic resetting does not exploit any information about the reaction progress. For a model system and chignolin in explicit water, we demonstrate that an informed resetting protocol leads to greater accelerations than standard stochastic resetting in molecular dynamics and Metadynamics simulations. This is achieved by resetting only when a certain condition is met, e.g., when the distance from the target along the reaction coordinate is larger than some threshold. We use these accelerated simulations to infer important kinetic observables such as the unbiased mean first-passage time and direct transit time. For the latter, Metadynamics with informed resetting leads to speedups of 2-3 orders of magnitude over unbiased simulations with relative errors of only ∼35-70%. Our work significantly extends the applicability of stochastic resetting for enhanced sampling of molecular simulations.
Collapse
Affiliation(s)
| | - Ofir Blumer
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Tommer D. Keidar
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Leo Ploutno
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Shlomi Reuveni
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Barak Hirshberg
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
3
|
Li C, Chan GKL. Accurate QM/MM Molecular Dynamics for Periodic Systems in GPU4PySCF with Applications to Enzyme Catalysis. J Chem Theory Comput 2025; 21:803-816. [PMID: 39813105 DOI: 10.1021/acs.jctc.4c01698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2025]
Abstract
We present an implementation of the quantum mechanics/molecular mechanics (QM/MM) method for periodic systems using GPU accelerated QM methods, a distributed multipole formulation of the electrostatics, and a pseudobond treatment of the QM/MM boundary. We demonstrate that our method has well-controlled errors, stable self-consistent QM convergence, and energy-conserving dynamics. We further describe an application to the catalytic kinetics of chorismate mutase. Using an accurate hybrid functional reparametrized to coupled cluster energetics, our QM/MM simulations highlight the sensitivity in the calculated rate to the choice of quantum method, quantum region selection, and local protein conformation. Our work is provided through the open-source PySCF package using acceleration from the GPU4PySCF module.
Collapse
Affiliation(s)
- Chenghan Li
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Garnet Kin-Lic Chan
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| |
Collapse
|
4
|
Mitra S, Biswas R, Chakrabarty S. WeTICA: A directed search weighted ensemble based enhanced sampling method to estimate rare event kinetics in a reduced dimensional space. J Chem Phys 2025; 162:034106. [PMID: 39812249 DOI: 10.1063/5.0239713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Accepted: 12/30/2024] [Indexed: 01/16/2025] Open
Abstract
Estimating rare event kinetics from molecular dynamics simulations is a non-trivial task despite the great advances in enhanced sampling methods. Weighted Ensemble (WE) simulation, a special class of enhanced sampling techniques, offers a way to directly calculate kinetic rate constants from biased trajectories without the need to modify the underlying energy landscape using bias potentials. Conventional WE algorithms use different binning schemes to partition the collective variable (CV) space separating the two metastable states of interest. In this work, we have developed a new "binless" WE simulation algorithm to bypass the hurdles of optimizing binning procedures. Our proposed protocol (WeTICA) uses a low-dimensional CV space to drive the WE simulation toward the specified target state. We have applied this new algorithm to recover the unfolding kinetics of three proteins: (A) TC5b Trp-cage mutant, (B) TC10b Trp-cage mutant, and (C) Protein G, with unfolding times spanning the range between 3 and 40 μs using projections along predefined fixed Time-lagged Independent Component Analysis (TICA) eigenvectors as CVs. Calculated unfolding times converge to the reported values with good accuracy with more than one order of magnitude less cumulative WE simulation time than the unfolding time scales with or without a priori knowledge of the CVs that can capture unfolding. Our algorithm can be used with other linear CVs, not limited to TICA. Moreover, the new walker selection criteria for resampling employed in this algorithm can be used on more sophisticated nonlinear CV space for further improvements of binless WE methods.
Collapse
Affiliation(s)
- Sudipta Mitra
- Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Block-JD, Sector-III, Salt Lake, Kolkata 700106, India
| | - Ranjit Biswas
- Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Block-JD, Sector-III, Salt Lake, Kolkata 700106, India
| | - Suman Chakrabarty
- Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Block-JD, Sector-III, Salt Lake, Kolkata 700106, India
| |
Collapse
|
5
|
Ray D, Rizzi V. Enhanced Sampling with Suboptimal Collective Variables: Reconciling Accuracy and Convergence Speed. J Chem Theory Comput 2025; 21:58-69. [PMID: 39729052 DOI: 10.1021/acs.jctc.4c01231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2024]
Abstract
We introduce an enhanced sampling algorithm to obtain converged free energy landscapes of molecular rare events, even when the collective variable (CV) used for biasing is not optimal. Our approach samples a time-dependent target distribution by combining the on-the-fly probability enhanced sampling and its exploratory variant, OPES Explore. This promotes more transitions between the relevant metastable states and accelerates the convergence speed of the free energy estimate. We demonstrate the successful application of this combined algorithm on the two-dimensional Wolfe-Quapp potential, millisecond time-scale ligand-receptor binding in the trypsin-benzamidine complex, and folding-unfolding transitions in chignolin mini-protein. Our proposed algorithm can compute accurate free energies at an affordable computational cost and is robust in terms of the choice of CVs, making it particularly promising for the simulation of complex biomolecular systems.
Collapse
Affiliation(s)
- Dhiman Ray
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon 97403, United States
| | - Valerio Rizzi
- School of Pharmaceutical Sciences and Institute of Pharmaceutical Sciences of Western Switzerland (ISPSO), University of Geneva, Rue Michel Servet 1, 1206 Genève, Switzerland
| |
Collapse
|
6
|
Jones MS, Shmilovich K, Ferguson AL. Tutorial on Molecular Latent Space Simulators (LSSs): Spatially and Temporally Continuous Data-Driven Surrogate Dynamical Models of Molecular Systems. J Phys Chem A 2024; 128:10299-10317. [PMID: 39540914 DOI: 10.1021/acs.jpca.4c05389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2024]
Abstract
The inherently serial nature and requirement for short integration time steps in the numerical integration of molecular dynamics (MD) calculations place strong limitations on the accessible simulation time scales and statistical uncertainties in sampling slowly relaxing dynamical modes and rare events. Molecular latent space simulators (LSSs) are a data-driven approach to learning a surrogate dynamical model of the molecular system from modest MD training trajectories that can generate synthetic trajectories at a fraction of the computational cost. The training data may comprise single long trajectories or multiple short, discontinuous trajectories collected over, for example, distributed computing resources. Provided the training data provide sufficient sampling of the relevant thermodynamic states and dynamical transitions to robustly learn the underlying microscopic propagator, an LSS furnishes a global model of the dynamics capable of producing temporally and spatially continuous molecular trajectories. Trained LSS models have produced simulation trajectories at up to 6 orders of magnitude lower cost than standard MD to enable dense sampling of molecular phase space and large reduction of the statistical errors in structural, thermodynamic, and kinetic observables. The LSS employs three deep learning architectures to solve three independent learning problems over the training data: (i) an encoding of the high-dimensional MD into a low-dimensional slow latent space using state-free reversible VAMPnets (SRVs), (ii) a propagator of the microscopic dynamics within the low-dimensional latent space using mixture density networks (MDNs), and (iii) a generative decoding of the low-dimensional latent coordinates back to the original high-dimensional molecular configuration space using conditional Wasserstein generative adversarial networks (cWGANs) or denoising diffusion probability models (DDPMs). In this software tutorial, we introduce the mathematical and numerical background and theory of LSS and present example applications of a user-friendly Python package software implementation to alanine dipeptide and a 28-residue beta-beta-alpha (BBA) protein within simple Google Colab notebooks.
Collapse
Affiliation(s)
- Michael S Jones
- Pritzker School of Molecular Engineering, The University of Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, United States
| | - Kirill Shmilovich
- Pritzker School of Molecular Engineering, The University of Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, United States
| | - Andrew L Ferguson
- Pritzker School of Molecular Engineering, The University of Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, United States
| |
Collapse
|
7
|
Cummins D, Longstreth C, McCarty J. Analysis of transition rates from variational flooding using analytical theory. J Chem Phys 2024; 161:194104. [PMID: 39555758 DOI: 10.1063/5.0238289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Accepted: 10/31/2024] [Indexed: 11/19/2024] Open
Abstract
Variational flooding is an enhanced sampling method for obtaining kinetic rates from molecular dynamics simulations. This method is inspired by the idea of conformational flooding that employs a boost potential acting along a chosen reaction coordinate to accelerate rare events. In this work, we show how the empirical distribution of crossing times from variational flooding simulations can be modeled with analytical Kramers' time-dependent rate (KTR) theory. An optimized bias potential that fills metastable free energy basins is constructed from the variationally enhanced sampling (VES) method. This VES-derived flooding potential is then augmented by a switching function that determines the fill level of the boost. Having a prescribed time-dependent fill rate of the flooding potential gives an analytical expression for the distribution of crossing times from KTR theory that is used to extract unbiased rates. In the case of a static boost potential, the distribution of barrier crossing times follows an expected exponential distribution, and unbiased rates are extracted from a series of boosted simulations at discrete fill levels. Introducing a time-dependent boost that increases the fill level gradually over the simulation time leads to a simplified procedure for fitting the biased distribution of crossing times to analytical theory. We demonstrate the approach for the paradigmatic cases of alanine dipeptide in vacuum, the asymmetric SN2 reaction, and the folding of chignolin in explicit solvent.
Collapse
Affiliation(s)
- David Cummins
- Department of Chemistry, Western Washington University, Bellingham, Washington 98225, USA
| | - Carter Longstreth
- Department of Chemistry, Western Washington University, Bellingham, Washington 98225, USA
| | - James McCarty
- Department of Chemistry, Western Washington University, Bellingham, Washington 98225, USA
| |
Collapse
|
8
|
Vergalli J, Réfrégiers M, Ruggerone P, Winterhalter M, Pagès JM. Advances in methods and concepts provide new insight into antibiotic fluxes across the bacterial membrane. Commun Biol 2024; 7:1508. [PMID: 39543341 PMCID: PMC11564671 DOI: 10.1038/s42003-024-07168-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Accepted: 10/29/2024] [Indexed: 11/17/2024] Open
Abstract
The sophisticated envelope of Gram-negative bacteria modulates the uptake of small molecules in a side-chain-sensitive manner. Despite intensive theoretical and experimental investigations, a general set of pathways underpinning antibiotic uptake has not been identified. This manuscript discusses the passive influx versus active efflux of antibiotics, considering the responsible membrane proteins and the transported molecules. Recent methods have analyzed drug transport across the bacterial membrane in order to understand their activity. The combination of in vitro, in cellulo and in silico methods shed light on the key, mainly electrostatic, interactions between the molecule surface, porins and transporters during permeation. A key factor is the relationship between the dose of an active compound near its target and its antibacterial activity during the critical early window. Today, methodology breakthroughs provide fruitful tools to precisely dissect drug transport, identify key steps in drug resistance associated with membrane impermeability and efflux, and highlight key parameters to generate more effective drugs.
Collapse
Affiliation(s)
| | | | - Paolo Ruggerone
- Department of Physics, University of Cagliari, 09042, Monserrato, CA, Italy
| | - Mathias Winterhalter
- Department of Life Sciences and Chemistry, Constructor University, 28719, Bremen, Germany
- Center for Hybrid Nanostructures (CHyN), Universität Hamburg, Luruper Chaussee 149, 22761, Hamburg, Germany
| | | |
Collapse
|
9
|
Zhang M, Wu H, Wang Y. Enhanced Sampling of Biomolecular Slow Conformational Transitions Using Adaptive Sampling and Machine Learning. J Chem Theory Comput 2024; 20:8569-8582. [PMID: 39301626 DOI: 10.1021/acs.jctc.4c00764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2024]
Abstract
Biomolecular simulations often suffer from the "time scale problem", hindering the study of rare events occurring over extended time scales. Enhanced sampling techniques aim to alleviate this issue by accelerating conformational transitions, yet they typically necessitate well-defined collective variables (CVs), posing a significant challenge. Machine learning offers promising solutions but typically requires rich training data encompassing the entire free energy surface (FES). In this work, we introduce an automated iterative pipeline designed to mitigate these limitations. Our protocol first utilizes a CV-free count-based adaptive sampling method to generate a data set rich in rare events. From this data set, slow modes are identified using Koopman-reweighted time-lagged independent component analysis (KTICA), which are subsequently leveraged by on-the-fly probability enhanced sampling (OPES) to efficiently explore the FES. The effectiveness of our pipeline is demonstrated and further compared with the common Markov State Model (MSM) approach on two model systems with increasing complexity: alanine dipeptide (Ala2) and deca-alanine (Ala10), underscoring its applicability across diverse biomolecular simulations.
Collapse
Affiliation(s)
- Mingyuan Zhang
- College of Life Sciences, Zhejiang University, Hangzhou 310027, China
| | - Hao Wu
- School of Mathematical Sciences, Institute of Natural Sciences, and MOE-LSC, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yong Wang
- College of Life Sciences, Zhejiang University, Hangzhou 310027, China
| |
Collapse
|
10
|
Das M, Venkatramani R. A Mode Evolution Metric to Extract Reaction Coordinates for Biomolecular Conformational Transitions. J Chem Theory Comput 2024; 20:8422-8436. [PMID: 39287954 DOI: 10.1021/acs.jctc.4c00744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/19/2024]
Abstract
The complex, multidimensional energy landscape of biomolecules makes the extraction of suitable, nonintuitive collective variables (CVs) that describe their conformational transitions challenging. At present, dimensionality reduction approaches and machine learning (ML) schemes are employed to obtain CVs from molecular dynamics (MD)/Monte Carlo (MC) trajectories or structural databanks for biomolecules. However, minimum sampling conditions to generate reliable CVs that accurately describe the underlying energy landscape remain unclear. Here, we address this issue by developing a Mode evolution Metric (MeM) to extract CVs that can pinpoint new states and describe local transitions in the vicinity of a reference minimum from nonequilibrated MD/MC trajectories. We present a general mathematical formulation of MeM for both statistical dimensionality reduction and machine learning approaches. Application of MeM to MC trajectories of model potential energy landscapes and MD trajectories of solvated alanine dipeptide reveals that the principal components which locate new states in the vicinity of a reference minimum emerge well before the trajectories locally equilibrate between the associated states. Finally, we demonstrate a possible application of MeM in designing efficient biased sampling schemes to construct accurate energy landscape slices that link transitions between states. MeM can help speed up the search for new minima around a biomolecular conformational state and enable the accurate estimation of thermodynamics for states lying on the energy landscape and the description of associated transitions.
Collapse
Affiliation(s)
- Mitradip Das
- Department of Chemical Sciences, Tata Institue of Fundamental Research, Colaba, Mumbai 400005, India
| | - Ravindra Venkatramani
- Department of Chemical Sciences, Tata Institue of Fundamental Research, Colaba, Mumbai 400005, India
| |
Collapse
|
11
|
Yang S, Nam J, Dietschreit JCB, Gómez-Bombarelli R. Learning Collective Variables with Synthetic Data Augmentation through Physics-Inspired Geodesic Interpolation. J Chem Theory Comput 2024. [PMID: 39073442 DOI: 10.1021/acs.jctc.4c00435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
In molecular dynamics simulations, rare events, such as protein folding, are typically studied using enhanced sampling techniques, most of which are based on the definition of a collective variable (CV) along which acceleration occurs. Obtaining an expressive CV is crucial, but often hindered by the lack of information about the particular event, e.g., the transition from unfolded to folded conformation. We propose a simulation-free data augmentation strategy using physics-inspired metrics to generate geodesic interpolations resembling protein folding transitions, thereby improving sampling efficiency without true transition state samples. This new data can be used to improve the accuracy of classifier-based methods. Alternatively, a regression-based learning scheme for CV models can be adopted by leveraging the interpolation progress parameter.
Collapse
Affiliation(s)
- Soojung Yang
- Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Juno Nam
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Johannes C B Dietschreit
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria
| | - Rafael Gómez-Bombarelli
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
12
|
Lee S, Wang D, Seeliger MA, Tiwary P. Calculating Protein-Ligand Residence Times through State Predictive Information Bottleneck Based Enhanced Sampling. J Chem Theory Comput 2024; 20:6341-6349. [PMID: 38991145 DOI: 10.1021/acs.jctc.4c00503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Abstract
Understanding drug residence times in target proteins is key to improving drug efficacy and understanding target recognition in biochemistry. While drug residence time is just as important as binding affinity, atomic-level understanding of drug residence times through molecular dynamics (MD) simulations has been difficult primarily due to the extremely long time scales. Recent advances in rare event sampling have allowed us to reach these time scales, yet predicting protein-ligand residence times remains a significant challenge. Here we present a semi-automated protocol to calculate the ligand residence times across 12 orders of magnitude of time scales. In our proposed framework, we integrate a deep learning-based method, the state predictive information bottleneck (SPIB), to learn an approximate reaction coordinate (RC) and use it to guide the enhanced sampling method metadynamics. We demonstrate the performance of our algorithm by applying it to six different protein-ligand complexes with available benchmark residence times, including the dissociation of the widely studied anticancer drug Imatinib (Gleevec) from both wild-type Abl kinase and drug-resistant mutants. We show how our protocol can recover quantitatively accurate residence times, potentially opening avenues for deeper insights into drug development possibilities and ligand recognition mechanisms.
Collapse
Affiliation(s)
- Suemin Lee
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, United States
| | - Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, United States
| | - Markus A Seeliger
- Department of Pharmacological Sciences, Stony Brook University, Stony Brook, New York 11794-8651, United States
| | - Pratyush Tiwary
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, United States
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park 20742, United States
- University of Maryland Institute for Health Computing, Bethesda, Maryland 20852, United States
| |
Collapse
|
13
|
Mazzaferro N, Sasmal S, Cossio P, Hocky GM. Good Rates From Bad Coordinates: The Exponential Average Time-dependent Rate Approach. J Chem Theory Comput 2024; 20:5901-5912. [PMID: 38954555 PMCID: PMC11270837 DOI: 10.1021/acs.jctc.4c00425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2024] [Revised: 06/11/2024] [Accepted: 06/12/2024] [Indexed: 07/04/2024]
Abstract
Our ability to calculate rate constants of biochemical processes using molecular dynamics simulations is severely limited by the fact that the time scales for reactions, or changes in conformational state, scale exponentially with the relevant free-energy barrier heights. In this work, we improve upon a recently proposed rate estimator that allows us to predict transition times with molecular dynamics simulations biased to rapidly explore one or several collective variables (CVs). This approach relies on the idea that not all bias goes into promoting transitions, and along with the rate, it estimates a concomitant scale factor for the bias termed the "CV biasing efficiency" γ. First, we demonstrate mathematically that our new formulation allows us to derive the commonly used Infrequent Metadynamics (iMetaD) estimator when using a perfect CV, where γ = 1. After testing it on a model potential, we then study the unfolding behavior of a previously well characterized coarse-grained protein, which is sufficiently complex that we can choose many different CVs to bias, but which is sufficiently simple that we are able to compute the unbiased rate directly. For this system, we demonstrate that predictions from our new Exponential Average Time-Dependent Rate (EATR) estimator converge to the true rate constant more rapidly as a function of bias deposition time than does the previous iMetaD approach, even for bias deposition times that are short. We also show that the γ parameter can serve as a good metric for assessing the quality of the biasing coordinate. We demonstrate that these results hold when applying the methods to an atomistic protein folding example. Finally, we demonstrate that our approach works when combining multiple less-than-optimal bias coordinates, and adapt our method to the related "OPES flooding" approach. Overall, our time-dependent rate approach offers a powerful framework for predicting rate constants from biased simulations.
Collapse
Affiliation(s)
- Nicodemo Mazzaferro
- Department
of Chemistry, New York University, New York, New York 10003, United States
| | - Subarna Sasmal
- Department
of Chemistry, New York University, New York, New York 10003, United States
| | - Pilar Cossio
- Center
for Computational Mathematics, Flatiron
Institute, New York, New York 10010, United States
- Center
for Computational Biology, Flatiron Institute, New York, New York 10010, United States
| | - Glen M. Hocky
- Department
of Chemistry, New York University, New York, New York 10003, United States
- Simons
Center for Computational Physical Chemistry, New York University, New York, New York 10003, United States
| |
Collapse
|
14
|
Faran M, Ray D, Nag S, Raucci U, Parrinello M, Bisker G. A Stochastic Landscape Approach for Protein Folding State Classification. J Chem Theory Comput 2024; 20:5428-5438. [PMID: 38924770 PMCID: PMC11238538 DOI: 10.1021/acs.jctc.4c00464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 06/12/2024] [Accepted: 06/12/2024] [Indexed: 06/28/2024]
Abstract
Protein folding is a critical process that determines the functional state of proteins. Proper folding is essential for proteins to acquire their functional three-dimensional structures and execute their biological role, whereas misfolded proteins can lead to various diseases, including neurodegenerative disorders like Alzheimer's and Parkinson's. Therefore, a deeper understanding of protein folding is vital for understanding disease mechanisms and developing therapeutic strategies. This study introduces the Stochastic Landscape Classification (SLC), an innovative, automated, nonlearning algorithm that quantitatively analyzes protein folding dynamics. Focusing on collective variables (CVs) - low-dimensional representations of complex dynamical systems like molecular dynamics (MD) of macromolecules - the SLC approach segments the CVs into distinct macrostates, revealing the protein folding pathway explored by MD simulations. The segmentation is achieved by analyzing changes in CV trends and clustering these segments using a standard density-based spatial clustering of applications with noise (DBSCAN) scheme. Applied to the MD-based CV trajectories of Chignolin and Trp-Cage proteins, the SLC demonstrates apposite accuracy, validated by comparing standard classification metrics against ground-truth data. These metrics affirm the efficacy of the SLC in capturing intricate protein dynamics and offer a method to evaluate and select the most informative CVs. The practical application of this technique lies in its ability to provide a detailed, quantitative description of protein folding processes, with significant implications for understanding and manipulating protein behavior in industrial and pharmaceutical contexts.
Collapse
Affiliation(s)
- Michael Faran
- Department
of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dhiman Ray
- Atomistic
Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Shubhadeep Nag
- Department
of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| | - Umberto Raucci
- Atomistic
Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Michele Parrinello
- Atomistic
Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Gili Bisker
- Department
of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv 69978, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Nanoscience and Nanotechnology, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Light-Matter Interaction, Tel
Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
15
|
Benayad Z, David R, Stirnemann G. Prebiotic chemical reactivity in solution with quantum accuracy and microsecond sampling using neural network potentials. Proc Natl Acad Sci U S A 2024; 121:e2322040121. [PMID: 38809704 PMCID: PMC11161780 DOI: 10.1073/pnas.2322040121] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 04/26/2024] [Indexed: 05/31/2024] Open
Abstract
While RNA appears as a good candidate for the first autocatalytic systems preceding the emergence of modern life, the synthesis of RNA oligonucleotides without enzymes remains challenging. Because the uncatalyzed reaction is extremely slow, experimental studies bring limited and indirect information on the reaction mechanism, the nature of which remains debated. Here, we develop neural network potentials (NNPs) to study the phosphoester bond formation in water. While NNPs are becoming routinely applied to nonreactive systems or simple reactions, we demonstrate how they can systematically be trained to explore the reaction phase space for complex reactions involving several proton transfers and exchanges of heavy atoms. We then propagate at moderate computational cost hundreds of nanoseconds of a variety of enhanced sampling simulations with quantum accuracy in explicit solvent conditions. The thermodynamically preferred reaction pathway is a concerted, dissociative mechanism, with the transient formation of a metaphosphate transition state and direct participation of water solvent molecules that facilitate the exchange of protons through the nonbridging phosphate oxygens. Associative-dissociative pathways, characterized by a much tighter pentacoordinated phosphate, are higher in free energy. Our simulations also suggest that diprotonated phosphate, whose reactivity is never directly assessed in the experiments, is significantly less reactive than the monoprotonated species, suggesting that it is probably never the reactive species in normal pH conditions. These observations rationalize unexplained experimental results and the temperature dependence of the reaction rate, and they pave the way for the design of more efficient abiotic catalysts and activating groups.
Collapse
Affiliation(s)
- Zakarya Benayad
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique, Paris Sciences et Lettres University, Université Paris-Cité, 75005Paris, France
- PASTEUR, Département de Chimie, École Normale Supérieure, Paris Sciences et Lettres University, Sorbonne University, CNRS, 75005Paris, France
| | - Rolf David
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique, Paris Sciences et Lettres University, Université Paris-Cité, 75005Paris, France
- PASTEUR, Département de Chimie, École Normale Supérieure, Paris Sciences et Lettres University, Sorbonne University, CNRS, 75005Paris, France
| | - Guillaume Stirnemann
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique, Paris Sciences et Lettres University, Université Paris-Cité, 75005Paris, France
- PASTEUR, Département de Chimie, École Normale Supérieure, Paris Sciences et Lettres University, Sorbonne University, CNRS, 75005Paris, France
| |
Collapse
|
16
|
Keller BG, Bolhuis PG. Dynamical Reweighting for Biased Rare Event Simulations. Annu Rev Phys Chem 2024; 75:137-162. [PMID: 38941527 DOI: 10.1146/annurev-physchem-083122-124538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024]
Abstract
Dynamical reweighting techniques aim to recover the correct molecular dynamics from a simulation at a modified potential energy surface. They are important for unbiasing enhanced sampling simulations of molecular rare events. Here, we review the theoretical frameworks of dynamical reweighting for modified potentials. Based on an overview of kinetic models with increasing level of detail, we discuss techniques to reweight two-state dynamics, multistate dynamics, and path integrals. We explore the natural link to transition path sampling and how the effect of nonequilibrium forces can be reweighted. We end by providing an outlook on how dynamical reweighting integrates with techniques for optimizing collective variables and with modern potential energy surfaces.
Collapse
Affiliation(s)
- Bettina G Keller
- Department of Biology, Chemistry and Pharmacy, Freie Universität Berlin, Berlin, Germany;
| | - Peter G Bolhuis
- Van 't Hoff Institute for Molecular Sciences, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
17
|
Spiriti J, Wong CF. Quantitative Prediction of Dissociation Rates of PYK2 Ligands Using Umbrella Sampling and Milestoning. J Chem Theory Comput 2024; 20:4029-4044. [PMID: 38640609 DOI: 10.1021/acs.jctc.4c00192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/21/2024]
Abstract
We used umbrella sampling and the milestoning simulation method to study the dissociation of multiple ligands from protein kinase PYK2. The activation barriers obtained from the potential of mean force of the umbrella sampling simulations correlated well with the experimental dissociation rates. Using the zero-temperature string method, we obtained optimized paths along the free-energy surfaces for milestoning simulations of three ligands with a similar structure. The milestoning simulations gave an absolute dissociation rate within 2 orders of magnitude of the experimental value for two ligands but at least 3 orders of magnitude too high for the third. Despite the similarity in their structures, the ligands took different pathways to exit from the binding site of PYK2, making contact with different sets of residues. In addition, the protein experienced different conformational changes for dissociation of the three ligands.
Collapse
Affiliation(s)
- Justin Spiriti
- Department of Chemistry and Biochemistry, University of Missouri-St. Louis, St. Louis, Missouri 63121, United States
| | - Chung F Wong
- Department of Chemistry and Biochemistry, University of Missouri-St. Louis, St. Louis, Missouri 63121, United States
| |
Collapse
|
18
|
Blumer O, Reuveni S, Hirshberg B. Short-Time Infrequent Metadynamics for Improved Kinetics Inference. J Chem Theory Comput 2024; 20:3484-3491. [PMID: 38668722 PMCID: PMC11099961 DOI: 10.1021/acs.jctc.4c00170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 04/02/2024] [Accepted: 04/02/2024] [Indexed: 05/15/2024]
Abstract
Infrequent Metadynamics is a popular method to obtain the rates of long time-scale processes from accelerated simulations. The inference procedure is based on rescaling the first-passage times of the Metadynamics trajectories using a bias-dependent acceleration factor. While useful in many cases, it is limited to Poisson kinetics, and a reliable estimation of the unbiased rate requires slow bias deposition and prior knowledge of efficient collective variables. Here, we propose an improved inference scheme, which is based on two key observations: (1) the time-independent rate of Poisson processes can be estimated using short trajectories only. (2) Short trajectories experience minimal bias, and their rescaled first-passage times follow the unbiased distribution even for relatively high deposition rates and suboptimal collective variables. Therefore, by basing the inference procedure on short time scales, we obtain an improved trade-off between speedup and accuracy at no additional computational cost, especially when employing suboptimal collective variables. We demonstrate the improved inference scheme for a model system and two molecular systems.
Collapse
Affiliation(s)
- Ofir Blumer
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Shlomi Reuveni
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Barak Hirshberg
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
19
|
Ray D, Das S, Raucci U. Kinetic View of Enzyme Catalysis from Enhanced Sampling QM/MM Simulations. J Chem Inf Model 2024; 64:3953-3958. [PMID: 38607669 DOI: 10.1021/acs.jcim.4c00475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
The rate constants of enzyme-catalyzed reactions (kcat) are often approximated from the barrier height of the reactive step. We introduce an enhanced sampling QM/MM approach that directly calculates the kinetics of enzymatic reactions, without introducing the transition-state theory assumptions, and takes into account the dynamical equilibrium between the reactive and non-reactive conformations of the enzyme/substrate complex. Our computed kcat values are in order-of-magnitude agreement with the experimental data for two representative enzymatic reactions.
Collapse
Affiliation(s)
- Dhiman Ray
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, Genova GE 16152, Italy
| | - Sudip Das
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, Genova GE 16152, Italy
| | - Umberto Raucci
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, Genova GE 16152, Italy
| |
Collapse
|
20
|
Zhang P, Gardini AT, Xu X, Parrinello M. Intramolecular and Water Mediated Tautomerism of Solvated Glycine. J Chem Inf Model 2024; 64:3599-3604. [PMID: 38620066 DOI: 10.1021/acs.jcim.4c00273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2024]
Abstract
Understanding tautomerism and characterizing solvent effects on the dynamic processes pose significant challenges. Using enhanced-sampling molecular dynamics based on state-of-the-art deep learning potentials, we investigated the tautomeric equilibria of glycine in water. We observed that the tautomerism between neutral and zwitterionic glycine can occur through both intramolecular and intermolecular proton transfers. The latter proceeds involving a contact anionic-glycine-hydronium ion pair or separate cationic-glycine-hydroxide ion pair. These pathways with comparable barriers contribute almost equally to the reaction flux.
Collapse
Affiliation(s)
- Pengchao Zhang
- Center for Combustion Energy, Department of Energy and Power Engineering, and Key Laboratory for Thermal Science and Power Engineering of Ministry of Education, Tsinghua University, Beijing 100084, China
- Atomistic Simulations, Italian Institute of Technology, Genova 16152, Italy
| | - Axel Tosello Gardini
- Atomistic Simulations, Italian Institute of Technology, Genova 16152, Italy
- Department of Materials Science, Università di Milano-Bicocca, 20126 Milano, Italy
| | - Xuefei Xu
- Center for Combustion Energy, Department of Energy and Power Engineering, and Key Laboratory for Thermal Science and Power Engineering of Ministry of Education, Tsinghua University, Beijing 100084, China
| | - Michele Parrinello
- Atomistic Simulations, Italian Institute of Technology, Genova 16152, Italy
| |
Collapse
|
21
|
Lee S, Wang D, Seeliger MA, Tiwary P. Calculating Protein-Ligand Residence Times Through State Predictive Information Bottleneck based Enhanced Sampling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.16.589710. [PMID: 38659748 PMCID: PMC11042289 DOI: 10.1101/2024.04.16.589710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Understanding drug residence times in target proteins is key to improving drug efficacy and understanding target recognition in biochemistry. While drug residence time is just as important as binding affinity, atomic-level understanding of drug residence times through molecular dynamics (MD) simulations has been difficult primarily due to the extremely long timescales. Recent advances in rare event sampling have allowed us to reach these timescales, yet predicting protein-ligand residence times remains a significant challenge. Here we present a semi-automated protocol to calculate the ligand residence times across 12 orders of magnitudes of timescales. In our proposed framework, we integrate a deep learning-based method, the state predictive information bottleneck (SPIB), to learn an approximate reaction coordinate (RC) and use it to guide the enhanced sampling method metadynamics. We demonstrate the performance of our algorithm by applying it to six different protein-ligand complexes with available benchmark residence times, including the dissociation of the widely studied anti-cancer drug Imatinib (Gleevec) from both wild-type Abl kinase and drug-resistant mutants. We show how our protocol can recover quantitatively accurate residence times, potentially opening avenues for deeper insights into drug development possibilities and ligand recognition mechanisms.
Collapse
Affiliation(s)
- Suemin Lee
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
| | - Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
| | - Markus A. Seeliger
- Department of Pharmacological Sciences, Stony Brook University, Stony Brook, NY 11794-8651, USA
| | - Pratyush Tiwary
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
- University of Maryland Institute for Health Computing, Rockville, United States
| |
Collapse
|
22
|
Ray D, Parrinello M. Data-driven classification of ligand unbinding pathways. Proc Natl Acad Sci U S A 2024; 121:e2313542121. [PMID: 38412121 PMCID: PMC10927508 DOI: 10.1073/pnas.2313542121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 01/26/2024] [Indexed: 02/29/2024] Open
Abstract
Studying the pathways of ligand-receptor binding is essential to understand the mechanism of target recognition by small molecules. The binding free energy and kinetics of protein-ligand complexes can be computed using molecular dynamics (MD) simulations, often in quantitative agreement with experiments. However, only a qualitative picture of the ligand binding/unbinding paths can be obtained through a conventional analysis of the MD trajectories. Besides, the higher degree of manual effort involved in analyzing pathways limits its applicability in large-scale drug discovery. Here, we address this limitation by introducing an automated approach for analyzing molecular transition paths with a particular focus on protein-ligand dissociation. Our method is based on the dynamic time-warping algorithm, originally designed for speech recognition. We accurately classified molecular trajectories using a very generic descriptor set of contacts or distances. Our approach outperforms manual classification by distinguishing between parallel dissociation channels, within the pathways identified by visual inspection. Most notably, we could compute exit-path-specific ligand-dissociation kinetics. The unbinding timescale along the fastest path agrees with the experimental residence time, providing a physical interpretation to our entirely data-driven protocol. In combination with appropriate enhanced sampling algorithms, this technique can be used for the initial exploration of ligand-dissociation pathways as well as for calculating path-specific thermodynamic and kinetic properties.
Collapse
Affiliation(s)
- Dhiman Ray
- Simulations Research Line, Italian Institute of Technology, Via Enrico Melen 83, GenovaGE16152, Italy
| | - Michele Parrinello
- Simulations Research Line, Italian Institute of Technology, Via Enrico Melen 83, GenovaGE16152, Italy
| |
Collapse
|
23
|
Zhang DT, Baldauf L, Roet S, Lervik A, van Erp TS. Highly parallelizable path sampling with minimal rejections using asynchronous replica exchange and infinite swaps. Proc Natl Acad Sci U S A 2024; 121:e2318731121. [PMID: 38315841 PMCID: PMC10873605 DOI: 10.1073/pnas.2318731121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 01/09/2024] [Indexed: 02/07/2024] Open
Abstract
Capturing rare yet pivotal events poses a significant challenge for molecular simulations. Path sampling provides a unique approach to tackle this issue without altering the potential energy landscape or dynamics, enabling recovery of both thermodynamic and kinetic information. However, despite its exponential acceleration compared to standard molecular dynamics, generating numerous trajectories can still require a long time. By harnessing our recent algorithmic innovations-particularly subtrajectory moves with high acceptance, coupled with asynchronous replica exchange featuring infinite swaps-we establish a highly parallelizable and rapidly converging path sampling protocol, compatible with diverse high-performance computing architectures. We demonstrate our approach on the liquid-vapor phase transition in superheated water, the unfolding of the chignolin protein, and water dissociation. The latter, performed at the ab initio level, achieves comparable statistical accuracy within days, in contrast to a previous study requiring over a year.
Collapse
Affiliation(s)
- Daniel T. Zhang
- Department of Chemistry, Norwegian University of Science and Technology, TrondheimN-7491, Norway
| | - Lukas Baldauf
- Department of Chemistry, Norwegian University of Science and Technology, TrondheimN-7491, Norway
| | - Sander Roet
- Department of Chemistry, Utrecht University, Utrecht3584 CH, Netherlands
| | - Anders Lervik
- Department of Chemistry, Norwegian University of Science and Technology, TrondheimN-7491, Norway
| | - Titus S. van Erp
- Department of Chemistry, Norwegian University of Science and Technology, TrondheimN-7491, Norway
| |
Collapse
|
24
|
Blumer O, Reuveni S, Hirshberg B. Combining stochastic resetting with Metadynamics to speed-up molecular dynamics simulations. Nat Commun 2024; 15:240. [PMID: 38172126 PMCID: PMC10764788 DOI: 10.1038/s41467-023-44528-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 12/18/2023] [Indexed: 01/05/2024] Open
Abstract
Metadynamics is a powerful method to accelerate molecular dynamics simulations, but its efficiency critically depends on the identification of collective variables that capture the slow modes of the process. Unfortunately, collective variables are usually not known a priori and finding them can be very challenging. We recently presented a collective variables-free approach to enhanced sampling using stochastic resetting. Here, we combine the two methods, showing that it can lead to greater acceleration than either of them separately. We also demonstrate that resetting Metadynamics simulations performed with suboptimal collective variables can lead to speedups comparable with those obtained with optimal collective variables. Therefore, applying stochastic resetting can be an alternative to the challenging task of improving suboptimal collective variables, at almost no additional computational cost. Finally, we propose a method to extract unbiased mean first-passage times from Metadynamics simulations with resetting, resulting in an improved tradeoff between speedup and accuracy. This work enables combining stochastic resetting with other enhanced sampling methods to accelerate a broad range of molecular simulations.
Collapse
Affiliation(s)
- Ofir Blumer
- School of Chemistry, Tel Aviv University, Tel Aviv, 6997801, Israel
| | - Shlomi Reuveni
- School of Chemistry, Tel Aviv University, Tel Aviv, 6997801, Israel
- The Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv, 6997801, Israel
- The Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv, 6997801, Israel
| | - Barak Hirshberg
- School of Chemistry, Tel Aviv University, Tel Aviv, 6997801, Israel.
- The Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv, 6997801, Israel.
- The Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv, 6997801, Israel.
| |
Collapse
|
25
|
Kim H, Kim E, Pak Y. Computational Probing of the Folding Mechanism of Human Telomeric G-Quadruplex DNA. J Chem Inf Model 2023; 63:6366-6375. [PMID: 37782649 DOI: 10.1021/acs.jcim.3c01257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/04/2023]
Abstract
The human telomeric (htel) sequences in the terminal regions of human telomeres form diverse G-quadruplex (GQ) structures. Despite much experimental efforts to elucidate the folding pathways of htel GQ, no comprehensive model of htel GQ folding has been presented. Here, we describe folding pathways of the htel GQ determined by state-of-the-art enhanced sampling molecular dynamics simulation at the all-atom level. Briefly, GQ folding is initiated by the formation of a single-hairpin and then followed by the formation of double-hairpins, which then branch via distinct folding pathways to produce different GQ topologies (antiparallel chair, antiparallel basket, hybrids 1 and 2, and parallel propeller). In addition to these double-hairpin states, three-triad and two-tetrad structures in antiparallel backbone alignment serve as key intermediates that connect the GQ folding and transition between two different GQs.
Collapse
Affiliation(s)
- Hyeonjun Kim
- Department of Chemistry and Institute of Functional Materials, Pusan National University, Busan 46241, South Korea
| | - Eunae Kim
- College of Pharmacy, Chosun University, Gwangju 61452, South Korea
| | - Youngshang Pak
- Department of Chemistry and Institute of Functional Materials, Pusan National University, Busan 46241, South Korea
| |
Collapse
|
26
|
Conflitti P, Raniolo S, Limongelli V. Perspectives on Ligand/Protein Binding Kinetics Simulations: Force Fields, Machine Learning, Sampling, and User-Friendliness. J Chem Theory Comput 2023; 19:6047-6061. [PMID: 37656199 PMCID: PMC10536999 DOI: 10.1021/acs.jctc.3c00641] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Indexed: 09/02/2023]
Abstract
Computational techniques applied to drug discovery have gained considerable popularity for their ability to filter potentially active drugs from inactive ones, reducing the time scale and costs of preclinical investigations. The main focus of these studies has historically been the search for compounds endowed with high affinity for a specific molecular target to ensure the formation of stable and long-lasting complexes. Recent evidence has also correlated the in vivo drug efficacy with its binding kinetics, thus opening new fascinating scenarios for ligand/protein binding kinetic simulations in drug discovery. The present article examines the state of the art in the field, providing a brief summary of the most popular and advanced ligand/protein binding kinetics techniques and evaluating their current limitations and the potential solutions to reach more accurate kinetic models. Particular emphasis is put on the need for a paradigm change in the present methodologies toward ligand and protein parametrization, the force field problem, characterization of the transition states, the sampling issue, and algorithms' performance, user-friendliness, and data openness.
Collapse
Affiliation(s)
- Paolo Conflitti
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
| | - Stefano Raniolo
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
| | - Vittorio Limongelli
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
- Department
of Pharmacy, University of Naples “Federico
II”, 80131 Naples, Italy
| |
Collapse
|
27
|
Ray D, Parrinello M. Kinetics from Metadynamics: Principles, Applications, and Outlook. J Chem Theory Comput 2023; 19:5649-5670. [PMID: 37585703 DOI: 10.1021/acs.jctc.3c00660] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Metadynamics is a popular enhanced sampling algorithm for computing the free energy landscape of rare events by using molecular dynamics simulation. Ten years ago, Tiwary and Parrinello introduced the infrequent metadynamics approach for calculating the kinetics of transitions across free energy barriers. Since then, metadynamics-based methods for obtaining rate constants have attracted significant attention in computational molecular science. Such methods have been applied to study a wide range of problems, including protein-ligand binding, protein folding, conformational transitions, chemical reactions, catalysis, and nucleation. Here, we review the principles of elucidating kinetics from metadynamics-like approaches, subsequent methodological developments in this area, and successful applications on chemical, biological, and material systems. We also highlight the challenges of reconstructing accurate kinetics from enhanced sampling simulations and the scope of future developments.
Collapse
Affiliation(s)
- Dhiman Ray
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Michele Parrinello
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| |
Collapse
|
28
|
Blazhynska M, Goulard Coderc de Lacam E, Chen H, Chipot C. Improving Speed and Affordability without Compromising Accuracy: Standard Binding Free-Energy Calculations Using an Enhanced Sampling Algorithm, Multiple-Time Stepping, and Hydrogen Mass Repartitioning. J Chem Theory Comput 2023. [PMID: 37196198 DOI: 10.1021/acs.jctc.3c00141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Accurate evaluation of protein-ligand binding free energies in silico is of paramount importance for understanding the mechanisms of biological regulation and providing a theoretical basis for drug design and discovery. Based on a series of atomistic molecular dynamics simulations in an explicit solvent, using well-tempered metadynamics extended adaptive biasing force (WTM-eABF) as an enhanced sampling algorithm, the so-called "geometrical route" offers a rigorous theoretical framework for binding affinity calculations that match experimental values. However, although robust, this strategy remains expensive, requiring substantial computational time to achieve convergence of the simulations. Improving the efficiency of the geometrical route, while preserving its reliability through improved ergodic sampling, is, therefore, highly desirable. In this contribution, having identified the computational bottleneck of the geometrical route, to accelerate the calculations we combine (i) a longer time step for the integration of the equations of motion with hydrogen-mass repartitioning (HMR), and (ii) multiple time-stepping (MTS) for collective-variable and biasing-force evaluation. Altogether, we performed 50 independent WTM-eABF simulations in triplicate for the "physical" separation of the Abl kinase-SH3 domain:p41 complex, following different HMR and MTS schemes, while tuning, in distinct protocols, the parameters of the enhanced-sampling algorithm. To demonstrate the consistency and reliability of the results obtained with the best-performing setups, we carried out quintuple simulations. Furthermore, we demonstrated the transferability of our method to other complexes by triplicating a 200 ns separation simulation of nine chosen protocols for the MDM2-p53:NVP-CGM097 complex. [Holzer et al. J. Med. Chem. 2015, 58, 6348-6358.] Our results, based on an aggregate simulation time of 14.4 μs, allowed an optimal set of parameters to be identified, able to accelerate convergence by a factor of three without any noticeable loss of accuracy.
Collapse
Affiliation(s)
- Marharyta Blazhynska
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche n°7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy cedex, France
| | - Emma Goulard Coderc de Lacam
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche n°7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy cedex, France
| | - Haochuan Chen
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche n°7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy cedex, France
| | - Christophe Chipot
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche n°7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy cedex, France
- Theoretical and Computational Biophysics Group, Beckman Institute, and Department of Physics, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Biochemistry and Molecular Biology, The University of Chicago, 929 E. 57th Street W225, Chicago, Illinois 60637, United States
| |
Collapse
|
29
|
Blumer O, Reuveni S, Hirshberg B. Stochastic Resetting for Enhanced Sampling. J Phys Chem Lett 2022; 13:11230-11236. [PMID: 36446130 PMCID: PMC9743203 DOI: 10.1021/acs.jpclett.2c03055] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 11/23/2022] [Indexed: 06/16/2023]
Abstract
We present a method for enhanced sampling of molecular dynamics simulations using stochastic resetting. Various phenomena, ranging from crystal nucleation to protein folding, occur on time scales that are unreachable in standard simulations. They are often characterized by broad transition time distributions, in which extremely slow events have a non-negligible probability. Stochastic resetting, i.e., restarting simulations at random times, was recently shown to significantly expedite processes that follow such distributions. Here, we employ resetting for enhanced sampling of molecular simulations for the first time. We show that it accelerates long time scale processes by up to an order of magnitude in examples ranging from simple models to a molecular system. Most importantly, we recover the mean transition time without resetting, which is typically too long to be sampled directly, from accelerated simulations at a single restart rate. Stochastic resetting can be used as a standalone method or combined with other sampling algorithms to further accelerate simulations.
Collapse
Affiliation(s)
- Ofir Blumer
- School
of Chemistry, Tel Aviv University, Tel Aviv6997801, Israel
| | - Shlomi Reuveni
- School
of Chemistry, Tel Aviv University, Tel Aviv6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv6997801, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv6997801, Israel
| | - Barak Hirshberg
- School
of Chemistry, Tel Aviv University, Tel Aviv6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv6997801, Israel
| |
Collapse
|
30
|
Kinetics of Drug Release from Clay Using Enhanced Sampling Methods. Pharmaceutics 2022; 14:pharmaceutics14122586. [PMID: 36559081 PMCID: PMC9781022 DOI: 10.3390/pharmaceutics14122586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 11/21/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022] Open
Abstract
A key step in the development of a new drug, is the design of drug-excipient complexes that lead to optimal drug release kinetics. Computational chemistry and specifically enhanced sampling molecular dynamics methods can play a key role in this context, by minimizing the need for expensive experiments, and reducing cost and time. Here we show that recent advances in enhanced sampling methodologies can be brought to fruition in this area. We demonstrate the potential of these methodologies by simulating the drug release kinetics of the complex praziquantel-montmorillonite in water. Praziquantel finds promising applications in the treatment of schistosomiasis, but its biopharmaceutical profile needs to be improved, and a cheap material such as the montmorillonite clay would be a very convenient excipient. We simulate the drug release both from surface and interlayer space, and find that the diffusion of the praziquantel inside the interlayer space is the process that limits the rate of drug release.
Collapse
|