1
|
Zhang M, Wu H, Wang Y. Enhanced Sampling of Biomolecular Slow Conformational Transitions Using Adaptive Sampling and Machine Learning. J Chem Theory Comput 2024; 20:8569-8582. [PMID: 39301626 DOI: 10.1021/acs.jctc.4c00764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2024]
Abstract
Biomolecular simulations often suffer from the "time scale problem", hindering the study of rare events occurring over extended time scales. Enhanced sampling techniques aim to alleviate this issue by accelerating conformational transitions, yet they typically necessitate well-defined collective variables (CVs), posing a significant challenge. Machine learning offers promising solutions but typically requires rich training data encompassing the entire free energy surface (FES). In this work, we introduce an automated iterative pipeline designed to mitigate these limitations. Our protocol first utilizes a CV-free count-based adaptive sampling method to generate a data set rich in rare events. From this data set, slow modes are identified using Koopman-reweighted time-lagged independent component analysis (KTICA), which are subsequently leveraged by on-the-fly probability enhanced sampling (OPES) to efficiently explore the FES. The effectiveness of our pipeline is demonstrated and further compared with the common Markov State Model (MSM) approach on two model systems with increasing complexity: alanine dipeptide (Ala2) and deca-alanine (Ala10), underscoring its applicability across diverse biomolecular simulations.
Collapse
Affiliation(s)
- Mingyuan Zhang
- College of Life Sciences, Zhejiang University, Hangzhou 310027, China
| | - Hao Wu
- School of Mathematical Sciences, Institute of Natural Sciences, and MOE-LSC, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yong Wang
- College of Life Sciences, Zhejiang University, Hangzhou 310027, China
| |
Collapse
|
2
|
Das M, Venkatramani R. A Mode Evolution Metric to Extract Reaction Coordinates for Biomolecular Conformational Transitions. J Chem Theory Comput 2024; 20:8422-8436. [PMID: 39287954 DOI: 10.1021/acs.jctc.4c00744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/19/2024]
Abstract
The complex, multidimensional energy landscape of biomolecules makes the extraction of suitable, nonintuitive collective variables (CVs) that describe their conformational transitions challenging. At present, dimensionality reduction approaches and machine learning (ML) schemes are employed to obtain CVs from molecular dynamics (MD)/Monte Carlo (MC) trajectories or structural databanks for biomolecules. However, minimum sampling conditions to generate reliable CVs that accurately describe the underlying energy landscape remain unclear. Here, we address this issue by developing a Mode evolution Metric (MeM) to extract CVs that can pinpoint new states and describe local transitions in the vicinity of a reference minimum from nonequilibrated MD/MC trajectories. We present a general mathematical formulation of MeM for both statistical dimensionality reduction and machine learning approaches. Application of MeM to MC trajectories of model potential energy landscapes and MD trajectories of solvated alanine dipeptide reveals that the principal components which locate new states in the vicinity of a reference minimum emerge well before the trajectories locally equilibrate between the associated states. Finally, we demonstrate a possible application of MeM in designing efficient biased sampling schemes to construct accurate energy landscape slices that link transitions between states. MeM can help speed up the search for new minima around a biomolecular conformational state and enable the accurate estimation of thermodynamics for states lying on the energy landscape and the description of associated transitions.
Collapse
Affiliation(s)
- Mitradip Das
- Department of Chemical Sciences, Tata Institue of Fundamental Research, Colaba, Mumbai 400005, India
| | - Ravindra Venkatramani
- Department of Chemical Sciences, Tata Institue of Fundamental Research, Colaba, Mumbai 400005, India
| |
Collapse
|
3
|
Yang S, Nam J, Dietschreit JCB, Gómez-Bombarelli R. Learning Collective Variables with Synthetic Data Augmentation through Physics-Inspired Geodesic Interpolation. J Chem Theory Comput 2024. [PMID: 39073442 DOI: 10.1021/acs.jctc.4c00435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
In molecular dynamics simulations, rare events, such as protein folding, are typically studied using enhanced sampling techniques, most of which are based on the definition of a collective variable (CV) along which acceleration occurs. Obtaining an expressive CV is crucial, but often hindered by the lack of information about the particular event, e.g., the transition from unfolded to folded conformation. We propose a simulation-free data augmentation strategy using physics-inspired metrics to generate geodesic interpolations resembling protein folding transitions, thereby improving sampling efficiency without true transition state samples. This new data can be used to improve the accuracy of classifier-based methods. Alternatively, a regression-based learning scheme for CV models can be adopted by leveraging the interpolation progress parameter.
Collapse
Affiliation(s)
- Soojung Yang
- Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Juno Nam
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Johannes C B Dietschreit
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria
| | - Rafael Gómez-Bombarelli
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
4
|
Lee S, Wang D, Seeliger MA, Tiwary P. Calculating Protein-Ligand Residence Times through State Predictive Information Bottleneck Based Enhanced Sampling. J Chem Theory Comput 2024; 20:6341-6349. [PMID: 38991145 DOI: 10.1021/acs.jctc.4c00503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Abstract
Understanding drug residence times in target proteins is key to improving drug efficacy and understanding target recognition in biochemistry. While drug residence time is just as important as binding affinity, atomic-level understanding of drug residence times through molecular dynamics (MD) simulations has been difficult primarily due to the extremely long time scales. Recent advances in rare event sampling have allowed us to reach these time scales, yet predicting protein-ligand residence times remains a significant challenge. Here we present a semi-automated protocol to calculate the ligand residence times across 12 orders of magnitude of time scales. In our proposed framework, we integrate a deep learning-based method, the state predictive information bottleneck (SPIB), to learn an approximate reaction coordinate (RC) and use it to guide the enhanced sampling method metadynamics. We demonstrate the performance of our algorithm by applying it to six different protein-ligand complexes with available benchmark residence times, including the dissociation of the widely studied anticancer drug Imatinib (Gleevec) from both wild-type Abl kinase and drug-resistant mutants. We show how our protocol can recover quantitatively accurate residence times, potentially opening avenues for deeper insights into drug development possibilities and ligand recognition mechanisms.
Collapse
Affiliation(s)
- Suemin Lee
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, United States
| | - Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, United States
| | - Markus A Seeliger
- Department of Pharmacological Sciences, Stony Brook University, Stony Brook, New York 11794-8651, United States
| | - Pratyush Tiwary
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, United States
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park 20742, United States
- University of Maryland Institute for Health Computing, Bethesda, Maryland 20852, United States
| |
Collapse
|
5
|
Mazzaferro N, Sasmal S, Cossio P, Hocky GM. Good Rates From Bad Coordinates: The Exponential Average Time-dependent Rate Approach. J Chem Theory Comput 2024; 20:5901-5912. [PMID: 38954555 PMCID: PMC11270837 DOI: 10.1021/acs.jctc.4c00425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2024] [Revised: 06/11/2024] [Accepted: 06/12/2024] [Indexed: 07/04/2024]
Abstract
Our ability to calculate rate constants of biochemical processes using molecular dynamics simulations is severely limited by the fact that the time scales for reactions, or changes in conformational state, scale exponentially with the relevant free-energy barrier heights. In this work, we improve upon a recently proposed rate estimator that allows us to predict transition times with molecular dynamics simulations biased to rapidly explore one or several collective variables (CVs). This approach relies on the idea that not all bias goes into promoting transitions, and along with the rate, it estimates a concomitant scale factor for the bias termed the "CV biasing efficiency" γ. First, we demonstrate mathematically that our new formulation allows us to derive the commonly used Infrequent Metadynamics (iMetaD) estimator when using a perfect CV, where γ = 1. After testing it on a model potential, we then study the unfolding behavior of a previously well characterized coarse-grained protein, which is sufficiently complex that we can choose many different CVs to bias, but which is sufficiently simple that we are able to compute the unbiased rate directly. For this system, we demonstrate that predictions from our new Exponential Average Time-Dependent Rate (EATR) estimator converge to the true rate constant more rapidly as a function of bias deposition time than does the previous iMetaD approach, even for bias deposition times that are short. We also show that the γ parameter can serve as a good metric for assessing the quality of the biasing coordinate. We demonstrate that these results hold when applying the methods to an atomistic protein folding example. Finally, we demonstrate that our approach works when combining multiple less-than-optimal bias coordinates, and adapt our method to the related "OPES flooding" approach. Overall, our time-dependent rate approach offers a powerful framework for predicting rate constants from biased simulations.
Collapse
Affiliation(s)
- Nicodemo Mazzaferro
- Department
of Chemistry, New York University, New York, New York 10003, United States
| | - Subarna Sasmal
- Department
of Chemistry, New York University, New York, New York 10003, United States
| | - Pilar Cossio
- Center
for Computational Mathematics, Flatiron
Institute, New York, New York 10010, United States
- Center
for Computational Biology, Flatiron Institute, New York, New York 10010, United States
| | - Glen M. Hocky
- Department
of Chemistry, New York University, New York, New York 10003, United States
- Simons
Center for Computational Physical Chemistry, New York University, New York, New York 10003, United States
| |
Collapse
|
6
|
Faran M, Ray D, Nag S, Raucci U, Parrinello M, Bisker G. A Stochastic Landscape Approach for Protein Folding State Classification. J Chem Theory Comput 2024; 20:5428-5438. [PMID: 38924770 PMCID: PMC11238538 DOI: 10.1021/acs.jctc.4c00464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 06/12/2024] [Accepted: 06/12/2024] [Indexed: 06/28/2024]
Abstract
Protein folding is a critical process that determines the functional state of proteins. Proper folding is essential for proteins to acquire their functional three-dimensional structures and execute their biological role, whereas misfolded proteins can lead to various diseases, including neurodegenerative disorders like Alzheimer's and Parkinson's. Therefore, a deeper understanding of protein folding is vital for understanding disease mechanisms and developing therapeutic strategies. This study introduces the Stochastic Landscape Classification (SLC), an innovative, automated, nonlearning algorithm that quantitatively analyzes protein folding dynamics. Focusing on collective variables (CVs) - low-dimensional representations of complex dynamical systems like molecular dynamics (MD) of macromolecules - the SLC approach segments the CVs into distinct macrostates, revealing the protein folding pathway explored by MD simulations. The segmentation is achieved by analyzing changes in CV trends and clustering these segments using a standard density-based spatial clustering of applications with noise (DBSCAN) scheme. Applied to the MD-based CV trajectories of Chignolin and Trp-Cage proteins, the SLC demonstrates apposite accuracy, validated by comparing standard classification metrics against ground-truth data. These metrics affirm the efficacy of the SLC in capturing intricate protein dynamics and offer a method to evaluate and select the most informative CVs. The practical application of this technique lies in its ability to provide a detailed, quantitative description of protein folding processes, with significant implications for understanding and manipulating protein behavior in industrial and pharmaceutical contexts.
Collapse
Affiliation(s)
- Michael Faran
- Department
of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dhiman Ray
- Atomistic
Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Shubhadeep Nag
- Department
of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| | - Umberto Raucci
- Atomistic
Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Michele Parrinello
- Atomistic
Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Gili Bisker
- Department
of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv 69978, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Nanoscience and Nanotechnology, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Light-Matter Interaction, Tel
Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
7
|
Benayad Z, David R, Stirnemann G. Prebiotic chemical reactivity in solution with quantum accuracy and microsecond sampling using neural network potentials. Proc Natl Acad Sci U S A 2024; 121:e2322040121. [PMID: 38809704 PMCID: PMC11161780 DOI: 10.1073/pnas.2322040121] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 04/26/2024] [Indexed: 05/31/2024] Open
Abstract
While RNA appears as a good candidate for the first autocatalytic systems preceding the emergence of modern life, the synthesis of RNA oligonucleotides without enzymes remains challenging. Because the uncatalyzed reaction is extremely slow, experimental studies bring limited and indirect information on the reaction mechanism, the nature of which remains debated. Here, we develop neural network potentials (NNPs) to study the phosphoester bond formation in water. While NNPs are becoming routinely applied to nonreactive systems or simple reactions, we demonstrate how they can systematically be trained to explore the reaction phase space for complex reactions involving several proton transfers and exchanges of heavy atoms. We then propagate at moderate computational cost hundreds of nanoseconds of a variety of enhanced sampling simulations with quantum accuracy in explicit solvent conditions. The thermodynamically preferred reaction pathway is a concerted, dissociative mechanism, with the transient formation of a metaphosphate transition state and direct participation of water solvent molecules that facilitate the exchange of protons through the nonbridging phosphate oxygens. Associative-dissociative pathways, characterized by a much tighter pentacoordinated phosphate, are higher in free energy. Our simulations also suggest that diprotonated phosphate, whose reactivity is never directly assessed in the experiments, is significantly less reactive than the monoprotonated species, suggesting that it is probably never the reactive species in normal pH conditions. These observations rationalize unexplained experimental results and the temperature dependence of the reaction rate, and they pave the way for the design of more efficient abiotic catalysts and activating groups.
Collapse
Affiliation(s)
- Zakarya Benayad
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique, Paris Sciences et Lettres University, Université Paris-Cité, 75005Paris, France
- PASTEUR, Département de Chimie, École Normale Supérieure, Paris Sciences et Lettres University, Sorbonne University, CNRS, 75005Paris, France
| | - Rolf David
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique, Paris Sciences et Lettres University, Université Paris-Cité, 75005Paris, France
- PASTEUR, Département de Chimie, École Normale Supérieure, Paris Sciences et Lettres University, Sorbonne University, CNRS, 75005Paris, France
| | - Guillaume Stirnemann
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique, Paris Sciences et Lettres University, Université Paris-Cité, 75005Paris, France
- PASTEUR, Département de Chimie, École Normale Supérieure, Paris Sciences et Lettres University, Sorbonne University, CNRS, 75005Paris, France
| |
Collapse
|
8
|
Keller BG, Bolhuis PG. Dynamical Reweighting for Biased Rare Event Simulations. Annu Rev Phys Chem 2024; 75:137-162. [PMID: 38941527 DOI: 10.1146/annurev-physchem-083122-124538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024]
Abstract
Dynamical reweighting techniques aim to recover the correct molecular dynamics from a simulation at a modified potential energy surface. They are important for unbiasing enhanced sampling simulations of molecular rare events. Here, we review the theoretical frameworks of dynamical reweighting for modified potentials. Based on an overview of kinetic models with increasing level of detail, we discuss techniques to reweight two-state dynamics, multistate dynamics, and path integrals. We explore the natural link to transition path sampling and how the effect of nonequilibrium forces can be reweighted. We end by providing an outlook on how dynamical reweighting integrates with techniques for optimizing collective variables and with modern potential energy surfaces.
Collapse
Affiliation(s)
- Bettina G Keller
- Department of Biology, Chemistry and Pharmacy, Freie Universität Berlin, Berlin, Germany;
| | - Peter G Bolhuis
- Van 't Hoff Institute for Molecular Sciences, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
9
|
Spiriti J, Wong CF. Quantitative Prediction of Dissociation Rates of PYK2 Ligands Using Umbrella Sampling and Milestoning. J Chem Theory Comput 2024; 20:4029-4044. [PMID: 38640609 DOI: 10.1021/acs.jctc.4c00192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/21/2024]
Abstract
We used umbrella sampling and the milestoning simulation method to study the dissociation of multiple ligands from protein kinase PYK2. The activation barriers obtained from the potential of mean force of the umbrella sampling simulations correlated well with the experimental dissociation rates. Using the zero-temperature string method, we obtained optimized paths along the free-energy surfaces for milestoning simulations of three ligands with a similar structure. The milestoning simulations gave an absolute dissociation rate within 2 orders of magnitude of the experimental value for two ligands but at least 3 orders of magnitude too high for the third. Despite the similarity in their structures, the ligands took different pathways to exit from the binding site of PYK2, making contact with different sets of residues. In addition, the protein experienced different conformational changes for dissociation of the three ligands.
Collapse
Affiliation(s)
- Justin Spiriti
- Department of Chemistry and Biochemistry, University of Missouri-St. Louis, St. Louis, Missouri 63121, United States
| | - Chung F Wong
- Department of Chemistry and Biochemistry, University of Missouri-St. Louis, St. Louis, Missouri 63121, United States
| |
Collapse
|
10
|
Blumer O, Reuveni S, Hirshberg B. Short-Time Infrequent Metadynamics for Improved Kinetics Inference. J Chem Theory Comput 2024; 20:3484-3491. [PMID: 38668722 PMCID: PMC11099961 DOI: 10.1021/acs.jctc.4c00170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 04/02/2024] [Accepted: 04/02/2024] [Indexed: 05/15/2024]
Abstract
Infrequent Metadynamics is a popular method to obtain the rates of long time-scale processes from accelerated simulations. The inference procedure is based on rescaling the first-passage times of the Metadynamics trajectories using a bias-dependent acceleration factor. While useful in many cases, it is limited to Poisson kinetics, and a reliable estimation of the unbiased rate requires slow bias deposition and prior knowledge of efficient collective variables. Here, we propose an improved inference scheme, which is based on two key observations: (1) the time-independent rate of Poisson processes can be estimated using short trajectories only. (2) Short trajectories experience minimal bias, and their rescaled first-passage times follow the unbiased distribution even for relatively high deposition rates and suboptimal collective variables. Therefore, by basing the inference procedure on short time scales, we obtain an improved trade-off between speedup and accuracy at no additional computational cost, especially when employing suboptimal collective variables. We demonstrate the improved inference scheme for a model system and two molecular systems.
Collapse
Affiliation(s)
- Ofir Blumer
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Shlomi Reuveni
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Barak Hirshberg
- School
of Chemistry, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
11
|
Ray D, Das S, Raucci U. Kinetic View of Enzyme Catalysis from Enhanced Sampling QM/MM Simulations. J Chem Inf Model 2024; 64:3953-3958. [PMID: 38607669 DOI: 10.1021/acs.jcim.4c00475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
The rate constants of enzyme-catalyzed reactions (kcat) are often approximated from the barrier height of the reactive step. We introduce an enhanced sampling QM/MM approach that directly calculates the kinetics of enzymatic reactions, without introducing the transition-state theory assumptions, and takes into account the dynamical equilibrium between the reactive and non-reactive conformations of the enzyme/substrate complex. Our computed kcat values are in order-of-magnitude agreement with the experimental data for two representative enzymatic reactions.
Collapse
Affiliation(s)
- Dhiman Ray
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, Genova GE 16152, Italy
| | - Sudip Das
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, Genova GE 16152, Italy
| | - Umberto Raucci
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, Genova GE 16152, Italy
| |
Collapse
|
12
|
Zhang P, Gardini AT, Xu X, Parrinello M. Intramolecular and Water Mediated Tautomerism of Solvated Glycine. J Chem Inf Model 2024; 64:3599-3604. [PMID: 38620066 DOI: 10.1021/acs.jcim.4c00273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2024]
Abstract
Understanding tautomerism and characterizing solvent effects on the dynamic processes pose significant challenges. Using enhanced-sampling molecular dynamics based on state-of-the-art deep learning potentials, we investigated the tautomeric equilibria of glycine in water. We observed that the tautomerism between neutral and zwitterionic glycine can occur through both intramolecular and intermolecular proton transfers. The latter proceeds involving a contact anionic-glycine-hydronium ion pair or separate cationic-glycine-hydroxide ion pair. These pathways with comparable barriers contribute almost equally to the reaction flux.
Collapse
Affiliation(s)
- Pengchao Zhang
- Center for Combustion Energy, Department of Energy and Power Engineering, and Key Laboratory for Thermal Science and Power Engineering of Ministry of Education, Tsinghua University, Beijing 100084, China
- Atomistic Simulations, Italian Institute of Technology, Genova 16152, Italy
| | - Axel Tosello Gardini
- Atomistic Simulations, Italian Institute of Technology, Genova 16152, Italy
- Department of Materials Science, Università di Milano-Bicocca, 20126 Milano, Italy
| | - Xuefei Xu
- Center for Combustion Energy, Department of Energy and Power Engineering, and Key Laboratory for Thermal Science and Power Engineering of Ministry of Education, Tsinghua University, Beijing 100084, China
| | - Michele Parrinello
- Atomistic Simulations, Italian Institute of Technology, Genova 16152, Italy
| |
Collapse
|
13
|
Lee S, Wang D, Seeliger MA, Tiwary P. Calculating Protein-Ligand Residence Times Through State Predictive Information Bottleneck based Enhanced Sampling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.16.589710. [PMID: 38659748 PMCID: PMC11042289 DOI: 10.1101/2024.04.16.589710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Understanding drug residence times in target proteins is key to improving drug efficacy and understanding target recognition in biochemistry. While drug residence time is just as important as binding affinity, atomic-level understanding of drug residence times through molecular dynamics (MD) simulations has been difficult primarily due to the extremely long timescales. Recent advances in rare event sampling have allowed us to reach these timescales, yet predicting protein-ligand residence times remains a significant challenge. Here we present a semi-automated protocol to calculate the ligand residence times across 12 orders of magnitudes of timescales. In our proposed framework, we integrate a deep learning-based method, the state predictive information bottleneck (SPIB), to learn an approximate reaction coordinate (RC) and use it to guide the enhanced sampling method metadynamics. We demonstrate the performance of our algorithm by applying it to six different protein-ligand complexes with available benchmark residence times, including the dissociation of the widely studied anti-cancer drug Imatinib (Gleevec) from both wild-type Abl kinase and drug-resistant mutants. We show how our protocol can recover quantitatively accurate residence times, potentially opening avenues for deeper insights into drug development possibilities and ligand recognition mechanisms.
Collapse
Affiliation(s)
- Suemin Lee
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
| | - Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
| | - Markus A. Seeliger
- Department of Pharmacological Sciences, Stony Brook University, Stony Brook, NY 11794-8651, USA
| | - Pratyush Tiwary
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
- University of Maryland Institute for Health Computing, Rockville, United States
| |
Collapse
|
14
|
Ray D, Parrinello M. Data-driven classification of ligand unbinding pathways. Proc Natl Acad Sci U S A 2024; 121:e2313542121. [PMID: 38412121 PMCID: PMC10927508 DOI: 10.1073/pnas.2313542121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 01/26/2024] [Indexed: 02/29/2024] Open
Abstract
Studying the pathways of ligand-receptor binding is essential to understand the mechanism of target recognition by small molecules. The binding free energy and kinetics of protein-ligand complexes can be computed using molecular dynamics (MD) simulations, often in quantitative agreement with experiments. However, only a qualitative picture of the ligand binding/unbinding paths can be obtained through a conventional analysis of the MD trajectories. Besides, the higher degree of manual effort involved in analyzing pathways limits its applicability in large-scale drug discovery. Here, we address this limitation by introducing an automated approach for analyzing molecular transition paths with a particular focus on protein-ligand dissociation. Our method is based on the dynamic time-warping algorithm, originally designed for speech recognition. We accurately classified molecular trajectories using a very generic descriptor set of contacts or distances. Our approach outperforms manual classification by distinguishing between parallel dissociation channels, within the pathways identified by visual inspection. Most notably, we could compute exit-path-specific ligand-dissociation kinetics. The unbinding timescale along the fastest path agrees with the experimental residence time, providing a physical interpretation to our entirely data-driven protocol. In combination with appropriate enhanced sampling algorithms, this technique can be used for the initial exploration of ligand-dissociation pathways as well as for calculating path-specific thermodynamic and kinetic properties.
Collapse
Affiliation(s)
- Dhiman Ray
- Simulations Research Line, Italian Institute of Technology, Via Enrico Melen 83, GenovaGE16152, Italy
| | - Michele Parrinello
- Simulations Research Line, Italian Institute of Technology, Via Enrico Melen 83, GenovaGE16152, Italy
| |
Collapse
|
15
|
Zhang DT, Baldauf L, Roet S, Lervik A, van Erp TS. Highly parallelizable path sampling with minimal rejections using asynchronous replica exchange and infinite swaps. Proc Natl Acad Sci U S A 2024; 121:e2318731121. [PMID: 38315841 PMCID: PMC10873605 DOI: 10.1073/pnas.2318731121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 01/09/2024] [Indexed: 02/07/2024] Open
Abstract
Capturing rare yet pivotal events poses a significant challenge for molecular simulations. Path sampling provides a unique approach to tackle this issue without altering the potential energy landscape or dynamics, enabling recovery of both thermodynamic and kinetic information. However, despite its exponential acceleration compared to standard molecular dynamics, generating numerous trajectories can still require a long time. By harnessing our recent algorithmic innovations-particularly subtrajectory moves with high acceptance, coupled with asynchronous replica exchange featuring infinite swaps-we establish a highly parallelizable and rapidly converging path sampling protocol, compatible with diverse high-performance computing architectures. We demonstrate our approach on the liquid-vapor phase transition in superheated water, the unfolding of the chignolin protein, and water dissociation. The latter, performed at the ab initio level, achieves comparable statistical accuracy within days, in contrast to a previous study requiring over a year.
Collapse
Affiliation(s)
- Daniel T. Zhang
- Department of Chemistry, Norwegian University of Science and Technology, TrondheimN-7491, Norway
| | - Lukas Baldauf
- Department of Chemistry, Norwegian University of Science and Technology, TrondheimN-7491, Norway
| | - Sander Roet
- Department of Chemistry, Utrecht University, Utrecht3584 CH, Netherlands
| | - Anders Lervik
- Department of Chemistry, Norwegian University of Science and Technology, TrondheimN-7491, Norway
| | - Titus S. van Erp
- Department of Chemistry, Norwegian University of Science and Technology, TrondheimN-7491, Norway
| |
Collapse
|
16
|
Blumer O, Reuveni S, Hirshberg B. Combining stochastic resetting with Metadynamics to speed-up molecular dynamics simulations. Nat Commun 2024; 15:240. [PMID: 38172126 PMCID: PMC10764788 DOI: 10.1038/s41467-023-44528-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 12/18/2023] [Indexed: 01/05/2024] Open
Abstract
Metadynamics is a powerful method to accelerate molecular dynamics simulations, but its efficiency critically depends on the identification of collective variables that capture the slow modes of the process. Unfortunately, collective variables are usually not known a priori and finding them can be very challenging. We recently presented a collective variables-free approach to enhanced sampling using stochastic resetting. Here, we combine the two methods, showing that it can lead to greater acceleration than either of them separately. We also demonstrate that resetting Metadynamics simulations performed with suboptimal collective variables can lead to speedups comparable with those obtained with optimal collective variables. Therefore, applying stochastic resetting can be an alternative to the challenging task of improving suboptimal collective variables, at almost no additional computational cost. Finally, we propose a method to extract unbiased mean first-passage times from Metadynamics simulations with resetting, resulting in an improved tradeoff between speedup and accuracy. This work enables combining stochastic resetting with other enhanced sampling methods to accelerate a broad range of molecular simulations.
Collapse
Affiliation(s)
- Ofir Blumer
- School of Chemistry, Tel Aviv University, Tel Aviv, 6997801, Israel
| | - Shlomi Reuveni
- School of Chemistry, Tel Aviv University, Tel Aviv, 6997801, Israel
- The Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv, 6997801, Israel
- The Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv, 6997801, Israel
| | - Barak Hirshberg
- School of Chemistry, Tel Aviv University, Tel Aviv, 6997801, Israel.
- The Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv, 6997801, Israel.
- The Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv, 6997801, Israel.
| |
Collapse
|
17
|
Kim H, Kim E, Pak Y. Computational Probing of the Folding Mechanism of Human Telomeric G-Quadruplex DNA. J Chem Inf Model 2023; 63:6366-6375. [PMID: 37782649 DOI: 10.1021/acs.jcim.3c01257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/04/2023]
Abstract
The human telomeric (htel) sequences in the terminal regions of human telomeres form diverse G-quadruplex (GQ) structures. Despite much experimental efforts to elucidate the folding pathways of htel GQ, no comprehensive model of htel GQ folding has been presented. Here, we describe folding pathways of the htel GQ determined by state-of-the-art enhanced sampling molecular dynamics simulation at the all-atom level. Briefly, GQ folding is initiated by the formation of a single-hairpin and then followed by the formation of double-hairpins, which then branch via distinct folding pathways to produce different GQ topologies (antiparallel chair, antiparallel basket, hybrids 1 and 2, and parallel propeller). In addition to these double-hairpin states, three-triad and two-tetrad structures in antiparallel backbone alignment serve as key intermediates that connect the GQ folding and transition between two different GQs.
Collapse
Affiliation(s)
- Hyeonjun Kim
- Department of Chemistry and Institute of Functional Materials, Pusan National University, Busan 46241, South Korea
| | - Eunae Kim
- College of Pharmacy, Chosun University, Gwangju 61452, South Korea
| | - Youngshang Pak
- Department of Chemistry and Institute of Functional Materials, Pusan National University, Busan 46241, South Korea
| |
Collapse
|
18
|
Conflitti P, Raniolo S, Limongelli V. Perspectives on Ligand/Protein Binding Kinetics Simulations: Force Fields, Machine Learning, Sampling, and User-Friendliness. J Chem Theory Comput 2023; 19:6047-6061. [PMID: 37656199 PMCID: PMC10536999 DOI: 10.1021/acs.jctc.3c00641] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Indexed: 09/02/2023]
Abstract
Computational techniques applied to drug discovery have gained considerable popularity for their ability to filter potentially active drugs from inactive ones, reducing the time scale and costs of preclinical investigations. The main focus of these studies has historically been the search for compounds endowed with high affinity for a specific molecular target to ensure the formation of stable and long-lasting complexes. Recent evidence has also correlated the in vivo drug efficacy with its binding kinetics, thus opening new fascinating scenarios for ligand/protein binding kinetic simulations in drug discovery. The present article examines the state of the art in the field, providing a brief summary of the most popular and advanced ligand/protein binding kinetics techniques and evaluating their current limitations and the potential solutions to reach more accurate kinetic models. Particular emphasis is put on the need for a paradigm change in the present methodologies toward ligand and protein parametrization, the force field problem, characterization of the transition states, the sampling issue, and algorithms' performance, user-friendliness, and data openness.
Collapse
Affiliation(s)
- Paolo Conflitti
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
| | - Stefano Raniolo
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
| | - Vittorio Limongelli
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
- Department
of Pharmacy, University of Naples “Federico
II”, 80131 Naples, Italy
| |
Collapse
|
19
|
Ray D, Parrinello M. Kinetics from Metadynamics: Principles, Applications, and Outlook. J Chem Theory Comput 2023; 19:5649-5670. [PMID: 37585703 DOI: 10.1021/acs.jctc.3c00660] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Metadynamics is a popular enhanced sampling algorithm for computing the free energy landscape of rare events by using molecular dynamics simulation. Ten years ago, Tiwary and Parrinello introduced the infrequent metadynamics approach for calculating the kinetics of transitions across free energy barriers. Since then, metadynamics-based methods for obtaining rate constants have attracted significant attention in computational molecular science. Such methods have been applied to study a wide range of problems, including protein-ligand binding, protein folding, conformational transitions, chemical reactions, catalysis, and nucleation. Here, we review the principles of elucidating kinetics from metadynamics-like approaches, subsequent methodological developments in this area, and successful applications on chemical, biological, and material systems. We also highlight the challenges of reconstructing accurate kinetics from enhanced sampling simulations and the scope of future developments.
Collapse
Affiliation(s)
- Dhiman Ray
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Michele Parrinello
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| |
Collapse
|
20
|
Blazhynska M, Goulard Coderc de Lacam E, Chen H, Chipot C. Improving Speed and Affordability without Compromising Accuracy: Standard Binding Free-Energy Calculations Using an Enhanced Sampling Algorithm, Multiple-Time Stepping, and Hydrogen Mass Repartitioning. J Chem Theory Comput 2023. [PMID: 37196198 DOI: 10.1021/acs.jctc.3c00141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Accurate evaluation of protein-ligand binding free energies in silico is of paramount importance for understanding the mechanisms of biological regulation and providing a theoretical basis for drug design and discovery. Based on a series of atomistic molecular dynamics simulations in an explicit solvent, using well-tempered metadynamics extended adaptive biasing force (WTM-eABF) as an enhanced sampling algorithm, the so-called "geometrical route" offers a rigorous theoretical framework for binding affinity calculations that match experimental values. However, although robust, this strategy remains expensive, requiring substantial computational time to achieve convergence of the simulations. Improving the efficiency of the geometrical route, while preserving its reliability through improved ergodic sampling, is, therefore, highly desirable. In this contribution, having identified the computational bottleneck of the geometrical route, to accelerate the calculations we combine (i) a longer time step for the integration of the equations of motion with hydrogen-mass repartitioning (HMR), and (ii) multiple time-stepping (MTS) for collective-variable and biasing-force evaluation. Altogether, we performed 50 independent WTM-eABF simulations in triplicate for the "physical" separation of the Abl kinase-SH3 domain:p41 complex, following different HMR and MTS schemes, while tuning, in distinct protocols, the parameters of the enhanced-sampling algorithm. To demonstrate the consistency and reliability of the results obtained with the best-performing setups, we carried out quintuple simulations. Furthermore, we demonstrated the transferability of our method to other complexes by triplicating a 200 ns separation simulation of nine chosen protocols for the MDM2-p53:NVP-CGM097 complex. [Holzer et al. J. Med. Chem. 2015, 58, 6348-6358.] Our results, based on an aggregate simulation time of 14.4 μs, allowed an optimal set of parameters to be identified, able to accelerate convergence by a factor of three without any noticeable loss of accuracy.
Collapse
Affiliation(s)
- Marharyta Blazhynska
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche n°7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy cedex, France
| | - Emma Goulard Coderc de Lacam
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche n°7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy cedex, France
| | - Haochuan Chen
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche n°7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy cedex, France
| | - Christophe Chipot
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche n°7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy cedex, France
- Theoretical and Computational Biophysics Group, Beckman Institute, and Department of Physics, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Biochemistry and Molecular Biology, The University of Chicago, 929 E. 57th Street W225, Chicago, Illinois 60637, United States
| |
Collapse
|
21
|
Blumer O, Reuveni S, Hirshberg B. Stochastic Resetting for Enhanced Sampling. J Phys Chem Lett 2022; 13:11230-11236. [PMID: 36446130 PMCID: PMC9743203 DOI: 10.1021/acs.jpclett.2c03055] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 11/23/2022] [Indexed: 06/16/2023]
Abstract
We present a method for enhanced sampling of molecular dynamics simulations using stochastic resetting. Various phenomena, ranging from crystal nucleation to protein folding, occur on time scales that are unreachable in standard simulations. They are often characterized by broad transition time distributions, in which extremely slow events have a non-negligible probability. Stochastic resetting, i.e., restarting simulations at random times, was recently shown to significantly expedite processes that follow such distributions. Here, we employ resetting for enhanced sampling of molecular simulations for the first time. We show that it accelerates long time scale processes by up to an order of magnitude in examples ranging from simple models to a molecular system. Most importantly, we recover the mean transition time without resetting, which is typically too long to be sampled directly, from accelerated simulations at a single restart rate. Stochastic resetting can be used as a standalone method or combined with other sampling algorithms to further accelerate simulations.
Collapse
Affiliation(s)
- Ofir Blumer
- School
of Chemistry, Tel Aviv University, Tel Aviv6997801, Israel
| | - Shlomi Reuveni
- School
of Chemistry, Tel Aviv University, Tel Aviv6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv6997801, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv6997801, Israel
| | - Barak Hirshberg
- School
of Chemistry, Tel Aviv University, Tel Aviv6997801, Israel
- The
Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv6997801, Israel
| |
Collapse
|
22
|
Kinetics of Drug Release from Clay Using Enhanced Sampling Methods. Pharmaceutics 2022; 14:pharmaceutics14122586. [PMID: 36559081 PMCID: PMC9781022 DOI: 10.3390/pharmaceutics14122586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 11/21/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022] Open
Abstract
A key step in the development of a new drug, is the design of drug-excipient complexes that lead to optimal drug release kinetics. Computational chemistry and specifically enhanced sampling molecular dynamics methods can play a key role in this context, by minimizing the need for expensive experiments, and reducing cost and time. Here we show that recent advances in enhanced sampling methodologies can be brought to fruition in this area. We demonstrate the potential of these methodologies by simulating the drug release kinetics of the complex praziquantel-montmorillonite in water. Praziquantel finds promising applications in the treatment of schistosomiasis, but its biopharmaceutical profile needs to be improved, and a cheap material such as the montmorillonite clay would be a very convenient excipient. We simulate the drug release both from surface and interlayer space, and find that the diffusion of the praziquantel inside the interlayer space is the process that limits the rate of drug release.
Collapse
|