1
|
Clark F, Robb GR, Cole DJ, Michel J. Automated Adaptive Absolute Binding Free Energy Calculations. J Chem Theory Comput 2024. [PMID: 39254715 DOI: 10.1021/acs.jctc.4c00806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Alchemical absolute binding free energy (ABFE) calculations have substantial potential in drug discovery, but are often prohibitively computationally expensive. To unlock their potential, efficient automated ABFE workflows are required to reduce both computational cost and human intervention. We present a fully automated ABFE workflow based on the automated selection of λ windows, the ensemble-based detection of equilibration, and the adaptive allocation of sampling time based on inter-replicate statistics. We find that the automated selection of intermediate states with consistent overlap is rapid, robust, and simple to implement. Robust detection of equilibration is achieved with a paired t-test between the free energy estimates at initial and final portions of a an ensemble of runs. We determine reasonable default parameters for all algorithms and show that the full workflow produces equivalent results to a nonadaptive scheme over a variety of test systems, while often accelerating equilibration. Our complete workflow is implemented in the open-source package A3FE (https://github.com/michellab/a3fe).
Collapse
Affiliation(s)
- Finlay Clark
- EaStCHEM School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, United Kingdom
| | - Graeme R Robb
- Oncology R&D, AstraZeneca, Cambridge CB4 0WG, United Kingdom
| | - Daniel J Cole
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne NE1 7RU, United Kingdom
| | - Julien Michel
- EaStCHEM School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, United Kingdom
| |
Collapse
|
2
|
Sampson JM, Cannon DA, Duan J, Epstein JCK, Sergeeva AP, Katsamba PS, Mannepalli SM, Bahna FA, Adihou H, Guéret SM, Gopalakrishnan R, Geschwindner S, Rees DG, Sigurdardottir A, Wilkinson T, Dodd RB, De Maria L, Mobarec JC, Shapiro L, Honig B, Buchanan A, Friesner RA, Wang L. Robust Prediction of Relative Binding Energies for Protein-Protein Complex Mutations Using Free Energy Perturbation Calculations. J Mol Biol 2024; 436:168640. [PMID: 38844044 PMCID: PMC11339910 DOI: 10.1016/j.jmb.2024.168640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Accepted: 05/31/2024] [Indexed: 06/18/2024]
Abstract
Computational free energy-based methods have the potential to significantly improve throughput and decrease costs of protein design efforts. Such methods must reach a high level of reliability, accuracy, and automation to be effectively deployed in practical industrial settings in a way that impacts protein design projects. Here, we present a benchmark study for the calculation of relative changes in protein-protein binding affinity for single point mutations across a variety of systems from the literature, using free energy perturbation (FEP+) calculations. We describe a method for robust treatment of alternate protonation states for titratable amino acids, which yields improved correlation with and reduced error compared to experimental binding free energies. Following careful analysis of the largest outlier cases in our dataset, we assess limitations of the default FEP+ protocols and introduce an automated script which identifies probable outlier cases that may require additional scrutiny and calculates an empirical correction for a subset of charge-related outliers. Through a series of three additional case study systems, we discuss how Protein FEP+ can be applied to real-world protein design projects, and suggest areas of further study.
Collapse
Affiliation(s)
- Jared M Sampson
- Schrödinger, Inc., Life Sciences Software, New York, NY, USA
| | - Daniel A Cannon
- Schrödinger, GmbH, Life Sciences Software, Mannheim, Germany
| | - Jianxin Duan
- Schrödinger, GmbH, Life Sciences Software, Mannheim, Germany
| | | | - Alina P Sergeeva
- Columbia University, Department of Systems Biology, New York, NY, USA
| | | | - Seetha M Mannepalli
- Columbia University, Zuckerman Mind Brain Behavior Institute, New York, NY, USA
| | - Fabiana A Bahna
- Columbia University, Zuckerman Mind Brain Behavior Institute, New York, NY, USA
| | - Hélène Adihou
- AstraZeneca, Medicinal Chemistry, Research and Early Development, Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, Gothenburg, Sweden; Max Planck Institute of Molecular Physiology, AstraZeneca-MPI Satellite Unit, Dortmund, Germany
| | - Stéphanie M Guéret
- AstraZeneca, Medicinal Chemistry, Research and Early Development, Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, Gothenburg, Sweden; Max Planck Institute of Molecular Physiology, AstraZeneca-MPI Satellite Unit, Dortmund, Germany
| | - Ranganath Gopalakrishnan
- AstraZeneca, Medicinal Chemistry, Research and Early Development, Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, Gothenburg, Sweden; Max Planck Institute of Molecular Physiology, AstraZeneca-MPI Satellite Unit, Dortmund, Germany
| | - Stefan Geschwindner
- AstraZeneca, Mechanistic and Structural Biology, Discovery Sciences, R&D, Gothenburg, Sweden
| | - D Gareth Rees
- AstraZeneca, Biologics Engineering, R&D, Cambridge, UK
| | | | | | - Roger B Dodd
- AstraZeneca, Biologics Engineering, R&D, Cambridge, UK
| | - Leonardo De Maria
- AstraZeneca, Medicinal Chemistry, Research and Early Development, Respiratory and Immunology, BioPharmaceuticals R&D, Gothenburg, Sweden
| | - Juan Carlos Mobarec
- AstraZeneca, Mechanistic and Structural Biology, Discovery Sciences, R&D, Cambridge, UK
| | - Lawrence Shapiro
- Columbia University, Zuckerman Mind Brain Behavior Institute, New York, NY, USA; Columbia University, Department of Biochemistry and Molecular Biophysics, New York, NY, USA
| | - Barry Honig
- Columbia University, Department of Systems Biology, New York, NY, USA; Columbia University, Zuckerman Mind Brain Behavior Institute, New York, NY, USA; Columbia University, Department of Biochemistry and Molecular Biophysics, New York, NY, USA; Columbia University, Department of Medicine, New York, NY, USA
| | | | | | - Lingle Wang
- Schrödinger, Inc., Life Sciences Software, New York, NY, USA.
| |
Collapse
|
3
|
Hahn DF, Gapsys V, de Groot BL, Mobley DL, Tresadern G. Current State of Open Source Force Fields in Protein-Ligand Binding Affinity Predictions. J Chem Inf Model 2024; 64:5063-5076. [PMID: 38895959 PMCID: PMC11234369 DOI: 10.1021/acs.jcim.4c00417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 04/23/2024] [Accepted: 04/25/2024] [Indexed: 06/21/2024]
Abstract
In drug discovery, the in silico prediction of binding affinity is one of the major means to prioritize compounds for synthesis. Alchemical relative binding free energy (RBFE) calculations based on molecular dynamics (MD) simulations are nowadays a popular approach for the accurate affinity ranking of compounds. MD simulations rely on empirical force field parameters, which strongly influence the accuracy of the predicted affinities. Here, we evaluate the ability of six different small-molecule force fields to predict experimental protein-ligand binding affinities in RBFE calculations on a set of 598 ligands and 22 protein targets. The public force fields OpenFF Parsley and Sage, GAFF, and CGenFF show comparable accuracy, while OPLS3e is significantly more accurate. However, a consensus approach using Sage, GAFF, and CGenFF leads to accuracy comparable to OPLS3e. While Parsley and Sage are performing comparably based on aggregated statistics across the whole dataset, there are differences in terms of outliers. Analysis of the force field reveals that improved parameters lead to significant improvement in the accuracy of affinity predictions on subsets of the dataset involving those parameters. Lower accuracy can not only be attributed to the force field parameters but is also dependent on input preparation and sampling convergence of the calculations. Especially large perturbations and nonconverged simulations lead to less accurate predictions. The input structures, Gromacs force field files, as well as the analysis Python notebooks are available on GitHub.
Collapse
Affiliation(s)
- David F. Hahn
- Computational
Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse 2340, Belgium
| | - Vytautas Gapsys
- Computational
Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse 2340, Belgium
- Computational
Biomolecular Dynamics Group, Max Planck
Institute for Multidisciplinary Sciences, Am Fassberg 11, Göttingen 37077, Germany
| | - Bert L. de Groot
- Computational
Biomolecular Dynamics Group, Max Planck
Institute for Multidisciplinary Sciences, Am Fassberg 11, Göttingen 37077, Germany
| | - David L. Mobley
- Department
of Chemistry, University of California, Irvine, California 92697, United States
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
| | - Gary Tresadern
- Computational
Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse 2340, Belgium
| |
Collapse
|
4
|
Mittal A, Martin MF, Levin EJ, Adams C, Yang M, Provins L, Hall A, Procter M, Ledecq M, Hillisch A, Wolff C, Gillard M, Horanyi PS, Coleman JA. Structures of synaptic vesicle protein 2A and 2B bound to anticonvulsants. Nat Struct Mol Biol 2024:10.1038/s41594-024-01335-1. [PMID: 38898101 DOI: 10.1038/s41594-024-01335-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 05/14/2024] [Indexed: 06/21/2024]
Abstract
Epilepsy is a common neurological disorder characterized by abnormal activity of neuronal networks, leading to seizures. The racetam class of anti-seizure medications bind specifically to a membrane protein found in the synaptic vesicles of neurons called synaptic vesicle protein 2 (SV2) A (SV2A). SV2A belongs to an orphan subfamily of the solute carrier 22 organic ion transporter family that also includes SV2B and SV2C. The molecular basis for how anti-seizure medications act on SV2s remains unknown. Here we report cryo-electron microscopy structures of SV2A and SV2B captured in a luminal-occluded conformation complexed with anticonvulsant ligands. The conformation bound by anticonvulsants resembles an inhibited transporter with closed luminal and intracellular gates. Anticonvulsants bind to a highly conserved central site in SV2s. These structures provide blueprints for future drug design and will facilitate future investigations into the biological function of SV2s.
Collapse
Affiliation(s)
- Anshumali Mittal
- Department of Structural Biology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Matthew F Martin
- Department of Structural Biology, University of Pittsburgh, Pittsburgh, PA, USA
| | | | | | | | | | | | | | | | | | | | | | | | - Jonathan A Coleman
- Department of Structural Biology, University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
5
|
Zeng J, Qian Y. Adaptive lambda schemes for efficient relative binding free energy calculation. J Comput Chem 2024; 45:855-862. [PMID: 38153254 DOI: 10.1002/jcc.27287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 11/13/2023] [Accepted: 12/03/2023] [Indexed: 12/29/2023]
Abstract
The relative free energy perturbation (RFEP) calculation is one of the most theoretically sound computational chemistry approaches for the binding affinity prediction. However, its application is often hindered by the complexity of the calculation choices and the requirement of expertise in the field. Improper lambda scheme of RFEP may result in deviations from an accurate description of the perturbation process and is prone to erroneous affinity predictions. To address such challenges, an automated adaptive lambda method is proposed where the adaptive lambda schemes are obtained through a split-and-merge algorithm based on the pilot runs. The newly established workflow along with a series of improvements to the perturbation settings increases the consistency of the RFEP calculation results. Comparing the pilot and adaptive lambda schemes, the latter demonstrated improvements in convergence and reproducibility and lowered the mean unsigned error and the root-mean-square error. Overall, the adaptive lambda method is a reliable and robust choice to predict small molecule relative binding free energy and can be capitalized to benefit routine RFEP calculations for drug discovery projects.
Collapse
Affiliation(s)
- Jin Zeng
- AIxplorerBio Biotech Co., Ltd., Jiaxing, Zhejiang Province, China
| | - Yue Qian
- Viva Biotech (Shanghai) Ltd., Shanghai, China
| |
Collapse
|
6
|
Sampson JM, Cannon DA, Duan J, Epstein JCK, Sergeeva AP, Katsamba PS, Mannepalli SM, Bahna FA, Adihou H, Guéret SM, Gopalakrishnan R, Geschwindner S, Rees DG, Sigurdardottir A, Wilkinson T, Dodd RB, De Maria L, Mobarec JC, Shapiro L, Honig B, Buchanan A, Friesner RA, Wang L. Robust prediction of relative binding energies for protein-protein complex mutations using free energy perturbation calculations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.22.590325. [PMID: 38712280 PMCID: PMC11071377 DOI: 10.1101/2024.04.22.590325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Computational free energy-based methods have the potential to significantly improve throughput and decrease costs of protein design efforts. Such methods must reach a high level of reliability, accuracy, and automation to be effectively deployed in practical industrial settings in a way that impacts protein design projects. Here, we present a benchmark study for the calculation of relative changes in protein-protein binding affinity for single point mutations across a variety of systems from the literature, using free energy perturbation (FEP+) calculations. We describe a method for robust treatment of alternate protonation states for titratable amino acids, which yields improved correlation with and reduced error compared to experimental binding free energies. Following careful analysis of the largest outlier cases in our dataset, we assess limitations of the default FEP+ protocols and introduce an automated script which identifies probable outlier cases that may require additional scrutiny and calculates an empirical correction for a subset of charge-related outliers. Through a series of three additional case study systems, we discuss how protein FEP+ can be applied to real-world protein design projects, and suggest areas of further study.
Collapse
Affiliation(s)
| | | | - Jianxin Duan
- Schrödinger, GmbH, Life Sciences Software, Mannheim, Germany
| | | | - Alina P. Sergeeva
- Columbia University, Department of Systems Biology, New York, NY, USA
| | | | - Seetha M. Mannepalli
- Columbia University, Zuckerman Mind Brain Behavior Institute, New York, NY, USA, 10027
| | - Fabiana A. Bahna
- Columbia University, Zuckerman Mind Brain Behavior Institute, New York, NY, USA, 10027
| | - Hélène Adihou
- AstraZeneca, Medicinal Chemistry, Research and Early Development, Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, Gothenburg, Sweden
- Max Planck Institute of Molecular Physiology, AstraZeneca-MPI Satellite Unit, Dortmund, Germany
| | - Stéphanie M. Guéret
- AstraZeneca, Medicinal Chemistry, Research and Early Development, Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, Gothenburg, Sweden
- Max Planck Institute of Molecular Physiology, AstraZeneca-MPI Satellite Unit, Dortmund, Germany
| | - Ranganath Gopalakrishnan
- AstraZeneca, Medicinal Chemistry, Research and Early Development, Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, Gothenburg, Sweden
- Max Planck Institute of Molecular Physiology, AstraZeneca-MPI Satellite Unit, Dortmund, Germany
| | - Stefan Geschwindner
- AstraZeneca, Mechanistic and Structural Biology, Discovery Sciences, R&D, Cambridge, UK
| | | | | | | | - Roger B. Dodd
- AstraZeneca, Biologics Engineering, R&D, Cambridge, UK
| | - Leonardo De Maria
- AstraZeneca, Medicinal Chemistry, Research and Early Development, Respiratory and Immunology, BioPharmaceuticals R&D, Gothenburg, Sweden
| | - Juan Carlos Mobarec
- AstraZeneca, Mechanistic and Structural Biology, Discovery Sciences, R&D, Cambridge, UK
| | - Lawrence Shapiro
- Columbia University, Zuckerman Mind Brain Behavior Institute, New York, NY, USA, 10027
- Columbia University, Department of Biochemistry and Molecular Biophysics, New York, NY, USA
| | - Barry Honig
- Columbia University, Department of Systems Biology, New York, NY, USA
- Columbia University, Zuckerman Mind Brain Behavior Institute, New York, NY, USA, 10027
- Columbia University, Department of Biochemistry and Molecular Biophysics, New York, NY, USA
- Columbia University, Department of Medicine, New York, NY, USA
| | | | | | - Lingle Wang
- Schrödinger, Inc., Life Sciences Software, New York, NY, USA
| |
Collapse
|
7
|
Thakur A, Gizzio J, Levy RM. Potts Hamiltonian Models and Molecular Dynamics Free Energy Simulations for Predicting the Impact of Mutations on Protein Kinase Stability. J Phys Chem B 2024; 128:1656-1667. [PMID: 38350894 PMCID: PMC10939730 DOI: 10.1021/acs.jpcb.3c08097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2024]
Abstract
Single-point mutations in kinase proteins can affect their stability and fitness, and computational analysis of these effects can provide insights into the relationships among protein sequence, structure, and function for this enzyme family. To assess the impact of mutations on protein stability, we used a sequence-based Potts Hamiltonian model trained on a kinase family multiple-sequence alignment (MSA) to calculate the statistical energy (fitness) effects of mutations and compared these against relative folding free energies (ΔΔGs) calculated from all-atom molecular dynamics free energy perturbation (FEP) simulations in explicit solvent. The fitness effects of mutations in the Potts model (ΔEs) showed good agreement with experimental thermostability data (Pearson r = 0.68), similar to the correlation we observed with ΔΔGs predicted from structure-based relative FEP simulations. Recognizing the possible advantages of using Potts models to rapidly estimate protein stability effects of kinase mutations seen in cancer genomics data, we used the Potts statistical energy model to estimate the stability effects of 65 conservative and nonconservative mutations across three distinct kinases (Wee1, Abl1, and Cdc7) with somatic mutations reported in the Genomic Data Commons (GDC) database. The ΔEs of these mutations calculated from the Potts model are consistent with the corresponding ΔΔGs from FEP simulations (Pearson ratio of 0.72). The agreement between these methods suggests that the Potts model may be used as a sequence-based tool for high-throughput screening of mutational effects as part of a computational pipeline for predicting the stability effects of mutations. We also demonstrate how the scalability of the fitness-based Potts model calculations permits analyses that are not easily accessed using FEP simulations. To this end, we employed site-saturation mutagenesis in the Potts model in order to investigate the relative stability effects of mutations seen in different cancer evolutionary scenarios. We used this approach to analyze the effects of drug pressure in Abl kinase by contrasting the relative fitness penalties of somatic mutations seen in miscellaneous cancer types with those calculated for mutations associated with cancer drug resistance. We observed that, in contrast to somatic mutations of Abl seen in various tumors that appear to have evolved neutrally, cancer mutations that evolved under drug pressure in Abl-targeted therapies tend to preserve enzyme stability.
Collapse
Affiliation(s)
- Abhishek Thakur
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122, United States
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Joan Gizzio
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122, United States
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Ronald M Levy
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122, United States
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
- Department of Physics, Temple University, Philadelphia, Pennsylvania 19122, United States
| |
Collapse
|
8
|
Khuttan S, Gallicchio E. What to Make of Zero: Resolving the Statistical Noise from Conformational Reorganization in Alchemical Binding Free Energy Estimates with Metadynamics Sampling. J Chem Theory Comput 2024; 20:1489-1501. [PMID: 38252868 PMCID: PMC10867849 DOI: 10.1021/acs.jctc.3c01250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Revised: 01/03/2024] [Accepted: 01/03/2024] [Indexed: 01/24/2024]
Abstract
We introduce the self-relative binding free energy (self-RBFE) approach to evaluate the intrinsic statistical variance of dual-topology alchemical binding free energy estimators. The self-RBFE is the relative binding free energy between a ligand and a copy of the same ligand, and its true value is zero. Nevertheless, because the two copies of the ligand move independently, the self-RBFE value produced by a finite-length simulation fluctuates and can be used to measure the variance of the model. The results of this validation provide evidence that a significant fraction of the errors observed in benchmark studies reflect the statistical fluctuations of unconverged estimates rather than the models' accuracy. Furthermore, we find that ligand reorganization is a significant contributing factor to the statistical variance of binding free energy estimates and that metadynamics-accelerated conformational sampling of the torsional degrees of freedom of the ligand can drastically reduce the time to convergence.
Collapse
Affiliation(s)
- Sheenam Khuttan
- Department
of Chemistry and Biochemistry, Brooklyn
College of the City University of New York, New York, New York 11210, United States
- Ph.D.
Program in Biochemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
| | - Emilio Gallicchio
- Department
of Chemistry and Biochemistry, Brooklyn
College of the City University of New York, New York, New York 11210, United States
- Ph.D.
Program in Biochemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
- Ph.D.
Program in Chemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
| |
Collapse
|
9
|
Xue B, Yang Q, Zhang Q, Wan X, Fang D, Lin X, Sun G, Gobbo G, Cao F, Mathiowetz AM, Burke BJ, Kumpf RA, Rai BK, Wood GPF, Pickard FC, Wang J, Zhang P, Ma J, Jiang YA, Wen S, Hou X, Zou J, Yang M. Development and Comprehensive Benchmark of a High-Quality AMBER-Consistent Small Molecule Force Field with Broad Chemical Space Coverage for Molecular Modeling and Free Energy Calculation. J Chem Theory Comput 2024; 20:799-818. [PMID: 38157475 DOI: 10.1021/acs.jctc.3c00920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]
Abstract
Biomolecular simulations have become an essential tool in contemporary drug discovery, and molecular mechanics force fields (FFs) constitute its cornerstone. Developing a high quality and broad coverage general FF is a significant undertaking that requires substantial expert knowledge and computing resources, which is beyond the scope of general practitioners. Existing FFs originate from only a limited number of groups and organizations, and they either suffer from limited numbers of training sets, lower than desired quality because of oversimplified representations, or are costly for the molecular modeling community to access. To address these issues, in this work, we developed an AMBER-consistent small molecule FF with extensive chemical space coverage, and we provide Open Access parameters for the entire modeling community. To validate our FF, we carried out benchmarks of quantum mechanics (QM)/molecular mechanics conformer comparison and free energy perturbation calculations on several benchmark data sets. Our FF achieves a higher level of performance at reproducing QM energies and geometries than two popular open-source FFs, OpenFF2 and GAFF2. In relative binding free energy calculations for 31 protein-ligand data sets, comprising 1079 pairs of ligands, the new FF achieves an overall root-mean-square error of 1.19 kcal/mol for ΔΔG and 0.92 kcal/mol for ΔG on a subset of 463 ligands without bespoke fitting to the data sets. The results are on par with those of the leading commercial series of OPLS FFs.
Collapse
Affiliation(s)
- Bai Xue
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Qingyi Yang
- Medicine Design, Pfizer Inc., 1 Portland Street, Cambridge, Massachusetts 02139, United States
| | - Qiaochu Zhang
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Xiao Wan
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Dong Fang
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Xiaolu Lin
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Guangxu Sun
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Gianpaolo Gobbo
- XtalPi Inc., 245 Main Street, Cambridge, Massachusetts 02142, United States
| | - Fenglei Cao
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Alan M Mathiowetz
- Medicine Design, Pfizer Inc., 1 Portland Street, Cambridge, Massachusetts 02139, United States
| | - Benjamin J Burke
- Medicine Design, Pfizer Inc., 10777 Science Center Drive, San Diego, California 92121, United States
| | - Robert A Kumpf
- Medicine Design, Pfizer Inc., 10777 Science Center Drive, San Diego, California 92121, United States
| | - Brajesh K Rai
- Machine Learning and Computational Sciences, Pfizer Inc., 610 Main Street, Cambridge, Massachusetts 02139, United States
| | - Geoffrey P F Wood
- Pharmaceutical Science Small Molecule, Pfizer Inc., Eastern Point Road, Groton, Connecticut 06340, United States
| | - Frank C Pickard
- Pharmaceutical Science Small Molecule, Pfizer Inc., Eastern Point Road, Groton, Connecticut 06340, United States
| | - Junmei Wang
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Peiyu Zhang
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Jian Ma
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Yide Alan Jiang
- XtalPi Inc., 245 Main Street, Cambridge, Massachusetts 02142, United States
| | - Shuhao Wen
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Xinjun Hou
- Medicine Design, Pfizer Inc., 1 Portland Street, Cambridge, Massachusetts 02139, United States
| | - Junjie Zou
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| | - Mingjun Yang
- Shenzhen Jingtai Technology Co., Ltd. (XtalPi), Floor 3, Sf Industrial Plant, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen 518045, China
| |
Collapse
|
10
|
Mihalovits LM, Kollár L, Bajusz D, Knez D, Bozovičar K, Imre T, Ferenczy GG, Gobec S, Keserű GM. Molecular Mechanism of Labelling Functional Cysteines by Heterocyclic Thiones. Chemphyschem 2024; 25:e202300596. [PMID: 37888491 DOI: 10.1002/cphc.202300596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/24/2023] [Accepted: 10/26/2023] [Indexed: 10/28/2023]
Abstract
Heterocyclic thiones have recently been identified as reversible covalent warheads, consistent with their mild electrophilic nature. Little is known so far about their mechanism of action in labelling nucleophilic sidechains, especially cysteines. The vast number of tractable cysteines promotes a wide range of target proteins to examine; however, our focus was put on functional cysteines. We chose the main protease of SARS-CoV-2 harboring Cys145 at the active site that is a structurally characterized and clinically validated target of covalent inhibitors. We screened an in-house, cysteine-targeting covalent inhibitor library which resulted in several covalent fragment hits with benzoxazole, benzothiazole and benzimidazole cores. Thione derivatives and Michael acceptors were selected for further investigations with the objective of exploring the mechanism of inhibition of the thiones and using the thoroughly characterized Michael acceptors for benchmarking our studies. Classical and hybrid quantum mechanical/molecular mechanical (QM/MM) molecular dynamics simulations were carried out that revealed a new mechanism of covalent cysteine labelling by thione derivatives, which was supported by QM and free energy calculations and by a wide range of experimental results. Our study shows that the molecular recognition step plays a crucial role in the overall binding of both sets of molecules.
Collapse
Affiliation(s)
- Levente M Mihalovits
- Medicinal Chemistry Research Group, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary
| | - Levente Kollár
- Medicinal Chemistry Research Group, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary
- Department of Organic Chemistry and Technology, Faculty of Chemical Technology and Biotechnology, Budapest University of Technology and Economics, Műegyetem rkp. 3., 1111, Budapest, Hungary
| | - Dávid Bajusz
- Medicinal Chemistry Research Group, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary
| | - Damijan Knez
- Department of Medicinal Chemistry, Faculty of Pharmacy, University of Ljubljana, Aškerčeva cesta 7, 1000, Ljubljana, Slovenia
| | - Krištof Bozovičar
- Department of Pharmaceutical Biology, Faculty of Pharmacy, University of Ljubljana, Aškerčeva cesta 7, 1000, Ljubljana, Slovenia
| | - Tímea Imre
- Medicinal Chemistry Research Group, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary
- MS Metabolomics Research Group, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary
| | - György G Ferenczy
- Medicinal Chemistry Research Group, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary
| | - Stanislav Gobec
- Department of Medicinal Chemistry, Faculty of Pharmacy, University of Ljubljana, Aškerčeva cesta 7, 1000, Ljubljana, Slovenia
| | - György M Keserű
- Medicinal Chemistry Research Group, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary
- Department of Organic Chemistry and Technology, Faculty of Chemical Technology and Biotechnology, Budapest University of Technology and Economics, Műegyetem rkp. 3., 1111, Budapest, Hungary
| |
Collapse
|
11
|
Herz AM, Kellici T, Morao I, Michel J. Alchemical Free Energy Workflows for the Computation of Protein-Ligand Binding Affinities. Methods Mol Biol 2024; 2716:241-264. [PMID: 37702943 DOI: 10.1007/978-1-0716-3449-3_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/14/2023]
Abstract
Alchemical free energy methods can be used for the efficient computation of relative binding free energies during preclinical drug discovery stages. In recent years, this has been facilitated further by the implementation of workflows that enable non-experts to quickly and consistently set up the required simulations. Given the correct input structures, workflows handle the difficult aspects of setting up perturbations, including consistently defining the perturbable molecule, its atom mapping and topology generation, perturbation network generation, running of the simulations via different sampling methods, and analysis of the results. Different academic and commercial workflows are discussed, including FEW, FESetup, FEPrepare, CHARMM-GUI, Transformato, PMX, QLigFEP, TIES, ProFESSA, PyAutoFEP, BioSimSpace, FEP+, Flare, and Orion. These workflows differ in various aspects, such as mapping algorithms or enhanced sampling methods. Some workflows can accommodate more than one molecular dynamics (MD) engine and use external libraries for tasks. Differences between workflows can present advantages for different use cases, however a lack of interoperability of the workflows' components hinders systematic comparisons.
Collapse
Affiliation(s)
- Anna M Herz
- EaStChem School of Chemistry, Joseph Black Building, University of Edinburgh, Edinburgh, UK
| | - Tahsin Kellici
- Evotec (UK) Ltd., In Silico Research and Development, Abingdon, Oxfordshire, UK
- Merck & Co., Inc., Modelling and Informatics, West Point, PA, USA
| | - Inaki Morao
- Evotec (UK) Ltd., In Silico Research and Development, Abingdon, Oxfordshire, UK
| | - Julien Michel
- EaStChem School of Chemistry, Joseph Black Building, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
12
|
Robo MT, Hayes RL, Ding X, Pulawski B, Vilseck JZ. Fast free energy estimates from λ-dynamics with bias-updated Gibbs sampling. Nat Commun 2023; 14:8515. [PMID: 38129400 PMCID: PMC10740020 DOI: 10.1038/s41467-023-44208-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 12/04/2023] [Indexed: 12/23/2023] Open
Abstract
Relative binding free energy calculations have become an integral computational tool for lead optimization in structure-based drug design. Classical alchemical methods, including free energy perturbation or thermodynamic integration, compute relative free energy differences by transforming one molecule into another. However, these methods have high operational costs due to the need to perform many pairwise perturbations independently. To reduce costs and accelerate molecular design workflows, we present a method called λ-dynamics with bias-updated Gibbs sampling. This method uses dynamic biases to continuously sample between multiple ligand analogues collectively within a single simulation. We show that many relative binding free energies can be determined quickly with this approach without compromising accuracy. For five benchmark systems, agreement to experiment is high, with root mean square errors near or below 1.0 kcal mol-1. Free energy results are consistent with other computational approaches and within statistical noise of both methods (0.4 kcal mol-1 or less). Notably, large efficiency gains over thermodynamic integration of 18-66-fold for small perturbations and 100-200-fold for whole aromatic ring substitutions are observed. The rapid determination of relative binding free energies will enable larger chemical spaces to be more readily explored and structure-based drug design to be accelerated.
Collapse
Affiliation(s)
- Michael T Robo
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
- Indiana Biosciences Research Institute, 1210 Waterway Blvd Ste. 2000, Indianapolis, IN, 46202, USA
| | - Ryan L Hayes
- Chemical and Biomolecular Engineering, University of California, Irvine, California, 92617, USA
- Pharmaceutical Sciences, University of California, Irvine, CA, 92617, USA
| | - Xinqiang Ding
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Department of Chemistry, Tufts University, Medford, MA, 02144, USA
| | - Brian Pulawski
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Jonah Z Vilseck
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.
| |
Collapse
|
13
|
Procacci P. Dealing with Induced Fit, Conformational Selection, and Secondary Poses in Molecular Dynamics Simulations for Reliable Free Energy Predictions. J Chem Theory Comput 2023; 19:8942-8954. [PMID: 38037326 PMCID: PMC10720345 DOI: 10.1021/acs.jctc.3c00867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 11/08/2023] [Accepted: 11/09/2023] [Indexed: 12/02/2023]
Abstract
In this study, we have tested the performance of standard molecular dynamics (MD) simulations, replicates of shorter standard MD simulations, and Hamiltonian Replica Exchange (HREM) simulations for the sampling of two macrocyclic hosts for guest delivery, characterized by induced fit (phenyl-based host) and conformation selection (naphthyl-based host) and of the ODR-BRD4(I) drug-receptor system where the ligand can assume two main poses. For the optimization of the HREM simulation, we have proposed and tested an on-the-fly iterative scheme for equalizing the acceptance ratio along the replica progression at a constant replica number resulting in a moderate impact of the sampling efficiency. Concerning standard MD, we have found that, while splitting the total allocated simulation time in short MD replicates can reproduce the sampling efficiency of HREM in the phenyl-based host and in the ODR-BRD4(I) complex, in the naphthyl-based macrocycle, characterized by long-lived metastable states, enhanced sampling techniques are the only viable alternative for a reliable canonical sampling of the rugged conformational landscape.
Collapse
Affiliation(s)
- Piero Procacci
- Dipartimento di Chimica “Ugo
Schiff”, Università degli
Studi di Firenze, Via
della Lastruccia 3, 50019 Sesto Fiorentino, Italy
| |
Collapse
|
14
|
Amblard F, Chen Z, Wiseman J, Zhou S, Liu P, Salman M, Verma K, Azadi N, Downs-Bowen J, Tao S, Kumari A, Zhang Q, Smith DB, Patel D, Bassit L, Schinazi RF. Synthesis and evaluation of highly potent HBV capsid assembly modulators (CAMs). Bioorg Chem 2023; 141:106923. [PMID: 37871391 PMCID: PMC10765384 DOI: 10.1016/j.bioorg.2023.106923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/10/2023] [Accepted: 10/16/2023] [Indexed: 10/25/2023]
Abstract
Chronic hepatitis B virus (HBV) infection remains a major global health burden. It affects more than 290 million individuals worldwide and is responsible for approximately 900,000 deaths annually. Anti-HBV treatment with a nucleoside analog in combination with pegylated interferon are considered first-line therapy for patients with chronic HBV infection and liver inflammation. However, because cure rates are low, most patients will require lifetime treatment. HBV Capsid Assembly Modulators (CAMs) have emerged as a promising new class of compounds as they can affect levels of HBV covalently closed-circular DNA (cccDNA) associated with viral persistence. SAR studies around the core structure of lead HBV CAM GLP-26 (Fig. 1B) was performed and led to the discovery of non-toxic compound 10a displaying sub-nanomolar anti-HBV activity. Advanced toxicity and cellular pharmacology profiles of compounds 10a were also established and the results are discussed herein.
Collapse
Affiliation(s)
- Franck Amblard
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA.
| | - Zhe Chen
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - John Wiseman
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Shaoman Zhou
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Peng Liu
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Mohammad Salman
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Kiran Verma
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Niloufar Azadi
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Jessica Downs-Bowen
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Sijia Tao
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Amita Kumari
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Qingling Zhang
- Aligos Therapeutics, Inc., 1 Corporate Drive, South San Francisco, CA 94080, USA
| | - David B Smith
- Aligos Therapeutics, Inc., 1 Corporate Drive, South San Francisco, CA 94080, USA
| | - Dharmeshkumar Patel
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Leda Bassit
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA
| | - Raymond F Schinazi
- Center for ViroScience and Cure, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA 30322, USA.
| |
Collapse
|
15
|
Hasan MN, Ray M, Saha A. Landscape of In Silico Tools for Modeling Covalent Modification of Proteins: A Review on Computational Covalent Drug Discovery. J Phys Chem B 2023; 127:9663-9684. [PMID: 37921534 DOI: 10.1021/acs.jpcb.3c04710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2023]
Abstract
Covalent drug discovery has been a challenging research area given the struggle of finding a sweet balance between selectivity and reactivity for these drugs, the lack of which often leads to off-target activities and hence undesirable side effects. However, there has been a resurgence in covalent drug design following the success of several covalent drugs such as boceprevir (2011), ibrutinib (2013), neratinib (2017), dacomitinib (2018), zanubrutinib (2019), and many others. Design of covalent drugs includes many crucial factors, where "evaluation of the binding affinity" and "a detailed mechanistic understanding on covalent inhibition" are at the top of the list. Well-defined experimental techniques are available to elucidate these factors; however, often they are expensive and/or time-consuming and hence not suitable for high throughput screens. Recent developments in in silico methods provide promise in this direction. In this report, we review a set of recent publications that focused on developing and/or implementing novel in silico techniques in "Computational Covalent Drug Discovery (CCDD)". We also discuss the advantages and disadvantages of these approaches along with what improvements are required to make it a great tool in medicinal chemistry in the near future.
Collapse
Affiliation(s)
- Md Nazmul Hasan
- Department of Chemistry and Biochemistry, University of Wisconsin─Milwaukee, Milwaukee, Wisconsin 53211, United States
| | - Manisha Ray
- Department of Chemistry and Biochemistry, Loyola University Chicago, Chicago, Illinois 60660, United States
| | - Arjun Saha
- Department of Chemistry and Biochemistry, University of Wisconsin─Milwaukee, Milwaukee, Wisconsin 53211, United States
| |
Collapse
|
16
|
Ross GA, Lu C, Scarabelli G, Albanese SK, Houang E, Abel R, Harder ED, Wang L. The maximal and current accuracy of rigorous protein-ligand binding free energy calculations. Commun Chem 2023; 6:222. [PMID: 37838760 PMCID: PMC10576784 DOI: 10.1038/s42004-023-01019-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 10/02/2023] [Indexed: 10/16/2023] Open
Abstract
Computational techniques can speed up the identification of hits and accelerate the development of candidate molecules for drug discovery. Among techniques for predicting relative binding affinities, the most consistently accurate is free energy perturbation (FEP), a class of rigorous physics-based methods. However, uncertainty remains about how accurate FEP is and can ever be. Here, we present what we believe to be the largest publicly available dataset of proteins and congeneric series of small molecules, and assess the accuracy of the leading FEP workflow. To ascertain the limit of achievable accuracy, we also survey the reproducibility of experimental relative affinity measurements. We find a wide variability in experimental accuracy and a correspondence between binding and functional assays. When careful preparation of protein and ligand structures is undertaken, FEP can achieve accuracy comparable to experimental reproducibility. Throughout, we highlight reliable protocols that can help maximize the accuracy of FEP in prospective studies.
Collapse
Affiliation(s)
- Gregory A Ross
- Schrödinger Inc, New York, NY, USA.
- Isomorphic Labs, London, UK.
| | - Chao Lu
- Schrödinger Inc, New York, NY, USA
| | | | | | | | | | | | | |
Collapse
|
17
|
Salomon K, Abramyan AM, Plenge P, Wang L, Bundgaard C, Bang-Andersen B, Loland CJ, Shi L. Dynamic extracellular vestibule of human SERT: Unveiling druggable potential with high-affinity allosteric inhibitors. Proc Natl Acad Sci U S A 2023; 120:e2304089120. [PMID: 37792512 PMCID: PMC10576121 DOI: 10.1073/pnas.2304089120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 08/15/2023] [Indexed: 10/06/2023] Open
Abstract
The serotonin transporter (SERT) tightly regulates synaptic serotonin levels and has been the primary target of antidepressants. Binding of inhibitors to the allosteric site of human SERT (hSERT) impedes the dissociation of antidepressants bound at the central site and may enhance the efficacy of such antidepressants to potentially reduce their dosage and side effects. Here, we report the identification of a series of high-affinity allosteric inhibitors of hSERT in a unique scaffold, with the lead compound, Lu AF88273 (3-(1-(2-(1H-indol-3-yl)ethyl)piperidin-4-yl)-6-chloro-1H-indole), having 2.1 nM allosteric potency in inhibiting imipramine dissociation. In addition, we find that Lu AF88273 also inhibits serotonin transport in a noncompetitive manner. The binding pose of Lu AF88273 in the allosteric site of hSERT is determined with extensive molecular dynamics simulations and rigorous absolute binding free energy perturbation (FEP) calculations, which show that a part of the compound occupies a dynamically formed small cavity. The predicted binding location and pose are validated by site-directed mutagenesis and can explain much of the structure-activity relationship of these inhibitors using the relative binding FEP calculations. Together, our findings provide a promising lead compound and the structural basis for the development of allosteric drugs targeting hSERT. Further, they demonstrate that the divergent allosteric sites of neurotransmitter transporters can be selectively targeted.
Collapse
Affiliation(s)
- Kristine Salomon
- Laboratory for Membrane Protein Dynamics, Department of Neuroscience, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200Copenhagen N, Denmark
| | - Ara M. Abramyan
- Computational Chemistry and Molecular Biophysics Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse - Intramural Research Program, NIH, Baltimore, MD21224
- Schrödinger, Inc., San Diego, CA92121
| | - Per Plenge
- Laboratory for Membrane Protein Dynamics, Department of Neuroscience, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200Copenhagen N, Denmark
| | | | - Christoffer Bundgaard
- Medicinal Chemistry and Translational DMPK, H. Lundbeck A/S, DK-2500Copenhagen-Valby, Denmark
| | - Benny Bang-Andersen
- Medicinal Chemistry and Translational DMPK, H. Lundbeck A/S, DK-2500Copenhagen-Valby, Denmark
| | - Claus J. Loland
- Laboratory for Membrane Protein Dynamics, Department of Neuroscience, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200Copenhagen N, Denmark
| | - Lei Shi
- Computational Chemistry and Molecular Biophysics Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse - Intramural Research Program, NIH, Baltimore, MD21224
| |
Collapse
|
18
|
Bowers SR, Lockhart C, Klimov DK. Replica Exchange with Hybrid Tempering Efficiently Samples PGLa Peptide Binding to Anionic Bilayer. J Chem Theory Comput 2023; 19:6532-6550. [PMID: 37676235 DOI: 10.1021/acs.jctc.3c00787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
We evaluated the utility of a variant of the replica exchange method, a replica exchange with hybrid tempering (REHT), for all-atom explicit water biomolecular simulations and compared it with a more traditional replica exchange with the solute tempering (REST) algorithm. As a test system, we selected a 21-mer antimicrobial peptide PGLa binding to an anionic DMPC/DMPG lipid bilayer. Application of REHT revealed the following binding mechanism. Due to the strong hydrophobic moment, the bound PGLa adopts an extensive helical structure. The binding free energy landscape identifies two major bound states, a metastable surface bound state and a dominant inserted state. In both states, positively charged PGLa amino acids maintain electrostatic interactions with anionic phosphate groups by rotating the PGLa helix around its axis. PGLa binding causes an influx of anionic DMPG and an efflux of zwitterionic DMPC lipids from the peptide proximity. PGLa thins the bilayer and disorders the adjacent fatty acid tails. Deep invasion of water wires into the bilayer hydrophobic core is detected in the inserted peptide state. The analysis of charge density distributions indicated that peptide positive charges are nearly compensated for by lipid negative charges and water dipole ordering, whereas ions play no role in peptide binding. Thus, electrostatic interactions are the key energetic factor in binding cationic PGLa to an anionic DMPC/DMPG bilayer. Comparison of REHT and REST shows that due to exclusion of lipids from tempered partition, REST lags behind REHT in peptide equilibration, particularly, with respect to peptide insertion and helix acquisition. As a result, REST struggles to provide accurate details of PGLa binding, although it still qualitatively maps the bimodal binding mechanism. Importantly, REHT not only equilibrates PGLa in the bilayer faster than REST, but also with less computational effort. We conclude that REHT is a preferable choice for studying interfacial biomolecular systems.
Collapse
Affiliation(s)
- Steven R Bowers
- School of Systems Biology, George Mason University, Manassas, Virginia 20110, United States
| | - Christopher Lockhart
- School of Systems Biology, George Mason University, Manassas, Virginia 20110, United States
| | - Dmitri K Klimov
- School of Systems Biology, George Mason University, Manassas, Virginia 20110, United States
| |
Collapse
|
19
|
Kenney IM, Beckstein O. Thermodynamically consistent determination of free energies and rates in kinetic cycle models. BIOPHYSICAL REPORTS 2023; 3:100120. [PMID: 37638349 PMCID: PMC10450860 DOI: 10.1016/j.bpr.2023.100120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 07/28/2023] [Indexed: 08/29/2023]
Abstract
Kinetic and thermodynamic models of biological systems are commonly used to connect microscopic features to system function in a bottom-up multiscale approach. The parameters of such models-free energy differences for equilibrium properties and in general rates for equilibrium and out-of-equilibrium observables-have to be measured by different experiments or calculated from multiple computer simulations. All such parameters necessarily come with uncertainties so that when they are naively combined in a full model of the process of interest, they will generally violate fundamental statistical mechanical equalities, namely detailed balance and an equality of forward/backward rate products in cycles due to Hill. If left uncorrected, such models can produce arbitrary outputs that are physically inconsistent. Here, we develop a maximum likelihood approach (named multibind) based on the so-called potential graph to combine kinetic or thermodynamic measurements to yield state-resolved models that are thermodynamically consistent while being most consistent with the provided data and their uncertainties. We demonstrate the approach with two theoretical models, a generic two-proton binding site and a simplified model of a sodium/proton antiporter. We also describe an algorithm to use the multibind approach to solve the inverse problem of determining microscopic quantities from macroscopic measurements and, as an example, we predict the microscopic p K a values and protonation states of a small organic molecule from 1D NMR data. The multibind approach is applicable to any thermodynamic or kinetic model that describes a system as transitions between well-defined states with associated free energy differences or rates between these states. A Python package multibind, which implements the approach described here, is made publicly available under the MIT Open Source license.
Collapse
Affiliation(s)
- Ian M. Kenney
- Arizona State University, Department of Physics, Tempe, Arizona
| | - Oliver Beckstein
- Arizona State University, Department of Physics, Tempe, Arizona
- Arizona State University, Center for Biological Physics, Tempe, Arizona
| |
Collapse
|
20
|
de Oliveira C, Leswing K, Feng S, Kanters R, Abel R, Bhat S. FEP Protocol Builder: Optimization of Free Energy Perturbation Protocols Using Active Learning. J Chem Inf Model 2023; 63:5592-5603. [PMID: 37594480 DOI: 10.1021/acs.jcim.3c00681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/19/2023]
Abstract
Significant improvements have been made in the past decade to methods that rapidly and accurately predict binding affinity through free energy perturbation (FEP) calculations. This has been driven by recent advances in small-molecule force fields and sampling algorithms combined with the availability of low-cost parallel computing. Predictive accuracies of ∼1 kcal mol-1 have been regularly achieved, which are sufficient to drive potency optimization in modern drug discovery campaigns. Despite the robustness of these FEP approaches across multiple target classes, there are invariably target systems that do not display expected performance with default FEP settings. Traditionally, these systems required labor-intensive manual protocol development to arrive at parameter settings that produce a predictive FEP model. Due to the (a) relatively large parameter space to be explored, (b) significant compute requirements, and (c) limited understanding of how combinations of parameters can affect FEP performance, manual FEP protocol optimization can take weeks to months to complete, and often does not involve rigorous train-test set splits, resulting in potential overfitting. These manual FEP protocol development timelines do not coincide with tight drug discovery project timelines, essentially preventing the use of FEP calculations for these target systems. Here, we describe an automated workflow termed FEP Protocol Builder (FEP-PB) to rapidly generate accurate FEP protocols for systems that do not perform well with default settings. FEP-PB uses an active-learning workflow to iteratively search the protocol parameter space to develop accurate FEP protocols. To validate this approach, we applied it to pharmaceutically relevant systems where default FEP settings could not produce predictive models. We demonstrate that FEP-PB can rapidly generate accurate FEP protocols for the previously challenging MCL1 system with limited human intervention. We also apply FEP-PB in a real-world drug discovery setting to generate an accurate FEP protocol for the p97 system. FEP-PB is able to generate a more accurate protocol than the expert user, rapidly validating p97 as amenable to free energy calculations. Additionally, through the active-learning workflow, we are able to gain insight into which parameters are most important for a given system. These results suggest that FEP-PB is a robust tool that can aid in rapidly developing accurate FEP protocols and increasing the number of targets that are amenable to the technology.
Collapse
Affiliation(s)
- César de Oliveira
- Schrodinger, Inc., 9868 Scranton Road, Suite 3200, San Diego, California 92121, United States
| | - Karl Leswing
- Schrodinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Shulu Feng
- Schrodinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - René Kanters
- Schrodinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Robert Abel
- Schrodinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Sathesh Bhat
- Schrodinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| |
Collapse
|
21
|
Zhang I, Rufa DA, Pulido I, Henry MM, Rosen LE, Hauser K, Singh S, Chodera JD. Identifying and Overcoming the Sampling Challenges in Relative Binding Free Energy Calculations of a Model Protein:Protein Complex. J Chem Theory Comput 2023; 19:4863-4882. [PMID: 37450482 PMCID: PMC11219094 DOI: 10.1021/acs.jctc.3c00333] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2023]
Abstract
Relative alchemical binding free energy calculations are routinely used in drug discovery projects to optimize the affinity of small molecules for their drug targets. Alchemical methods can also be used to estimate the impact of amino acid mutations on protein:protein binding affinities, but these calculations can involve sampling challenges due to the complex networks of protein and water interactions frequently present in protein:protein interfaces. We investigate these challenges by extending a graphics processing unit (GPU)-accelerated open-source relative free energy calculation package (Perses) to predict the impact of amino acid mutations on protein:protein binding. Using the well-characterized model system barnase:barstar, we describe analyses for identifying and characterizing sampling problems in protein:protein relative free energy calculations. We find that mutations with sampling problems often involve charge-changes, and inadequate sampling can be attributed to slow degrees of freedom that are mutation-specific. We also explore the accuracy and efficiency of current state-of-the-art approaches─alchemical replica exchange and alchemical replica exchange with solute tempering─for overcoming relevant sampling problems. By employing sufficiently long simulations, we achieve accurate predictions (RMSE 1.61, 95% CI: [1.12, 2.11] kcal/mol), with 86% of estimates within 1 kcal/mol of the experimentally determined relative binding free energies and 100% of predictions correctly classifying the sign of the changes in binding free energies. Ultimately, we provide a model workflow for applying protein mutation free energy calculations to protein:protein complexes, and importantly, catalog the sampling challenges associated with these types of alchemical transformations. Our free open-source package (Perses) is based on OpenMM and is available at https://github.com/choderalab/perses.
Collapse
Affiliation(s)
- Ivy Zhang
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
- Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medical College, Cornell University, New York, NY 10065
| | - Dominic A. Rufa
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
- Tri-Institutional PhD Program in Chemical Biology, Weill Cornell Medical College, Cornell University, New York, NY 10065
| | - Iván Pulido
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| | - Michael M. Henry
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| | | | | | - Sukrit Singh
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| | - John D. Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| |
Collapse
|
22
|
Kenney IM, Beckstein O. Thermodynamically consistent determination of free energies and rates in kinetic cycle models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.08.536126. [PMID: 37066357 PMCID: PMC10104237 DOI: 10.1101/2023.04.08.536126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
Kinetic and thermodynamic models of biological systems are commonly used to connect microscopic features to system function in a bottom-up multiscale approach. The parameters of such models-free energy differences for equilibrium properties and in general rates for equilibrium and out-of-equilibrium observables-have to be measured by different experiments or calculated from multiple computer simulations. All such parameters necessarily come with uncertainties so that when they are naively combined in a full model of the process of interest, they will generally violate fundamental statistical mechanical equalities, namely detailed balance and an equality of forward/backward rate products in cycles due to T. Hill. If left uncorrected, such models can produce arbitrary outputs that are physically inconsistent. Here we develop a maximum likelihood approach (named multibind ) based on the so-called potential graph to combine kinetic or thermodynamic measurements to yield state resolved models that are thermodynamically consistent while being most consistent with the provided data and their uncertainties. We demonstrate the approach with two theoretical models, a generic two-proton binding site and a simplified model of a sodium/proton antiporter. We also describe an algorithm to use the multibind approach to solve the inverse problem of determining microscopic quantities from macroscopic measurements and as an example we predict the microscopic p K a s and protonation states of a small organic molecule from 1D NMR data. The multibind approach is applicable to any thermodynamic or kinetic model that describes a system as transitions between well-defined states with associated free energy differences or rates between these states. A Python package multibind , which implements the approach described here, is made publicly available under the MIT Open Source license. WHY IT MATTERS The increase in computational efficiency and rapid advances in methodology for quantitative free energy and rate calculations has allowed for the construction of increasingly complex thermodynamic or kinetic "bottom-up" models of chemical and biological processes. These multi-scale models serve as a framework for analyzing aspects of cellular function in terms of microscopic, molecular properties and provide an opportunity to connect molecular mechanisms to cellular function. The underlying model parameters-free energy differences or rates-are constrained by thermodynamic identities over cycles of states but these identities are not necessarily obeyed during model construction, thus potentially leading to inconsistent models. We address these inconsistencies through the use of a maximum likelihood approach for free energies and rates to adjust the model parameters in such a way that they are maximally consistent with the input parameters and exactly fulfill the thermodynamic cycle constraints. This approach enables formulation of thermodynamically consistent multi-scale models from simulated or experimental measurements.
Collapse
Affiliation(s)
- Ian M. Kenney
- Arizona State University, Department of Physics, Tempe AZ, USA
| | - Oliver Beckstein
- Arizona State University, Department of Physics, Tempe AZ, USA
- Arizona State University, Center for Biological Physics. Tempe AZ, USA
| |
Collapse
|
23
|
Chen W, Cui D, Jerome SV, Michino M, Lenselink EB, Huggins DJ, Beautrait A, Vendome J, Abel R, Friesner RA, Wang L. Enhancing Hit Discovery in Virtual Screening through Absolute Protein-Ligand Binding Free-Energy Calculations. J Chem Inf Model 2023; 63:3171-3185. [PMID: 37167486 DOI: 10.1021/acs.jcim.3c00013] [Citation(s) in RCA: 35] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
In the hit identification stage of drug discovery, a diverse chemical space needs to be explored to identify initial hits. Contrary to empirical scoring functions, absolute protein-ligand binding free-energy perturbation (ABFEP) provides a theoretically more rigorous and accurate description of protein-ligand binding thermodynamics and could, in principle, greatly improve the hit rates in virtual screening. In this work, we describe an implementation of an accurate and reliable ABFEP method in FEP+. We validated the ABFEP method on eight congeneric compound series binding to eight protein receptors including both neutral and charged ligands. For ligands with net charges, the alchemical ion approach is adopted to avoid artifacts in electrostatic potential energy calculations. The calculated binding free energies correlate with experimental results with a weighted average of R2 = 0.55 for the entire dataset. We also observe an overall root-mean-square error (RMSE) of 1.1 kcal/mol after shifting the zero-point of the simulation data to match the average experimental values. Through ABFEP calculations using apo versus holo protein structures, we demonstrated that the protein conformational and protonation state changes between the apo and holo proteins are the main physical factors contributing to the protein reorganization free energy manifested by the overestimation of raw ABFEP calculated binding free energies using the holo structures of the proteins. Furthermore, we performed ABFEP calculations in three virtual screening applications for hit enrichment. ABFEP greatly improves the hit rates as compared to docking scores or other methods like metadynamics. The good performance of ABFEP in rank ordering compounds demonstrated in this work confirms it as a useful tool to improve the hit rates in virtual screening, thus facilitating hit discovery.
Collapse
Affiliation(s)
- Wei Chen
- Schrödinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Di Cui
- Schrödinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Steven V Jerome
- Schrödinger, Inc., 10201 Wateridge Circle, Suite 220, San Diego, California 92121, United States
| | - Mayako Michino
- Tri-Institutional Therapeutics Discovery Institute, 413 E. 69th Street, New York, New York 10065, United States
| | | | - David J Huggins
- Tri-Institutional Therapeutics Discovery Institute, 413 E. 69th Street, New York, New York 10065, United States
- Department of Physiology and Biophysics, Weill Cornell Medical College of Cornell University, New York, New York 10065, United States
| | - Alexandre Beautrait
- Schrödinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Jeremie Vendome
- Schrödinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Robert Abel
- Schrödinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Richard A Friesner
- Department of Chemistry, Columbia University, New York, New York 10027, United States
| | - Lingle Wang
- Schrödinger, Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| |
Collapse
|
24
|
Moore JH, Margreitter C, Janet JP, Engkvist O, de Groot BL, Gapsys V. Automated relative binding free energy calculations from SMILES to ΔΔG. Commun Chem 2023; 6:82. [PMID: 37106032 PMCID: PMC10140266 DOI: 10.1038/s42004-023-00859-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 03/22/2023] [Indexed: 04/29/2023] Open
Abstract
In drug discovery, computational methods are a key part of making informed design decisions and prioritising experiments. In particular, optimizing compound affinity is a central concern during the early stages of development. In the last 10 years, alchemical free energy (FE) calculations have transformed our ability to incorporate accurate in silico potency predictions in design decisions, and represent the 'gold standard' for augmenting experiment-driven drug discovery. However, relative FE calculations are complex to set up, require significant expert intervention to prepare the calculation and analyse the results or are provided only as closed-source software, not allowing for fine-grained control over the underlying settings. In this work, we introduce an end-to-end relative FE workflow based on the non-equilibrium switching approach that facilitates calculation of binding free energies starting from SMILES strings. The workflow is implemented using fully modular steps, allowing various components to be exchanged depending on licence availability. We further investigate the dependence of the calculated free energy accuracy on the initial ligand pose generated by various docking algorithms. We show that both commercial and open-source docking engines can be used to generate poses that lead to good correlation of free energies with experimental reference data.
Collapse
Affiliation(s)
- J Harry Moore
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | | | - Jon Paul Janet
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | - Ola Engkvist
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden.
| | - Bert L de Groot
- Computational Biomolecular Dynamics Group, Department of Theoretical and Computational Biophysics, Max Planck Institute for Multidisciplinary Sciences, Am Fassberg 11, D-37077, Göttingen, Germany.
| | - Vytautas Gapsys
- Computational Biomolecular Dynamics Group, Department of Theoretical and Computational Biophysics, Max Planck Institute for Multidisciplinary Sciences, Am Fassberg 11, D-37077, Göttingen, Germany.
| |
Collapse
|
25
|
Pitman M, Hahn DF, Tresadern G, Mobley DL. To Design Scalable Free Energy Perturbation Networks, Optimal Is Not Enough. J Chem Inf Model 2023; 63:1776-1793. [PMID: 36878475 PMCID: PMC10547263 DOI: 10.1021/acs.jcim.2c01579] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]
Abstract
Drug discovery is accelerated with computational methods such as alchemical simulations to estimate ligand affinities. In particular, relative binding free energy (RBFE) simulations are beneficial for lead optimization. To use RBFE simulations to compare prospective ligands in silico, researchers first plan the simulation experiment, using graphs where nodes represent ligands and graph edges represent alchemical transformations between ligands. Recent work demonstrated that optimizing the statistical architecture of these perturbation graphs improves the accuracy of the predicted changes in the free energy of ligand binding. Therefore, to improve the success rate of computational drug discovery, we present the open-source software package High Information Mapper (HiMap)─a new take on its predecessor, Lead Optimization Mapper (LOMAP). HiMap removes heuristics decisions from design selection and instead finds statistically optimal graphs over ligands clustered with machine learning. Beyond optimal design generation, we present theoretical insights for designing alchemical perturbation maps. Some of these results include that for n number of nodes, the precision of perturbation maps is stable at n·ln(n) edges. This result indicates that even an "optimal" graph can result in unexpectedly high errors if a plan includes too few alchemical transformations for the given number of ligands and edges. And, as a study compares more ligands, the performance of even optimal graphs will deteriorate with linear scaling of the edge count. In this sense, ensuring an A- or D-optimal topology is not enough to produce robust errors. We additionally find that optimal designs will converge more rapidly than radial and LOMAP designs. Moreover, we derive bounds for how clustering reduces cost for designs with a constant expected relative error per cluster, invariant of the size of the design. These results inform how to best design perturbation maps for computational drug discovery and have broader implications for experimental design.
Collapse
Affiliation(s)
- Mary Pitman
- Department of Pharmacy & Pharmaceutical Sciences, University of California, Irvine, CA 92697, USA
| | - David F. Hahn
- Computational Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse B-2340, Belgium
| | - Gary Tresadern
- Computational Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse B-2340, Belgium
| | - David L. Mobley
- Department of Pharmacy & Pharmaceutical Sciences, University of California, Irvine, CA 92697, USA
- Department of Chemistry, University of California, Irvine, CA 92697, USA
| |
Collapse
|
26
|
Zhang I, Rufa DA, Pulido I, Henry MM, Rosen LE, Hauser K, Singh S, Chodera JD. Identifying and overcoming the sampling challenges in relative binding free energy calculations of a model protein:protein complex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.07.530278. [PMID: 36945557 PMCID: PMC10028896 DOI: 10.1101/2023.03.07.530278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Relative alchemical binding free energy calculations are routinely used in drug discovery projects to optimize the affinity of small molecules for their drug targets. Alchemical methods can also be used to estimate the impact of amino acid mutations on protein:protein binding affinities, but these calculations can involve sampling challenges due to the complex networks of protein and water interactions frequently present in protein:protein interfaces. We investigate these challenges by extending a GPU-accelerated opensource relative free energy calculation package (Perses) to predict the impact of amino acid mutations on protein:protein binding. Using the well-characterized model system barnase:barstar, we describe analyses for identifying and characterizing sampling problems in protein:protein relative free energy calculations. We find that mutations with sampling problems often involve charge-changes, and inadequate sampling can be attributed to slow degrees of freedom that are mutation-specific. We also explore the accuracy and efficiency of current state-of-the-art approaches-alchemical replica exchange and alchemical replica exchange with solute tempering-for overcoming relevant sampling problems. By employing sufficiently long simulations, we achieve accurate predictions (RMSE 1.61, 95% CI: [1.12, 2.11] kcal/mol), with 86% of estimates within 1 kcal/mol of the experimentally-determined relative binding free energies and 100% of predictions correctly classifying the sign of the changes in binding free energies. Ultimately, we provide a model workflow for applying protein mutation free energy calculations to protein:protein complexes, and importantly, catalog the sampling challenges associated with these types of alchemical transformations. Our free open-source package (Perses) is based on OpenMM and available at https://github.com/choderalab/perses .
Collapse
|
27
|
Xu H. The slow but steady rise of binding free energy calculations in drug discovery. J Comput Aided Mol Des 2023; 37:67-74. [PMID: 36469232 DOI: 10.1007/s10822-022-00494-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 11/28/2022] [Indexed: 12/12/2022]
Abstract
Binding free energy calculations are increasingly used in drug discovery research to predict protein-ligand binding affinities and to prioritize candidate drug molecules accordingly. It has taken decades of collective effort to transform this academic concept into a technology adopted by the pharmaceutical and biotech industry. Having personally witnessed and taken part in this transformation, here I recount the (incomplete) list of problems that had to be solved to make this computational tool practical and suggest areas of future development.
Collapse
Affiliation(s)
- Huafeng Xu
- Roivant Discovery, 151 West 42nd Street, New York, NY, 10036, USA.
| |
Collapse
|
28
|
Lihan M, Lupyan D, Oehme D. Target-template relationships in protein structure prediction and their effect on the accuracy of thermostability calculations. Protein Sci 2023; 32:e4557. [PMID: 36573828 PMCID: PMC9878467 DOI: 10.1002/pro.4557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 12/22/2022] [Accepted: 12/23/2022] [Indexed: 12/28/2022]
Abstract
Improving protein thermostability has been a labor- and time-consuming process in industrial applications of protein engineering. Advances in computational approaches have facilitated the development of more efficient strategies to allow the prioritization of stabilizing mutants. Among these is FEP+, a free energy perturbation implementation that uses a thoroughly tested physics-based method to achieve unparalleled accuracy in predicting changes in protein thermostability. To gauge the applicability of FEP+ to situations where crystal structures are unavailable, here we have applied the FEP+ approach to homology models of 12 different proteins covering 316 mutations. By comparing predictions obtained with homology models to those obtained using crystal structures, we have identified that local rather than global sequence conservation between target and template sequence is a determining factor in the accuracy of predictions. By excluding mutation sites with low local sequence identity (<40%) to a template structure, we have obtained predictions with comparable performance to crystal structures (R2 of 0.67 and 0.63 and an RMSE of 1.20 and 1.16 kcal/mol for crystal structure and homology model predictions, respectively) for identifying stabilizing mutations when incorporating residue scanning into a cascade screening strategy. Additionally, we identify and discuss inherent limitations in sequence alignments and homology modeling protocols that translate into the poor FEP+ performance of a few select examples. Overall, our retrospective study provides detailed guidelines for the application of the FEP+ approach using homology models for protein thermostability predictions, which will greatly extend this approach to studies that were previously limited by structure availability.
Collapse
Affiliation(s)
- Muyun Lihan
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, and Center for Biophysics and Quantitative BiologyUniversity of Illinois Urbana‐ChampaignUrbanaIllinoisUSA
- Schrödinger Inc.CambridgeMassachusettsUSA
| | | | | |
Collapse
|
29
|
Li Y, Liu R, Liu J, Luo H, Wu C, Li Z. An Open Source Graph-Based Weighted Cycle Closure Method for Relative Binding Free Energy Calculations. J Chem Inf Model 2023; 63:561-570. [PMID: 36583975 DOI: 10.1021/acs.jcim.2c01076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Free energy perturbation-relative binding free energy (FEP-RBFE) prediction has shown its reliability and accuracy in the prediction of protein-ligand binding affinities, which plays a fundamental role in structure-based drug design. In FEP-RBFE predictions, the calculation of each mutation path is associated with a statistical error, and cycle closure (cc) has proven to be an effective method in improving the calculation accuracy by correcting the hysteresis (summation of errors) of each closed cycle to the theoretical value 0. However, a primary hypothesis was made in the current cycle closure method that the hysteresis is evenly distributed to all paths, which is unlikely to be true in practice and may limit the further improvement of the calculation accuracy when better error estimation methods are available. Moreover, being a closed source software makes the current cycle closure method unachievable in many studies. In this paper, a newly implemented open source graph-based weighted cycle closure (wcc) algorithm was developed and introduced, not only including functions from the original cc method but also containing a new wcc method which can consider different error contributions from different paths and further improve the calculation accuracy. The wcc program also provides a new path-independent molecular error calculation method, which can be quite useful in many studies (like structure-activity relationship (SAR)) compared with the path-dependent method of the original cc program.
Collapse
Affiliation(s)
- Yishui Li
- Science and Technology on Parallel and Distributed Processing Laboratory, National University of Defense Technology, Changsha410073, Hunan, P.R. China.,Laboratory of Software Engineering for Complex System, National University of Defense Technology, Changsha410073, Hunan, P.R. China
| | - Runduo Liu
- School of Pharmaceutical Sciences, Sun Yat-Sen University, Guangzhou510275, Guangdong, P.R. China
| | - Jie Liu
- Science and Technology on Parallel and Distributed Processing Laboratory, National University of Defense Technology, Changsha410073, Hunan, P.R. China.,Laboratory of Software Engineering for Complex System, National University of Defense Technology, Changsha410073, Hunan, P.R. China
| | - Haibin Luo
- Key Laboratory of Tropical Biological Resources of Ministry of Education, School of Pharmaceutical Sciences, Hainan University, Haikou570228, Hainan, P.R. China
| | - Chengkun Wu
- State Key Laboratory of High-Performance Computing, National University of Defense Technology, Changsha410073, Hunan, P.R. China
| | - Zhe Li
- School of Pharmaceutical Sciences, Sun Yat-Sen University, Guangzhou510275, Guangdong, P.R. China
| |
Collapse
|
30
|
Lee TS, Tsai HC, Ganguly A, York DM. ACES: Optimized Alchemically Enhanced Sampling. J Chem Theory Comput 2023; 19:10.1021/acs.jctc.2c00697. [PMID: 36630672 PMCID: PMC10333454 DOI: 10.1021/acs.jctc.2c00697] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
We present an alchemical enhanced sampling (ACES) method implemented in the GPU-accelerated AMBER free energy MD engine. The methods hinges on the creation of an "enhanced sampling state" by reducing or eliminating selected potential energy terms and interactions that lead to kinetic traps and conformational barriers while maintaining those terms that curtail the need to otherwise sample large volumes of phase space. For example, the enhanced sampling state might involve transforming regions of a ligand and/or protein side chain into a noninteracting "dummy state" with internal electrostatics and torsion angle terms turned off. The enhanced sampling state is connected to a real-state end point through a Hamiltonian replica exchange (HREMD) framework that is facilitated by newly developed alchemical transformation pathways and smoothstep softcore potentials. This creates a counterdiffusion of real and enhanced-sampling states along the HREMD network. The effect of a differential response of the environment to the real and enhanced-sampling states is minimized by leveraging the dual topology framework in AMBER to construct a counterbalancing HREMD network in the opposite alchemical direction with the same (or similar) real and enhanced sampling states at inverted end points. The method has been demonstrated in a series of test cases of increasing complexity where traditional MD, and in several cases alternative REST2-like enhanced sampling methods, are shown to fail. The hydration free energy for acetic acid was shown to be independent of the starting conformation, and the values for four additional edge case molecules from the FreeSolv database were shown to have a significantly closer agreement with experiment using ACES. The method was further able to handle different rotamer states in a Cdk2 ligand identified as fractionally occupied in crystal structures. Finally, ACES was applied to T4-lysozyme and demonstrated that the side chain distribution of V111χ1 could be reliably reproduced for the apo state, bound to p-xylene, and in p-xylene→ benzene transformations. In these cases, the ACES method is shown to be highly robust and superior to a REST2-like enhanced sampling implementation alone.
Collapse
Affiliation(s)
- Tai-Sung Lee
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Hsu-Chun Tsai
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Abir Ganguly
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
31
|
Wang L, Lu D, Wang Y, Xu X, Zhong P, Yang Z. Binding selectivity-dependent molecular mechanism of inhibitors towards CDK2 and CDK6 investigated by multiple short molecular dynamics and free energy landscapes. J Enzyme Inhib Med Chem 2023; 38:84-99. [DOI: 10.1080/14756366.2022.2135511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Affiliation(s)
- Lifei Wang
- School of Science, Shandong Jiaotong University, Jinan, PR China
| | - Dan Lu
- Department of Physics, Jiangxi Agricultural University, Nanchang, PR China
| | - Yan Wang
- School of Science, Shandong Jiaotong University, Jinan, PR China
| | - Xiaoyan Xu
- School of Science, Shandong Jiaotong University, Jinan, PR China
| | - Peihua Zhong
- College of Computer Information and Engineering, Jiangxi Agriculture University, Nanchang, PR China
| | - Zhiyong Yang
- Department of Physics, Jiangxi Agricultural University, Nanchang, PR China
| |
Collapse
|
32
|
Prieto-Díaz R, González-Gómez M, Fojo-Carballo H, Azuaje J, El Maatougui A, Majellaro M, Loza MI, Brea J, Fernández-Dueñas V, Paleo MR, Díaz-Holguín A, Garcia-Pinel B, Mallo-Abreu A, Estévez JC, Andújar-Arias A, García-Mera X, Gomez-Tourino I, Ciruela F, Salas CO, Gutiérrez-de-Terán H, Sotelo E. Exploring the Effect of Halogenation in a Series of Potent and Selective A 2B Adenosine Receptor Antagonists. J Med Chem 2022; 66:890-912. [PMID: 36517209 PMCID: PMC9841532 DOI: 10.1021/acs.jmedchem.2c01768] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
The modulation of the A2B adenosine receptor is a promising strategy in cancer (immuno) therapy, with A2BAR antagonists emerging as immune checkpoint inhibitors. Herein, we report a systematic assessment of the impact of (di- and mono-)halogenation at positions 7 and/or 8 on both A2BAR affinity and pharmacokinetic properties of a collection of A2BAR antagonists and its study with structure-based free energy perturbation simulations. Monohalogenation at position 8 produced potent A2BAR ligands irrespective of the nature of the halogen. In contrast, halogenation at position 7 and dihalogenation produced a halogen-size-dependent decay in affinity. Eight novel A2BAR ligands exhibited remarkable affinity (Ki < 10 nM), exquisite subtype selectivity, and enantioselective recognition, with some eutomers eliciting sub-nanomolar affinity. The pharmacokinetic profile of representative derivatives showed enhanced solubility and microsomal stability. Finally, two compounds showed the capacity of reversing the antiproliferative effect of adenosine in activated primary human peripheral blood mononuclear cells.
Collapse
Affiliation(s)
- Rubén Prieto-Díaz
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain,Department
of Cell and Molecular Biology, Uppsala University, Biomedical Center, 75124Uppsala, Sweden
| | - Manuel González-Gómez
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - Hugo Fojo-Carballo
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - Jhonny Azuaje
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - Abdelaziz El Maatougui
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - Maria Majellaro
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - María I. Loza
- Center
for Research in Molecular Medicine and Chronic Diseases (CiMUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Pharmacology, Pharmacy and Pharmaceutical Technology, Faculty of
Pharmacy, University of Santiago de Compostela, 15782Santiago de
Compostela, Spain
| | - José Brea
- Center
for Research in Molecular Medicine and Chronic Diseases (CiMUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Pharmacology, Pharmacy and Pharmaceutical Technology, Faculty of
Pharmacy, University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,. Tel: +34 881815459. Fax: +34-8818115474
| | - Víctor Fernández-Dueñas
- Pharmacology
Unit, Department of Pathology and Experimental Therapeutics, Faculty
of Medicine and Health Sciences, Institute of Neuroscience, University of Barcelona, 08907L’Hospitalet de Llobregat, Spain,Neuropharmacology
and Pain Group, Neuroscience Program, Institut
d’Investigació Biomèdica de Bellvitge, IDIBELL, 08907L’Hospitalet
de Llobregat, Spain
| | - M. Rita Paleo
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - Alejandro Díaz-Holguín
- Department
of Cell and Molecular Biology, Uppsala University, Biomedical Center, 75124Uppsala, Sweden
| | - Beatriz Garcia-Pinel
- Center
for Research in Molecular Medicine and Chronic Diseases (CiMUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Biochemistry and Molecular Biology, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de
Compostela, Spain
| | - Ana Mallo-Abreu
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - Juan C. Estévez
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - Antonio Andújar-Arias
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - Xerardo García-Mera
- Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain
| | - Iria Gomez-Tourino
- Center
for Research in Molecular Medicine and Chronic Diseases (CiMUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain
| | - Francisco Ciruela
- Pharmacology
Unit, Department of Pathology and Experimental Therapeutics, Faculty
of Medicine and Health Sciences, Institute of Neuroscience, University of Barcelona, 08907L’Hospitalet de Llobregat, Spain,Neuropharmacology
and Pain Group, Neuroscience Program, Institut
d’Investigació Biomèdica de Bellvitge, IDIBELL, 08907L’Hospitalet
de Llobregat, Spain
| | - Cristian O. Salas
- Department
of Organic Chemistry, Faculty of Chemistry and Pharmacy, Pontificia Universidad Católica de Chile, Vicuña Mackenna 4860, Macul, Santiago7820436, Chile
| | - Hugo Gutiérrez-de-Terán
- Department
of Cell and Molecular Biology, Uppsala University, Biomedical Center, 75124Uppsala, Sweden,. Tel: +46 18
471 5056. Fax: +46 18 536971
| | - Eddy Sotelo
- Center
for Research in Biological Chemistry and Molecular Materials (CIQUS), University of Santiago de Compostela, 15782Santiago de
Compostela, Spain,Department
of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782Santiago de Compostela, Spain,. Tel: +34 881815732. Fax: +34-881815704
| |
Collapse
|
33
|
Gumede NJ. Pathfinder-Driven Chemical Space Exploration and Multiparameter Optimization in Tandem with Glide/IFD and QSAR-Based Active Learning Approach to Prioritize Design Ideas for FEP+ Calculations of SARS-CoV-2 PL pro Inhibitors. Molecules 2022; 27:8569. [PMID: 36500659 PMCID: PMC9741453 DOI: 10.3390/molecules27238569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Revised: 11/25/2022] [Accepted: 11/30/2022] [Indexed: 12/12/2022] Open
Abstract
A global pandemic caused by the SARS-CoV-2 virus that started in 2020 and has wreaked havoc on humanity still ravages up until now. As a result, the negative impact of travel restrictions and lockdowns has underscored the importance of our preparedness for future pandemics. The main thrust of this work was based on addressing this need by traversing chemical space to design inhibitors that target the SARS-CoV-2 papain-like protease (PLpro). Pathfinder-based retrosynthesis analysis was used to generate analogs of GRL-0617 using commercially available building blocks by replacing the naphthalene moiety. A total of 10 models were built using active learning QSAR, which achieved good statistical results such as an R2 > 0.70, Q2 > 0.64, STD Dev < 0.30, and RMSE < 0.31, on average for all models. A total of 35 ideas were further prioritized for FEP+ calculations. The FEP+ results revealed that compound 45 was the most active compound in this series with a ΔG of −7.28 ± 0.96 kcal/mol. Compound 5 exhibited a ΔG of −6.78 ± 1.30 kcal/mol. The inactive compounds in this series were compound 91 and compound 23 with a ΔG of −5.74 ± 1.06 and −3.11 ± 1.45 kcal/mol. The combined strategy employed here is envisaged to be of great utility in multiparameter lead optimization efforts, to traverse chemical space, maintaining and/or improving the potency as well as the property space of synthetically aware design ideas.
Collapse
Affiliation(s)
- Njabulo Joyfull Gumede
- Department of Chemistry, Mangosuthu University of Technology, P.O. Box 12363, Jacobs 4026, South Africa
| |
Collapse
|
34
|
Abstract
Positional analogue scanning (PAS) is an accepted strategy for multiparameter lead optimization (MPO) in drug discovery. Small structural changes as introduced by PAS can lead to 10-fold changes in binding potency in ∼10-20% of cases, a significant parameter shift irrespective of other MPO objectives. Sometimes performing a complete PAS is challenging due to resource and time constraints, building block availability, or difficulty in synthesis. Calculating relative binding free energies (RBFEs) for all positions can contribute to prioritizing the most promising analogues for synthesis. We tested a well-established RBFE calculation method, Amber GPU-TI, for 20 positional analogue scans in 14 test systems (cyclin-dependent kinase 8 (CDK8), hepatitis C virus nonstructural protein 5B (HCV NS5B), tankyrase, RAC-α serine/threonine-protein kinase (Akt), phosphodiesterase 1B (PDE1B), orexin/hypocretin receptor type 1 (OX1R), orexin/hypocretin receptor type 2 (OX2R), histone acetyltransferase K (lysine) acetyltransferase 6A (KAT6A), peroxisome proliferator-activated receptor γ (PPARγ), extracellular signal-regulated kinases (ERK1/2), coactivator-associated arginine methyltransferase 1 (PRMT4), αvβ6, bromodomain 1 (BD1), human immunodeficiency virus-1 (HIV-1) entry) involving nitrogen, methyl, halogen, methoxy, and hydroxyl scans with at least four analogues per set. Among the 66 analogue positions explored, we found that in 18 cases Amber GPU-TI calculations predicted a more than 10-fold change in potency. In all of these cases, the experimentally observed direction of potency changes agreed with the predictions. In 16 cases, more than 10-fold changes in experimental potency were observed. Again, in all of these cases, Amber GPU-TI predicted the direction of the potency changes correctly. In none of these cases would a decision made for or against synthesis based on a 10-fold change in potency have resulted in missing an important analogue. Therefore, in silico RBFE calculations using Amber GPU-TI can meaningfully contribute to the prioritization of positional analogues before synthesis.
Collapse
Affiliation(s)
- Yuan Hu
- Alkermes, Inc., 852 Winter Street, Waltham, Massachusetts 02451-1420, United States
| | - Ingo Muegge
- Alkermes, Inc., 852 Winter Street, Waltham, Massachusetts 02451-1420, United States
| |
Collapse
|
35
|
High-level expression and improved pepsin activity by enhancing the conserved domain stability based on a scissor-like model. Lebensm Wiss Technol 2022. [DOI: 10.1016/j.lwt.2022.113877] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
36
|
Zhu F, Bourguet FA, Bennett WFD, Lau EY, Arrildt KT, Segelke BW, Zemla AT, Desautels TA, Faissol DM. Large-scale application of free energy perturbation calculations for antibody design. Sci Rep 2022; 12:12489. [PMID: 35864134 PMCID: PMC9302960 DOI: 10.1038/s41598-022-14443-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 06/07/2022] [Indexed: 01/02/2023] Open
Abstract
Alchemical free energy perturbation (FEP) is a rigorous and powerful technique to calculate the free energy difference between distinct chemical systems. Here we report our implementation of automated large-scale FEP calculations, using the Amber software package, to facilitate antibody design and evaluation. In combination with Hamiltonian replica exchange, our FEP simulations aim to predict the effect of mutations on both the binding affinity and the structural stability. Importantly, we incorporate multiple strategies to faithfully estimate the statistical uncertainties in the FEP results. As a case study, we apply our protocols to systematically evaluate variants of the m396 antibody for their conformational stability and their binding affinity to the spike proteins of SARS-CoV-1 and SARS-CoV-2. By properly adjusting relevant parameters, the particle collapse problems in the FEP simulations are avoided. Furthermore, large statistical errors in a small fraction of the FEP calculations are effectively reduced by extending the sampling, such that acceptable statistical uncertainties are achieved for the vast majority of the cases with a modest total computational cost. Finally, our predicted conformational stability for the m396 variants is qualitatively consistent with the experimentally measured melting temperatures. Our work thus demonstrates the applicability of FEP in computational antibody design.
Collapse
Affiliation(s)
- Fangqiang Zhu
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, USA.
| | - Feliza A Bourguet
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, USA
| | - William F D Bennett
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, USA
| | - Edmond Y Lau
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, USA
| | - Kathryn T Arrildt
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, USA
| | - Brent W Segelke
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, USA
| | - Adam T Zemla
- Global Security Computing Division, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, USA
| | - Thomas A Desautels
- Computational Engineering Division, Engineering Directorate, Lawrence Livermore National Laboratory, Livermore, USA
| | - Daniel M Faissol
- Computational Engineering Division, Engineering Directorate, Lawrence Livermore National Laboratory, Livermore, USA.
| |
Collapse
|
37
|
Procacci P. Does Hamiltonian Replica Exchange via Lambda-Hopping Enhance the Sampling in Alchemical Free Energy Calculations? Molecules 2022; 27:4426. [PMID: 35889299 PMCID: PMC9316500 DOI: 10.3390/molecules27144426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 07/06/2022] [Accepted: 07/07/2022] [Indexed: 02/06/2023] Open
Abstract
In the context of computational drug design, we examine the effectiveness of the enhanced sampling techniques in state-of-the-art free energy calculations based on alchemical molecular dynamics simulations. In a paradigmatic molecule with competition between conformationally restrained E and Z isomers whose probability ratio is strongly affected by the coupling with the environment, we compare the so-called λ-hopping technique to the Hamiltonian replica exchange methods assessing their convergence behavior as a function of the enhanced sampling protocols (number of replicas, scaling factors, simulation times). We found that the pure λ-hopping, commonly used in solvation and binding free energy calculations via alchemical free energy perturbation techniques, is ineffective in enhancing the sampling of the isomeric states, exhibiting a pathological dependence on the initial conditions. Correct sampling can be restored in λ-hopping simulation by the addition of a "hot-zone" scaling factor to the λ-stratification (FEP+ approach), provided that the additive hot-zone scaling factors are tuned and optimized using preliminary ordinary replica-exchange simulation of the end-states.
Collapse
Affiliation(s)
- Piero Procacci
- Chemistry Department, University of Florence, Via Lastruccia n.3, I-50019 Sesto Firentino, Italy
| |
Collapse
|
38
|
Procacci P. Relative Binding Free Energy between Chemically Distant Compounds Using a Bidirectional Nonequilibrium Approach. J Chem Theory Comput 2022; 18:4014-4026. [PMID: 35642423 PMCID: PMC9202353 DOI: 10.1021/acs.jctc.2c00295] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Indexed: 12/02/2022]
Abstract
In the context of advanced hit-to-lead drug design based on atomistic molecular dynamics simulations, we propose a dual topology alchemical approach for calculating the relative binding free energy (RBFE) between two chemically distant compounds. The method (termed NE-RBFE) relies on the enhanced sampling of the end-states in bulk and in the bound state via Hamiltonian Replica Exchange, alchemically connected by a series of independent and fast nonequilibrium (NE) simulations. The technique has been implemented in a bidirectional fashion, applying the Crooks theorem to the NE work distributions for RBFE predictions. The dissipation of the NE process, negatively affecting accuracy, has been minimized by introducing a smooth regularization based on shifted electrostatic and Lennard-Jones non bonded potentials. As a challenging testbed, we have applied our method to the calculation of the RBFEs in the recent host-guest SAMPL international contest, featuring a macrocyclic host with guests varying in the net charge, volume, and chemical fingerprints. Closure validation has been successfully verified in cycles involving compounds with disparate Tanimoto coefficients, volume, and net charge. NE-RBFE is specifically tailored for massively parallel facilities and can be used with little or no code modification on most of the popular software packages supporting nonequilibrium alchemical simulations, such as Gromacs, Amber, NAMD, or OpenMM. The proposed methodology bypasses most of the entanglements and limitations of the standard single topology RBFE approach for strictly congeneric series based on free-energy perturbation, such as slowly relaxing cavity water, sampling issues along the alchemical stratification, and the need for highly overlapping molecular fingerprints.
Collapse
Affiliation(s)
- Piero Procacci
- Dipartimento di Chimica “Ugo
Schiff”, Università degli
Studi di Firenze, Via
della Lastruccia 3, 50019 Sesto Fiorentino, Italy
| |
Collapse
|
39
|
Patel D, Cox BD, Kasthuri M, Mengshetti S, Bassit L, Verma K, Ollinger-Russell O, Amblard F, Schinazi RF. In silico design of a novel nucleotide antiviral agent by free energy perturbation. Chem Biol Drug Des 2022; 99:801-815. [PMID: 35313085 PMCID: PMC9175506 DOI: 10.1111/cbdd.14042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 02/03/2022] [Accepted: 03/05/2022] [Indexed: 11/30/2022]
Abstract
Nucleoside analogs are the backbone of antiviral therapies. Drugs from this class undergo processing by host or viral kinases to form the active nucleoside triphosphate species that selectively inhibits the viral polymerase. It is the central hypothesis that the nucleoside triphosphate analog must be a favorable substrate for the viral polymerase and the nucleoside precursor must be a satisfactory substrate for the host kinases to inhibit viral replication. Herein, free energy perturbation (FEP) was used to predict substrate affinity for both host and viral enzymes. Several uridine 5'-monophosphate prodrug analogs known to inhibit hepatitis C virus (HCV) were utilized in this study to validate the use of FEP. Binding free energies to the host monophosphate kinase and viral RNA-dependent RNA polymerase (RdRp) were calculated for methyl-substituted uridine analogs. The 2'-C-methyl-uridine and 4'-C-methyl-uridine scaffolds delivered favorable substrate binding to the host kinase and HCV RdRp that were consistent with results from cellular antiviral activity in support of our new approach. In a prospective evaluation, FEP results suggest that 2'-C-dimethyl-uridine scaffold delivered favorable monophosphate and triphosphate substrates for both host kinase and HCV RdRp, respectively. Novel 2'-C-dimethyl-uridine monophosphate prodrug was synthesized and exhibited sub-micromolar inhibition of HCV replication. Using this novel approach, we demonstrated for the first time that nucleoside analogs can be rationally designed that meet the multi-target requirements for antiviral activity.
Collapse
Affiliation(s)
- Dharmeshkumar Patel
- Center for AIDS Research, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, 1760 Haygood Dr., Atlanta, GA, 30322, USA
| | - Bryan D. Cox
- Center for AIDS Research, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, 1760 Haygood Dr., Atlanta, GA, 30322, USA
| | - Mahesh Kasthuri
- Center for AIDS Research, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, 1760 Haygood Dr., Atlanta, GA, 30322, USA
| | - Seema Mengshetti
- Center for AIDS Research, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, 1760 Haygood Dr., Atlanta, GA, 30322, USA
| | - Leda Bassit
- Center for AIDS Research, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, 1760 Haygood Dr., Atlanta, GA, 30322, USA
| | - Kiran Verma
- Center for AIDS Research, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, 1760 Haygood Dr., Atlanta, GA, 30322, USA
| | - Olivia Ollinger-Russell
- Center for AIDS Research, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, 1760 Haygood Dr., Atlanta, GA, 30322, USA
| | - Franck Amblard
- Center for AIDS Research, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, 1760 Haygood Dr., Atlanta, GA, 30322, USA
| | - Raymond F. Schinazi
- Center for AIDS Research, Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, 1760 Haygood Dr., Atlanta, GA, 30322, USA
| |
Collapse
|
40
|
McNutt AT, Koes DR. Improving ΔΔG Predictions with a Multitask Convolutional Siamese Network. J Chem Inf Model 2022; 62:1819-1829. [PMID: 35380443 PMCID: PMC9038699 DOI: 10.1021/acs.jcim.1c01497] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The lead optimization phase of drug discovery refines an initial hit molecule for desired properties, especially potency. Synthesis and experimental testing of the small perturbations during this refinement can be quite costly and time-consuming. Relative binding free energy (RBFE, also referred to as ΔΔG) methods allow the estimation of binding free energy changes after small changes to a ligand scaffold. Here, we propose and evaluate a Siamese convolutional neural network (CNN) for the prediction of RBFE between two bound ligands. We show that our multitask loss is able to improve on a previous state-of-the-art Siamese network for RBFE prediction via increased regularization of the latent space. The Siamese network architecture is well suited to the prediction of RBFE in comparison to a standard CNN trained on the same data (Pearson's R of 0.553 and 0.5, respectively). When evaluated on a left-out protein family, our Siamese CNN shows variability in its RBFE predictive performance depending on the protein family being evaluated (Pearson's R ranging from -0.44 to 0.97). RBFE prediction performance can be improved during generalization by injecting only a few examples (few-shot learning) from the evaluation data set during model training.
Collapse
Affiliation(s)
- Andrew T McNutt
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - David Ryan Koes
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
41
|
Markthaler D, Fleck M, Stankiewicz B, Hansen N. Exploring the Effect of Enhanced Sampling on Protein Stability Prediction. J Chem Theory Comput 2022; 18:2569-2583. [PMID: 35298174 DOI: 10.1021/acs.jctc.1c01012] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Changes in protein stability due to side-chain mutations are evaluated by alchemical free-energy calculations based on classical molecular dynamics (MD) simulations in explicit solvent using the GROMOS force field. Three proteins of different complexity with a total number of 93 single-point mutations are analyzed, and the relative free-energy differences are discussed with respect to configurational sampling and (dis)agreement with experimental data. For the smallest protein studied, a 34-residue WW domain, the starting structure dependence of the alchemical free-energy changes, is discussed in detail. Deviations from previous simulations for the two other proteins are shown to result from insufficient sampling in the earlier studies. Hamiltonian replica exchange in combination with multiple starting structures and sufficient sampling time of more than 100 ns per intermediate alchemical state is required in some cases to reach convergence.
Collapse
Affiliation(s)
- Daniel Markthaler
- Institute of Thermodynamics and Thermal Process Engineering, University of Stuttgart, Pfaffenwaldring 9, 70569 Stuttgart, Germany
| | - Maximilian Fleck
- Institute of Thermodynamics and Thermal Process Engineering, University of Stuttgart, Pfaffenwaldring 9, 70569 Stuttgart, Germany
| | - Bartosz Stankiewicz
- Institute of Thermodynamics and Thermal Process Engineering, University of Stuttgart, Pfaffenwaldring 9, 70569 Stuttgart, Germany
| | - Niels Hansen
- Institute of Thermodynamics and Thermal Process Engineering, University of Stuttgart, Pfaffenwaldring 9, 70569 Stuttgart, Germany
| |
Collapse
|
42
|
Machine Learning Applied to the Modeling of Pharmacological and ADMET Endpoints. Methods Mol Biol 2021. [PMID: 34731464 DOI: 10.1007/978-1-0716-1787-8_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2023]
Abstract
The well-known concept of quantitative structure-activity relationships (QSAR) has been gaining significant interest in the recent years. Data, descriptors, and algorithms are the main pillars to build useful models that support more efficient drug discovery processes with in silico methods. Significant advances in all three areas are the reason for the regained interest in these models. In this book chapter we review various machine learning (ML) approaches that make use of measured in vitro/in vivo data of many compounds. We put these in context with other digital drug discovery methods and present some application examples.
Collapse
|
43
|
Zhang H, Kim S, Giese TJ, Lee TS, Lee J, York DM, Im W. CHARMM-GUI Free Energy Calculator for Practical Ligand Binding Free Energy Simulations with AMBER. J Chem Inf Model 2021; 61:4145-4151. [PMID: 34521199 DOI: 10.1021/acs.jcim.1c00747] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Alchemical free energy methods, such as free energy perturbation (FEP) and thermodynamic integration (TI), become increasingly popular and crucial for drug design and discovery. However, the system preparation of alchemical free energy simulation is an error-prone, time-consuming, and tedious process for a large number of ligands. To address this issue, we have recently presented CHARMM-GUI Free Energy Calculator that can provide input and postprocessing scripts for NAMD and GENESIS FEP molecular dynamics systems. In this work, we extended three submodules of Free Energy Calculator to work with the full suite of GPU-accelerated alchemical free energy methods and tools in AMBER, including input and postprocessing scripts. The BACE1 (β-secretase 1) benchmark set was used to validate the AMBER-TI simulation systems and scripts generated by Free Energy Calculator. The overall results of relatively large and diverse systems are almost equivalent with different protocols (unified and split) and with different timesteps (1, 2, and 4 fs), with R2 > 0.9. More importantly, the average free energy differences between two protocols are small and reliable with four independent runs, with a mean unsigned error (MUE) below 0.4 kcal/mol. Running at least four independent runs for each pair with AMBER20 (and FF19SB/GAFF2.1/OPC force fields), we obtained a MUE of 0.99 kcal/mol and root-mean-square error of 1.31 kcal/mol for 58 alchemical transformations in comparison with experimental data. In addition, a set of ligands for T4-lysozyme was used to further validate our free energy calculation protocol whose results are close to experimental data (within 1 kcal/mol). In summary, Free Energy Calculator provides a user-friendly web-based tool to generate the AMBER-TI system and input files for high-throughput binding free energy calculations with access to the full set of GPU-accelerated alchemical free energy, enhanced sampling, and analysis methods in AMBER.
Collapse
Affiliation(s)
- Han Zhang
- Department of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Pennsylvania 18015, United States
| | - Seonghoon Kim
- School of Computational Sciences, Korea Institute for Advanced Study, Seoul 02455, Republic of Korea
| | - Timothy J Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine, and Department of Chemistry and Chemical Biology, Rutgers, the State University of New Jersey, New Jersey 08854, United States
| | - Tai-Sung Lee
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine, and Department of Chemistry and Chemical Biology, Rutgers, the State University of New Jersey, New Jersey 08854, United States
| | - Jumin Lee
- Department of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Pennsylvania 18015, United States
| | - Darrin M York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine, and Department of Chemistry and Chemical Biology, Rutgers, the State University of New Jersey, New Jersey 08854, United States
| | - Wonpil Im
- Department of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Pennsylvania 18015, United States
| |
Collapse
|
44
|
Carvalho Martins L, Cino EA, Ferreira RS. PyAutoFEP: An Automated Free Energy Perturbation Workflow for GROMACS Integrating Enhanced Sampling Methods. J Chem Theory Comput 2021; 17:4262-4273. [PMID: 34142828 DOI: 10.1021/acs.jctc.1c00194] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Free energy perturbation (FEP) calculations are now routinely used in drug discovery to estimate the relative FEB (RFEB) of small molecules to a biomolecular target of interest. Using enhanced sampling can improve the correlation between predictions and experimental data, especially in systems with conformational changes. Due to the large number of perturbations required in drug discovery campaigns, the manual setup of FEP calculations is no longer viable. Here, we introduce PyAutoFEP, a flexible and open-source tool to aid the setup of RFEB FEP. PyAutoFEP is written in Python3, and automates the generation of perturbation maps, dual topologies, system building and molecular dynamics (MD), and analysis. PyAutoFEP supports multiple force fields, incorporates replica exchange with solute tempering (REST) and replica exchange with solute scaling (REST2) enhanced sampling methods, and allows flexible λ values along perturbation windows. To validate PyAutoFEP, it was applied to a set of 14 Farnesoid X receptor ligands, a system included in the drug design data resource grand challenge 2. An 88% mean correct sign prediction was achieved, and 75% of the predictions had an error below 1.5 kcal/mol. Results using Amber03/GAFF, CHARMM36m/CGenFF, and OPLS-AA/M/LigParGen had Pearson's r values of 0.71 ± 0.13, 0.30 ± 0.27, and 0.66 ± 0.20, respectively. The Amber03/GAFF and OPLS-AA/M/LigParGen results were on par with the top grand challenge 2 submissions. Applying REST2 improved the results using CHARMM36m/CGenFF (Pearson's r = 0.43 ± 0.21) but had little impact on the other force fields. CHARMM36-YF and CHARMM36-WYF modifications did not yield improved predictions compared to CHARMM36m. Finally, we estimated the probability of finding a molecule 1 pKi better than a lead when using PyAutoFEP to screen 10 or 100 analogues. The probabilities, when compared to random sampling, increased up to sevenfold when 100 molecules were to be screened, suggesting that PyAutoFEP would likely be useful for lead optimization. PyAutoFEP is available on GitHub at https://github.com/lmmpf/PyAutoFEP.
Collapse
Affiliation(s)
- Luan Carvalho Martins
- Graduate Program in Bioinformatics. Institute for Biological Sciences, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Elio A Cino
- Biochemistry and Immunology Department, Institute for Biological Sciences, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Rafaela Salgado Ferreira
- Biochemistry and Immunology Department, Institute for Biological Sciences, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil
| |
Collapse
|
45
|
Goel H, Hazel A, Ustach VD, Jo S, Yu W, MacKerell AD. Rapid and accurate estimation of protein-ligand relative binding affinities using site-identification by ligand competitive saturation. Chem Sci 2021; 12:8844-8858. [PMID: 34257885 PMCID: PMC8246086 DOI: 10.1039/d1sc01781k] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 05/24/2021] [Indexed: 01/18/2023] Open
Abstract
Predicting relative protein-ligand binding affinities is a central pillar of lead optimization efforts in structure-based drug design. The site identification by ligand competitive saturation (SILCS) methodology is based on functional group affinity patterns in the form of free energy maps that may be used to compute protein-ligand binding poses and affinities. Presented are results obtained from the SILCS methodology for a set of eight target proteins as reported originally in Wang et al. (J. Am. Chem. Soc., 2015, 137, 2695-2703) using free energy perturbation (FEP) methods in conjunction with enhanced sampling and cycle closure corrections. These eight targets have been subsequently studied by many other authors to compare the efficacy of their method while comparing with the outcomes of Wang et al. In this work, we present results for a total of 407 ligands on the eight targets and include specific analysis on the subset of 199 ligands considered previously. Using the SILCS methodology we can achieve an average accuracy of up to 77% and 74% when considering the eight targets with their 199 and 407 ligands, respectively, for rank-ordering ligand affinities as calculated by the percent correct metric. This accuracy increases to 82% and 80%, respectively, when the SILCS atomic free energy contributions are optimized using a Bayesian Markov-chain Monte Carlo approach. We also report other metrics including Pearson's correlation coefficient, Pearlman's predictive index, mean unsigned error, and root mean square error for both sets of ligands. The results obtained for the 199 ligands are compared with the outcomes of Wang et al. and other published works. Overall, the SILCS methodology yields similar or better-quality predictions without a priori need for known ligand orientations in terms of the different metrics when compared to current FEP approaches with significant computational savings while additionally offering quantitative estimates of individual atomic contributions to binding free energies. These results further validate the SILCS methodology as an accurate, computationally efficient tool to support lead optimization and drug discovery.
Collapse
Affiliation(s)
- Himanshu Goel
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy 20, Penn St. Baltimore Maryland 21201 USA +1-410-706-5017 +1-410-706-7442
| | - Anthony Hazel
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy 20, Penn St. Baltimore Maryland 21201 USA +1-410-706-5017 +1-410-706-7442
| | - Vincent D Ustach
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy 20, Penn St. Baltimore Maryland 21201 USA +1-410-706-5017 +1-410-706-7442
| | - Sunhwan Jo
- SilcsBio LLC 8 Market Place, Suite 300 Baltimore Maryland 21201 USA
| | - Wenbo Yu
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy 20, Penn St. Baltimore Maryland 21201 USA +1-410-706-5017 +1-410-706-7442
| | - Alexander D MacKerell
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy 20, Penn St. Baltimore Maryland 21201 USA +1-410-706-5017 +1-410-706-7442
- SilcsBio LLC 8 Market Place, Suite 300 Baltimore Maryland 21201 USA
| |
Collapse
|
46
|
Cappel D, Mozziconacci JC, Braun T, Steinbrecher T. Performance of Relative Binding Free Energy Calculations on an Automatically Generated Dataset of Halogen-Deshalogen Matched Molecular Pairs. J Chem Inf Model 2021; 61:3421-3430. [PMID: 34170707 DOI: 10.1021/acs.jcim.1c00290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In this study, we generated a matched molecular pair dataset of halogen/deshalogen compounds with reliable binding affinity data and structural binding mode information from public databases. The workflow includes automated system preparation and setup of free energy perturbation relative binding free energy calculations. We demonstrate the suitability of these datasets to investigate the performance of molecular mechanics force fields and molecular simulation algorithms for the purpose of in silico affinity predictions in lead optimization. Our datasets of a total of 115 matched molecular pairs show highly accurate binding free energy predictions with an average error of <1 kcal/mol despite the semi-automated calculation scheme. We quantify the accuracy of the optimized potential for liquid simulations (OPLS) force field to predict the effect of halogen addition to compounds, a commonly employed chemical modification in the design of drug-like molecules.
Collapse
Affiliation(s)
- Daniel Cappel
- Schrödinger GmbH, Glücksteinallee 25, 68163 Mannheim, Germany
| | | | - Tatjana Braun
- Schrödinger GmbH, Thierschstraße 27, 80538 München, Germany
| | | |
Collapse
|
47
|
Liang SS, Liu XG, Cui YX, Zhang SL, Zhang QG, Chen JZ. Molecular mechanism concerning conformational changes of CDK2 mediated by binding of inhibitors using molecular dynamics simulations and principal component analysis. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2021; 32:1-22. [PMID: 34130570 DOI: 10.1080/1062936x.2021.1934896] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Accepted: 05/23/2021] [Indexed: 06/12/2023]
Abstract
Cyclin-dependent kinase 2 (CDK2) has been regarded as a promising drug target for anti-tumour agents. In this study, molecular dynamics (MD) simulations and principal component (PC) analysis were used to explore binding mechanism of three inhibitors 1PU, CDK, 50Z to CDK2 and influences of their bindings on conformational changes of CDK2. The results show that bindings of inhibitors yield obvious impacts on internal dynamics, movement patterns and conformational changes of CDK2. In addition, molecular mechanics generalized Born surface area (MM-GBSA) was applied to calculate binding free energies between three inhibitors and CDK2 and evaluate their binding ability to CDK2. The results show that CDK has the strongest binding to CDK2 among the current three inhibitors. Residue-based free energy decomposition method was further utilized to decode the contributions of a single residue to binding of inhibitors, and it was found that three inhibitors not only produce hydrogen bonding interactions and hydrophobic interactions with key residues of CDK2, which promotes binding of three inhibitors to CDK2, but also share similar binding modes. This work is expected to be helpful for design of efficient drugs targeting CDK2.
Collapse
Affiliation(s)
- S S Liang
- School of Physics and Electronics, Shandong Normal University, Jinan, China
| | - X G Liu
- School of Physics and Electronics, Shandong Normal University, Jinan, China
| | - Y X Cui
- School of Physics and Electronics, Shandong Normal University, Jinan, China
| | - S L Zhang
- School of Physics and Electronics, Shandong Normal University, Jinan, China
| | - Q G Zhang
- School of Physics and Electronics, Shandong Normal University, Jinan, China
| | - J Z Chen
- School of Science, Shandong Jiaotong University, Jinan, China
| |
Collapse
|
48
|
Vilseck JZ, Ding X, Hayes RL, Brooks CL. Generalizing the Discrete Gibbs Sampler-Based λ-Dynamics Approach for Multisite Sampling of Many Ligands. J Chem Theory Comput 2021; 17:3895-3907. [PMID: 34101448 DOI: 10.1021/acs.jctc.1c00176] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
In this work, the discrete λ variant of the Gibbs sampler-based λ-dynamics (d-GSλD) method is developed to enable multiple functional group perturbations to be investigated at one or more sites of substitution off a common ligand core. The theoretical framework and special considerations for constructing discrete λ states for multisite d-GSλD are presented. The precision and accuracy of the d-GSλD method is evaluated with three test cases of increasing complexity. Specifically, methyl → methyl symmetric perturbations in water, 1,4-benzene hydration free energies and protein-ligand binding affinities for an example HIV-1 reverse transcriptase inhibitor series are computed with d-GSλD. Complementary MSλD calculations were also performed to compare with d-GSλD's performance. Excellent agreement between d-GSλD and MSλD is observed, with mean unsigned errors of 0.12 and 0.22 kcal/mol for computed hydration and binding free energy test cases, respectively. Good agreement with experiment is also observed, with errors of 0.5-0.7 kcal/mol. These findings support the applicability of the d-GSλD free energy method for a variety of molecular design problems, including structure-based drug design. Finally, a discussion of d-GSλD versus MSλD approaches is presented to compare and contrast features of both methods.
Collapse
Affiliation(s)
- Jonah Z Vilseck
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States.,Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States.,Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
| | - Xinqiang Ding
- Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Ryan L Hayes
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Charles L Brooks
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States.,Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
49
|
Arasteh S, Zhang BW, Levy RM. Protein Loop Conformational Free Energy Changes via an Alchemical Path without Reaction Coordinates. J Phys Chem Lett 2021; 12:4368-4377. [PMID: 33938761 PMCID: PMC8170697 DOI: 10.1021/acs.jpclett.1c00778] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
We introduce a method called restrain-free energy perturbation-release 2.0 (R-FEP-R 2.0) to estimate conformational free energy changes of protein loops via an alchemical path. R-FEP-R 2.0 is a generalization of the method called restrain-free energy perturbation-release (R-FEP-R) that can only estimate conformational free energy changes of protein side chains but not loops. The reorganization of protein loops is a central feature of many biological processes. Unlike other advanced sampling algorithms such as umbrella sampling and metadynamics, R-FEP-R and R-FEP-R 2.0 do not require predetermined collective coordinates and transition pathways that connect the two endpoint conformational states. The R-FEP-R 2.0 method was applied to estimate the conformational free energy change of a β-turn flip in the protein ubiquitin. The result obtained by R-FEP-R 2.0 agrees with the benchmarks very well. We also comment on problems commonly encountered when applying umbrella sampling to calculate protein conformational free energy changes.
Collapse
Affiliation(s)
- Shima Arasteh
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Bin W Zhang
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Ronald M Levy
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| |
Collapse
|
50
|
Giese TJ, York DM. Variational Method for Networkwide Analysis of Relative Ligand Binding Free Energies with Loop Closure and Experimental Constraints. J Chem Theory Comput 2021; 17:1326-1336. [PMID: 33528251 PMCID: PMC8011336 DOI: 10.1021/acs.jctc.0c01219] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
We describe an efficient method for the simultaneous solution of all free energies within a relative binding free-energy (RBFE) network with cycle closure and experimental/reference constraint conditions using Bennett Acceptance Ratio (BAR) and Multistate BAR (MBAR) analysis. Rather than solving the BAR or MBAR equations for each transformation independently, the simultaneous solution of all transformations are obtained by performing a constrained minimization of a global objective function. The nonlinear optimization of the objective function is subjected to affine linear constraints that couple the free energies between the network edges. The constraints are used to enforce the closure of thermodynamic cycles within the RBFE network, and to enforce an additional set of linear constraint conditions demonstrated here to be subsets of (1 or 2) experimental values. We describe details of the practical implementation of the network BAR/MBAR procedure, including use of generalized coordinates in the minimization of the free-energy objective function, propagation of bootstrap errors from those coordinates, and performance and memory optimization. In some cases it is found that use of restraints in the optimization is more practical than use of generalized coordinates for enforcing constraint conditions. The fast BARnet and MBARnet methods are used to analyze the RBFEs of six prototypical protein-ligand systems, and it is shown that enforcement of cycle closure conditions reduces the error in the predictions only modestly, and further reduction in errors can be achieved when one or two experimental RBFEs are included in the optimization procedure. These methods have been implemented into FE-ToolKit, a new free-energy analysis toolkit. The BARnet/MBARnet framework presented here opens the door to new, more efficient and robust free-energy analysis with enhanced predictive capability for drug discovery applications.
Collapse
Affiliation(s)
- Timothy J. Giese
- Laboratory for Biomolecular Simulation Research, Center for Integrative Proteomics Research and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854-8087 USA
| | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Center for Integrative Proteomics Research and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854-8087 USA
| |
Collapse
|