1
|
Fast calculation of hydrogen-bond strengths and free energy of hydration of small molecules. Sci Rep 2023; 13:4143. [PMID: 36914670 PMCID: PMC10011384 DOI: 10.1038/s41598-023-30089-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 02/15/2023] [Indexed: 03/16/2023] Open
Abstract
Hydrogen bonding is an interaction of great importance in drug discovery and development as it may significantly affect chemical and biological processes including the interaction of small molecules with other molecules, proteins, and membranes. In particular, hydrogen bonding can impact drug-like properties such as target affinity and oral availability which are critical to developing effective pharmaceuticals, and therefore, numerous methods for the calculation of properties such as hydrogen-bond strengths, free energy of hydration, or water solubility have been proposed over time. However, the accessibility to efficient methods for the predictions of such properties is still limited. Here, we present the development of Jazzy, an open-source tool for the prediction of hydrogen-bond strengths and free energies of hydration of small molecules. Jazzy also allows the visualisation of hydrogen-bond strengths with atomistic resolution to support the design of compounds with desired properties and the interpretation of existing data. The tool is described in its implementation, parameter fitting, and validation against two data sets of experimental hydration free energies. Jazzy is also applied against two chemical series of bioactive compounds to show that hydrogen-bond strengths can be used to understand their structure-activity relationships. Results from the validations highlight the strengths and limitations of Jazzy, and suggest its suitability for interactive design, screening, and machine-learning featurisation.
Collapse
|
2
|
P. Oliveira M, Hünenberger PH. Systematic optimization of a fragment-based force field against experimental pure-liquid properties considering large compound families: application to oxygen and nitrogen compounds. Phys Chem Chem Phys 2021; 23:17774-17793. [PMID: 34350931 PMCID: PMC8386690 DOI: 10.1039/d1cp02001c] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Accepted: 06/30/2021] [Indexed: 12/04/2022]
Abstract
The CombiFF approach is a workflow for the automated refinement of force-field parameters against experimental condensed-phase data, considering entire classes of organic molecules constructed using a fragment library via combinatorial isomer enumeration. One peculiarity of this approach is that it relies on an electronegativity-equalization scheme to account for induction effects within molecules, with values of the atomic hardness and electronegativity as electrostatic parameters, rather than the partial charges themselves. In a previous article [M. P. Oliveira, M. Andrey, S. R. Rieder, L. Kern, D. F. Hahn, S. Riniker, B. A. C. Horta and P. H. Hünenberger, J. Chem. Theory. Comput. 2020, 16, 7525], CombiFF was introduced and applied to calibrate a GROMOS-compatible united-atom force field for the saturated acyclic (halo-)alkane family. Here, this scheme is employed for the construction of a corresponding force field for saturated acyclic compounds encompassing eight common chemical functional groups involving oxygen and/or nitrogen atoms, namely: ether, aldehyde, ketone, ester, alcohol, carboxylic acid, amine, and amide. Monofunctional as well as homo-polyfunctional compounds are considered. A total of 1712 experimental liquid densities ρliq and vaporization enthalpies ΔHvap concerning 1175 molecules are used for the calibration (339 molecules) and validation (836 molecules) of the 102 non-bonded interaction parameters of the force field. Using initial parameter values based on the GROMOS 2016H66 parameter set, convergence is reached after five iterations. Given access to one processor per simulated system, this operation only requires a few days of wall-clock computing time. After optimization, the root-mean-square deviations from experiment are 29.9 (22.4) kg m-3 for ρliq and 4.1 (5.5) kJ mol-1 for ΔHvap for the calibration (validation) set. Thus, a very good level of agreement with experiment is achieved in terms of these two properties, although the errors are inhomogeneously distributed across the different chemical functional groups.
Collapse
Affiliation(s)
- Marina P. Oliveira
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCICH-8093 ZürichSwitzerland+41 44 632 5503
| | - Philippe H. Hünenberger
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCICH-8093 ZürichSwitzerland+41 44 632 5503
| |
Collapse
|
3
|
Fan S, Nedev H, Vijayan R, Iorga BI, Beckstein O. Precise force-field-based calculations of octanol-water partition coefficients for the SAMPL7 molecules. J Comput Aided Mol Des 2021; 35:853-870. [PMID: 34232435 PMCID: PMC8397498 DOI: 10.1007/s10822-021-00407-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Accepted: 06/22/2021] [Indexed: 10/20/2022]
Abstract
We predicted water-octanol partition coefficients for the molecules in the SAMPL7 challenge with explicit solvent classical molecular dynamics (MD) simulations. Water hydration free energies and octanol solvation free energies were calculated with a windowed alchemical free energy approach. Three commonly used force fields (AMBER GAFF, CHARMM CGenFF, OPLS-AA) were tested. Special emphasis was placed on converging all simulations, using a criterion developed for the SAMPL6 challenge. In aggregate, over 1000 [Formula: see text]s of simulations were performed, with some free energy windows remaining not fully converged even after 1 [Formula: see text]s of simulation time. Nevertheless, the amount of sampling produced [Formula: see text] estimates with a precision of 0.1 log units or better for converged simulations. Despite being probably as fully sampled as can expected and is feasible, the agreement with experiment remained modest for all force fields, with no force field performing better than 1.6 in root mean squared error. Overall, our results indicate that a large amount of sampling is necessary to produce precise [Formula: see text] predictions for the SAMPL7 compounds and that high precision does not necessarily lead to high accuracy. Thus, fundamental problems remain to be solved for physics-based [Formula: see text] predictions.
Collapse
Affiliation(s)
- Shujie Fan
- Department of Physics, Arizona State University, P.O. Box 871504, Tempe, AZ, 85287-1504, USA
| | - Hristo Nedev
- Université Paris-Saclay, CNRS, Institut de Chimie des Substances Naturelles, UPR 2301, Labex LERMIT, 1 Avenue de la Terrasse, 91198, Gif-sur-Yvette, France
| | - Ranjit Vijayan
- Department of Biology, College of Science, United Arab Emirates University, PO Box 15551, Al Ain, UAE
| | - Bogdan I Iorga
- Université Paris-Saclay, CNRS, Institut de Chimie des Substances Naturelles, UPR 2301, Labex LERMIT, 1 Avenue de la Terrasse, 91198, Gif-sur-Yvette, France.
| | - Oliver Beckstein
- Department of Physics and Center for Biological Physics, Arizona State University, P.O. Box 871504, Tempe, AZ, 85287-1504, USA.
| |
Collapse
|
4
|
Oliveira MP, Andrey M, Rieder SR, Kern L, Hahn DF, Riniker S, Horta BAC, Hünenberger PH. Systematic Optimization of a Fragment-Based Force Field against Experimental Pure-Liquid Properties Considering Large Compound Families: Application to Saturated Haloalkanes. J Chem Theory Comput 2020; 16:7525-7555. [DOI: 10.1021/acs.jctc.0c00683] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Marina P. Oliveira
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Maurice Andrey
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Salomé R. Rieder
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Leyla Kern
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - David F. Hahn
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Sereina Riniker
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Bruno A. C. Horta
- Instituto de Química, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-909, Brazil
| | - Philippe H. Hünenberger
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| |
Collapse
|
5
|
Fan S, Iorga BI, Beckstein O. Prediction of octanol-water partition coefficients for the SAMPL6-[Formula: see text] molecules using molecular dynamics simulations with OPLS-AA, AMBER and CHARMM force fields. J Comput Aided Mol Des 2020; 34:543-560. [PMID: 31960254 PMCID: PMC7667952 DOI: 10.1007/s10822-019-00267-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 11/27/2019] [Indexed: 10/25/2022]
Abstract
All-atom molecular dynamics simulations with stratified alchemical free energy calculations were used to predict the octanol-water partition coefficient [Formula: see text] of eleven small molecules as part of the SAMPL6-[Formula: see text] blind prediction challenge using four different force field parametrizations: standard OPLS-AA with transferable charges, OPLS-AA with non-transferable CM1A charges, AMBER/GAFF, and CHARMM/CGenFF. Octanol parameters for OPLS-AA, GAFF and CHARMM were validated by comparing the density as a function of temperature, the chemical potential, and the hydration free energy to experimental values. The partition coefficients were calculated from the solvation free energy for the compounds in water and pure ("dry") octanol or "wet" octanol with 27 mol% water dissolved. Absolute solvation free energies were computed by thermodynamic integration (TI) and the multistate Bennett acceptance ratio with uncorrelated samples from data generated by an established protocol using 5-ns windowed alchemical free energy perturbation (FEP) calculations with the Gromacs molecular dynamics package. Equilibration of sets of FEP simulations was quantified by a new measure of convergence based on the analysis of forward and time-reversed trajectories. The accuracy of the [Formula: see text] predictions was assessed by descriptive statistical measures such as the root mean square error (RMSE) of the data set compared to the experimental values. Discarding the first 1 ns of each 5-ns window as an equilibration phase had a large effect on the GAFF data, where it improved the RMSE by up to 0.8 log units, while the effect for other data sets was smaller or marginally worsened the agreement. Overall, CGenFF gave the best prediction with RMSE 1.2 log units, although for only eight molecules because the current CGenFF workflow for Gromacs does not generate files for certain halogen-containing compounds. Over all eleven compounds, GAFF gave an RMSE of 1.5. The effect of using a mixed water/octanol solvent slightly decreased the accuracy for CGenFF and GAFF and slightly increased it for OPLS-AA. The GAFF and OPLS-AA results displayed a systematic error where molecules were too hydrophobic whereas CGenFF appeared to be more balanced, at least on this small data set.
Collapse
Affiliation(s)
- Shujie Fan
- Department of Physics, Arizona State University, P.O. Box 871504, Tempe, AZ 85287-1504, USA
| | - Bogdan I Iorga
- Institut de Chimie des Substances Naturelles, CNRS UPR 2301, Université Paris-Saclay, Labex LERMIT, 1 Avenue de la Terrasse, 91198 Gif-sur-Yvette, France
| | - Oliver Beckstein
- Department of Physics and Center for Biological Physics, Arizona State University, P.O. Box 871504, Tempe, AZ 85287-1504, USA
| |
Collapse
|
6
|
Selwa E, Kenney IM, Beckstein O, Iorga BI. SAMPL6: calculation of macroscopic pK a values from ab initio quantum mechanical free energies. J Comput Aided Mol Des 2018; 32:1203-1216. [PMID: 30084080 DOI: 10.1007/s10822-018-0138-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2018] [Accepted: 07/21/2018] [Indexed: 12/16/2022]
Abstract
Macroscopic pKa values were calculated for all compounds in the SAMPL6 blind prediction challenge, based on quantum chemical calculations with a continuum solvation model and a linear correction derived from a small training set. Microscopic pKa values were derived from the gas-phase free energy difference between protonated and deprotonated forms together with the Conductor-like Polarizable Continuum Solvation Model and the experimental solvation free energy of the proton. pH-dependent microstate free energies were obtained from the microscopic pKas with a maximum likelihood estimator and appropriately summed to yield macroscopic pKa values or microstate populations as function of pH. We assessed the accuracy of three approaches to calculate the microscopic pKas: direct use of the quantum mechanical free energy differences and correction of the direct values for short-comings in the QM solvation model with two different linear models that we independently derived from a small training set of 38 compounds with known pKa. The predictions that were corrected with the linear models had much better accuracy [root-mean-square error (RMSE) 2.04 and 1.95 pKa units] than the direct calculation (RMSE 3.74). Statistical measures indicate that some systematic errors remain, likely due to differences in the SAMPL6 data set and the small training set with respect to their interactions with water. Overall, the current approach provides a viable physics-based route to estimate macroscopic pKa values for novel compounds with reasonable accuracy.
Collapse
Affiliation(s)
- Edithe Selwa
- Institut de Chimie des Substances Naturelles, CNRS UPR 2301, Université Paris-Saclay, Labex LERMIT, 1 Avenue de la Terrasse, 91198, Gif-sur-Yvette, France
| | - Ian M Kenney
- Department of Physics, Arizona State University, P.O. Box 871504, Tempe, AZ, 85287-1504, USA
| | - Oliver Beckstein
- Department of Physics, Arizona State University, P.O. Box 871504, Tempe, AZ, 85287-1504, USA. .,Center for Biological Physics, Arizona State University, P.O. Box 871504, Tempe, AZ, 85287-1504, USA.
| | - Bogdan I Iorga
- Institut de Chimie des Substances Naturelles, CNRS UPR 2301, Université Paris-Saclay, Labex LERMIT, 1 Avenue de la Terrasse, 91198, Gif-sur-Yvette, France.
| |
Collapse
|
7
|
König G, Reetz MT, Thiel W. 1-Butanol as a Solvent for Efficient Extraction of Polar Compounds from Aqueous Medium: Theoretical and Practical Aspects. J Phys Chem B 2018; 122:6975-6988. [PMID: 29897756 DOI: 10.1021/acs.jpcb.8b02877] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The extraction of polar molecules from aqueous solution is a challenging task in organic synthesis. 1-Butanol has been used sporadically as an eluent for polar molecules, but it is unclear which molecular features drive its efficiency. Here, we employ free energy simulations to study the partitioning of 15 solutes between water and 1-butanol. The simulations demonstrate that the high affinity of polar molecules to the wet 1-butanol phase is associated with its nanostructure. Small inverse micelles of water are able to accommodate polar solutes and locally mimic an aqueous environment. We verify the simulations based on partition coefficients between water and 1-octanol, and include a blind prediction of the water/1-butanol partition coefficient of cyclohexane-1,2-diol. The calculations are in excellent agreement with experiment, reaching root-mean-square deviations below 0.7 kcal/mol. Actual extractions of cyclohexane-1,2-diol from buffer solutions that mimic cell lysates and suspensions in biocatalytic reactions further exemplify our findings. The yields highlight that extractions with 1-butanol can be significantly more efficient than the conventional protocol based on ethyl acetate.
Collapse
Affiliation(s)
- Gerhard König
- Max-Planck-Institut für Kohlenforschung , 45470 Mülheim an der Ruhr , Germany.,Laboratory for Biomolecular Simulation Research, Center for Integrative Proteomics Research, and Department of Chemistry and Chemical Biology , Rutgers University , Piscataway , New Jersey 08854 , United States
| | - Manfred T Reetz
- Max-Planck-Institut für Kohlenforschung , 45470 Mülheim an der Ruhr , Germany.,Department of Chemistry , Philipps-University Marburg , 35032 Marburg , Germany
| | - Walter Thiel
- Max-Planck-Institut für Kohlenforschung , 45470 Mülheim an der Ruhr , Germany
| |
Collapse
|
8
|
Domanski J, Beckstein O, Iorga BI. Ligandbook: an online repository for small and drug-like molecule force field parameters. Bioinformatics 2018; 33:1747-1749. [PMID: 28130228 PMCID: PMC5447236 DOI: 10.1093/bioinformatics/btx037] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2016] [Accepted: 01/21/2017] [Indexed: 12/03/2022] Open
Abstract
Summary Ligandbook is a public database and archive for force field parameters of small and drug-like molecules. It is a repository for parameter sets that are part of published work but are not easily available to the community otherwise. Parameter sets can be downloaded and immediately used in molecular dynamics simulations. The sets of parameters are versioned with full histories and carry unique identifiers to facilitate reproducible research. Text-based search on rich metadata and chemical substructure search allow precise identification of desired compounds or functional groups. Ligandbook enables the rapid set up of reproducible molecular dynamics simulations of ligands and protein-ligand complexes. Availability and Implementation Ligandbook is available online at https://ligandbook.org and supports all modern browsers. Parameters can be searched and downloaded without registration, including access through a programmatic RESTful API. Deposition of files requires free user registration. Ligandbook is implemented in the PHP Symfony2 framework with TCL scripts using the CACTVS toolkit. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jan Domanski
- Department of Biochemistry, University of Oxford, Oxford, UK.,Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Oliver Beckstein
- Department of Physics, Arizona State University, Tempe, AZ, USA.,Center for Biological Physics, Arizona State University, Tempe, AZ, USA
| | - Bogdan I Iorga
- Institut de Chimie des Substances Naturelles, CNRS UPR 2301, Université Paris-Saclay, Labex LERMIT, Gif-sur-Yvette, France
| |
Collapse
|
9
|
Gosink LJ, Overall CC, Reehl SM, Whitney PD, Mobley DL, Baker NA. Bayesian Model Averaging for Ensemble-Based Estimates of Solvation-Free Energies. J Phys Chem B 2017; 121:3458-3472. [PMID: 27966363 DOI: 10.1021/acs.jpcb.6b09198] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
This paper applies the Bayesian Model Averaging statistical ensemble technique to estimate small molecule solvation free energies. There is a wide range of methods available for predicting solvation free energies, ranging from empirical statistical models to ab initio quantum mechanical approaches. Each of these methods is based on a set of conceptual assumptions that can affect predictive accuracy and transferability. Using an iterative statistical process, we have selected and combined solvation energy estimates using an ensemble of 17 diverse methods from the fourth Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) blind prediction study to form a single, aggregated solvation energy estimate. Methods that possess minimal or redundant information are pruned from the ensemble and the evaluation process repeats until aggregate predictive performance can no longer be improved. We show that this process results in a final aggregate estimate that outperforms all individual methods by reducing estimate errors by as much as 91% to 1.2 kcal mol-1 accuracy. This work provides a new approach for accurate solvation free energy prediction and lays the foundation for future work on aggregate models that can balance computational cost with prediction accuracy.
Collapse
Affiliation(s)
| | | | | | | | - David L Mobley
- Departments of Pharmaceutical Sciences and Chemistry, University of California, Irvine , Irvine, California 92697, United States
| | - Nathan A Baker
- Division of Applied Mathematics, Brown University , Providence, Rhode Island 02912, United States
| |
Collapse
|
10
|
Kenney IM, Beckstein O, Iorga BI. Prediction of cyclohexane-water distribution coefficients for the SAMPL5 data set using molecular dynamics simulations with the OPLS-AA force field. J Comput Aided Mol Des 2016; 30:1045-1058. [PMID: 27581968 DOI: 10.1007/s10822-016-9949-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2016] [Accepted: 08/21/2016] [Indexed: 11/29/2022]
Abstract
All-atom molecular dynamics simulations were used to predict water-cyclohexane distribution coefficients [Formula: see text] of a range of small molecules as part of the SAMPL5 blind prediction challenge. Molecules were parameterized with the transferable all-atom OPLS-AA force field, which required the derivation of new parameters for sulfamides and heterocycles and validation of cyclohexane parameters as a solvent. The distribution coefficient was calculated from the solvation free energies of the compound in water and cyclohexane. Absolute solvation free energies were computed by an established protocol using windowed alchemical free energy perturbation with thermodynamic integration. This protocol resulted in an overall root mean square error in [Formula: see text] of almost 4 log units and an overall signed error of -3 compared to experimental data. There was no substantial overall difference in accuracy between simulating in NVT and NPT ensembles. The signed error suggests a systematic error but the experimental [Formula: see text] data on their own are insufficient to uncover the source of this error. Preliminary work suggests that the major source of error lies in the hydration free energy calculations.
Collapse
Affiliation(s)
- Ian M Kenney
- Department of Physics, Arizona State University, P.O. Box 871504, Tempe, AZ, 85287-1504, USA
| | - Oliver Beckstein
- Department of Physics, Arizona State University, P.O. Box 871504, Tempe, AZ, 85287-1504, USA. .,Center for Biological Physics, Arizona State University, P.O. Box 871504, Tempe, AZ, 85287-1504, USA.
| | - Bogdan I Iorga
- Institut de Chimie des Substances Naturelles, CNRS UPR 2301, Université Paris-Saclay, Labex LERMIT, 1 Avenue de la Terrasse, 91198, Gif-sur-Yvette, France.
| |
Collapse
|
11
|
König G, Pickard FC, Huang J, Simmonett AC, Tofoleanu F, Lee J, Dral PO, Prasad S, Jones M, Shao Y, Thiel W, Brooks BR. Calculating distribution coefficients based on multi-scale free energy simulations: an evaluation of MM and QM/MM explicit solvent simulations of water-cyclohexane transfer in the SAMPL5 challenge. J Comput Aided Mol Des 2016; 30:989-1006. [PMID: 27577746 DOI: 10.1007/s10822-016-9936-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2016] [Accepted: 08/09/2016] [Indexed: 11/29/2022]
Abstract
One of the central aspects of biomolecular recognition is the hydrophobic effect, which is experimentally evaluated by measuring the distribution coefficients of compounds between polar and apolar phases. We use our predictions of the distribution coefficients between water and cyclohexane from the SAMPL5 challenge to estimate the hydrophobicity of different explicit solvent simulation techniques. Based on molecular dynamics trajectories with the CHARMM General Force Field, we compare pure molecular mechanics (MM) with quantum-mechanical (QM) calculations based on QM/MM schemes that treat the solvent at the MM level. We perform QM/MM with both density functional theory (BLYP) and semi-empirical methods (OM1, OM2, OM3, PM3). The calculations also serve to test the sensitivity of partition coefficients to solute polarizability as well as the interplay of the quantum-mechanical region with the fixed-charge molecular mechanics environment. Our results indicate that QM/MM with both BLYP and OM2 outperforms pure MM. However, this observation is limited to a subset of cases where convergence of the free energy can be achieved.
Collapse
Affiliation(s)
- Gerhard König
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA. .,Max-Planck-Institut für Kohlenforschung, 45470, Mülheim an der Ruhr, Germany.
| | - Frank C Pickard
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Jing Huang
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Andrew C Simmonett
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Florentina Tofoleanu
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Juyong Lee
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Pavlo O Dral
- Max-Planck-Institut für Kohlenforschung, 45470, Mülheim an der Ruhr, Germany
| | - Samarjeet Prasad
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Michael Jones
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Yihan Shao
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Walter Thiel
- Max-Planck-Institut für Kohlenforschung, 45470, Mülheim an der Ruhr, Germany
| | - Bernard R Brooks
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| |
Collapse
|
12
|
Pickard FC, König G, Simmonett AC, Shao Y, Brooks BR. An efficient protocol for obtaining accurate hydration free energies using quantum chemistry and reweighting from molecular dynamics simulations. Bioorg Med Chem 2016; 24:4988-4997. [PMID: 27667551 DOI: 10.1016/j.bmc.2016.08.031] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2016] [Revised: 08/18/2016] [Accepted: 08/19/2016] [Indexed: 11/15/2022]
Abstract
The non-Boltzmann Bennett (NBB) free energy estimator method is applied to 21 molecules from the blind subset of the SAMPL4 challenge. When NBB is applied with the SMD implicit solvent model, and the OLYP/DZP level of quantum chemistry, highly accurate hydration free energy calculations are obtained with respect to experiment (RMSD=0.89kcal·mol-1). Other quantum chemical methods are also tested, and the effects of solvent model, density functional, basis set are explored in this benchmarking study, providing a framework for improvements in calculating hydration free energies. We provide a practical guide for using the best QM-NBB protocols that are consistently more accurate than either pure QM or pure MM alone. In situations where high accuracy hydration free energy predictions are needed, the QM-NBB method with SMD implicit solvent should be the first choice of quantum chemists.
Collapse
Affiliation(s)
- Frank C Pickard
- National Institutes of Health - National Heart, Lung and Blood Institute, Laboratory of Computational Biology, 5635 Fishers Lane, T-900 Suite, Rockville, MD 20852, USA
| | - Gerhard König
- National Institutes of Health - National Heart, Lung and Blood Institute, Laboratory of Computational Biology, 5635 Fishers Lane, T-900 Suite, Rockville, MD 20852, USA; Max Planck Institut für Kohlenforschung, 45470 Mülheim an der Ruhr, NRW, Germany
| | - Andrew C Simmonett
- National Institutes of Health - National Heart, Lung and Blood Institute, Laboratory of Computational Biology, 5635 Fishers Lane, T-900 Suite, Rockville, MD 20852, USA
| | - Yihan Shao
- Department of Chemistry and Biochemistry University of Oklahoma Norman, OK 73019, USA
| | - Bernard R Brooks
- National Institutes of Health - National Heart, Lung and Blood Institute, Laboratory of Computational Biology, 5635 Fishers Lane, T-900 Suite, Rockville, MD 20852, USA
| |
Collapse
|
13
|
Zafar A, Reynisson J. Hydration Free Energy as a Molecular Descriptor in Drug Design: A Feasibility Study. Mol Inform 2016; 35:207-14. [PMID: 27492087 DOI: 10.1002/minf.201501035] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2015] [Accepted: 03/11/2016] [Indexed: 01/29/2023]
Abstract
In this work the idea was investigated whether calculated hydration energy (ΔGhyd ) can be used as a molecular descriptor in defining promising regions of chemical space for drug design. Calculating ΔGhyd using the Density Solvation Model (SMD) in conjunction with the density functional theory (DFT) gave an excellent correlation with experimental values. Furthermore, calculated ΔGhyd correlates reasonably well with experimental water solubility (r(2) =0.545) and also log P (r(2) =0.530). Three compound collections were used: Known drugs (n=150), drug-like compounds (n=100) and simple organic compounds (n=140). As an approximation only molecules, which do not de/protonate at physiological pH were considered. A relatively broad distribution was seen for the known drugs with an average at -15.3 kcal/mol and a standard deviation of 7.5 kcal/mol. Interestingly, much lower averages were found for the drug-like compounds (-7.5 kcal/mol) and the simple organic compounds (-3.1 kcal/mol) with tighter distributions; 4.3 and 3.2 kcal/mol, respectively. This trend was not observed for these collections when calculated log P and log S values were used. The considerable greater exothermic ΔGhyd average for the known drugs clearly indicates in order to develop a successful drug candidate value of ΔGhyd <-5 kcal/mol or less is preferable.
Collapse
Affiliation(s)
- Ayesha Zafar
- School of Chemical Sciences, University of Auckland, Private Bag 92019, Auckland 1142, New Zealand, Tel. +64-9-373-7599 ext. 83746, Fax. +64-9-373-7422
| | - Jóhannes Reynisson
- School of Chemical Sciences, University of Auckland, Private Bag 92019, Auckland 1142, New Zealand, Tel. +64-9-373-7599 ext. 83746, Fax. +64-9-373-7422.
| |
Collapse
|
14
|
Nieto-Draghi C, Fayet G, Creton B, Rozanska X, Rotureau P, de Hemptinne JC, Ungerer P, Rousseau B, Adamo C. A General Guidebook for the Theoretical Prediction of Physicochemical Properties of Chemicals for Regulatory Purposes. Chem Rev 2015; 115:13093-164. [PMID: 26624238 DOI: 10.1021/acs.chemrev.5b00215] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Carlos Nieto-Draghi
- IFP Energies nouvelles , 1 et 4 avenue de Bois-Préau, 92852 Rueil-Malmaison, France
| | - Guillaume Fayet
- INERIS, Parc Technologique Alata, BP2 , 60550 Verneuil-en-Halatte, France
| | - Benoit Creton
- IFP Energies nouvelles , 1 et 4 avenue de Bois-Préau, 92852 Rueil-Malmaison, France
| | - Xavier Rozanska
- Materials Design S.A.R.L. , 18, rue de Saisset, 92120 Montrouge, France
| | - Patricia Rotureau
- INERIS, Parc Technologique Alata, BP2 , 60550 Verneuil-en-Halatte, France
| | | | - Philippe Ungerer
- Materials Design S.A.R.L. , 18, rue de Saisset, 92120 Montrouge, France
| | - Bernard Rousseau
- Laboratoire de Chimie-Physique, Université Paris Sud , UMR 8000 CNRS, Bât. 349, 91405 Orsay Cedex, France
| | - Carlo Adamo
- Institut de Recherche Chimie Paris, PSL Research University, CNRS, Chimie Paristech , 11 rue P. et M. Curie, F-75005 Paris, France.,Institut Universitaire de France , 103 Boulevard Saint Michel, F-75005 Paris, France
| |
Collapse
|
15
|
Prediction of hydration free energies for the SAMPL4 diverse set of compounds using molecular dynamics simulations with the OPLS-AA force field. J Comput Aided Mol Des 2014; 28:265-76. [PMID: 24557853 DOI: 10.1007/s10822-014-9727-1] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2013] [Accepted: 02/03/2014] [Indexed: 10/25/2022]
Abstract
All-atom molecular dynamics computer simulations were used to blindly predict the hydration free energies of a range of small molecules as part of the SAMPL4 challenge. Compounds were parametrized on the basis of the OPLS-AA force field using three different protocols for deriving partial charges: (1) using existing OPLS-AA atom types and charges with minor adjustments of partial charges on equivalent connecting atoms and derivation of new parameters for a number of distinct chemical groups (N-alkyl imidazole, nitrate) that were not present in the published force field; (2) calculation of quantum mechanical charges via geometry optimization, followed by electrostatic potential (ESP) fitting, using Jaguar at the LMP2/cc-pVTZ(-F) level; and (3) via geometry optimization and CHelpG charges (Gaussian09 at the HF/6-31G* level), followed by two-stage RESP fitting. The absolute hydration free energy was computed by an established protocol including alchemical free energy perturbation with thermodynamic integration. The use of standard OPLS-AA charges (protocol 1) with a number of newly parametrized charges and the use of histidine derived parameters for imidazole yielded an overall root mean square deviation of the prediction from the experimental data of 1.75 kcal/mol. The precision of our results appears to be mainly limited by relatively poor reproducibility of the Lennard-Jones contribution towards the solvation free energy, for which we observed large variability that could be traced to a strong dependence on the initial system conditions.
Collapse
|
16
|
Extended solvent-contact model approach to SAMPL4 blind prediction challenge for hydration free energies. J Comput Aided Mol Des 2014; 28:175-86. [DOI: 10.1007/s10822-014-9729-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2013] [Accepted: 02/10/2014] [Indexed: 10/25/2022]
|
17
|
König G, Pickard FC, Mei Y, Brooks BR. Predicting hydration free energies with a hybrid QM/MM approach: an evaluation of implicit and explicit solvation models in SAMPL4. J Comput Aided Mol Des 2014; 28:245-57. [PMID: 24504703 DOI: 10.1007/s10822-014-9708-4] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Accepted: 01/11/2014] [Indexed: 12/14/2022]
Abstract
The correct representation of solute-water interactions is essential for the accurate simulation of most biological phenomena. Several highly accurate quantum methods are available to deal with solvation by using both implicit and explicit solvents. So far, however, most evaluations of those methods were based on a single conformation, which neglects solute entropy. Here, we present the first test of a novel approach to determine hydration free energies that uses molecular mechanics (MM) to sample phase space and quantum mechanics (QM) to evaluate the potential energies. Free energies are determined by using re-weighting with the Non-Boltzmann Bennett (NBB) method. In this context, the method is referred to as QM-NBB. Based on snapshots from MM sampling and accounting for their correct Boltzmann weight, it is possible to obtain hydration free energies that incorporate the effect of solute entropy. We evaluate the performance of several QM implicit solvent models, as well as explicit solvent QM/MM for the blind subset of the SAMPL4 hydration free energy challenge. While classical free energy simulations with molecular dynamics give root mean square deviations (RMSD) of 2.8 and 2.3 kcal/mol, the hybrid approach yields an improved RMSD of 1.6 kcal/mol. By selecting an appropriate functional and basis set, the RMSD can be reduced to 1 kcal/mol for calculations based on a single conformation. Results for a selected set of challenging molecules imply that this RMSD can be further reduced by using NBB to reweight MM trajectories with the SMD implicit solvent model.
Collapse
Affiliation(s)
- Gerhard König
- Laboratory of Computational Biology, National Institutes of Health, National Heart, Lung and Blood Institute, 5635 Fishers Lane, T-900 Suite, Rockville, MD, 20852, USA,
| | | | | | | |
Collapse
|
18
|
Bernadat G, Supuran CT, Iorga BI. Carbonic anhydrase binding site parameterization in OPLS-AA force field. Bioorg Med Chem 2012. [PMID: 23182217 DOI: 10.1016/j.bmc.2012.10.040] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The parameterization of carbonic anhydrase binding site in OPLS-AA force field was performed using quantum chemistry calculations. Both OH2 and OH(-) forms of the binding site were considered, showing important differences in terms of atomic partial charges. Three different parameterization protocols were used, and the results obtained highlighted the importance of including an extended binding site in the charge calculation. The force field parameters were subsequently validated using standard molecular dynamics simulations. The results presented in this work should greatly facilitate the use of molecular dynamics simulations for studying the carbonic anhydrase, and more generally, the metallo-enzymes.
Collapse
Affiliation(s)
- Guillaume Bernadat
- Institut de Chimie des Substances Naturelles, CNRS UPR 2301, Labex LERMIT, Centre de Recherche de Gif, 1 Avenue de la Terrasse, F-91198 Gif-sur-Yvette, France
| | | | | |
Collapse
|
19
|
Combined receptor-based and ligand-based approach to delineate the mode of binding of guaianolide–endoperoxides to PfATP6. Bioorg Med Chem Lett 2012; 22:5410-4. [DOI: 10.1016/j.bmcl.2012.07.053] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2012] [Revised: 07/10/2012] [Accepted: 07/12/2012] [Indexed: 11/17/2022]
|
20
|
Owen MC, Tóth L, Jojárt B, Komáromi I, Csizmadia IG, Viskolcz B. The Effect of Newly Developed OPLS-AA Alanyl Radical Parameters on Peptide Secondary Structure. J Chem Theory Comput 2012; 8:2569-80. [DOI: 10.1021/ct300059f] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Michael C. Owen
- Materials Science Research Institute,
Faculty of Dentistry, Semmelweis University, Üllöi út 26. H-1085 Budapest, Hungary
- Department of Chemical
Informatics,
Faculty of Education, University of Szeged, Boldogasszony sgt. 6, H-6725 Szeged, Hungary
- Global Institute of Computational Molecular and Materials Science
- Drug Discovery Research Center
| | - László Tóth
- Thrombosis and Haemostasis Research
Group of the Hungarian Academy of Sciences at the University of Debrecen, H-4010, Hungary
| | - Balázs Jojárt
- Department of Chemical
Informatics,
Faculty of Education, University of Szeged, Boldogasszony sgt. 6, H-6725 Szeged, Hungary
- Drug Discovery Research Center
| | - István Komáromi
- Thrombosis and Haemostasis Research
Group of the Hungarian Academy of Sciences at the University of Debrecen, H-4010, Hungary
| | - Imre G. Csizmadia
- Materials Science Research Institute,
Faculty of Dentistry, Semmelweis University, Üllöi út 26. H-1085 Budapest, Hungary
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6,
Canada
- Department of Chemical
Informatics,
Faculty of Education, University of Szeged, Boldogasszony sgt. 6, H-6725 Szeged, Hungary
- Global Institute of Computational Molecular and Materials Science
- Drug Discovery Research Center
| | - Bela Viskolcz
- Department of Chemical
Informatics,
Faculty of Education, University of Szeged, Boldogasszony sgt. 6, H-6725 Szeged, Hungary
- Drug Discovery Research Center
| |
Collapse
|