1
|
Tolokh IS, Folescu DE, Onufriev AV. Inclusion of Water Multipoles into the Implicit Solvation Framework Leads to Accuracy Gains. J Phys Chem B 2024; 128:5855-5873. [PMID: 38860842 PMCID: PMC11194828 DOI: 10.1021/acs.jpcb.4c00254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 05/28/2024] [Accepted: 05/29/2024] [Indexed: 06/12/2024]
Abstract
The current practical "workhorses" of the atomistic implicit solvation─the Poisson-Boltzmann (PB) and generalized Born (GB) models─face fundamental accuracy limitations. Here, we propose a computationally efficient implicit solvation framework, the Implicit Water Multipole GB (IWM-GB) model, that systematically incorporates the effects of multipole moments of water molecules in the first hydration shell of a solute, beyond the dipole water polarization already present at the PB/GB level. The framework explicitly accounts for coupling between polar and nonpolar contributions to the total solvation energy, which is missing from many implicit solvation models. An implementation of the framework, utilizing the GAFF force field and AM1-BCC atomic partial charges model, is parametrized and tested against the experimental hydration free energies of small molecules from the FreeSolv database. The resulting accuracy on the test set (RMSE ∼ 0.9 kcal/mol) is 12% better than that of the explicit solvation (TIP3P) treatment, which is orders of magnitude slower. We also find that the coupling between polar and nonpolar parts of the solvation free energy is essential to ensuring that several features of the IWM-GB model are physically meaningful, including the sign of the nonpolar contributions.
Collapse
Affiliation(s)
- Igor S. Tolokh
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Dan E. Folescu
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department
of Mathematics, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Alexey V. Onufriev
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department
of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
- Center
for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
2
|
Folescu DE, Onufriev AV. A Closed-Form, Analytical Approximation for Apparent Surface Charge and Electric Field of Molecules. ACS OMEGA 2022; 7:26123-26136. [PMID: 35936397 PMCID: PMC9352323 DOI: 10.1021/acsomega.2c01484] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Accepted: 06/10/2022] [Indexed: 06/15/2023]
Abstract
Closed-form, analytical approximations for electrostatic properties of molecules are of unique value as these can provide computational speed, versatility, and physical insight. Here, we have derived a simple, closed-form formula for the apparent surface charge (ASC) as well as for the electric field generated by a molecular charge distribution in aqueous solution. The approximation, with no fitted parameters, was tested against numerical solutions of the Poisson equation, where it has produced a significant speed-up. For neutral small molecules, the hydration free energies estimated from the closed-form ASC formula are within 0.8 kcal/mol RMSD from the numerical Poisson reference; the electric field at the surface is in quantitative agreement with the reference. Performance of the approximation was also tested on larger structures, including a protein, a DNA fragment, and a viral receptor-target complex. For all structures tested, a near-quantitative agreement with the numerical Poisson reference was achieved, except in regions of high negative curvature, where the new approximation is still qualitatively correct. A unique efficiency feature of the proposed "source-based″ closed-form approximation is that the ASC and electric field can be estimated individually at any point or surface patch, without the need to obtain the full global solution. An open-source software implementation of the method is available: https://people.cs.vt.edu/~onufriev/CODES/aasc.zip.
Collapse
Affiliation(s)
- Dan E. Folescu
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Alexey V. Onufriev
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department
of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
- Center
for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
3
|
Fortuna A, Costa PJ. Optimized Halogen Atomic Radii for PBSA Calculations Using Off-Center Point Charges. J Chem Inf Model 2021; 61:3361-3375. [PMID: 34185532 DOI: 10.1021/acs.jcim.1c00177] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
In force-field methods, the usage of off-center point charges, also called extra points (EPs), is a common strategy to tackle the anisotropy of the electrostatic potential of covalently bonded halogens (X), thus allowing the description of halogen bonds (XBs) at the molecular mechanics/molecular dynamics (MM/MD) level. Diverse EP implementations exist in the literature differing on the charge sets and/or the X-EP distances. Poisson-Boltzmann and surface area (PBSA) calculations can be used to obtain solvation free energies (ΔGsolv) of small molecules, often to compute binding free energies (ΔGbind) at the MM-PBSA level. This method depends, among other parameters, on the empirical assignment of atomic radii (PB radii). Given the multiplicity of off-center point-charge models and the lack of specific PB radii for halogens compatible with such implementations, in this work, we assessed the performance of PBSA calculations for the estimation of ΔGsolv values in water (ΔGhyd), also conducting an optimization of the halogen PB radii (Cl, Br, and I) for each EP model. We not only expand the usage of EP models in the scope of the general AMBER force field (GAFF) but also provide the first optimized halogen PB radii in the context of the CHARMM general force field (CGenFF), thus contributing to improving the description of halogenated compounds in PBSA calculations.
Collapse
Affiliation(s)
- Andreia Fortuna
- BioISI-Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1749-016 Lisboa, Portugal.,Research Institute for Medicines (iMed.ULisboa), Faculty of Pharmacy, University of Lisbon, Av. Professor Gama Pinto, 1649-003 Lisbon, Portugal
| | - Paulo J Costa
- BioISI-Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1749-016 Lisboa, Portugal
| |
Collapse
|
4
|
Forouzesh N, Mishra N. An Effective MM/GBSA Protocol for Absolute Binding Free Energy Calculations: A Case Study on SARS-CoV-2 Spike Protein and the Human ACE2 Receptor. Molecules 2021; 26:2383. [PMID: 33923909 PMCID: PMC8074138 DOI: 10.3390/molecules26082383] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 04/09/2021] [Accepted: 04/13/2021] [Indexed: 12/23/2022] Open
Abstract
The binding free energy calculation of protein-ligand complexes is necessary for research into virus-host interactions and the relevant applications in drug discovery. However, many current computational methods of such calculations are either inefficient or inaccurate in practice. Utilizing implicit solvent models in the molecular mechanics generalized Born surface area (MM/GBSA) framework allows for efficient calculations without significant loss of accuracy. Here, GBNSR6, a new flavor of the generalized Born model, is employed in the MM/GBSA framework for measuring the binding affinity between SARS-CoV-2 spike protein and the human ACE2 receptor. A computational protocol is developed based on the widely studied Ras-Raf complex, which has similar binding free energy to SARS-CoV-2/ACE2. Two options for representing the dielectric boundary of the complexes are evaluated: one based on the standard Bondi radii and the other based on a newly developed set of atomic radii (OPT1), optimized specifically for protein-ligand binding. Predictions based on the two radii sets provide upper and lower bounds on the experimental references: -14.7(ΔGbindBondi)<-10.6(ΔGbindExp.)<-4.1(ΔGbindOPT1) kcal/mol. The consensus estimates of the two bounds show quantitative agreement with the experiment values. This work also presents a novel truncation method and computational strategies for efficient entropy calculations with normal mode analysis. Interestingly, it is observed that a significant decrease in the number of snapshots does not affect the accuracy of entropy calculation, while it does lower computation time appreciably. The proposed MM/GBSA protocol can be used to study the binding mechanism of new variants of SARS-CoV-2, as well as other relevant structures.
Collapse
Affiliation(s)
- Negin Forouzesh
- Department of Computer Science, California State University, Los Angeles, CA 90032, USA
| | - Nikita Mishra
- Department of Chemistry and Biochemistry, California State University, Los Angeles, CA 90032, USA;
| |
Collapse
|
5
|
Herbert JM. Dielectric continuum methods for quantum chemistry. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1519] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- John M. Herbert
- Department of Chemistry and Biochemistry The Ohio State University Columbus Ohio USA
| |
Collapse
|
6
|
Forouzesh N, Mukhopadhyay A, Watson LT, Onufriev AV. Multidimensional Global Optimization and Robustness Analysis in the Context of Protein-Ligand Binding. J Chem Theory Comput 2020; 16:4669-4684. [PMID: 32450041 PMCID: PMC8594251 DOI: 10.1021/acs.jctc.0c00142] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Accuracy of protein-ligand binding free energy calculations utilizing implicit solvent models is critically affected by parameters of the underlying dielectric boundary, specifically, the atomic and water probe radii. Here, a global multidimensional optimization pipeline is developed to find optimal atomic radii specifically for protein-ligand binding calculations in implicit solvent. The computational pipeline has these three key components: (1) a massively parallel implementation of a deterministic global optimization algorithm (VTDIRECT95), (2) an accurate yet reasonably fast generalized Born implicit solvent model (GBNSR6), and (3) a novel robustness metric that helps distinguish between nearly degenerate local minima via a postprocessing step of the optimization. A graph-based "kT-connectivity" approach to explore and visualize the multidimensional energy landscape is proposed: local minima that can be reached from the global minimum without exceeding a given energy threshold (kT) are considered to be connected. As an illustration of the capabilities of the optimization pipeline, we apply it to find a global optimum in the space of just five radii: four atomic (O, H, N, and C) radii and water probe radius. The optimized radii, ρW = 1.37 Å, ρC = 1.40 Å, ρH = 1.55 Å, ρN = 2.35 Å, and ρO = 1.28 Å, lead to a closer agreement of electrostatic binding free energies with the explicit solvent reference than two commonly used sets of radii previously optimized for small molecules. At the same time, the ability of the optimizer to find the global optimum reveals fundamental limits of the common two-dielectric implicit solvation model: the computed electrostatic binding free energies are still almost 4 kcal/mol away from the explicit solvent reference. The proposed computational approach opens the possibility to further improve the accuracy of practical computational protocols for binding free energy calculations.
Collapse
Affiliation(s)
- Negin Forouzesh
- Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| | - Abhishek Mukhopadhyay
- Department of Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| | - Layne T Watson
- Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Department of Mathematics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Department of Aerospace and Ocean Engineering, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Center for Soft Matter and Biological Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| | - Alexey V Onufriev
- Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Department of Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Center for Soft Matter and Biological Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| |
Collapse
|
7
|
Lange AW, Herbert JM, Albrecht BJ, You ZQ. Intrinsically smooth discretisation of Connolly's solvent-excluded molecular surface. Mol Phys 2019. [DOI: 10.1080/00268976.2019.1644384] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Affiliation(s)
- Adrian W. Lange
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| | - John M. Herbert
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| | - Benjamin J. Albrecht
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| | - Zhi-Qiang You
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
8
|
Nunes R, Vila-Viçosa D, Costa PJ. Tackling Halogenated Species with PBSA: Effect of Emulating the σ-Hole. J Chem Theory Comput 2019; 15:4241-4251. [PMID: 31142112 DOI: 10.1021/acs.jctc.9b00106] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
To model halogen-bond phenomena using classical force fields, an extra point (EP) of charge is frequently introduced at a given distance from the halogen (X) to emulate the σ-hole. The resulting molecular dynamics (MD) trajectories can be used in subsequent molecular mechanics (MM) combined with Poisson-Boltzmann and surface area calculations (PBSA) to estimate protein-ligand binding free energies (Δ Gbind). While EP addition improves the MM/MD description of halogen-containing systems, its effect on the calculation of solvation free energies (Δ Gsolv) using the PBSA approach is yet to be assessed. As the PBSA calculations depend, among other parameters, on the empirical assignment of radii (PB radii), a problematic issue arises, since standard halogen radii are smaller than the typical X···EP distances, thus placing the EP within the solvent dielectric. Herein, we took a common literature EP parametrization scheme, which uses X···EP = Rmin and RESP charges in the context of GAFF, and performed a comprehensive study on the performance of PBSA (using three different setups) in the calculation of Δ Gsolv values for 142 halogenated compounds (bearing Cl, Br, or I) for which the experimental values are known. By conducting an optimization (minimizing the error against experimental values), we provide a new optimized set of halogen PB radii, for each PBSA setup, that should be used in the context of the aforementioned scenario. A simultaneous optimization of PB radii and X···EP distances shows that a wide range of distance/radius pairs can be used without significant loss of accuracy, therefore laying the basis for expanding this halogen radii optimization strategy to other force fields and EP implementations. As ligand Δ Gsolv estimation is an important term in the determination of protein-ligand Δ Gbind, this work is particularly relevant in the framework of structure-based virtual screening and related computer-aided drug design routines.
Collapse
|
9
|
Abstract
It would often be useful in computer simulations to use an implicit description of solvation effects, instead of explicitly representing the individual solvent molecules. Continuum dielectric models often work well in describing the thermodynamic aspects of aqueous solvation and can be very efficient compared to the explicit treatment of the solvent. Here, we review a particular class of so-called fast implicit solvent models, generalized Born (GB) models, which are widely used for molecular dynamics (MD) simulations of proteins and nucleic acids. These approaches model hydration effects and provide solvent-dependent forces with efficiencies comparable to molecular-mechanics calculations on the solute alone; as such, they can be incorporated into MD or other conformational searching strategies in a straightforward manner. The foundations of the GB model are reviewed, followed by examples of newer, emerging models and examples of important applications. We discuss their strengths and weaknesses, both for fidelity to the underlying continuum model and for the ability to replace explicit consideration of solvent molecules in macromolecular simulations.
Collapse
Affiliation(s)
- Alexey V Onufriev
- Departments of Computer Science and Physics, Center for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24060, USA;
| | - David A Case
- Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA;
| |
Collapse
|
10
|
Tolokh IS, Thomas DG, Onufriev AV. Explicit ions/implicit water generalized Born model for nucleic acids. J Chem Phys 2018; 148:195101. [PMID: 30307229 DOI: 10.1063/1.5027260] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The ion atmosphere around highly charged nucleic acid molecules plays a significant role in their dynamics, structure, and interactions. Here we utilized the implicit solvent framework to develop a model for the explicit treatment of ions interacting with nucleic acid molecules. The proposed explicit ions/implicit water model is based on a significantly modified generalized Born (GB) model and utilizes a non-standard approach to define the solute/solvent dielectric boundary. Specifically, the model includes modifications to the GB interaction terms for the case of multiple interacting solutes-disconnected dielectric boundary around the solute-ion or ion-ion pairs. A fully analytical description of all energy components for charge-charge interactions is provided. The effectiveness of the approach is demonstrated by calculating the potential of mean force for Na+-Cl- ion pair and by carrying out a set of Monte Carlo (MC) simulations of mono- and trivalent ions interacting with DNA and RNA duplexes. The monovalent (Na+) and trivalent (CoHex3+) counterion distributions predicted by the model are in close quantitative agreement with all-atom explicit water molecular dynamics simulations used as reference. Expressed in the units of energy, the maximum deviations of local ion concentrations from the reference are within k B T. The proposed explicit ions/implicit water GB model is able to resolve subtle features and differences of CoHex distributions around DNA and RNA duplexes. These features include preferential CoHex binding inside the major groove of the RNA duplex, in contrast to CoHex biding at the "external" surface of the sugar-phosphate backbone of the DNA duplex; these differences in the counterion binding patters were earlier shown to be responsible for the observed drastic differences in condensation propensities between short DNA and RNA duplexes. MC simulations of CoHex ions interacting with the homopolymeric poly(dA·dT) DNA duplex with modified (de-methylated) and native thymine bases are used to explore the physics behind CoHex-thymine interactions. The simulations suggest that the ion desolvation penalty due to proximity to the low dielectric volume of the methyl group can contribute significantly to CoHex-thymine interactions. Compared to the steric repulsion between the ion and the methyl group, the desolvation penalty interaction has a longer range and may be important to consider in the context of methylation effects on DNA condensation.
Collapse
Affiliation(s)
- Igor S Tolokh
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, USA
| | - Dennis G Thomas
- Computational Biology, Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Alexey V Onufriev
- Departments of Computer Science and Physics, Center for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24061, USA
| |
Collapse
|
11
|
Coons MP, Herbert JM. Quantum chemistry in arbitrary dielectric environments: Theory and implementation of nonequilibrium Poisson boundary conditions and application to compute vertical ionization energies at the air/water interface. J Chem Phys 2018; 148:222834. [DOI: 10.1063/1.5023916] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Affiliation(s)
- Marc P. Coons
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, USA
| | - John M. Herbert
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, USA
| |
Collapse
|
12
|
Izadi S, Harris RC, Fenley MO, Onufriev AV. Accuracy Comparison of Generalized Born Models in the Calculation of Electrostatic Binding Free Energies. J Chem Theory Comput 2018; 14:1656-1670. [PMID: 29378399 DOI: 10.1021/acs.jctc.7b00886] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The need for accurate yet efficient representation of the aqueous environment in biomolecular modeling has led to the development of a variety of generalized Born (GB) implicit solvent models. While many studies have focused on the accuracy of available GB models in predicting solvation free energies, a systematic assessment of the quality of these models in binding free energy calculations, crucial for rational drug design, has not been undertaken. Here, we evaluate the accuracies of eight common GB flavors (GB-HCT, GB-OBC, GB-neck2, GBNSR6, GBSW, GBMV1, GBMV2, and GBMV3), available in major molecular dynamics packages, in predicting the electrostatic binding free energies ( ΔΔ Gel) for a diverse set of 60 biomolecular complexes belonging to four main classes: protein-protein, protein-drug, RNA-peptide, and small complexes. The GB flavors are examined in terms of their ability to reproduce the results from the Poisson-Boltzmann (PB) model, commonly used as accuracy reference in this context. We show that the agreement with the PB of ΔΔ Gel estimates varies widely between different GB models and also across different types of biomolecular complexes, with R2 correlations ranging from 0.3772 to 0.9986. A surface-based "R6" GB model recently implemented in AMBER shows the closest overall agreement with reference PB ( R2 = 0.9949, RMSD = 8.75 kcal/mol). The RNA-peptide and protein-drug complex sets appear to be most challenging for all but one model, as indicated by the large deviations from the PB in ΔΔ Gel. Small neutral complexes present the least challenge for most of the GB models tested. The quantitative demonstration of the strengths and weaknesses of the GB models across the diverse complex types provided here can be used as a guide for practical computations and future development efforts.
Collapse
Affiliation(s)
- Saeed Izadi
- Early Stage Pharmaceutical Development , Genentech Inc. , 1 DNA Way , South San Francisco , California 94080 , United States
| | - Robert C Harris
- Department of Pharmaceutical Sciences , University of Maryland School of Pharmacy , Baltimore , Maryland 21201 , United States
| | - Marcia O Fenley
- Institute of Molecular Biophysics , Florida State University , Tallahassee , Florida 32306-3408 , United States
| | | |
Collapse
|
13
|
Chakravorty A, Jia Z, Li L, Zhao S, Alexov E. Reproducing the Ensemble Average Polar Solvation Energy of a Protein from a Single Structure: Gaussian-Based Smooth Dielectric Function for Macromolecular Modeling. J Chem Theory Comput 2018; 14:1020-1032. [PMID: 29350933 PMCID: PMC9885857 DOI: 10.1021/acs.jctc.7b00756] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Typically, the ensemble average polar component of solvation energy (ΔGpolarsolv) of a macromolecule is computed using molecular dynamics (MD) or Monte Carlo (MC) simulations to generate conformational ensemble and then single/rigid conformation solvation energy calculation is performed on each snapshot. The primary objective of this work is to demonstrate that Poisson-Boltzmann (PB)-based approach using a Gaussian-based smooth dielectric function for macromolecular modeling previously developed by us (Li et al. J. Chem. Theory Comput. 2013, 9 (4), 2126-2136) can reproduce that ensemble average (ΔGpolarsolv) of a protein from a single structure. We show that the Gaussian-based dielectric model reproduces the ensemble average ΔGpolarsolv(⟨ΔGpolarsolv⟩) from an energy-minimized structure of a protein regardless of the minimization environment (structure minimized in vacuo, implicit or explicit waters, or crystal structure); the best case, however, is when it is paired with an in vacuo-minimized structure. In other minimization environments (implicit or explicit waters or crystal structure), the traditional two-dielectric model can still be selected with which the model produces correct solvation energies. Our observations from this work reflect how the ability to appropriately mimic the motion of residues, especially the salt bridge residues, influences a dielectric model's ability to reproduce the ensemble average value of polar solvation free energy from a single in vacuo-minimized structure.
Collapse
Affiliation(s)
- Arghya Chakravorty
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | - Zhe Jia
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | - Lin Li
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | - Shan Zhao
- Departement of Mathematics, College of Arts and Sciences, University of Alabama, Tuscaloosa, Alabama 35487, USA
| | - Emil Alexov
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA.,Corresponding Author Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA.
| |
Collapse
|
14
|
Zhou HX, Pang X. Electrostatic Interactions in Protein Structure, Folding, Binding, and Condensation. Chem Rev 2018; 118:1691-1741. [PMID: 29319301 DOI: 10.1021/acs.chemrev.7b00305] [Citation(s) in RCA: 454] [Impact Index Per Article: 75.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Charged and polar groups, through forming ion pairs, hydrogen bonds, and other less specific electrostatic interactions, impart important properties to proteins. Modulation of the charges on the amino acids, e.g., by pH and by phosphorylation and dephosphorylation, have significant effects such as protein denaturation and switch-like response of signal transduction networks. This review aims to present a unifying theme among the various effects of protein charges and polar groups. Simple models will be used to illustrate basic ideas about electrostatic interactions in proteins, and these ideas in turn will be used to elucidate the roles of electrostatic interactions in protein structure, folding, binding, condensation, and related biological functions. In particular, we will examine how charged side chains are spatially distributed in various types of proteins and how electrostatic interactions affect thermodynamic and kinetic properties of proteins. Our hope is to capture both important historical developments and recent experimental and theoretical advances in quantifying electrostatic contributions of proteins.
Collapse
Affiliation(s)
- Huan-Xiang Zhou
- Department of Chemistry and Department of Physics, University of Illinois at Chicago , Chicago, Illinois 60607, United States.,Department of Physics and Institute of Molecular Biophysics, Florida State University , Tallahassee, Florida 32306, United States
| | - Xiaodong Pang
- Department of Physics and Institute of Molecular Biophysics, Florida State University , Tallahassee, Florida 32306, United States
| |
Collapse
|
15
|
Onufriev AV, Izadi S. Water models for biomolecular simulations. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2017. [DOI: 10.1002/wcms.1347] [Citation(s) in RCA: 94] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Alexey V. Onufriev
- Department of Physics; Virginia Tech; Blacksburg VA USA
- Department of Computer Science; Virginia Tech; Blacksburg VA USA
- Center for Soft Matter and Biological Physics; Virginia Tech; Blacksburg VA USA
| | - Saeed Izadi
- Early Stage Pharmaceutical Development; Genentech Inc.; South San Francisco, CA USA
| |
Collapse
|
16
|
Brini E, Fennell CJ, Fernandez-Serra M, Hribar-Lee B, Lukšič M, Dill KA. How Water's Properties Are Encoded in Its Molecular Structure and Energies. Chem Rev 2017; 117:12385-12414. [PMID: 28949513 PMCID: PMC5639468 DOI: 10.1021/acs.chemrev.7b00259] [Citation(s) in RCA: 194] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Indexed: 11/29/2022]
Abstract
How are water's material properties encoded within the structure of the water molecule? This is pertinent to understanding Earth's living systems, its materials, its geochemistry and geophysics, and a broad spectrum of its industrial chemistry. Water has distinctive liquid and solid properties: It is highly cohesive. It has volumetric anomalies-water's solid (ice) floats on its liquid; pressure can melt the solid rather than freezing the liquid; heating can shrink the liquid. It has more solid phases than other materials. Its supercooled liquid has divergent thermodynamic response functions. Its glassy state is neither fragile nor strong. Its component ions-hydroxide and protons-diffuse much faster than other ions. Aqueous solvation of ions or oils entails large entropies and heat capacities. We review how these properties are encoded within water's molecular structure and energies, as understood from theories, simulations, and experiments. Like simpler liquids, water molecules are nearly spherical and interact with each other through van der Waals forces. Unlike simpler liquids, water's orientation-dependent hydrogen bonding leads to open tetrahedral cage-like structuring that contributes to its remarkable volumetric and thermal properties.
Collapse
Affiliation(s)
- Emiliano Brini
- Laufer
Center for Physical and Quantitative Biology, Department of Physics and Astronomy, and Department of
Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
| | - Christopher J. Fennell
- Department
of Chemistry, Oklahoma State University, Stillwater, Oklahoma 74078, United States
| | - Marivi Fernandez-Serra
- Laufer
Center for Physical and Quantitative Biology, Department of Physics and Astronomy, and Department of
Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
| | - Barbara Hribar-Lee
- Faculty
of Chemistry and Chemical Technology, University
of Ljubljana, Večna
pot 113, SI-1000 Ljubljana, Slovenia
| | - Miha Lukšič
- Faculty
of Chemistry and Chemical Technology, University
of Ljubljana, Večna
pot 113, SI-1000 Ljubljana, Slovenia
| | - Ken A. Dill
- Laufer
Center for Physical and Quantitative Biology, Department of Physics and Astronomy, and Department of
Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
| |
Collapse
|
17
|
Forouzesh N, Izadi S, Onufriev AV. Grid-Based Surface Generalized Born Model for Calculation of Electrostatic Binding Free Energies. J Chem Inf Model 2017; 57:2505-2513. [DOI: 10.1021/acs.jcim.7b00192] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
| | - Saeed Izadi
- Early Stage Pharmaceutical
Development, Genentech Inc., 1 DNA
Way, South San Francisco, California 94080, United States
| | - Alexey V. Onufriev
- Center
for Soft Matter and Biological Physics, Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, United States
| |
Collapse
|
18
|
Setny P, Dudek A. Explicit Solvent Hydration Benchmark for Proteins with Application to the PBSA Method. J Chem Theory Comput 2017; 13:2762-2776. [PMID: 28498675 DOI: 10.1021/acs.jctc.7b00247] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Explicit and implicit solvent models have a proven record of delivering hydration free energies of small, druglike solutes in reasonable agreement with experiment. Hydration of macromolecules, such as proteins, is to a large extent uncharted territory, with few results shedding light on quantitative consistency between different solvent models, let alone their ability to reproduce real water. In this work, based on extensive explicit solvent simulations employing TIP3P and SPC/E water models we analyze hydration free energy changes between fixed conformations of 5 diverse proteins, including large multidomain structures. For the two solvent models we find better agreement in electrostatic rather than nonpolar contributions (RMSE of 2.3 and 2.7 kcal/mol, respectively), even though absolute values of the latter are typically an order of magnitude smaller. We also highlight the importance of finite size corrections to relative protein hydration free energies, which turn out to be rather large, on the order of several kcal/mol, and are necessary for proper interpretation of results obtained under periodic boundary conditions. We further compare gathered data with predictions of the implicit solvent approach based on the Poisson equation and the surface or volume based nonpolar term. We find definitely lesser consistency than between the two explicit models (RMSE between implicit and TIP3 results of 11.3 and 8.4 kcal/mol for electrostatic and nonpolar contributions, respectively). In the process we determine the value of the protein dielectric constant and the geometric model for the dielectric boundary that provide for the best agreement. Finally, we evaluate the usefulness of surface and volume based models of nonpolar contributions to hydration free energy of large biomolecules.
Collapse
Affiliation(s)
- Piotr Setny
- Centre of New Technologies, University of Warsaw , Banacha 2c, 02-097 Warsaw, Poland
| | - Anita Dudek
- Centre of New Technologies, University of Warsaw , Banacha 2c, 02-097 Warsaw, Poland
| |
Collapse
|
19
|
Katkova EV, Onufriev AV, Aguilar B, Sulimov VB. Accuracy comparison of several common implicit solvent models and their implementations in the context of protein-ligand binding. J Mol Graph Model 2016; 72:70-80. [PMID: 28064081 DOI: 10.1016/j.jmgm.2016.12.011] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Revised: 12/07/2016] [Accepted: 12/15/2016] [Indexed: 11/18/2022]
Abstract
In this study several commonly used implicit solvent models are compared with respect to their accuracy of estimating solvation energies of small molecules and proteins, as well as desolvation penalty in protein-ligand binding. The test set consists of 19 small proteins, 104 small molecules, and 15 protein-ligand complexes. We compared predicted hydration energies of small molecules with their experimental values; the results of the solvation and desolvation energy calculations for small molecules, proteins and protein-ligand complexes in water were also compared with Thermodynamic Integration calculations based on TIP3P water model and Amber12 force field. The following implicit solvent (water) models considered here are: PCM (Polarized Continuum Model implemented in DISOLV and MCBHSOLV programs), GB (Generalized Born method implemented in DISOLV program, S-GB, and GBNSR6 stand-alone version), COSMO (COnductor-like Screening Model implemented in the DISOLV program and the MOPAC package) and the Poisson-Boltzmann model (implemented in the APBS program). Different parameterizations of the molecules were examined: we compared MMFF94 force field, Amber12 force field and the quantum-chemical semi-empirical PM7 method implemented in the MOPAC package. For small molecules, all of the implicit solvent models tested here yield high correlation coefficients (0.87-0.93) between the calculated solvation energies and the experimental values of hydration energies. For small molecules high correlation (0.82-0.97) with the explicit solvent energies is seen as well. On the other hand, estimated protein solvation energies and protein-ligand binding desolvation energies show substantial discrepancy (up to 10kcal/mol) with the explicit solvent reference. The correlation of polar protein solvation energies and protein-ligand desolvation energies with the corresponding explicit solvent results is 0.65-0.99 and 0.76-0.96 respectively, though this difference in correlations is caused more by different parameterization and less by methods and indicates the need for further improvement of implicit solvent models parameterization. Within the same parameterization, various implicit methods give practically the same correlation with results obtained in explicit solvent model for ligands and proteins: e.g. correlation values of polar ligand solvation energies and the corresponding energies in the frame of explicit solvent were 0.953-0.966 for the APBS program, the GBNSR6 program and all models used in the DISOLV program. The DISOLV program proved to be on a par with the other used programs in the case of proteins and ligands solvation energy calculation. However, the solution of the Poisson-Boltzmann equation (APBS program) and Generalized Born method (implemented in the GBNSR6 program) proved to be the most accurate in calculating the desolvation energies of complexes.
Collapse
Affiliation(s)
- E V Katkova
- Dimonta, Ltd., Nagornaya Street 15, Bldg 8, Moscow, 117186, Russia; Research Computer Center, Lomonosov Moscow State University, Leninskie Gory 1,Bldg 4, Moscow, 119992, Russia.
| | - A V Onufriev
- Departments of Computer Science and Physics, Center for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, VA, USA
| | - B Aguilar
- Institute for Systems Biology, Seattle, WA, USA
| | - V B Sulimov
- Dimonta, Ltd., Nagornaya Street 15, Bldg 8, Moscow, 117186, Russia; Research Computer Center, Lomonosov Moscow State University, Leninskie Gory 1,Bldg 4, Moscow, 119992, Russia
| |
Collapse
|
20
|
Izadi S, Aguilar B, Onufriev AV. Protein-Ligand Electrostatic Binding Free Energies from Explicit and Implicit Solvation. J Chem Theory Comput 2015; 11:4450-9. [PMID: 26575935 PMCID: PMC5217485 DOI: 10.1021/acs.jctc.5b00483] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Accurate yet efficient computational models of solvent environment are central for most calculations that rely on atomistic modeling, such as prediction of protein-ligand binding affinities. In this study, we evaluate the accuracy of a recently developed generalized Born implicit solvent model, GBNSR6 (Aguilar et al. J. Chem. Theory Comput. 2010, 6, 3613-3639), in estimating the electrostatic solvation free energies (ΔG(pol)) and binding free energies (ΔΔG(pol)) for small protein-ligand complexes. We also compare estimates based on three different explicit solvent models (TIP3P, TIP4PEw, and OPC). The two main findings are as follows. First, the deviation (RMSD = 7.04 kcal/mol) of GBNSR6 binding affinities from commonly used TIP3P reference values is comparable to the deviations between explicit models themselves, e.g. TIP4PEw vs TIP3P (RMSD = 5.30 kcal/mol). A simple uniform adjustment of the atomic radii by a single scaling factor reduces the RMS deviation of GBNSR6 from TIP3P to within the above "error margin" - differences between ΔΔG(pol) estimated by different common explicit solvent models. The simple radii scaling virtually eliminates the systematic deviation (ΔΔG(pol)) between GBNSR6 and two out of the three explicit water models and significantly reduces the deviation from the third explicit model. Second, the differences between electrostatic binding energy estimates from different explicit models is disturbingly large; for example, the deviation between TIP4PEw and TIP3P estimates of ΔΔG(pol) values can be up to ∼50% or ∼9 kcal/mol, which is significantly larger than the "chemical accuracy" goal of ∼1 kcal/mol. The absolute ΔG(pol) calculated with different explicit models could differ by tens of kcal/mol. These discrepancies point to unacceptably high sensitivity of binding affinity estimates to the choice of common explicit water models. The absence of a clear "gold standard" among these models strengthens the case for the use of accurate implicit solvation models for binding energetics, which may be orders of magnitude faster.
Collapse
Affiliation(s)
- Saeed Izadi
- Department of Biomedical Engineering and Mechanics, Department of Computer Science, and Departments of Computer Science and Physics, Virginia Tech , Blacksburg, Virginia 24060, United States
| | - Boris Aguilar
- Department of Biomedical Engineering and Mechanics, Department of Computer Science, and Departments of Computer Science and Physics, Virginia Tech , Blacksburg, Virginia 24060, United States
| | - Alexey V Onufriev
- Department of Biomedical Engineering and Mechanics, Department of Computer Science, and Departments of Computer Science and Physics, Virginia Tech , Blacksburg, Virginia 24060, United States
| |
Collapse
|
21
|
Li L, Wang L, Alexov E. On the energy components governing molecular recognition in the framework of continuum approaches. Front Mol Biosci 2015; 2:5. [PMID: 25988173 PMCID: PMC4429657 DOI: 10.3389/fmolb.2015.00005] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2014] [Accepted: 02/04/2015] [Indexed: 01/14/2023] Open
Abstract
Molecular recognition is a process that brings together several biological macromolecules to form a complex and one of the most important characteristics of the process is the binding free energy. Various approaches exist to model the binding free energy, provided the knowledge of the 3D structures of bound and unbound molecules. Among them, continuum approaches are quite appealing due to their computational efficiency while at the same time providing predictions with reasonable accuracy. Here we review recent developments in the field emphasizing on the importance of adopting adequate description of physical processes taking place upon the binding. In particular, we focus on the efforts aiming at capturing some of the atomistic details of the binding phenomena into the continuum framework. When possible, the energy components are reviewed independently of each other. However, it is pointed out that rigorous approaches should consider all energy contributions on the same footage. The two major schemes for utilizing the individual energy components to predict binding affinity are outlined as well.
Collapse
Affiliation(s)
- Lin Li
- Computational Biophysics and Bioinformatics, Department of Physics, Clemson University Clemson, SC, USA
| | - Lin Wang
- Computational Biophysics and Bioinformatics, Department of Physics, Clemson University Clemson, SC, USA
| | - Emil Alexov
- Computational Biophysics and Bioinformatics, Department of Physics, Clemson University Clemson, SC, USA
| |
Collapse
|
22
|
Alexov E, Luo R. Editorial. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2014. [DOI: 10.1142/s0219633614020015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Affiliation(s)
- Emil Alexov
- Computational Biophysics and Bioinformatics, Department of Physics, Clemson University, Clemson, SC 29634, USA
| | - Ray Luo
- Department of Biomedical Engineering, Department of Molecular Biology and Biochemistry, University of California, Irvine, CA 92697, USA
| |
Collapse
|