1
|
Tolokh IS, Folescu DE, Onufriev AV. Inclusion of Water Multipoles into the Implicit Solvation Framework Leads to Accuracy Gains. J Phys Chem B 2024; 128:5855-5873. [PMID: 38860842 PMCID: PMC11194828 DOI: 10.1021/acs.jpcb.4c00254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 05/28/2024] [Accepted: 05/29/2024] [Indexed: 06/12/2024]
Abstract
The current practical "workhorses" of the atomistic implicit solvation─the Poisson-Boltzmann (PB) and generalized Born (GB) models─face fundamental accuracy limitations. Here, we propose a computationally efficient implicit solvation framework, the Implicit Water Multipole GB (IWM-GB) model, that systematically incorporates the effects of multipole moments of water molecules in the first hydration shell of a solute, beyond the dipole water polarization already present at the PB/GB level. The framework explicitly accounts for coupling between polar and nonpolar contributions to the total solvation energy, which is missing from many implicit solvation models. An implementation of the framework, utilizing the GAFF force field and AM1-BCC atomic partial charges model, is parametrized and tested against the experimental hydration free energies of small molecules from the FreeSolv database. The resulting accuracy on the test set (RMSE ∼ 0.9 kcal/mol) is 12% better than that of the explicit solvation (TIP3P) treatment, which is orders of magnitude slower. We also find that the coupling between polar and nonpolar parts of the solvation free energy is essential to ensuring that several features of the IWM-GB model are physically meaningful, including the sign of the nonpolar contributions.
Collapse
Affiliation(s)
- Igor S. Tolokh
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Dan E. Folescu
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department
of Mathematics, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Alexey V. Onufriev
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department
of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
- Center
for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
2
|
Bass L, Elder LH, Folescu DE, Forouzesh N, Tolokh IS, Karpatne A, Onufriev AV. Improving the Accuracy of Physics-Based Hydration-Free Energy Predictions by Machine Learning the Remaining Error Relative to the Experiment. J Chem Theory Comput 2024; 20:396-410. [PMID: 38149593 PMCID: PMC10950260 DOI: 10.1021/acs.jctc.3c00981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
The accuracy of computational models of water is key to atomistic simulations of biomolecules. We propose a computationally efficient way to improve the accuracy of the prediction of hydration-free energies (HFEs) of small molecules: the remaining errors of the physics-based models relative to the experiment are predicted and mitigated by machine learning (ML) as a postprocessing step. Specifically, the trained graph convolutional neural network attempts to identify the "blind spots" in the physics-based model predictions, where the complex physics of aqueous solvation is poorly accounted for, and partially corrects for them. The strategy is explored for five classical solvent models representing various accuracy/speed trade-offs, from the fast analytical generalized Born (GB) to the popular TIP3P explicit solvent model; experimental HFEs of small neutral molecules from the FreeSolv set are used for the training and testing. For all of the models, the ML correction reduces the resulting root-mean-square error relative to the experiment for HFEs of small molecules, without significant overfitting and with negligible computational overhead. For example, on the test set, the relative accuracy improvement is 47% for the fast analytical GB, making it, after the ML correction, almost as accurate as uncorrected TIP3P. For the TIP3P model, the accuracy improvement is about 39%, bringing the ML-corrected model's accuracy below the 1 kcal/mol threshold. In general, the relative benefit of the ML corrections is smaller for more accurate physics-based models, reaching the lower limit of about 20% relative accuracy gain compared with that of the physics-based treatment alone. The proposed strategy of using ML to learn the remaining error of physics-based models offers a distinct advantage over training ML alone directly on reference HFEs: it preserves the correct overall trend, even well outside of the training set.
Collapse
Affiliation(s)
- Lewis Bass
- Department of Computer Engineering, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Luke H Elder
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Dan E Folescu
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department of Mathematics, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Negin Forouzesh
- Department of Computer Science, California State University, Los Angeles, California 90032, United States
| | - Igor S Tolokh
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Anuj Karpatne
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Alexey V Onufriev
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
- Center for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
3
|
Rahbar M, Stein CJ. A Statistical Perspective on Microsolvation. J Phys Chem A 2023; 127:2176-2193. [PMID: 36854176 DOI: 10.1021/acs.jpca.2c08763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2023]
Abstract
The lack of a procedure to determine equilibrium thermodynamic properties of a small system interacting with a bath is frequently seen as a weakness of conventional statistical mechanics. A typical example for such a small system is a solute surrounded by an explicit solvation shell. One way to approach this problem is to enclose the small system of interest in a large bath of explicit solvent molecules, considerably larger than the system itself. The explicit inclusion of the solvent degrees of freedom is obviously limited by the available computational resources. A potential remedy to this problem is a microsolvation approach where only a few explicit solvent molecules are considered and surrounded by an implicit solvent bath. Still, the sampling of the solvent degrees of freedom is challenging with conventional grand canonical Monte Carlo methods, since no single chemical potential for the solvent molecules can be defined in the realm of small-system thermodynamics. In this work, a statistical thermodynamic model based on the grand canonical ensemble is proposed that avoids the conventional system size limitations and accurately characterizes the properties of the system of interest subject to the thermodynamic constraints of the bath. We extend an existing microsolvation approach to a generalized multibath "microstatistical" model and show that the previously derived approaches result as a limit of our model. The framework described here is universal and we validate our method numerically for a Lennard-Jones model fluid.
Collapse
Affiliation(s)
- Mohammad Rahbar
- Theoretische Physik and CENIDE, Universität Duisburg-Essen, D-47048 Duisburg, Germany
| | - Christopher J Stein
- Theoretische Physik and CENIDE, Universität Duisburg-Essen, D-47048 Duisburg, Germany
| |
Collapse
|
4
|
Abstract
Recently, machine learning (ML) has established itself in various worldwide benchmarking competitions in computational biology, including Critical Assessment of Structure Prediction (CASP) and Drug Design Data Resource (D3R) Grand Challenges. However, the intricate structural complexity and high ML dimensionality of biomolecular datasets obstruct the efficient application of ML algorithms in the field. In addition to data and algorithm, an efficient ML machinery for biomolecular predictions must include structural representation as an indispensable component. Mathematical representations that simplify the biomolecular structural complexity and reduce ML dimensionality have emerged as a prime winner in D3R Grand Challenges. This review is devoted to the recent advances in developing low-dimensional and scalable mathematical representations of biomolecules in our laboratory. We discuss three classes of mathematical approaches, including algebraic topology, differential geometry, and graph theory. We elucidate how the physical and biological challenges have guided the evolution and development of these mathematical apparatuses for massive and diverse biomolecular data. We focus the performance analysis on protein-ligand binding predictions in this review although these methods have had tremendous success in many other applications, such as protein classification, virtual screening, and the predictions of solubility, solvation free energies, toxicity, partition coefficients, protein folding stability changes upon mutation, etc.
Collapse
Affiliation(s)
- Duc Duy Nguyen
- Department of Mathematics, Michigan State University, MI 48824, USA.
| | - Zixuan Cang
- Department of Mathematics, Michigan State University, MI 48824, USA.
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, MI 48824, USA. and Department of Biochemistry and Molecular Biology, Michigan State University, MI 48824, USA and Department of Electrical and Computer Engineering, Michigan State University, MI 48824, USA
| |
Collapse
|
5
|
Wei H, Luo A, Qiu T, Luo R, Qi R. Improved Poisson-Boltzmann Methods for High-Performance Computing. J Chem Theory Comput 2019; 15:6190-6202. [PMID: 31525962 DOI: 10.1021/acs.jctc.9b00602] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Implicit solvent models based on the Poisson-Boltzmann equation (PBE) have been widely used to study electrostatic interactions in biophysical processes. These models often treat the solvent and solute regions as high and low dielectric continua, leading to a large jump in dielectrics across the molecular surface which is difficult to handle. Higher order interface schemes are often needed to seek higher accuracy for PBE applications. However, these methods are usually very liberal in the use of grid points nearby the molecular surface, making them difficult to use on high-performance computing platforms. Alternatively, the harmonic average (HA) method has been used to approximate dielectric interface conditions near the molecular surface with surprisingly good convergence and is well suited for high-performance computing. By adopting a 7-point stencil, the HA method is advantageous in generating simple 7-banded coefficient matrices, which greatly facilitate linear system solution with dense data parallelism, on high-performance computing platforms such as a graphics processing unit (GPU). However, the HA method is limited due to its lower accuracy. Therefore, it would be of great interest for high-performance applications to develop more accurate methods while retaining the simplicity and effectiveness of the 7-point stencil discretization scheme. In this study, we have developed two new algorithms based on the spirit of the HA method by introducing more physical interface relations and imposing the discretized Poisson's equation to the second order, respectively. Our testing shows that, for typical biomolecules, the new methods significantly improve the numerical accuracy to that comparable to the second-order solvers and with ∼65% overall efficiency gain on widely available high-performance GPU platforms.
Collapse
|
6
|
Nguyen DD, Wei GW. DG-GL: Differential geometry-based geometric learning of molecular datasets. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN BIOMEDICAL ENGINEERING 2019; 35:e3179. [PMID: 30693661 PMCID: PMC6598676 DOI: 10.1002/cnm.3179] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/21/2018] [Accepted: 12/06/2018] [Indexed: 05/11/2023]
Abstract
MOTIVATION Despite its great success in various physical modeling, differential geometry (DG) has rarely been devised as a versatile tool for analyzing large, diverse, and complex molecular and biomolecular datasets because of the limited understanding of its potential power in dimensionality reduction and its ability to encode essential chemical and biological information in differentiable manifolds. RESULTS We put forward a differential geometry-based geometric learning (DG-GL) hypothesis that the intrinsic physics of three-dimensional (3D) molecular structures lies on a family of low-dimensional manifolds embedded in a high-dimensional data space. We encode crucial chemical, physical, and biological information into 2D element interactive manifolds, extracted from a high-dimensional structural data space via a multiscale discrete-to-continuum mapping using differentiable density estimators. Differential geometry apparatuses are utilized to construct element interactive curvatures in analytical forms for certain analytically differentiable density estimators. These low-dimensional differential geometry representations are paired with a robust machine learning algorithm to showcase their descriptive and predictive powers for large, diverse, and complex molecular and biomolecular datasets. Extensive numerical experiments are carried out to demonstrate that the proposed DG-GL strategy outperforms other advanced methods in the predictions of drug discovery-related protein-ligand binding affinity, drug toxicity, and molecular solvation free energy. AVAILABILITY AND IMPLEMENTATION http://weilab.math.msu.edu/DG-GL/ Contact: wei@math.msu.edu.
Collapse
Affiliation(s)
- Duc Duy Nguyen
- Department of Mathematics, Michigan State University, East Lansing, 48824, Michigan
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, 48824, Michigan
- Department of Electrical and Computer Engineering, Michigan State University, MI 48824, USA
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, Michigan
| |
Collapse
|
7
|
Wei H, Luo R, Qi R. An efficient second-order poisson-boltzmann method. J Comput Chem 2019; 40:1257-1269. [PMID: 30776135 PMCID: PMC6422926 DOI: 10.1002/jcc.25783] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Revised: 11/29/2018] [Accepted: 01/06/2019] [Indexed: 11/08/2022]
Abstract
Immersed interface method (IIM) is a promising high-accuracy numerical scheme for the Poisson-Boltzmann model that has been widely used to study electrostatic interactions in biomolecules. However, the IIM suffers from instability and slow convergence for typical applications. In this study, we introduced both analytical interface and surface regulation into IIM to address these issues. The analytical interface setup leads to better accuracy and its convergence closely follows a quadratic manner as predicted by theory. The surface regulation further speeds up the convergence for nontrivial biomolecules. In addition, uncertainties of the numerical energies for tested systems are also reduced by about half. More interestingly, the analytical setup significantly improves the linear solver efficiency and stability by generating more precise and better-conditioned linear systems. Finally, we implemented the bottleneck linear system solver on GPUs to further improve the efficiency of the method, so it can be widely used for practical biomolecular applications. © 2019 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Haixin Wei
- Department of Chemical Engineering and Materials Science, University of California, Irvine, California, 92697
| | - Ray Luo
- Department of Chemical Engineering and Materials Science, University of California, Irvine, California, 92697.,Department of Molecular Biology and Biochemistry, University of California, Irvine, California, 92697.,Department of Biomedical Engineering, University of California, Irvine, California, 92697
| | - Ruxi Qi
- Department of Molecular Biology and Biochemistry, University of California, Irvine, California, 92697
| |
Collapse
|
8
|
Onufriev AV, Izadi S. Water models for biomolecular simulations. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2017. [DOI: 10.1002/wcms.1347] [Citation(s) in RCA: 94] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Alexey V. Onufriev
- Department of Physics; Virginia Tech; Blacksburg VA USA
- Department of Computer Science; Virginia Tech; Blacksburg VA USA
- Center for Soft Matter and Biological Physics; Virginia Tech; Blacksburg VA USA
| | - Saeed Izadi
- Early Stage Pharmaceutical Development; Genentech Inc.; South San Francisco, CA USA
| |
Collapse
|
9
|
Jurrus E, Engel D, Star K, Monson K, Brandi J, Felberg LE, Brookes DH, Wilson L, Chen J, Liles K, Chun M, Li P, Gohara DW, Dolinsky T, Konecny R, Koes DR, Nielsen JE, Head-Gordon T, Geng W, Krasny R, Wei GW, Holst MJ, McCammon JA, Baker NA. Improvements to the APBS biomolecular solvation software suite. Protein Sci 2017; 27:112-128. [PMID: 28836357 DOI: 10.1002/pro.3280] [Citation(s) in RCA: 1160] [Impact Index Per Article: 165.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Revised: 08/21/2017] [Accepted: 08/22/2017] [Indexed: 12/11/2022]
Abstract
The Adaptive Poisson-Boltzmann Solver (APBS) software was developed to solve the equations of continuum electrostatics for large biomolecular assemblages that have provided impact in the study of a broad range of chemical, biological, and biomedical applications. APBS addresses the three key technology challenges for understanding solvation and electrostatics in biomedical applications: accurate and efficient models for biomolecular solvation and electrostatics, robust and scalable software for applying those theories to biomolecular systems, and mechanisms for sharing and analyzing biomolecular electrostatics data in the scientific community. To address new research applications and advancing computational capabilities, we have continually updated APBS and its suite of accompanying software since its release in 2001. In this article, we discuss the models and capabilities that have recently been implemented within the APBS software package including a Poisson-Boltzmann analytical and a semi-analytical solver, an optimized boundary element solver, a geometry-based geometric flow solvation model, a graph theory-based algorithm for determining pKa values, and an improved web-based visualization tool for viewing electrostatics.
Collapse
Affiliation(s)
| | - Dave Engel
- Pacific Northwest National Laboratory, Richland, Washington
| | - Keith Star
- Pacific Northwest National Laboratory, Richland, Washington
| | - Kyle Monson
- Pacific Northwest National Laboratory, Richland, Washington
| | - Juan Brandi
- Pacific Northwest National Laboratory, Richland, Washington
| | | | | | | | - Jiahui Chen
- Southern Methodist University, Dallas, Texas
| | - Karina Liles
- Pacific Northwest National Laboratory, Richland, Washington
| | - Minju Chun
- Pacific Northwest National Laboratory, Richland, Washington
| | - Peter Li
- Pacific Northwest National Laboratory, Richland, Washington
| | | | | | - Robert Konecny
- University of California San Diego, San Diego, California
| | - David R Koes
- University of Pittsburgh, Pittsburgh, Pennsylvania
| | | | | | - Weihua Geng
- Southern Methodist University, Dallas, Texas
| | | | - Guo-Wei Wei
- Michigan State University, East Lansing, Michigan
| | | | | | - Nathan A Baker
- Pacific Northwest National Laboratory, Richland, Washington.,Brown University, Providence, Rhode Island
| |
Collapse
|
10
|
Gosink LJ, Overall CC, Reehl SM, Whitney PD, Mobley DL, Baker NA. Bayesian Model Averaging for Ensemble-Based Estimates of Solvation-Free Energies. J Phys Chem B 2017; 121:3458-3472. [PMID: 27966363 DOI: 10.1021/acs.jpcb.6b09198] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
This paper applies the Bayesian Model Averaging statistical ensemble technique to estimate small molecule solvation free energies. There is a wide range of methods available for predicting solvation free energies, ranging from empirical statistical models to ab initio quantum mechanical approaches. Each of these methods is based on a set of conceptual assumptions that can affect predictive accuracy and transferability. Using an iterative statistical process, we have selected and combined solvation energy estimates using an ensemble of 17 diverse methods from the fourth Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) blind prediction study to form a single, aggregated solvation energy estimate. Methods that possess minimal or redundant information are pruned from the ensemble and the evaluation process repeats until aggregate predictive performance can no longer be improved. We show that this process results in a final aggregate estimate that outperforms all individual methods by reducing estimate errors by as much as 91% to 1.2 kcal mol-1 accuracy. This work provides a new approach for accurate solvation free energy prediction and lays the foundation for future work on aggregate models that can balance computational cost with prediction accuracy.
Collapse
Affiliation(s)
| | | | | | | | - David L Mobley
- Departments of Pharmaceutical Sciences and Chemistry, University of California, Irvine , Irvine, California 92697, United States
| | - Nathan A Baker
- Division of Applied Mathematics, Brown University , Providence, Rhode Island 02912, United States
| |
Collapse
|
11
|
|
12
|
Shi YZ, Jin L, Wang FH, Zhu XL, Tan ZJ. Predicting 3D Structure, Flexibility, and Stability of RNA Hairpins in Monovalent and Divalent Ion Solutions. Biophys J 2016; 109:2654-2665. [PMID: 26682822 DOI: 10.1016/j.bpj.2015.11.006] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Revised: 10/09/2015] [Accepted: 11/06/2015] [Indexed: 10/24/2022] Open
Abstract
A full understanding of RNA-mediated biology would require the knowledge of three-dimensional (3D) structures, structural flexibility, and stability of RNAs. To predict RNA 3D structures and stability, we have previously proposed a three-bead coarse-grained predictive model with implicit salt/solvent potentials. In this study, we further develop the model by improving the implicit-salt electrostatic potential and including a sequence-dependent coaxial stacking potential to enable the model to simulate RNA 3D structure folding in divalent/monovalent ion solutions. The model presented here can predict 3D structures of RNA hairpins with bulges/internal loops (<77 nucleotides) from their sequences at the corresponding experimental ion conditions with an overall improved accuracy compared to the experimental data; the model also makes reliable predictions for the flexibility of RNA hairpins with bulge loops of different lengths at several divalent/monovalent ion conditions. In addition, the model successfully predicts the stability of RNA hairpins with various loops/stems in divalent/monovalent ion solutions.
Collapse
Affiliation(s)
- Ya-Zhou Shi
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of the Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Lei Jin
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of the Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Feng-Hua Wang
- Engineering Training Center, Jianghan University, Wuhan, China
| | - Xiao-Long Zhu
- Department of Physics, School of Physics and Information Engineering, Jianghan University, Wuhan, China
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of the Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China.
| |
Collapse
|
13
|
Cumberworth A, Bui JM, Gsponer J. Free energies of solvation in the context of protein folding: Implications for implicit and explicit solvent models. J Comput Chem 2015; 37:629-40. [DOI: 10.1002/jcc.24235] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Revised: 09/25/2015] [Accepted: 10/06/2015] [Indexed: 12/13/2022]
Affiliation(s)
| | | | - Jörg Gsponer
- Center for High-Throughput Biology, UBC; Vancouver Canada
| |
Collapse
|
14
|
Wang B, Wei GW. Parameter optimization in differential geometry based solvation models. J Chem Phys 2015; 143:134119. [PMID: 26450304 PMCID: PMC4602332 DOI: 10.1063/1.4932342] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Accepted: 09/22/2015] [Indexed: 01/01/2023] Open
Abstract
Differential geometry (DG) based solvation models are a new class of variational implicit solvent approaches that are able to avoid unphysical solvent-solute boundary definitions and associated geometric singularities, and dynamically couple polar and non-polar interactions in a self-consistent framework. Our earlier study indicates that DG based non-polar solvation model outperforms other methods in non-polar solvation energy predictions. However, the DG based full solvation model has not shown its superiority in solvation analysis, due to its difficulty in parametrization, which must ensure the stability of the solution of strongly coupled nonlinear Laplace-Beltrami and Poisson-Boltzmann equations. In this work, we introduce new parameter learning algorithms based on perturbation and convex optimization theories to stabilize the numerical solution and thus achieve an optimal parametrization of the DG based solvation models. An interesting feature of the present DG based solvation model is that it provides accurate solvation free energy predictions for both polar and non-polar molecules in a unified formulation. Extensive numerical experiment demonstrates that the present DG based solvation model delivers some of the most accurate predictions of the solvation free energies for a large number of molecules.
Collapse
Affiliation(s)
- Bao Wang
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, USA
| | - G W Wei
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, USA
| |
Collapse
|
15
|
Daily MD, Chun J, Heredia-Langner A, Wei G, Baker NA. Origin of parameter degeneracy and molecular shape relationships in geometric-flow calculations of solvation free energies. J Chem Phys 2014; 139:204108. [PMID: 24289345 DOI: 10.1063/1.4832900] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Implicit solvent models are important tools for calculating solvation free energies for chemical and biophysical studies since they require fewer computational resources but can achieve accuracy comparable to that of explicit-solvent models. In past papers, geometric flow-based solvation models have been established for solvation analysis of small and large compounds. In the present work, the use of realistic experiment-based parameter choices for the geometric flow models is studied. We find that the experimental parameters of solvent internal pressure p = 172 MPa and surface tension γ = 72 mN/m produce solvation free energies within 1 RT of the global minimum root-mean-squared deviation from experimental data over the expanded set. Our results demonstrate that experimental values can be used for geometric flow solvent model parameters, thus eliminating the need for additional parameterization. We also examine the correlations between optimal values of p and γ which are strongly anti-correlated. Geometric analysis of the small molecule test set shows that these results are inter-connected with an approximately linear relationship between area and volume in the range of molecular sizes spanned by the data set. In spite of this considerable degeneracy between the surface tension and pressure terms in the model, both terms are important for the broader applicability of the model.
Collapse
Affiliation(s)
- Michael D Daily
- Fundamental and Computational Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | | | | | | | | |
Collapse
|
16
|
Li L, Fennell CJ, Dill KA. Field-SEA: a model for computing the solvation free energies of nonpolar, polar, and charged solutes in water. J Phys Chem B 2013; 118:6431-7. [PMID: 24299013 PMCID: PMC4065164 DOI: 10.1021/jp4115139] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
Previous
work describes a computational solvation model called semi-explicit
assembly (SEA). The SEA water model computes the free energies of
solvation of nonpolar and polar solutes in water with good efficiency
and accuracy. However, SEA gives systematic errors in the solvation
free energies of ions and charged solutes. Here, we describe field-SEA,
an improved treatment that gives accurate solvation free energies
of charged solutes, including monatomic and polyatomic ions and model
dipeptides, as well as nonpolar and polar molecules. Field-SEA is
computationally inexpensive for a given solute because explicit-solvent
model simulations are relegated to a precomputation step and because
it represents solvating waters in terms of a solute’s free-energy
field. In essence, field-SEA approximates the physics of explicit-model
simulations within a computationally efficient framework. A key finding
is that an atom’s solvation shell inherits characteristics
of a neighboring atom, especially strongly charged neighbors. Field-SEA
may be useful where there is a need for solvation free-energy computations
that are faster than explicit-solvent simulations and more accurate
than traditional implicit-solvent simulations for a wide range of
solutes.
Collapse
Affiliation(s)
- Libo Li
- Laufer Center for Physical and Quantitative Biology, and Departments of Physics and Chemistry, Stony Brook University , Stony Brook, New York 11794, United States
| | | | | |
Collapse
|
17
|
Wei GW. Multiscale Multiphysics and Multidomain Models I: Basic Theory. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2013; 12:10.1142/S021963361341006X. [PMID: 25382892 PMCID: PMC4220694 DOI: 10.1142/s021963361341006x] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
This work extends our earlier two-domain formulation of a differential geometry based multiscale paradigm into a multidomain theory, which endows us the ability to simultaneously accommodate multiphysical descriptions of aqueous chemical, physical and biological systems, such as fuel cells, solar cells, nanofluidics, ion channels, viruses, RNA polymerases, molecular motors and large macromolecular complexes. The essential idea is to make use of the differential geometry theory of surfaces as a natural means to geometrically separate the macroscopic domain of solvent from the microscopic domain of solute, and dynamically couple continuum and discrete descriptions. Our main strategy is to construct energy functionals to put on an equal footing of multiphysics, including polar (i.e., electrostatic) solvation, nonpolar solvation, chemical potential, quantum mechanics, fluid mechanics, molecular mechanics, coarse grained dynamics and elastic dynamics. The variational principle is applied to the energy functionals to derive desirable governing equations, such as multidomain Laplace-Beltrami (LB) equations for macromolecular morphologies, multidomain Poisson-Boltzmann (PB) equation or Poisson equation for electrostatic potential, generalized Nernst-Planck (NP) equations for the dynamics of charged solvent species, generalized Navier-Stokes (NS) equation for fluid dynamics, generalized Newton's equations for molecular dynamics (MD) or coarse-grained dynamics and equation of motion for elastic dynamics. Unlike the classical PB equation, our PB equation is an integral-differential equation due to solvent-solute interactions. To illustrate the proposed formalism, we have explicitly constructed three models, a multidomain solvation model, a multidomain charge transport model and a multidomain chemo-electro-fluid-MD-elastic model. Each solute domain is equipped with distinct surface tension, pressure, dielectric function, and charge density distribution. In addition to long-range Coulombic interactions, various non-electrostatic solvent-solute interactions are considered in the present modeling. We demonstrate the consistency between the non-equilibrium charge transport model and the equilibrium solvation model by showing the systematical reduction of the former to the latter at equilibrium. This paper also offers a brief review of the field.
Collapse
Affiliation(s)
- Guo-Wei Wei
- Department of Mathematics Michigan State University, MI 48824, USA Department of Electrical and Computer Engineering Michigan State University, MI 48824, USA Department of Biochemistry and Molecular Biology Michigan State University, MI 48824, USA
| |
Collapse
|
18
|
Xia K, Opron K, Wei GW. Multiscale multiphysics and multidomain models--flexibility and rigidity. J Chem Phys 2013; 139:194109. [PMID: 24320318 PMCID: PMC3855066 DOI: 10.1063/1.4830404] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2013] [Accepted: 10/24/2013] [Indexed: 11/14/2022] Open
Abstract
The emerging complexity of large macromolecules has led to challenges in their full scale theoretical description and computer simulation. Multiscale multiphysics and multidomain models have been introduced to reduce the number of degrees of freedom while maintaining modeling accuracy and achieving computational efficiency. A total energy functional is constructed to put energies for polar and nonpolar solvation, chemical potential, fluid flow, molecular mechanics, and elastic dynamics on an equal footing. The variational principle is utilized to derive coupled governing equations for the above mentioned multiphysical descriptions. Among these governing equations is the Poisson-Boltzmann equation which describes continuum electrostatics with atomic charges. The present work introduces the theory of continuum elasticity with atomic rigidity (CEWAR). The essence of CEWAR is to formulate the shear modulus as a continuous function of atomic rigidity. As a result, the dynamics complexity of a macromolecular system is separated from its static complexity so that the more time-consuming dynamics is handled with continuum elasticity theory, while the less time-consuming static analysis is pursued with atomic approaches. We propose a simple method, flexibility-rigidity index (FRI), to analyze macromolecular flexibility and rigidity in atomic detail. The construction of FRI relies on the fundamental assumption that protein functions, such as flexibility, rigidity, and energy, are entirely determined by the structure of the protein and its environment, although the structure is in turn determined by all the interactions. As such, the FRI measures the topological connectivity of protein atoms or residues and characterizes the geometric compactness of the protein structure. As a consequence, the FRI does not resort to the interaction Hamiltonian and bypasses matrix diagonalization, which underpins most other flexibility analysis methods. FRI's computational complexity is of O(N(2)) at most, where N is the number of atoms or residues, in contrast to O(N(3)) for Hamiltonian based methods. We demonstrate that the proposed FRI gives rise to accurate prediction of protein B-Factor for a set of 263 proteins. We show that a parameter free FRI is able to achieve about 95% accuracy of the parameter optimized FRI. An interpolation algorithm is developed to construct continuous atomic flexibility functions for visualization and use with CEWAR.
Collapse
Affiliation(s)
- Kelin Xia
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, USA
| | | | | |
Collapse
|
19
|
Abstract
Accurate and efficient treatment of electrostatics is a crucial step in computational analyses of biomolecular structures and dynamics. In this study, we have explored a second-order finite-difference numerical method to solve the widely used Poisson-Boltzmann equation for electrostatic analyses of realistic bio-molecules. The so-called immersed interface method was first validated and found to be consistent with the classical weighted harmonic averaging method for a diversified set of test biomolecules. The numerical accuracy and convergence behaviors of the new method were next analyzed in its computation of numerical reaction field grid potentials, energies, and atomic solvation forces. Overall similar convergence behaviors were observed as those by the classical method. Interestingly, the new method was found to deliver more accurate and better-converged grid potentials than the classical method on or nearby the molecular surface, though the numerical advantage of the new method is reduced when grid potentials are extrapolated to the molecular surface. Our exploratory study indicates the need for further improving interpolation/extrapolation schemes in addition to the developments of higher-order numerical methods that have attracted most attention in the field.
Collapse
|