1
|
Liu X, Zheng L, Cong Y, Gong Z, Yin Z, Zhang JZH, Liu Z, Sun Z. Comprehensive evaluation of end-point free energy techniques in carboxylated-pillar[6]arene host-guest binding: II. regression and dielectric constant. J Comput Aided Mol Des 2022; 36:879-894. [PMID: 36394776 DOI: 10.1007/s10822-022-00487-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Accepted: 10/29/2022] [Indexed: 11/18/2022]
Abstract
End-point free energy calculations as a powerful tool have been widely applied in protein-ligand and protein-protein interactions. It is often recognized that these end-point techniques serve as an option of intermediate accuracy and computational cost compared with more rigorous statistical mechanic models (e.g., alchemical transformation) and coarser molecular docking. However, it is observed that this intermediate level of accuracy does not hold in relatively simple and prototypical host-guest systems. Specifically, in our previous work investigating a set of carboxylated-pillar[6]arene host-guest complexes, end-point methods provide free energy estimates deviating significantly from the experimental reference, and the rank of binding affinities is also incorrectly computed. These observations suggest the unsuitability and inapplicability of standard end-point free energy techniques in host-guest systems, and alteration and development are required to make them practically usable. In this work, we consider two ways to improve the performance of end-point techniques. The first one is the PBSA_E regression that varies the weights of different free energy terms in the end-point calculation procedure, while the second one is considering the interior dielectric constant as an additional variable in the end-point equation. By detailed investigation of the calculation procedure and the simulation outcome, we prove that these two treatments (i.e., regression and dielectric constant) are manipulating the end-point equation in a somehow similar way, i.e., weakening the electrostatic contribution and strengthening the non-polar terms, although there are still many detailed differences between these two methods. With the trained end-point scheme, the RMSE of the computed affinities is improved from the standard ~ 12 kcal/mol to ~ 2.4 kcal/mol, which is comparable to another altered end-point method (ELIE) trained with system-specific data. By tuning PBSA_E weighting factors with the host-specific data, it is possible to further decrease the prediction error to ~ 2.1 kcal/mol. These observations along with the extremely efficient optimized-structure computation procedure suggest the regression (i.e., PBSA_E as well as its GBSA_E extension) as a practically applicable solution that brings end-point methods back into the library of usable tools for host-guest binding. However, the dielectric-constant-variable scheme cannot effectively minimize the experiment-calculation discrepancy for absolute binding affinities, but is able to improve the calculation of affinity ranks. This phenomenon is somehow different from the protein-ligand case and suggests the difference between host-guest and biomacromolecular (protein-ligand and protein-protein) systems. Therefore, the spectrum of tools usable for protein-ligand complexes could be unsuitable for host-guest binding, and numerical validations are necessary to screen out really workable solutions in these 'prototypical' situations.
Collapse
Affiliation(s)
- Xiao Liu
- School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai, 201620, China.
| | - Lei Zheng
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, 200062, China
| | - Yalong Cong
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200062, China
| | - Zhihao Gong
- School of Micro-Nano Electronics, Zhejiang University, Hangzhou, 310027, China.,Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, 310027, China
| | - Zhixiang Yin
- School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai, 201620, China
| | - John Z H Zhang
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, 200062, China. .,School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200062, China. .,Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China. .,Department of Chemistry, New York University, NY, NY, 10003, USA.
| | - Zhirong Liu
- College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, China
| | - Zhaoxi Sun
- College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, China.
| |
Collapse
|
2
|
Liu X, Zheng L, Qin C, Zhang JZH, Sun Z. Comprehensive evaluation of end-point free energy techniques in carboxylated-pillar[6]arene host-guest binding: I. Standard procedure. J Comput Aided Mol Des 2022; 36:735-752. [PMID: 36136209 DOI: 10.1007/s10822-022-00475-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 09/06/2022] [Indexed: 10/14/2022]
Abstract
Despite the massive application of end-point free energy methods in protein-ligand and protein-protein interactions, computational understandings about their performance in relatively simple and prototypical host-guest systems are limited. In this work, we present a comprehensive benchmark calculation with standard end-point free energy techniques in a recent host-guest dataset containing 13 host-guest pairs involving the carboxylated-pillar[6]arene host. We first assess the charge schemes for solutes by comparing the charge-produced electrostatics with many ab initio references, in order to obtain a preliminary albeit detailed view of the charge quality. Then, we focus on four modelling details of end-point free energy calculations, including the docking procedure for the generation of initial condition, the charge scheme for host and guest molecules, the water model used in explicit-solvent sampling, and the end-point methods for free energy estimation. The binding thermodynamics obtained with different modelling schemes are compared with experimental references, and some practical guidelines on maximizing the performance of end-point methods in practical host-guest systems are summarized. Further, we compare our simulation outcome with predictions in the grand challenge and discuss further developments to improve the prediction quality of end-point free energy methods. Overall, unlike the widely acknowledged applicability in protein-ligand binding, the standard end-point calculations cannot produce useful outcomes in host-guest binding and thus are not recommended unless alterations are performed.
Collapse
Affiliation(s)
- Xiao Liu
- School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai, 201620, China.
| | - Lei Zheng
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, 200062, China
| | - Chu Qin
- School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai, 201620, China
| | - John Z H Zhang
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, 200062, China.,School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200062, China.,Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China.,Department of Chemistry, New York University, New York, NY, 10003, USA
| | - Zhaoxi Sun
- College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, China.
| |
Collapse
|
3
|
Huai Z, Yang H, Sun Z. Binding thermodynamics and interaction patterns of human purine nucleoside phosphorylase-inhibitor complexes from extensive free energy calculations. J Comput Aided Mol Des 2021; 35:643-656. [PMID: 33759016 DOI: 10.1007/s10822-021-00382-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Accepted: 03/13/2021] [Indexed: 11/29/2022]
Abstract
Human purine nucleoside phosphorylase (hPNP) plays a significant role in the catabolism of deoxyguanosine. The trimeric protein is an important target in the treatment of T-cell cancers and autoimmune disorders. Experimental studies on the inhibition of the hPNP observe that the first ligand bound to one of three subunits effectively inhibits the protein, while the binding of more ligands to the subsequent sites shows negative cooperativities. In this work, we performed extensive end-point and alchemical free energy calculations to determine the binding thermodynamics of the trimeric protein-ligand system. 13 Immucillin inhibitors with experimental results are under calculation. Two widely accepted charge schemes for small molecules including AM1-BCC and RESP are adopted for ligands. The results of RESP are in better agreement with the experimental reference. Further investigations of the interaction networks in the protein-ligand complexes reveal that several residues play significant roles in stabilizing the complex structure. The most commonly observed ones include PHE200, GLU201, MET219, and ASN243. The conformations of the protein in different protein-ligand complexes are observed to be similar. We expect these insights to aid the development of potent drugs targeting hPNP.
Collapse
Affiliation(s)
- Zhe Huai
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai, 200062, China
| | - Huaiyu Yang
- College of Engineering, Hebei Normal University, Shijiazhuang, 050024, China
| | - Zhaoxi Sun
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai, 200062, China.
| |
Collapse
|
4
|
Huai Z, Yang H, Li X, Sun Z. SAMPL7 TrimerTrip host-guest binding affinities from extensive alchemical and end-point free energy calculations. J Comput Aided Mol Des 2021; 35:117-29. [PMID: 33037549 DOI: 10.1007/s10822-020-00351-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Accepted: 09/30/2020] [Indexed: 12/12/2022]
Abstract
The prediction of host-guest binding affinities with computational modelling is still a challenging task. In the 7th statistical assessment of the modeling of proteins and ligands (SAMPL) challenge, a new host named TrimerTrip was synthesized and the thermodynamic parameters of 16 structurally diverse guests binding to the host were characterized. In the TrimerTrip-guest challenge, only structures of the host and the guests are provided, which indicates that the predictions of both the binding poses and the binding affinities are under assessment. In this work, starting from the binding poses obtained from our previous enhanced sampling simulations in the configurational space, we perform extensive alchemical and end-point free energy calculations to calculate the host-guest binding affinities retrospectively. The alchemical predictions with two widely accepted charge schemes (i.e. AM1-BCC and RESP) are in good agreement with the experimental reference, while the end-point estimates perform poorly in reproducing the experimental binding affinities. Aside from the absolute value of the binding affinity, the rank of binding free energies is also crucial in drug design. Surprisingly, the end-point MM/PBSA method seems very powerful in reproducing the experimental rank of binding affinities. Although the length of our simulations is long and the intermediate spacing is dense, the convergence behavior is not very good, which may arise from the flexibility of the host molecule. Enhanced sampling techniques in the configurational space may be required to obtain fully converged sampling. Further, as the length of sampling in alchemical free energy calculations already achieves several hundred ns, performing direct simulations of the binding/unbinding event in the physical space could be more useful and insightful. More details about the binding pathway and mechanism could be obtained in this way. The nonequilibrium method could also be a nice choice if one insists to use the alchemical method, as the intermediate sampling is avoided to some extent.
Collapse
|