1
|
Kalayan J, Chakravorty A, Warwicker J, Henchman RH. Total free energy analysis of fully hydrated proteins. Proteins 2023; 91:74-90. [PMID: 35964252 PMCID: PMC10087023 DOI: 10.1002/prot.26411] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 08/04/2022] [Accepted: 08/09/2022] [Indexed: 12/15/2022]
Abstract
The total free energy of a hydrated biomolecule and its corresponding decomposition of energy and entropy provides detailed information about regions of thermodynamic stability or instability. The free energies of four hydrated globular proteins with different net charges are calculated from a molecular dynamics simulation, with the energy coming from the system Hamiltonian and entropy using multiscale cell correlation. Water is found to be most stable around anionic residues, intermediate around cationic and polar residues, and least stable near hydrophobic residues, especially when more buried, with stability displaying moderate entropy-enthalpy compensation. Conversely, anionic residues in the proteins are energetically destabilized relative to singly solvated amino acids, while trends for other residues are less clear-cut. Almost all residues lose intraresidue entropy when in the protein, enthalpy changes are negative on average but may be positive or negative, and the resulting overall stability is moderate for some proteins and negligible for others. The free energy of water around single amino acids is found to closely match existing hydrophobicity scales. Regarding the effect of secondary structure, water is slightly more stable around loops, of intermediate stability around β strands and turns, and least stable around helices. An interesting asymmetry observed is that cationic residues stabilize a residue when bonded to its N-terminal side but destabilize it when on the C-terminal side, with a weaker reversed trend for anionic residues.
Collapse
Affiliation(s)
- Jas Kalayan
- Division of Pharmacy and Optometry, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Arghya Chakravorty
- Department of Chemistry and Biophysics, University of Michigan, Ann Arbor, Michigan, USA
| | - Jim Warwicker
- Manchester Institute of Biotechnology and School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Richard H Henchman
- Sydney Medical School, Faculty of Medicine and Health, University of Sydney, Sydney, Australia
| |
Collapse
|
2
|
Kawama K, Fukushima Y, Ikeguchi M, Ohta M, Yoshidome T. gr Predictor: A Deep Learning Model for Predicting the Hydration Structures around Proteins. J Chem Inf Model 2022; 62:4460-4473. [PMID: 36068974 DOI: 10.1021/acs.jcim.2c00987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Among the factors affecting biological processes such as protein folding and ligand binding, hydration, which is represented by a three-dimensional water site distribution function around the protein, is crucial. The typical methods for computing the distribution functions, including molecular dynamics simulations and the three-dimensional reference interaction site model (3D-RISM) theory, require a long computation time ranging from hours to tens of hours. Here, we propose a deep learning (DL) model that rapidly estimates the distribution functions around proteins obtained using the 3D-RISM theory from the protein 3D structure. The distribution functions predicted using our DL model are in good agreement with those obtained using the 3D-RISM theory. Particularly, the coefficient of determination between the distribution function obtained by the DL model and that obtained using the 3D-RISM theory is approximately 0.98. Furthermore, using a graphics processing unit, the prediction by the DL model is completed in less than 1 min, more than 2 orders of magnitude faster than the calculation time of the 3D-RISM theory. The position of water molecules around the protein was estimated based on the distribution function obtained by our DL model, and the position of waters estimated by our DL model was in good agreement with that of water molecules estimated using the 3D-RISM theory and of crystallographic waters. Therefore, our DL model provides a practical and efficient way to calculate the three-dimensional water site distribution functions and to estimate the position of water molecules around the protein. The program called "gr Predictor" is available under the GNU General Public License from https://github.com/YoshidomeGroup-Hydration/gr-predictor.
Collapse
Affiliation(s)
- Kosuke Kawama
- Department of Applied Physics, Graduate School of Engineering, Tohoku University, Sendai 980-8579, Japan
| | - Yusaku Fukushima
- Department of Applied Physics, Graduate School of Engineering, Tohoku University, Sendai 980-8579, Japan
| | - Mitsunori Ikeguchi
- AI-Driven Drug Discovery Collaborative Unit, HPC- and AI-Driven Drug Development Platform Division, Center for Computational Science, RIKEN, 1-7-29, Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan.,Graduate School of Medical Life Science, Yokohama City University, 1-7-29, Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | - Masateru Ohta
- AI-Driven Drug Discovery Collaborative Unit, HPC- and AI-Driven Drug Development Platform Division, Center for Computational Science, RIKEN, 1-7-29, Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | - Takashi Yoshidome
- Department of Applied Physics, Graduate School of Engineering, Tohoku University, Sendai 980-8579, Japan
| |
Collapse
|
3
|
Osaki K, Ekimoto T, Yamane T, Ikeguchi M. 3D-RISM-AI: A Machine Learning Approach to Predict Protein-Ligand Binding Affinity Using 3D-RISM. J Phys Chem B 2022; 126:6148-6158. [PMID: 35969673 PMCID: PMC9421647 DOI: 10.1021/acs.jpcb.2c03384] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 07/27/2022] [Indexed: 11/30/2022]
Abstract
Hydration free energy (HFE) is a key factor in improving protein-ligand binding free energy (BFE) prediction accuracy. The HFE itself can be calculated using the three-dimensional reference interaction model (3D-RISM); however, the BFE predictions solely evaluated using 3D-RISM are not correlated to the experimental BFE for abundant protein-ligand pairs. In this study, to predict the BFE for multiple sets of protein-ligand pairs, we propose a machine learning approach incorporating the HFEs obtained using 3D-RISM, termed 3D-RISM-AI. In the learning process, structural metrics, intra-/intermolecular energies, and HFEs obtained via 3D-RISM of ∼4000 complexes in the PDBbind database (ver. 2018) were used. The BFEs predicted using 3D-RISM-AI were well correlated to the experimental data (Pearson's correlation coefficient of 0.80 and root-mean-square error of 1.91 kcal/mol). As important factors for the prediction, the difference in the solvent accessible surface area between the bound and unbound structures and the hydration properties of the ligands were detected during the learning process.
Collapse
Affiliation(s)
- Kazu Osaki
- Graduate
School of Medical Life Science, Yokohama
City University, 1-7-29 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | - Toru Ekimoto
- Graduate
School of Medical Life Science, Yokohama
City University, 1-7-29 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | - Tsutomu Yamane
- Center
for Computational Science, RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | - Mitsunori Ikeguchi
- Graduate
School of Medical Life Science, Yokohama
City University, 1-7-29 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
- Center
for Computational Science, RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| |
Collapse
|
4
|
Martínez L. ComplexMixtures.jl: Investigating the structure of solutions of complex-shaped molecules from a solvent-shell perspective. J Mol Liq 2022. [DOI: 10.1016/j.molliq.2021.117945] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
5
|
Huang P, Xing H, Zou X, Han Q, Liu K, Sun X, Wu J, Fan J. Accurate Prediction of Hydration Sites of Proteins Using Energy Model With Atom Embedding. Front Mol Biosci 2021; 8:756075. [PMID: 34616774 PMCID: PMC8488165 DOI: 10.3389/fmolb.2021.756075] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Accepted: 09/02/2021] [Indexed: 11/13/2022] Open
Abstract
We propose a method based on neural networks to accurately predict hydration sites in proteins. In our approach, high-quality data of protein structures are used to parametrize our neural network model, which is a differentiable score function that can evaluate an arbitrary position in 3D structures on proteins and predict the nearest water molecule that is not present. The score function is further integrated into our water placement algorithm to generate explicit hydration sites. In experiments on the OppA protein dataset used in previous studies and our selection of protein structures, our method achieves the highest model quality in terms of F1 score, compared to several previous studies.
Collapse
Affiliation(s)
- Pin Huang
- College of Life Sciences, Beijing Normal University, Beijing, China.,Accutar Biotechnology Inc., Brooklyn, NY, United States
| | - Haoming Xing
- Accutar Biotechnology Inc., Brooklyn, NY, United States
| | - Xun Zou
- Accutar Biotechnology Inc., Brooklyn, NY, United States
| | - Qi Han
- Accutar Biotechnology Inc., Brooklyn, NY, United States
| | - Ke Liu
- Accutar Biotechnology Inc., Brooklyn, NY, United States
| | - Xiangyan Sun
- Accutar Biotechnology Inc., Brooklyn, NY, United States
| | - Junqiu Wu
- Accutar Biotechnology Inc., Brooklyn, NY, United States
| | - Jie Fan
- Accutar Biotechnology Inc., Brooklyn, NY, United States
| |
Collapse
|
6
|
Yoshidome T, Ikeguchi M, Ohta M. Comprehensive 3D-RISM analysis of the hydration of small molecule binding sites in ligand-free protein structures. J Comput Chem 2020; 41:2406-2419. [PMID: 32815201 PMCID: PMC7540010 DOI: 10.1002/jcc.26406] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 07/28/2020] [Accepted: 08/05/2020] [Indexed: 12/25/2022]
Abstract
Hydration is a critical factor in the ligand binding process. Herein, to examine the hydration states of ligand binding sites, the three‐dimensional distribution function for the water oxygen site, gO(r), is computed for 3,706 ligand‐free protein structures based on the corresponding small molecule–protein complexes using the 3D‐RISM theory. For crystallographic waters (CWs) close to the ligand, gO(r) reveals that several CWs are stabilized by interaction networks formed between the ligand, CW, and protein. Based on the gO(r) for the crystallographic binding pose of the ligand, hydrogen bond interactions are dominant in the highly hydrated regions while weak interactions such as CH‐O are dominant in the moderately hydrated regions. The polar heteroatoms of the ligand occupy the highly hydrated and moderately hydrated regions in the crystallographic (correct) and wrongly docked (incorrect) poses, respectively. Thus, the gO(r) of polar heteroatoms may be used to distinguish the correct binding poses.
Collapse
Affiliation(s)
- Takashi Yoshidome
- Department of Applied Physics, Graduate School of Engineering, Tohoku University, Sendai, Japan
| | - Mitsunori Ikeguchi
- Drug Development Data Intelligence Platform Group, Medical Science Innovation Hub Program, Cluster of Science, Technology and Innovation Hub, RIKEN, Yokohama, Japan.,Graduate School of Medical Life Science, Yokohama City University, Yokohama, Japan
| | - Masateru Ohta
- Drug Development Data Intelligence Platform Group, Medical Science Innovation Hub Program, Cluster of Science, Technology and Innovation Hub, RIKEN, Yokohama, Japan
| |
Collapse
|