1
|
Mishra A, Kaur U, Singh A. Fisetin 8-C-glucoside as entry inhibitor in SARS CoV-2 infection: molecular modelling study. J Biomol Struct Dyn 2022; 40:5128-5137. [PMID: 33382023 PMCID: PMC7784833 DOI: 10.1080/07391102.2020.1868335] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 12/18/2020] [Indexed: 11/03/2022]
Abstract
Coronaviruses are RNA viruses that infect varied species including humans. TMPRSS2 is gateway for SARS CoV-2 entry into the host cell. It causes proteolytic activation of spike protein and discharge of the peptide into host cell. The TMPRSS2 inhibition could be one of the approaches to stop the viral entry, therefore, interaction pattern and binding energies for Fisetin and TMPRSS2 have been explored in the present study. TMPRSS2 peptide was used for homology modelling and then for further study. Molecular docking score and MMGBSA Binding energy of Fisetin was better than Nafamostat, a known inhibitor of TMPRSS2. Post docking MM-GBSA free energy for Fisetin and Nafamostat was -42.78 and -21.11 kcal/mol, respectively. Fisetin forms H bond with Val 25, His 41, Lys 42, Lys 45, Glu 44, Ser186. Nafamostat formed H bonds with Lys 85, Asp 90, Asp 203. RMSD plots of TMPRSS2, TMPRSS2-Fisetin and TMPRSS2-Nafamostat complex showed stable profile with very small fluctuation during entire simulation of 150 ns. Significant decrease in TMPRSS2-Fisetin and TMPRSS2-Nafamostat complex fluctuation occurred around His 41, Glu 44, Gly 136, Ser 186 in RMSF study. During simulation Fisetin interaction was observed with residues Val 25, His 41, Glu 44, Lys 45, Lys 87, Gly 136, Gln 183, Ser 186 likewise interaction of Nafamostat with Lys 85, Asp 90, Asn 163, Asp 203 and Ser 205. Post simulation MM-GBSA free energy was found to be -51.87 ± 4.3 and -48.23 ± 4.39 kcal/mol for TMPRSS2 with Fisetin and Nafamostat, respectively.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Abha Mishra
- School of Biochemical Engineering, Indian Institute of Technology, Banaras Hindu University, Varanasi, India
| | - Upinder Kaur
- Department of Pharmacology, All India Institute of Medical Sciences, Gorakhpur, India
| | - Amit Singh
- Department of Pharmacology, Institute of Medical Sciences, Banaras Hindu University, Varanasi, India
| |
Collapse
|
2
|
Aggarwal R, Gupta A, Chelur V, Jawahar CV, Priyakumar UD. DeepPocket: Ligand Binding Site Detection and Segmentation using 3D Convolutional Neural Networks. J Chem Inf Model 2021; 62:5069-5079. [PMID: 34374539 DOI: 10.1021/acs.jcim.1c00799] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
A structure-based drug design pipeline involves the development of potential drug molecules or ligands that form stable complexes with a given receptor at its binding site. A prerequisite to this is finding druggable and functionally relevant binding sites on the 3D structure of the protein. Although several methods for detecting binding sites have been developed beforehand, a majority of them surprisingly fail in the identification and ranking of binding sites accurately. The rapid adoption and success of deep learning algorithms in various sections of structural biology beckons the usage of such algorithms for accurate binding site detection. As a combination of geometry based software and deep learning, we report a novel framework, DeepPocket that utilizes 3D convolutional neural networks for the rescoring of pockets identified by Fpocket and further segments these identified cavities on the protein surface. Apart from this, we also propose another data set SC6K containing protein structures submitted in the Protein Data Bank (PDB) from January 1st, 2018, until February 28th, 2020, for ligand binding site (LBS) detection. DeepPocket's results on various binding site data sets and SC6K highlight its better performance over current state-of-the-art methods and good generalization ability over novel structures.
Collapse
Affiliation(s)
- Rishal Aggarwal
- International Institute of Information Technology, Hyderabad 500 032, India
| | - Akash Gupta
- International Institute of Information Technology, Hyderabad 500 032, India
| | - Vineeth Chelur
- International Institute of Information Technology, Hyderabad 500 032, India
| | - C V Jawahar
- International Institute of Information Technology, Hyderabad 500 032, India
| | - U Deva Priyakumar
- International Institute of Information Technology, Hyderabad 500 032, India
| |
Collapse
|
3
|
Lu ZC, Jiang F, Wu YD. Phosphate binding sites prediction in phosphorylation-dependent protein-protein interactions. Bioinformatics 2021; 37:4712-4718. [PMID: 34270697 DOI: 10.1093/bioinformatics/btab525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 06/07/2021] [Accepted: 07/13/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Phosphate binding plays an important role in modulating protein-protein interactions, which are ubiquitous in various biological processes. Accurate prediction of phosphate binding sites is an important but challenging task. Small size and diversity of phosphate binding sites lead to a substantial challenge for developing accurate prediction methods. RESULTS Here we present the phosphate binding site predictor (PBSP), a novel and accurate approach to identifying phosphate binding sites from protein structures. PBSP combines an energy-based ligand-binding sites identification method with reverse focused docking using a phosphate probe. We show that PBSP outperforms not only general ligand binding sites predictors but also other existing phospholigand-specific binding sites predictors. It achieves ∼95% success rate for top 10 predicted sites with an average Matthews correlation coefficient (MCC) value of 0.84 for successful predictions. PBSP can accurately predict phosphate binding modes, with average position error of 1.4 Å and 2.4 Å in bound and unbound datasets, respectively. Lastly, visual inspection of the predictions is conducted. Reasons for failed predictions are further analyzed and possible ways to improve the performance are provided. These results demonstrate a novel and accurate approach to phosphate binding sites identification in protein structures. AVAILABILITY The software and benchmark datasets are freely available at http://web.pkusz.edu.cn/wu/PBSP/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zheng-Chang Lu
- Lab of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen, 518055, China.,Shenzhen Bay Laboratory, Shenzhen, 518055, China
| | - Fan Jiang
- Lab of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen, 518055, China.,NanoAI Biotech Co., Ltd, Shenzhen, 518118, China
| | - Yun-Dong Wu
- Lab of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen, 518055, China.,Shenzhen Bay Laboratory, Shenzhen, 518055, China.,College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, China
| |
Collapse
|
4
|
Hendrix SG, Chang KY, Ryu Z, Xie ZR. DeepDISE: DNA Binding Site Prediction Using a Deep Learning Method. Int J Mol Sci 2021; 22:ijms22115510. [PMID: 34073705 PMCID: PMC8197219 DOI: 10.3390/ijms22115510] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 04/30/2021] [Accepted: 05/19/2021] [Indexed: 11/18/2022] Open
Abstract
It is essential for future research to develop a new, reliable prediction method of DNA binding sites because DNA binding sites on DNA-binding proteins provide critical clues about protein function and drug discovery. However, the current prediction methods of DNA binding sites have relatively poor accuracy. Using 3D coordinates and the atom-type of surface protein atom as the input, we trained and tested a deep learning model to predict how likely a voxel on the protein surface is to be a DNA-binding site. Based on three different evaluation datasets, the results show that our model not only outperforms several previous methods on two commonly used datasets, but also demonstrates its robust performance to be consistent among the three datasets. The visualized prediction outcomes show that the binding sites are also mostly located in correct regions. We successfully built a deep learning model to predict the DNA binding sites on target proteins. It demonstrates that 3D protein structures plus atom-type information on protein surfaces can be used to predict the potential binding sites on a protein. This approach should be further extended to develop the binding sites of other important biological molecules.
Collapse
Affiliation(s)
- Samuel Godfrey Hendrix
- Computational Drug Discovery Laboratory, School of Electrical and Computer Engineering, College of Engineering, University of Georgia, Athens, GA 30602, USA; (S.G.H.); (Z.R.)
| | - Kuan Y. Chang
- Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung 202, Taiwan;
| | - Zeezoo Ryu
- Computational Drug Discovery Laboratory, School of Electrical and Computer Engineering, College of Engineering, University of Georgia, Athens, GA 30602, USA; (S.G.H.); (Z.R.)
- Department of Computer Science, Franklin College of Arts and Sciences, University of Georgia, Athens, GA 30602, USA
| | - Zhong-Ru Xie
- Computational Drug Discovery Laboratory, School of Electrical and Computer Engineering, College of Engineering, University of Georgia, Athens, GA 30602, USA; (S.G.H.); (Z.R.)
- Correspondence:
| |
Collapse
|
5
|
Brackenridge DA, McGuffin LJ. Proteins and Their Interacting Partners: An Introduction to Protein-Ligand Binding Site Prediction Methods with a Focus on FunFOLD3. Methods Mol Biol 2021; 2365:43-58. [PMID: 34432238 DOI: 10.1007/978-1-0716-1665-9_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Proteins are essential molecules with a diverse range of functions; elucidating their biological and biochemical characteristics can be difficult and time consuming using in vitro and/or in vivo methods. Additionally, in vivo protein-ligand binding site elucidation is unable to keep place with current growth in sequencing, leaving the majority of new protein sequences without known functions. Therefore, the development of new methods, which aim to predict the protein-ligand interactions and ligand-binding site residues directly from amino acid sequences, is becoming increasingly important. In silico prediction can utilise either sequence information, structural information or a combination of both. In this chapter, we will discuss the broad range of methods for ligand-binding site prediction from protein structure and we will describe our method, FunFOLD3, for the prediction of protein-ligand interactions and ligand-binding sites based on template-based modelling. Additionally, we will describe the step-by-step instructions using the FunFOLD3 downloadable application along with examples from the Critical Assessment of Techniques for Protein Structure Prediction (CASP) where FunFOLD3 has been used to aid ligand and ligand-binding site prediction. Finally, we will introduce our newer method, FunFOLD3-D, a version of FunFOLD3 which aims to improve template-based protein-ligand binding site prediction through the integration of docking, using AutoDock Vina.
Collapse
|
6
|
Predicting binding sites from unbound versus bound protein structures. Sci Rep 2020; 10:15856. [PMID: 32985584 PMCID: PMC7522209 DOI: 10.1038/s41598-020-72906-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 07/27/2020] [Indexed: 11/30/2022] Open
Abstract
We present the application of seven binding-site prediction algorithms to a meticulously curated dataset of ligand-bound and ligand-free crystal structures for 304 unique protein sequences (2528 crystal structures). We probe the influence of starting protein structures on the results of binding-site prediction, so the dataset contains a minimum of two ligand-bound and two ligand-free structures for each protein. We use this dataset in a brief survey of five geometry-based, one energy-based, and one machine-learning-based methods: Surfnet, Ghecom, LIGSITEcsc, Fpocket, Depth, AutoSite, and Kalasanty. Distributions of the F scores and Matthew’s correlation coefficients for ligand-bound versus ligand-free structure performance show no statistically significant difference in structure type versus performance for most methods. Only Fpocket showed a statistically significant but low magnitude enhancement in performance for holo structures. Lastly, we found that most methods will succeed on some crystal structures and fail on others within the same protein family, despite all structures being relatively high-quality structures with low structural variation. We expected better consistency across varying protein conformations of the same sequence. Interestingly, the success or failure of a given structure cannot be predicted by quality metrics such as resolution, Cruickshank Diffraction Precision index, or unresolved residues. Cryptic sites were also examined.
Collapse
|
7
|
Wu Y, Lou L, Xie ZR. A Pilot Study of All-Computational Drug Design Protocol-From Structure Prediction to Interaction Analysis. Front Chem 2020; 8:81. [PMID: 32117898 PMCID: PMC7028743 DOI: 10.3389/fchem.2020.00081] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Accepted: 01/24/2020] [Indexed: 11/13/2022] Open
Abstract
Speeding up the drug discovery process is of great significance. To achieve that, high-efficiency methods should be exploited. The conventional wet-bench methods hardly meet the high-speed demand due to time-consuming experiments. Conversely, in silico approaches are much more efficient for drug discovery and design. However, in silico approaches usually serve as a supportive role in research processes. To fully exert the strength of computational methods, we propose a protocol which integrates various in silico approaches, from de novo protein structure prediction to ligand-protein interaction simulation. As a proof of concept, human SK2/calmodulin complex was used as a target for validation. First, we obtained a predicted structure of SK2/calmodulin and predicted binding sites which were consistent with the literature data. Then we investigated the ligand-protein interaction via virtual mutagenesis, flexible docking, and binding affinity calculation. As a result, the binding energies of mutants have similar trends compared with the EC50 values (R = 0.6 for NS309 in V481 mutants). The results indicate that our protocol can be applied to the drug design of structure unknown proteins. Our study also demonstrates that the integration of in silico approaches is feasible and it facilitates the acceleration of new drug discovery.
Collapse
Affiliation(s)
- Yifei Wu
- Computational Drug Discovery Laboratory, School of Electrical and Computer Engineering, College of Engineering, University of Georgia, Athens, GA, United States
| | - Lei Lou
- Computational Drug Discovery Laboratory, School of Electrical and Computer Engineering, College of Engineering, University of Georgia, Athens, GA, United States
| | - Zhong-Ru Xie
- Computational Drug Discovery Laboratory, School of Electrical and Computer Engineering, College of Engineering, University of Georgia, Athens, GA, United States
| |
Collapse
|
8
|
Chen X, Liu H, Xie W, Yang Y, Wang Y, Fan Y, Hua Y, Zhu L, Zhao J, Lu T, Chen Y, Zhang Y. Investigation of Crystal Structures in Structure-Based Virtual Screening for Protein Kinase Inhibitors. J Chem Inf Model 2019; 59:5244-5262. [PMID: 31689093 DOI: 10.1021/acs.jcim.9b00684] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Protein kinases are important drug targets in several therapeutic areas ,and structure-based virtual screening (SBVS) is an important strategy in discovering lead compounds for kinase targets. However, there are multiple crystal structures available for each target, and determining which one is the most favorable is a key step in molecular docking for SBVS due to the ligand induce-fit effect. This work aimed to find the most desirable crystal structures for molecular docking by a comprehensive analysis of the protein kinase database which covers 190 different kinases from all eight main kinase families. Through an integrated self-docking and cross-docking evaluation, 86 targets were eventually evaluated on a total of 2608 crystal structures. Results showed that molecular docking has great capability in reproducing conformation of crystallized ligands and for each target, the most favorable crystal structure was selected, and the AGC family outperformed the other family targets based on RMSD comparison. In addition, RMSD values, GlideScore, and corresponding bioactivity data were compared and demonstrated certain relationships. This work provides great convenience for researchers to directly select the optimal crystal structure in SBVS-based kinase drug design and further validates the effectiveness of molecular docking in drug discovery.
Collapse
Affiliation(s)
- Xingye Chen
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Haichun Liu
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Wuchen Xie
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Yan Yang
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Yuchen Wang
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Yuanrong Fan
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Yi Hua
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Lu Zhu
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Junnan Zhao
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Tao Lu
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China.,State Key Laboratory of Natural Medicines , China Pharmaceutical University , 24 Tongjiaxiang , Nanjing 210009 , China
| | - Yadong Chen
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Yanmin Zhang
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| |
Collapse
|
9
|
Krivák R, Hoksza D. P2Rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure. J Cheminform 2018; 10:39. [PMID: 30109435 PMCID: PMC6091426 DOI: 10.1186/s13321-018-0285-8] [Citation(s) in RCA: 181] [Impact Index Per Article: 30.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2017] [Accepted: 06/29/2018] [Indexed: 01/29/2023] Open
Abstract
Background Ligand binding site prediction from protein structure has many applications related to elucidation of protein function and structure based drug discovery. It often represents only one step of many in complex computational drug design efforts. Although many methods have been published to date, only few of them are suitable for use in automated pipelines or for processing large datasets.
These use cases require stability and speed, which disqualifies many of the recently introduced tools that are either template based or available only as web servers. Results We present P2Rank, a stand-alone template-free tool for prediction of ligand binding sites based on machine learning. It is based on prediction of ligandability of local chemical neighbourhoods that are centered on points placed on the solvent accessible surface of a protein.
We show that P2Rank outperforms several existing tools, which include two widely used stand-alone tools (Fpocket, SiteHound), a comprehensive consensus based tool (MetaPocket 2.0), and a recent deep learning based method (DeepSite). P2Rank belongs to the fastest available tools (requires under 1 s for prediction on one protein), with additional advantage of multi-threaded implementation. Conclusions P2Rank is a new open source software package for ligand binding site prediction from protein structure. It is available as a user-friendly stand-alone command line program and a Java library. P2Rank has a lightweight installation and does not depend on other bioinformatics tools or large structural or sequence databases. Thanks to its speed and ability to make fully automated predictions, it is particularly well suited for processing large datasets or as a component of scalable structural bioinformatics pipelines. Electronic supplementary material The online version of this article (10.1186/s13321-018-0285-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Radoslav Krivák
- Department of Software Engineering, Charles University, Prague, Czech Republic.
| | - David Hoksza
- Department of Software Engineering, Charles University, Prague, Czech Republic.
| |
Collapse
|
10
|
Zeng L, Shin WH, Zhu X, Park SH, Park C, Tao WA, Kihara D. Discovery of Nicotinamide Adenine Dinucleotide Binding Proteins in the Escherichia coli Proteome Using a Combined Energetic- and Structural-Bioinformatics-Based Approach. J Proteome Res 2016; 16:470-480. [PMID: 28152599 DOI: 10.1021/acs.jproteome.6b00624] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Protein-ligand interaction plays a critical role in regulating the biochemical functions of proteins. Discovering protein targets for ligands is vital to new drug development. Here, we present a strategy that combines experimental and computational approaches to identify ligand-binding proteins in a proteomic scale. For the experimental part, we coupled pulse proteolysis with filter-assisted sample preparation (FASP) and quantitative mass spectrometry. Under denaturing conditions, ligand binding affected protein stability, which resulted in altered protein abundance after pulse proteolysis. For the computational part, we used the software Patch-Surfer2.0. We applied the integrated approach to identify nicotinamide adenine dinucleotide (NAD)-binding proteins in the Escherichia coli proteome, which has over 4200 proteins. Pulse proteolysis and Patch-Surfer2.0 identified 78 and 36 potential NAD-binding proteins, respectively, including 12 proteins that were consistently detected by the two approaches. Interestingly, the 12 proteins included 8 that are not previously known as NAD binders. Further validation of these eight proteins showed that their binding affinities to NAD computed by AutoDock Vina are higher than their cognate ligands and also that their protein ratios in the pulse proteolysis are consistent with known NAD-binding proteins. These results strongly suggest that these eight proteins are indeed newly identified NAD binders.
Collapse
Affiliation(s)
| | | | - Xiaolei Zhu
- School of Life Science, Anhui University , Hefei, Anhui 230601, China
| | - Sung Hoon Park
- Research Institute of Food and Biotechnology, SPC Group , Seoul 06737, South Korea
| | | | | | | |
Collapse
|
11
|
Cao C, Xu S. Improving the performance of the PLB index for ligand-binding site prediction using dihedral angles and the solvent-accessible surface area. Sci Rep 2016; 6:33232. [PMID: 27619067 PMCID: PMC5020399 DOI: 10.1038/srep33232] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Accepted: 08/23/2016] [Indexed: 12/02/2022] Open
Abstract
Protein ligand-binding site prediction is highly important for protein function determination and structure-based drug design. Over the past twenty years, dozens of computational methods have been developed to address this problem. Soga et al. identified ligand cavities based on the preferences of amino acids for the ligand-binding site (RA) and proposed the propensity for ligand binding (PLB) index to rank the cavities on the protein surface. However, we found that residues exhibit different RAs in response to changes in solvent exposure. Furthermore, previous studies have suggested that some dihedral angles of amino acids in specific regions of the Ramachandran plot are preferred at the functional sites of proteins. Based on these discoveries, the amino acid solvent-accessible surface area and dihedral angles were combined with the RA and PLB to obtain two new indexes, multi-factor RA (MF-RA) and multi-factor PLB (MF-PLB). MF-PLB, PLB and other methods were tested using two benchmark databases and two particular ligand-binding sites. The results show that MF-PLB can improve the success rate of PLB for both ligand-bound and ligand-unbound structures, particularly for top choice prediction.
Collapse
Affiliation(s)
- Chen Cao
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China
- Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of Education, Jilin University, Changchun, Jilin, China
| | - Shutan Xu
- Department of Biochemistry and Molecular Biology, Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| |
Collapse
|
12
|
Brown JA, Espiritu MV, Abraham J, Thorpe IF. Computational predictions suggest that structural similarity in viral polymerases may lead to comparable allosteric binding sites. Virus Res 2016; 222:80-93. [PMID: 27262620 PMCID: PMC4969206 DOI: 10.1016/j.virusres.2016.05.029] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Revised: 05/27/2016] [Accepted: 05/31/2016] [Indexed: 01/18/2023]
Abstract
The identification of ligand-binding sites is often the first step in drug targeting and design. To date there are numerous computational tools available to predict ligand binding sites. These tools can guide or mitigate the need for experimental methods to identify binding sites, which often require significant resources and time. Here, we evaluate four ligand-binding site predictor (LBSP) tools for their ability to predict allosteric sites within the Hepatitis C Virus (HCV) polymerase. Our results show that the LISE LBSP is able to identify all three target allosteric sites within the HCV polymerase as well as a known allosteric site in the Coxsackievirus polymerase. LISE was then employed to identify novel binding sites within the polymerases of the Dengue, West Nile, and Foot-and-mouth Disease viruses. Our results suggest that all three viral polymerases have putative sites that share structural or chemical similarities with allosteric pockets of the HCV polymerase. Thus, these binding locations may represent an evolutionarily conserved structural feature of several viral polymerases that could be exploited for the development of small molecule therapeutics.
Collapse
Affiliation(s)
- Jodian A Brown
- Department of Chemistry & Biochemistry, University of Maryland Baltimore County, Baltimore, MD 21250, USA
| | - Marie V Espiritu
- Department of Chemistry & Biochemistry, University of Maryland Baltimore County, Baltimore, MD 21250, USA
| | - Joel Abraham
- Department of Chemistry & Biochemistry, University of Maryland Baltimore County, Baltimore, MD 21250, USA
| | - Ian F Thorpe
- Department of Chemistry & Biochemistry, University of Maryland Baltimore County, Baltimore, MD 21250, USA.
| |
Collapse
|
13
|
Jian JW, Elumalai P, Pitti T, Wu CY, Tsai KC, Chang JY, Peng HP, Yang AS. Predicting Ligand Binding Sites on Protein Surfaces by 3-Dimensional Probability Density Distributions of Interacting Atoms. PLoS One 2016; 11:e0160315. [PMID: 27513851 PMCID: PMC4981321 DOI: 10.1371/journal.pone.0160315] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Accepted: 07/18/2016] [Indexed: 11/18/2022] Open
Abstract
Predicting ligand binding sites (LBSs) on protein structures, which are obtained either from experimental or computational methods, is a useful first step in functional annotation or structure-based drug design for the protein structures. In this work, the structure-based machine learning algorithm ISMBLab-LIG was developed to predict LBSs on protein surfaces with input attributes derived from the three-dimensional probability density maps of interacting atoms, which were reconstructed on the query protein surfaces and were relatively insensitive to local conformational variations of the tentative ligand binding sites. The prediction accuracy of the ISMBLab-LIG predictors is comparable to that of the best LBS predictors benchmarked on several well-established testing datasets. More importantly, the ISMBLab-LIG algorithm has substantial tolerance to the prediction uncertainties of computationally derived protein structure models. As such, the method is particularly useful for predicting LBSs not only on experimental protein structures without known LBS templates in the database but also on computationally predicted model protein structures with structural uncertainties in the tentative ligand binding sites.
Collapse
Affiliation(s)
- Jhih-Wei Jian
- Genomics Research Center, Academia Sinica, Taipei, Taiwan 115
- Institute of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan 11221
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan 115
| | | | - Thejkiran Pitti
- Genomics Research Center, Academia Sinica, Taipei, Taiwan 115
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan 115
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, Taiwan 30013
| | - Chih Yuan Wu
- Genomics Research Center, Academia Sinica, Taipei, Taiwan 115
| | - Keng-Chang Tsai
- Genomics Research Center, Academia Sinica, Taipei, Taiwan 115
| | - Jeng-Yih Chang
- Genomics Research Center, Academia Sinica, Taipei, Taiwan 115
| | - Hung-Pin Peng
- Genomics Research Center, Academia Sinica, Taipei, Taiwan 115
| | - An-Suei Yang
- Genomics Research Center, Academia Sinica, Taipei, Taiwan 115
- * E-mail:
| |
Collapse
|
14
|
Gao J, Zhang Q, Liu M, Zhu L, Wu D, Cao Z, Zhu R. bSiteFinder, an improved protein-binding sites prediction server based on structural alignment: more accurate and less time-consuming. J Cheminform 2016; 8:38. [PMID: 27403208 PMCID: PMC4939519 DOI: 10.1186/s13321-016-0149-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Accepted: 06/30/2016] [Indexed: 11/10/2022] Open
Abstract
MOTIVATION Protein-binding sites prediction lays a foundation for functional annotation of protein and structure-based drug design. As the number of available protein structures increases, structural alignment based algorithm becomes the dominant approach for protein-binding sites prediction. However, the present algorithms underutilize the ever increasing numbers of three-dimensional protein-ligand complex structures (bound protein), and it could be improved on the process of alignment, selection of templates and clustering of template. Herein, we built so far the largest database of bound templates with stringent quality control. And on this basis, bSiteFinder as a protein-binding sites prediction server was developed. RESULTS By introducing Homology Indexing, Chain Length Indexing, Stability of Complex and Optimized Multiple-Templates Clustering into our algorithm, the efficiency of our server has been significantly improved. Further, the accuracy was approximately 2-10 % higher than that of other algorithms for the test with either bound dataset or unbound dataset. For 210 bound dataset, bSiteFinder achieved high accuracies up to 94.8 % (MCC 0.95). For another 48 bound/unbound dataset, bSiteFinder achieved high accuracies up to 93.8 % for bound proteins (MCC 0.95) and 85.4 % for unbound proteins (MCC 0.72). Our bSiteFinder server is freely available at http://binfo.shmtu.edu.cn/bsitefinder/, and the source code is provided at the methods page. CONCLUSION An online bSiteFinder server is freely available at http://binfo.shmtu.edu.cn/bsitefinder/. Our work lays a foundation for functional annotation of protein and structure-based drug design. With ever increasing numbers of three-dimensional protein-ligand complex structures, our server should be more accurate and less time-consuming.Graphical Abstract bSiteFinder (http://binfo.shmtu.edu.cn/bsitefinder/) as a protein-binding sites prediction server was developed based on the largest database of bound templates so far with stringent quality control. By introducing Homology Indexing, Chain Length Indexing, Stability of Complex and Optimized Multiple-Templates Clustering into our algorithm, the efficiency of our server have been significantly improved. What's more, the accuracy was approximately 2-10 % higher than that of other algorithms for the test with either bound dataset or unbound dataset.
Collapse
Affiliation(s)
- Jun Gao
- Department of Bioinformatics, Tongji University, Shanghai, 200092 People's Republic of China ; School of Information Engineering, Shanghai Maritime University, Shanghai, 201306 People's Republic of China
| | - Qingchen Zhang
- Department of Bioinformatics, Tongji University, Shanghai, 200092 People's Republic of China
| | - Min Liu
- School of Information Engineering, Shanghai Maritime University, Shanghai, 201306 People's Republic of China
| | - Lixin Zhu
- Digestive Diseases and Nutrition Center, Department of Pediatrics, The State University of New York at Buffalo, Buffalo, NY 14260 USA ; Genomics, Environment, and Microbiome Community of Excellence, The State University of New York at Buffalo, Buffalo, NY 14203 USA ; Institute of Digestive Diseases, Longhua Hospital, Shanghai University of Traditional Chinese Medicine, Shanghai, 200032 People's Republic of China
| | - Dingfeng Wu
- Department of Bioinformatics, Tongji University, Shanghai, 200092 People's Republic of China
| | - Zhiwei Cao
- Department of Bioinformatics, Tongji University, Shanghai, 200092 People's Republic of China
| | - Ruixin Zhu
- Department of Bioinformatics, Tongji University, Shanghai, 200092 People's Republic of China
| |
Collapse
|
15
|
Abstract
Ligand binding is required for many proteins to function properly. A large number of bioinformatics tools have been developed to predict ligand binding sites as a first step in understanding a protein's function or to facilitate docking computations in virtual screening based drug design. The prediction usually requires only the three-dimensional structure (experimentally determined or computationally modeled) of the target protein to be searched for ligand binding site(s), and Web servers have been built, allowing the free and simple use of prediction tools. In this chapter, we review the underlying concepts of the methods used by various tools, and discuss their different features and the related issues of ligand binding site prediction. Some cautionary notes about the use of these tools are also provided.
Collapse
Affiliation(s)
- Zhong-Ru Xie
- Institute of Biomedical Sciences, Academia Sinica, 128 Academia Road, Section 2, Nankang, Taipei, 115, Taiwan
| | | |
Collapse
|
16
|
Zhu X, Xiong Y, Kihara D. Large-scale binding ligand prediction by improved patch-based method Patch-Surfer2.0. ACTA ACUST UNITED AC 2014; 31:707-13. [PMID: 25359888 DOI: 10.1093/bioinformatics/btu724] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Ligand binding is a key aspect of the function of many proteins. Thus, binding ligand prediction provides important insight in understanding the biological function of proteins. Binding ligand prediction is also useful for drug design and examining potential drug side effects. RESULTS We present a computational method named Patch-Surfer2.0, which predicts binding ligands for a protein pocket. By representing and comparing pockets at the level of small local surface patches that characterize physicochemical properties of the local regions, the method can identify binding pockets of the same ligand even if they do not share globally similar shapes. Properties of local patches are represented by an efficient mathematical representation, 3D Zernike Descriptor. Patch-Surfer2.0 has significant technical improvements over our previous prototype, which includes a new feature that captures approximate patch position with a geodesic distance histogram. Moreover, we constructed a large comprehensive database of ligand binding pockets that will be searched against by a query. The benchmark shows better performance of Patch-Surfer2.0 over existing methods. AVAILABILITY AND IMPLEMENTATION http://kiharalab.org/patchsurfer2.0/ CONTACT: dkihara@purdue.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiaolei Zhu
- Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA
| | - Yi Xiong
- Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA
| | - Daisuke Kihara
- Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA
| |
Collapse
|
17
|
Abstract
The amount of known protein structures is continuously growing, exhibited in over 95,000 3D structures freely available via the PDB. Over the last decade, pharmaceutical research has sparked interest in computationally extracting information from this large data pool, resulting in a homology-driven knowledge transfer from annotated to new structures. Studying protein structures with respect to understanding and modulating their functional behavior means analyzing their centers of action. Therefore, the detection and description of potential binding sites on the protein surface is a major step towards protein classification and assessment. Subsequently, these representations can be incorporated to compare proteins, and to predict their druggability or function. Especially in the context of target identification and polypharmacology, automated tools for large-scale target comparisons are highly needed. In this article, developments for automated structure-based target assessment are reviewed and remaining challenges as well as future perspectives are discussed.
Collapse
|
18
|
|
19
|
Lee HS, Im W. Ligand binding site detection by local structure alignment and its performance complementarity. J Chem Inf Model 2013; 53:2462-70. [PMID: 23957286 DOI: 10.1021/ci4003602] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Accurate determination of potential ligand binding sites (BS) is a key step for protein function characterization and structure-based drug design. Despite promising results of template-based BS prediction methods using global structure alignment (GSA), there is room to improve the performance by properly incorporating local structure alignment (LSA) because BS are local structures and often similar for proteins with dissimilar global folds. We present a template-based ligand BS prediction method using G-LoSA, our LSA tool. A large benchmark set validation shows that G-LoSA predicts drug-like ligands' positions in single-chain protein targets more precisely than TM-align, a GSA-based method, while the overall success rate of TM-align is better. G-LoSA is particularly efficient for accurate detection of local structures conserved across proteins with diverse global topologies. Recognizing the performance complementarity of G-LoSA to TM-align and a nontemplate geometry-based method, fpocket, a robust consensus scoring method, CMCS-BSP (Complementary Methods and Consensus Scoring for ligand Binding Site Prediction), is developed and shows improvement on prediction accuracy.
Collapse
Affiliation(s)
- Hui Sun Lee
- Department of Molecular Biosciences and Center for Bioinformatics, The University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66047, United States
| | | |
Collapse
|
20
|
Lavanya P, Ramaiah S, Anbarasu A. Influence of C-H...O interactions on the structural stability of β-lactamases. J Biol Phys 2013; 39:649-63. [PMID: 23996409 DOI: 10.1007/s10867-013-9324-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2013] [Accepted: 05/26/2013] [Indexed: 01/31/2023] Open
Abstract
β-Lactamases produced by pathogenic bacteria cleave β-lactam antibiotics and render them ineffective. Understanding the principles that govern the structural stability of β-lactamases requires elucidation of the nature of the interactions that are involved in stabilization. In the present study, we systematically analyze the influence of CH...O interactions on determining the specificity and stability of β-lactamases in relation to environmental preferences. It is interesting to note that all the residues located in the active site of β-lactamases are involved in CH...O interactions. A significant percentage of CH...O interactions have a higher conservation score and short-range interactions are the predominant type of interactions in β-lactamases. These results will be useful in understanding the stability patterns of β-lactamases.
Collapse
Affiliation(s)
- P Lavanya
- Medical & Biological Computing Laboratory, School of Biosciences and Technology, VIT University, Vellore 632014, Tamil Nadu, India
| | | | | |
Collapse
|
21
|
Xie ZR, Liu CK, Hsiao FC, Yao A, Hwang MJ. LISE: a server using ligand-interacting and site-enriched protein triangles for prediction of ligand-binding sites. Nucleic Acids Res 2013; 41:W292-6. [PMID: 23609546 PMCID: PMC3692107 DOI: 10.1093/nar/gkt300] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
LISE is a web server for a novel method for predicting small molecule binding sites on proteins. It differs from a number of servers currently available for such predictions in two aspects. First, rather than relying on knowledge of similar protein structures, identification of surface cavities or estimation of binding energy, LISE computes a score by counting geometric motifs extracted from sub-structures of interaction networks connecting protein and ligand atoms. These network motifs take into account spatial and physicochemical properties of ligand-interacting protein surface atoms. Second, LISE has now been more thoroughly tested, as, in addition to the evaluation we previously reported using two commonly used small benchmark test sets and targets of two community-based experiments on ligand-binding site predictions, we now report an evaluation using a large non-redundant data set containing >2000 protein–ligand complexes. This unprecedented test, the largest ever reported to our knowledge, demonstrates LISE’s overall accuracy and robustness. Furthermore, we have identified some hard to predict protein classes and provided an estimate of the performance that can be expected from a state-of-the-art binding site prediction server, such as LISE, on a proteome scale. The server is freely available at http://lise.ibms.sinica.edu.tw.
Collapse
Affiliation(s)
- Zhong-Ru Xie
- Institute of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan
| | | | | | | | | |
Collapse
|