1
|
Ramatenki V, Dumpati R, Vadija R, Vellanki S, Potlapally SR, Rondla R, Vuruputuri U. Targeting the ubiquitin-conjugating enzyme E2D4 for cancer drug discovery-a structure-based approach. J Chem Biol 2016; 10:51-67. [PMID: 28405240 DOI: 10.1007/s12154-016-0164-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 12/05/2016] [Indexed: 10/20/2022] Open
Abstract
Cancer progression is a global burden. The incidence and mortality now reach 30 million deaths per year. Several pathways of cancer are under investigation for the discovery of effective therapeutics. The present study highlights the structural details of the ubiquitin protein 'Ubiquitin-conjugating enzyme E2D4' (UBE2D4) for the novel lead structure identification in cancer drug discovery process. The evaluation of 3D structure of UBE2D4 was carried out using homology modelling techniques. The optimized structure was validated by standard computational protocols. The active site region of the UBE2D4 was identified using computational tools like CASTp, Q-site Finder and SiteMap. The hydrophobic pocket which is responsible for binding with its natural receptor ubiquitin ligase CHIP (C-terminal of Hsp 70 interacting protein) was identified through protein-protein docking study. Corroborating the results obtained from active site prediction tools and protein-protein docking study, the domain of UBE2D4 which is responsible for cancer cell progression is sorted out for further docking study. Virtual screening with large structural database like CB_Div Set and Asinex BioDesign small molecular structural database was carried out. The obtained new ligand molecules that have shown affinity towards UBE2D4 were considered for ADME prediction studies. The identified new ligand molecules with acceptable parameters of docking, ADME are considered as potent UBE2D4 enzyme inhibitors for cancer therapy.
Collapse
Affiliation(s)
- Vishwanath Ramatenki
- Department of Chemistry, University College of Science, Osmania University, Hyderabad, Telangana 500007 India
| | - Ramakrishna Dumpati
- Department of Chemistry, University College of Science, Osmania University, Hyderabad, Telangana 500007 India
| | - Rajender Vadija
- Department of Chemistry, University College of Science, Osmania University, Hyderabad, Telangana 500007 India
| | - Santhiprada Vellanki
- Department of Chemistry, University College of Science, Osmania University, Hyderabad, Telangana 500007 India
| | - Sarita Rajender Potlapally
- Department of Chemistry, Nizam College, Osmania University, Basheerbagh, Hyderabad, Telangana 500001 India
| | - Rohini Rondla
- Department of Chemistry, University College of Science, Osmania University, Hyderabad, Telangana 500007 India
| | - Uma Vuruputuri
- Department of Chemistry, University College of Science, Osmania University, Hyderabad, Telangana 500007 India
| |
Collapse
|
2
|
Flores DI, Sotelo-Mundo RR, Brizuela CA. A simple extension to the CMASA method for the prediction of catalytic residues in the presence of single point mutations. PLoS One 2014; 9:e108513. [PMID: 25268770 PMCID: PMC4182483 DOI: 10.1371/journal.pone.0108513] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Accepted: 08/31/2014] [Indexed: 11/23/2022] Open
Abstract
The automatic identification of catalytic residues still remains an important challenge in structural bioinformatics. Sequence-based methods are good alternatives when the query shares a high percentage of identity with a well-annotated enzyme. However, when the homology is not apparent, which occurs with many structures from the structural genome initiative, structural information should be exploited. A local structural comparison is preferred to a global structural comparison when predicting functional residues. CMASA is a recently proposed method for predicting catalytic residues based on a local structure comparison. The method achieves high accuracy and a high value for the Matthews correlation coefficient. However, point substitutions or a lack of relevant data strongly affect the performance of the method. In the present study, we propose a simple extension to the CMASA method to overcome this difficulty. Extensive computational experiments are shown as proof of concept instances, as well as for a few real cases. The results show that the extension performs well when the catalytic site contains mutated residues or when some residues are missing. The proposed modification could correctly predict the catalytic residues of a mutant thymidylate synthase, 1EVF. It also successfully predicted the catalytic residues for 3HRC despite the lack of information for a relevant side chain atom in the PDB file.
Collapse
Affiliation(s)
- David I. Flores
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada, Ensenada, Baja California, México
| | | | - Carlos A. Brizuela
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada, Ensenada, Baja California, México
- * E-mail:
| |
Collapse
|
3
|
Kisslov I, Naamati A, Shakarchy N, Pines O. Dual-targeted proteins tend to be more evolutionarily conserved. Mol Biol Evol 2014; 31:2770-9. [PMID: 25063438 DOI: 10.1093/molbev/msu221] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In eukaryotic cells, identical proteins can be located in more than a single subcellular compartment, a phenomenon termed dual targeting. We hypothesized that dual-targeted proteins should be more evolutionary conserved than exclusive mitochondrial proteins, due to separate selective pressures administered by the different compartments to maintain the functions associated with the protein sequences. We employed codon usage bias, propensity for gene loss, phylogenetic relationships, conservation analysis at the DNA level, and gene expression, to test our hypothesis. Our findings indicate that, indeed, dual-targeted proteins are significantly more conserved than their exclusively targeted counterparts. We then used this trait of gene conservation, together with previously identified traits of dual-targeted proteins (such as protein net charge and mitochondrial targeting sequence strength) to 1) create, for the first time (due to addition of conservation parameters), a tool for the prediction of dual-targeted mitochondrial proteins based on protein and mRNA sequences, and 2) show that molecular mechanisms involving one versus two translation products are not correlated with specific dual-targeting parameters. Finally, we discuss what evolutionary pressure maintains protein dual targeting in eukaryotes and deduce, as we initially hypothesized, that it is the discrete functions of these proteins in the different subcellular compartments, regardless of their dual-targeting mechanism.
Collapse
Affiliation(s)
- Irit Kisslov
- Department of Microbiology Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Adi Naamati
- Department of Microbiology Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel Department of Microbiology Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nitzan Shakarchy
- Department of Microbiology Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Ophry Pines
- Department of Microbiology Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel CREATE-NUS-HUJ Cellular & Molecular Mechanisms of Inflammation Program, National University of Singapore, Singapore
| |
Collapse
|
4
|
Khazanov NA, Carlson HA. Exploring the composition of protein-ligand binding sites on a large scale. PLoS Comput Biol 2013; 9:e1003321. [PMID: 24277997 PMCID: PMC3836696 DOI: 10.1371/journal.pcbi.1003321] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 09/23/2013] [Indexed: 12/21/2022] Open
Abstract
The residue composition of a ligand binding site determines the interactions available for diffusion-mediated ligand binding, and understanding general composition of these sites is of great importance if we are to gain insight into the functional diversity of the proteome. Many structure-based drug design methods utilize such heuristic information for improving prediction or characterization of ligand-binding sites in proteins of unknown function. The Binding MOAD database if one of the largest curated sets of protein-ligand complexes, and provides a source of diverse, high-quality data for establishing general trends of residue composition from currently available protein structures. We present an analysis of 3,295 non-redundant proteins with 9,114 non-redundant binding sites to identify residues over-represented in binding regions versus the rest of the protein surface. The Binding MOAD database delineates biologically-relevant “valid” ligands from “invalid” small-molecule ligands bound to the protein. Invalids are present in the crystallization medium and serve no known biological function. Contacts are found to differ between these classes of ligands, indicating that residue composition of biologically relevant binding sites is distinct not only from the rest of the protein surface, but also from surface regions capable of opportunistic binding of non-functional small molecules. To confirm these trends, we perform a rigorous analysis of the variation of residue propensity with respect to the size of the dataset and the content bias inherent in structure sets obtained from a large protein structure database. The optimal size of the dataset for establishing general trends of residue propensities, as well as strategies for assessing the significance of such trends, are suggested for future studies of binding-site composition. Describing the general structure of protein binding sites is fundamentally important for guiding drug design and better understanding structure-function relationships. Here, we analyze small molecules bound to proteins within our large database, Binding MOAD (Mother of All Databases, pronounced like “mode” as a pun referring to ligand-binding modes). We focus on different contacts across the residues in the binding sites, and we normalize the data relative to the protein's entire surface. A key feature of this study is the use of a “control” where we compare real, functional binding sites to the random contacts seen for crystallographic additives against the protein surface. Controls are required in experimental biology, but they are ill-defined in many computational approaches. This allows us to describe how true binding sites are unique on the protein surface and distinct from random patches that attract common, small molecules.
Collapse
Affiliation(s)
- Nickolay A. Khazanov
- Bioinformatics Graduate Program, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Heather A. Carlson
- Bioinformatics Graduate Program, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| |
Collapse
|
5
|
Rahimi A, Madadkar-Sobhani A, Touserkani R, Goliaei B. Efficacy of function specific 3D-motifs in enzyme classification according to their EC-numbers. J Theor Biol 2013; 336:36-43. [PMID: 23871713 DOI: 10.1016/j.jtbi.2013.07.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2013] [Revised: 06/05/2013] [Accepted: 07/02/2013] [Indexed: 11/28/2022]
Abstract
Due to the increasing number of protein structures with unknown function originated from structural genomics projects, protein function prediction has become an important subject in bioinformatics. Among diverse function prediction methods, exploring known 3D-motifs, which are associated with functional elements in unknown protein structures is one of the most biologically meaningful methods. Homologous enzymes inherit such motifs in their active sites from common ancestors. However, slight differences in the properties of these motifs, results in variation in the reactions and substrates of the enzymes. In this study, we examined the possibility of discriminating highly related active site patterns according to their EC-numbers by 3D-motifs. For each EC-number, the spatial arrangement of an active site, which has minimum average distance to other active sites with the same function, was selected as a representative 3D-motif. In order to characterize the motifs, various points in active site elements were tested. The results demonstrated the possibility of predicting full EC-number of enzymes by 3D-motifs. However, the discriminating power of 3D-motifs varies among different enzyme families and depends on selecting the appropriate points and features.
Collapse
Affiliation(s)
- Amir Rahimi
- Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | | | | | | |
Collapse
|
6
|
Janda JO, Meier A, Merkl R. CLIPS-4D: a classifier that distinguishes structurally and functionally important residue-positions based on sequence and 3D data. ACTA ACUST UNITED AC 2013; 29:3029-35. [PMID: 24048358 DOI: 10.1093/bioinformatics/btt519] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION The precise identification of functionally and structurally important residues of a protein is still an open problem, and state-of-the-art classifiers predict only one or at most two different categories. RESULT We have implemented the classifier CLIPS-4D, which predicts in a mutually exclusively manner a role in catalysis, ligand-binding or protein stability for each residue-position of a protein. Each prediction is assigned a P-value, which enables the statistical assessment and the selection of predictions with similar quality. CLIPS-4D requires as input a multiple sequence alignment and a 3D structure of one protein in PDB format. A comparison with existing methods confirmed state-of-the-art prediction quality, even though CLIPS-4D classifies more specifically than other methods. CLIPS-4D was implemented as a multiclass support vector machine, which exploits seven sequence-based and two structure-based features, each of which was shown to contribute to classification quality. The classification of ligand-binding sites profited most from the 3D features, which were the assessment of the solvent accessible surface area and the identification of surface pockets. In contrast, five additionally tested 3D features did not increase the classification performance achieved with evolutionary signals deduced from the multiple sequence alignment.
Collapse
Affiliation(s)
- Jan-Oliver Janda
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, D-93040 Regensburg, Germany and Faculty of Mathematics and Computer Science, University of Hagen, D-58084 Hagen, Germany
| | | | | |
Collapse
|
7
|
Han L, Zhang YJ, Song J, Liu MS, Zhang Z. Identification of catalytic residues using a novel feature that integrates the microenvironment and geometrical location properties of residues. PLoS One 2012; 7:e41370. [PMID: 22829945 PMCID: PMC3400608 DOI: 10.1371/journal.pone.0041370] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2012] [Accepted: 06/20/2012] [Indexed: 11/18/2022] Open
Abstract
Enzymes play a fundamental role in almost all biological processes and identification of catalytic residues is a crucial step for deciphering the biological functions and understanding the underlying catalytic mechanisms. In this work, we developed a novel structural feature called MEDscore to identify catalytic residues, which integrated the microenvironment (ME) and geometrical properties of amino acid residues. Firstly, we converted a residue's ME into a series of spatially neighboring residue pairs, whose likelihood of being located in a catalytic ME was deduced from a benchmark enzyme dataset. We then calculated an ME-based score, termed as MEscore, by summing up the likelihood of all residue pairs. Secondly, we defined a parameter called Dscore to measure the relative distance of a residue to the center of the protein, provided that catalytic residues are typically located in the center of the protein structure. Finally, we defined the MEDscore feature based on an effective nonlinear integration of MEscore and Dscore. When evaluated on a well-prepared benchmark dataset using five-fold cross-validation tests, MEDscore achieved a robust performance in identifying catalytic residues with an AUC1.0 of 0.889. At a ≤ 10% false positive rate control, MEDscore correctly identified approximately 70% of the catalytic residues. Remarkably, MEDscore achieved a competitive performance compared with the residue conservation score (e.g. CONscore), the most informative singular feature predominantly employed to identify catalytic residues. To the best of our knowledge, MEDscore is the first singular structural feature exhibiting such an advantage. More importantly, we found that MEDscore is complementary with CONscore and a significantly improved performance can be achieved by combining CONscore with MEDscore in a linear manner. As an implementation of this work, MEDscore has been made freely accessible at http://protein.cau.edu.cn/mepi/.
Collapse
Affiliation(s)
- Lei Han
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, People's Republic of China
| | - Yong-Jun Zhang
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, People's Republic of China
| | - Jiangning Song
- National Engineering Laboratory for Industrial Enzymes and Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, People's Republic of China
- Department of Biochemistry and Molecular Biology, Faculty of Medicine, Monash University, Melbourne, Victoria, Australia
| | - Ming S. Liu
- CSIRO - Mathematics, Informatics and Statistics, Clayton, Victoria, Australia
- * E-mail: (MSL); (ZZ)
| | - Ziding Zhang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, People's Republic of China
- * E-mail: (MSL); (ZZ)
| |
Collapse
|
8
|
Janda JO, Busch M, Kück F, Porfenenko M, Merkl R. CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure. BMC Bioinformatics 2012; 13:55. [PMID: 22480135 PMCID: PMC3391178 DOI: 10.1186/1471-2105-13-55] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2011] [Accepted: 04/05/2012] [Indexed: 11/12/2022] Open
Abstract
Background One aim of the in silico characterization of proteins is to identify all residue-positions, which are crucial for function or structure. Several sequence-based algorithms exist, which predict functionally important sites. However, with respect to sequence information, many functionally and structurally important sites are hard to distinguish and consequently a large number of incorrectly predicted functional sites have to be expected. This is why we were interested to design a new classifier that differentiates between functionally and structurally important sites and to assess its performance on representative datasets. Results We have implemented CLIPS-1D, which predicts a role in catalysis, ligand-binding, or protein structure for residue-positions in a mutually exclusive manner. By analyzing a multiple sequence alignment, the algorithm scores conservation as well as abundance of residues at individual sites and their local neighborhood and categorizes by means of a multiclass support vector machine. A cross-validation confirmed that residue-positions involved in catalysis were identified with state-of-the-art quality; the mean MCC-value was 0.34. For structurally important sites, prediction quality was considerably higher (mean MCC = 0.67). For ligand-binding sites, prediction quality was lower (mean MCC = 0.12), because binding sites and structurally important residue-positions share conservation and abundance values, which makes their separation difficult. We show that classification success varies for residues in a class-specific manner. This is why our algorithm computes residue-specific p-values, which allow for the statistical assessment of each individual prediction. CLIPS-1D is available as a Web service at http://www-bioinf.uni-regensburg.de/. Conclusions CLIPS-1D is a classifier, whose prediction quality has been determined separately for catalytic sites, ligand-binding sites, and structurally important sites. It generates hypotheses about residue-positions important for a set of homologous proteins and focuses on conservation and abundance signals. Thus, the algorithm can be applied in cases where function cannot be transferred from well-characterized proteins by means of sequence comparison.
Collapse
Affiliation(s)
- Jan-Oliver Janda
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, 93040 Regensburg, Germany.
| | | | | | | | | |
Collapse
|