1
|
Narunsky A, Kessel A, Solan R, Alva V, Kolodny R, Ben-Tal N. On the evolution of protein-adenine binding. Proc Natl Acad Sci U S A 2020; 117:4701-4709. [PMID: 32079721 PMCID: PMC7060716 DOI: 10.1073/pnas.1911349117] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Proteins' interactions with ancient ligands may reveal how molecular recognition emerged and evolved. We explore how proteins recognize adenine: a planar rigid fragment found in the most common and ancient ligands. We have developed a computational pipeline that extracts protein-adenine complexes from the Protein Data Bank, structurally superimposes their adenine fragments, and detects the hydrogen bonds mediating the interaction. Our analysis extends the known motifs of protein-adenine interactions in the Watson-Crick edge of adenine and shows that all of adenine's edges may contribute to molecular recognition. We further show that, on the proteins' side, binding is often mediated by specific amino acid segments ("themes") that recur across different proteins, such that different proteins use the same themes when binding the same adenine-containing ligands. We identify numerous proteins that feature these themes and are thus likely to bind adenine-containing ligands. Our analysis suggests that adenine binding has emerged multiple times in evolution.
Collapse
Affiliation(s)
- Aya Narunsky
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, 69978 Ramat Aviv, Israel
| | - Amit Kessel
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, 69978 Ramat Aviv, Israel
| | - Ron Solan
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, 69978 Ramat Aviv, Israel
| | - Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Rachel Kolodny
- Department of Computer Science, University of Haifa, Mount Carmel, 3498838 Haifa, Israel
| | - Nir Ben-Tal
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, 69978 Ramat Aviv, Israel;
| |
Collapse
|
2
|
Bhagavat R, Srinivasan N, Chandra N. Deciphering common recognition principles of nucleoside mono/di and tri-phosphates binding in diverse proteins via structural matching of their binding sites. Proteins 2017; 85:1699-1712. [PMID: 28547747 DOI: 10.1002/prot.25328] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Revised: 05/04/2017] [Accepted: 05/20/2017] [Indexed: 12/14/2022]
Abstract
Nucleoside triphosphate (NTP) ligands are of high biological importance and are essential for all life forms. A pre-requisite for them to participate in diverse biochemical processes is their recognition by diverse proteins. It is thus of great interest to understand the basis for such recognition in different proteins. Towards this, we have used a structural bioinformatics approach and analyze structures of 4677 NTP complexes available in Protein Data Bank (PDB). Binding sites were extracted and compared exhaustively using PocketMatch, a sensitive in-house site comparison algorithm, which resulted in grouping the entire dataset into 27 site-types. Each of these site-types represent a structural motif comprised of two or more residue conservations, derived using another in-house tool for superposing binding sites, PocketAlign. The 27 site-types could be grouped further into 9 super-types by considering partial similarities in the sites, which indicated that the individual site-types comprise different combinations of one or more site features. A scan across PDB using the 27 structural motifs determined the motifs to be specific to NTP binding sites, and a computational alanine mutagenesis indicated that residues identified to be highly conserved in the motifs are also most contributing to binding. Alternate orientations of the ligand in several site-types were observed and rationalized, indicating the possibility of some residues serving as anchors for NTP recognition. The presence of multiple site-types and the grouping of multiple folds into each site-type is strongly suggestive of convergent evolution. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. Proteins 2017; 85:1699-1712. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Raghu Bhagavat
- Department of Biochemistry, Molecular Biophysics Unit, National Mathematics Initiative, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Narayanaswamy Srinivasan
- Department of Biochemistry, Molecular Biophysics Unit, National Mathematics Initiative, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Nagasuma Chandra
- Department of Biochemistry, Molecular Biophysics Unit, National Mathematics Initiative, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| |
Collapse
|
3
|
Singh S, Tanneeru K, Guruprasad L. Structure and dynamics of H. pylori 98-10 C5-cytosine specific DNA methyltransferase in complex with S-adenosyl-l-methionine and DNA. MOLECULAR BIOSYSTEMS 2016; 12:3111-23. [PMID: 27470658 DOI: 10.1039/c6mb00306k] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Helicobacter pylori is a Gram-negative bacterium that inhabits the human gastrointestinal tract, and some strains of this bacterium cause gastric ulcers and cancer. DNA methyltransferases (MTases) are promising drug targets for the treatment of cancer and other diseases that are also caused by epigenetic alternations of the genome. The C5-cytosine specific DNA methyltransferase from H. pylori (M. Hpy C5mC) catalyzes the transfer of the methyl group from the cofactor S-adenosyl-l-methionine (AdoMet) to the flipped cytosine of the substrate DNA. Herein we report the sequence analyses, 3-D structure modeling and molecular dynamics simulations of M. Hpy C5mC, when complexed with AdoMet as well as DNA. We analyzed the protein-DNA interactions prominently established by the flipped cytosine and the interactions between the protein and cofactor in the active site. We propose that the contacts made by cytosine O2 with Arg155 and Arg157, and the water-mediated interactions with cytosine N3 may be essential for the activity of methyl transfer as well as the deprotonation at the C5 position in our C5mC model. Specific recognition of DNA was mediated mainly by residues from Ser221-Arg229 and Ser243-Gln246 of the target recognition domain (TRD) and some residues of the loop Ser75-Lys83 from the large domain. These findings are further supported by alanine scanning mutagenesis studies. The results reported here explain the sequence, structure and binding features necessary for the recognition between the cofactor and the substrate by the key epigenetic enzyme, M. Hpy C5mC.
Collapse
Affiliation(s)
- Swati Singh
- School of Chemistry, University of Hyderabad, Hyderabad, 500046, India.
| | | | | |
Collapse
|
4
|
Laurino P, Tóth-Petróczy Á, Meana-Pañeda R, Lin W, Truhlar DG, Tawfik DS. An Ancient Fingerprint Indicates the Common Ancestry of Rossmann-Fold Enzymes Utilizing Different Ribose-Based Cofactors. PLoS Biol 2016; 14:e1002396. [PMID: 26938925 PMCID: PMC4777477 DOI: 10.1371/journal.pbio.1002396] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2015] [Accepted: 01/29/2016] [Indexed: 01/30/2023] Open
Abstract
Nucleoside-based cofactors are presumed to have preceded proteins. The Rossmann fold is one of the most ancient and functionally diverse protein folds, and most Rossmann enzymes utilize nucleoside-based cofactors. We analyzed an omnipresent Rossmann ribose-binding interaction: a carboxylate side chain at the tip of the second β-strand (β2-Asp/Glu). We identified a canonical motif, defined by the β2-topology and unique geometry. The latter relates to the interaction being bidentate (both ribose hydroxyls interacting with the carboxylate oxygens), to the angle between the carboxylate and the ribose, and to the ribose's ring configuration. We found that this canonical motif exhibits hallmarks of divergence rather than convergence. It is uniquely found in Rossmann enzymes that use different cofactors, primarily SAM (S-adenosyl methionine), NAD (nicotinamide adenine dinucleotide), and FAD (flavin adenine dinucleotide). Ribose-carboxylate bidentate interactions in other folds are not only rare but also have a different topology and geometry. We further show that the canonical geometry is not dictated by a physical constraint--geometries found in noncanonical interactions have similar calculated bond energies. Overall, these data indicate the divergence of several major Rossmann-fold enzyme classes, with different cofactors and catalytic chemistries, from a common pre-LUCA (last universal common ancestor) ancestor that possessed the β2-Asp/Glu motif.
Collapse
Affiliation(s)
- Paola Laurino
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Ágnes Tóth-Petróczy
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Rubén Meana-Pañeda
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Wei Lin
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Donald G. Truhlar
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Dan S. Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
5
|
Wang J, Luttrell J, Zhang N, Khan S, Shi N, Wang MX, Kang JQ, Wang Z, Xu D. Exploring Human Diseases and Biological Mechanisms by Protein Structure Prediction and Modeling. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 939:39-61. [PMID: 27807743 PMCID: PMC6829626 DOI: 10.1007/978-981-10-1503-8_3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Protein structure prediction and modeling provide a tool for understanding protein functions by computationally constructing protein structures from amino acid sequences and analyzing them. With help from protein prediction tools and web servers, users can obtain the three-dimensional protein structure models and gain knowledge of functions from the proteins. In this chapter, we will provide several examples of such studies. As an example, structure modeling methods were used to investigate the relation between mutation-caused misfolding of protein and human diseases including epilepsy and leukemia. Protein structure prediction and modeling were also applied in nucleotide-gated channels and their interaction interfaces to investigate their roles in brain and heart cells. In molecular mechanism studies of plants, rice salinity tolerance mechanism was studied via structure modeling on crucial proteins identified by systems biology analysis; trait-associated protein-protein interactions were modeled, which sheds some light on the roles of mutations in soybean oil/protein content. In the age of precision medicine, we believe protein structure prediction and modeling will play more and more important roles in investigating biomedical mechanism of diseases and drug design.
Collapse
Affiliation(s)
- Juexin Wang
- Department of Computer Science, University of Missouri, Columbia, MO, 65211, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, 65211, USA
| | - Joseph Luttrell
- School of Computing, University of Southern Mississippi, 118 College Drive, Hattiesburg, MS, 39406, USA
| | - Ning Zhang
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, 65211, USA
- Informatics Institute, University of Missouri, Columbia, MO, 65211, USA
| | - Saad Khan
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, 65211, USA
- Informatics Institute, University of Missouri, Columbia, MO, 65211, USA
| | - NianQing Shi
- Department of Medicine, Division of Cardiovascular Medicine, University of Wisconsin, Room 8418, 1111 Highland Ave, Madison, WI, 53706, USA
| | - Michael X Wang
- Department of Pathology and Anatomical Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Jing-Qiong Kang
- Department of Neurology, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Zheng Wang
- School of Computing, University of Southern Mississippi, 118 College Drive, Hattiesburg, MS, 39406, USA
| | - Dong Xu
- Department of Computer Science, University of Missouri, Columbia, MO, 65211, USA.
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, 65211, USA.
- Informatics Institute, University of Missouri, Columbia, MO, 65211, USA.
| |
Collapse
|
6
|
Zheng Z, Goncearenco A, Berezovsky IN. Nucleotide binding database NBDB--a collection of sequence motifs with specific protein-ligand interactions. Nucleic Acids Res 2015; 44:D301-7. [PMID: 26507856 PMCID: PMC4702817 DOI: 10.1093/nar/gkv1124] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Accepted: 10/14/2015] [Indexed: 11/14/2022] Open
Abstract
NBDB database describes protein motifs, elementary functional loops (EFLs) that are involved in binding of nucleotide-containing ligands and other biologically relevant cofactors/coenzymes, including ATP, AMP, ATP, GMP, GDP, GTP, CTP, PAP, PPS, FMN, FAD(H), NAD(H), NADP, cAMP, cGMP, c-di-AMP and c-di-GMP, ThPP, THD, F-420, ACO, CoA, PLP and SAM. The database is freely available online at http://nbdb.bii.a-star.edu.sg. In total, NBDB contains data on 249 motifs that work in interactions with 24 ligands. Sequence profiles of EFL motifs were derived de novo from nonredundant Uniprot proteome sequences. Conserved amino acid residues in the profiles interact specifically with distinct chemical parts of nucleotide-containing ligands, such as nitrogenous bases, phosphate groups, ribose, nicotinamide, and flavin moieties. Each EFL profile in the database is characterized by a pattern of corresponding ligand–protein interactions found in crystallized ligand–protein complexes. NBDB database helps to explore the determinants of nucleotide and cofactor binding in different protein folds and families. NBDB can also detect fragments that match to profiles of particular EFLs in the protein sequence provided by user. Comprehensive information on sequence, structures, and interactions of EFLs with ligands provides a foundation for experimental and computational efforts on design of required protein functions.
Collapse
Affiliation(s)
- Zejun Zheng
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | | | - Igor N Berezovsky
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore
| |
Collapse
|
7
|
NAD captureSeq indicates NAD as a bacterial cap for a subset of regulatory RNAs. Nature 2014; 519:374-7. [PMID: 25533955 DOI: 10.1038/nature14020] [Citation(s) in RCA: 188] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2013] [Accepted: 10/27/2014] [Indexed: 11/08/2022]
Abstract
A distinctive feature of prokaryotic gene expression is the absence of 5'-capped RNA. In eukaryotes, 5',5'-triphosphate-linked 7-methylguanosine protects messenger RNA from degradation and modulates maturation, localization and translation. Recently, the cofactor nicotinamide adenine dinucleotide (NAD) was reported as a covalent modification of bacterial RNA. Given the central role of NAD in redox biochemistry, posttranslational protein modification and signalling, its attachment to RNA indicates that there are unknown functions of RNA in these processes and undiscovered pathways in RNA metabolism and regulation. The unknown identity of NAD-modified RNAs has so far precluded functional analyses. Here we identify NAD-linked RNAs from bacteria by chemo-enzymatic capture and next-generation sequencing (NAD captureSeq). Among those identified, specific regulatory small RNAs (sRNAs) and sRNA-like 5'-terminal fragments of certain mRNAs are particularly abundant. Analogous to a eukaryotic cap, 5'-NAD modification is shown in vitro to stabilize RNA against 5'-processing by the RNA-pyrophosphohydrolase RppH and against endonucleolytic cleavage by ribonuclease (RNase) E. The nudix phosphohydrolase NudC decaps NAD-RNA and thereby triggers RNase-E-mediated RNA decay, while being inactive against triphosphate-RNA. In vivo, ∼13% of the abundant sRNA RNAI is NAD-capped in the presence, and ∼26% in the absence, of functional NudC. To our knowledge, this is the first description of a cap-like structure and a decapping machinery in bacteria.
Collapse
|
8
|
Kuppuraj G, Kruise D, Yura K. Conformational behavior of flavin adenine dinucleotide: conserved stereochemistry in bound and free states. J Phys Chem B 2014; 118:13486-97. [PMID: 25389798 DOI: 10.1021/jp507629n] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Metabolic enzymes utilize the cofactor flavin adenine dinucleotide (FAD) to catalyze essential biochemical reactions. Because these enzymes have been implicated in disease pathways, it will be necessary to target them via FAD-based structural analogues that can either activate/inhibit the enzymatic activity. To achieve this, it is important to explore the conformational space of FAD in the enzyme-bound and free states. Herein, we analyze X-ray crystallographic data of the enzyme-bound FAD conformations and sample conformations of the molecule in explicit water by molecular dynamics (MD) simulations. Enzyme-bound FAD conformations segregate into five distinct groups based on dihedral angle principal component analysis (PCA). A notable feature in the bound FADs is that the adenine base and isoalloxazine ring are oppositely oriented relative to the pyrophosphate axis characterized by near trans hypothetical dihedral angle "δV" values. Not surprisingly, MD simulations in water show final compact but not perfectly stacked ring structures in FAD. Simulation data did not reveal noticeable changes in overall conformational dynamics of the dinucleotide in reduced and oxidized forms and in the presence and/or absence of ions. During unfolding-folding dynamics, the riboflavin moiety is more flexible than the adenosine monophosphate group in the molecule. Conversely, the isoalloxazine ring is more stable than the variable adenine base. The pyrophosphate group depicts an unusually highly organized fluctuation illustrated by its dihedral angle distribution. Conformations sampled from enzymes and MD are quantified. The extent to which the protein shifts the distribution from the unbound state is discussed in terms of prevalent FAD shapes and dihedral angle population.
Collapse
Affiliation(s)
- Gopi Kuppuraj
- Center for Informational Biology, Ochanomizu University , 2-1-1 Otsuka, Bunkyo, Tokyo 112-8610, Japan
| | | | | |
Collapse
|
9
|
Afzal AM, Al-Shubailly F, Leader DP, Milner-White EJ. Bridging of anions by hydrogen bonds in nest motifs and its significance for Schellman loops and other larger motifs within proteins. Proteins 2014; 82:3023-31. [DOI: 10.1002/prot.24663] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2014] [Revised: 07/30/2014] [Accepted: 08/05/2014] [Indexed: 01/13/2023]
Affiliation(s)
- Avid M. Afzal
- College of Medical, Veterinary and Life Sciences; University of Glasgow; Glasgow G12 8QQ United Kingdom
| | - Fawzia Al-Shubailly
- College of Medical, Veterinary and Life Sciences; University of Glasgow; Glasgow G12 8QQ United Kingdom
| | - David P. Leader
- College of Medical, Veterinary and Life Sciences; University of Glasgow; Glasgow G12 8QQ United Kingdom
| | - E. James Milner-White
- College of Medical, Veterinary and Life Sciences; University of Glasgow; Glasgow G12 8QQ United Kingdom
| |
Collapse
|
10
|
Hleap JS, Susko E, Blouin C. Defining structural and evolutionary modules in proteins: a community detection approach to explore sub-domain architecture. BMC STRUCTURAL BIOLOGY 2013; 13:20. [PMID: 24131821 PMCID: PMC4016585 DOI: 10.1186/1472-6807-13-20] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Accepted: 10/11/2013] [Indexed: 12/23/2022]
Abstract
Background Assessing protein modularity is important to understand protein evolution. Still the question of the existence of a sub-domain modular architecture remains. We propose a graph-theory approach with significance and power testing to identify modules in protein structures. In the first step, clusters are determined by optimizing the partition that maximizes the modularity score. Second, each cluster is tested for significance. Significant clusters are referred to as modules. Evolutionary modules are identified by analyzing homologous structures. Dynamic modules are inferred from sets of snapshots of molecular simulations. We present here a methodology to identify sub-domain architecture robustly, biologically meaningful, and statistically supported. Results The robustness of this new method is tested using simulated data with known modularity. Modules are correctly identified even when there is a low correlation between landmarks within a module. We also analyzed the evolutionary modularity of a data set of α-amylase catalytic domain homologs, and the dynamic modularity of the Niemann-Pick C1 (NPC1) protein N-terminal domain. The α-amylase contains an (α/β)8 barrel (TIM barrel) with the polysaccharides cleavage site and a calcium-binding domain. In this data set we identified four robust evolutionary modules, one of which forms the minimal functional TIM barrel topology. The NPC1 protein is involved in the intracellular lipid metabolism coordinating sterol trafficking. NPC1 N-terminus is the first luminal domain which binds to cholesterol and its oxygenated derivatives. Our inferred dynamic modules in the protein NPC1 are also shown to match functional components of the protein related to the NPC1 disease. Conclusions A domain compartmentalization can be found and described in correlation space. To our knowledge, there is no other method attempting to identify sub-domain architecture from the correlation among residues. Most attempts made focus on sequence motifs of protein-protein interactions, binding sites, or sequence conservancy. We were able to describe functional/structural sub-domain architecture related to key residues for starch cleavage, calcium, and chloride binding sites in the α-amylase, and sterol opening-defining modules and disease-related residues in the NPC1. We also described the evolutionary sub-domain architecture of the α-amylase catalytic domain, identifying the already reported minimum functional TIM barrel.
Collapse
Affiliation(s)
- Jose Sergio Hleap
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, B3H 4R2, Canada.
| | | | | |
Collapse
|
11
|
Parca L, Ferré F, Ausiello G, Helmer-Citterich M. Nucleos: a web server for the identification of nucleotide-binding sites in protein structures. Nucleic Acids Res 2013; 41:W281-5. [PMID: 23703207 PMCID: PMC3692072 DOI: 10.1093/nar/gkt390] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Nucleos is a web server for the identification of nucleotide-binding sites in protein structures. Nucleos compares the structure of a query protein against a set of known template 3D binding sites representing nucleotide modules, namely the nucleobase, carbohydrate and phosphate. Structural features, clustering and conservation are used to filter and score the predictions. The predicted nucleotide modules are then joined to build whole nucleotide-binding sites, which are ranked by their score. The server takes as input either the PDB code of the query protein structure or a user-submitted structure in PDB format. The output of Nucleos is composed of ranked lists of predicted nucleotide-binding sites divided by nucleotide type (e.g. ATP-like). For each ranked prediction, Nucleos provides detailed information about the score, the template structure and the structural match for each nucleotide module composing the nucleotide-binding site. The predictions on the query structure and the template-binding sites can be viewed directly on the web through a graphical applet. In 98% of the cases, the modules composing correct predictions belong to proteins with no homology relationship between each other, meaning that the identification of brand-new nucleotide-binding sites is possible using information from non-homologous proteins. Nucleos is available at http://nucleos.bio.uniroma2.it/nucleos/.
Collapse
Affiliation(s)
- Luca Parca
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata, Via della Ricerca Scientifica snc, 00133 Rome, Italy
| | | | | | | |
Collapse
|
12
|
Nucleotide binding architecture for secreted cytotoxic endoribonucleases. Biochimie 2012; 95:1087-97. [PMID: 23274129 DOI: 10.1016/j.biochi.2012.12.015] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2012] [Accepted: 12/13/2012] [Indexed: 12/20/2022]
Abstract
Vertebrate secreted RNases are small cationic protein endowed with an endoribonuclease activity that belong to the RNase A superfamily and display diverse cytotoxic activities. In an effort to unravel their mechanism of action, we have analysed their nucleotide binding recognition patterns. General shared features with other nucleotide binding proteins were deduced from overall statistics on the available structure complexes at the Protein Data Bank and compared with the particularities of selected representative endoribonuclease families. Results were compared with other endoribonuclease representative families and with the overall protein-nucleotide interaction features. Preferred amino acids and atom types involved in pair bonding interactions were identified, defining the spatial motives for phosphate, base and ribose building blocks. Together with the conserved catalytic triad at the active site, variability was observed for secondary binding subsites that may contribute to the proper substrate alignment and could explain the distinct substrate preference patterns. Highly conserved binding patterns were identified for the pyrimidine and purine subsites at the main and secondary base subsites. Particular substitution could be ascribed to specific adenine or guanine specificities. Distribution of evolutionary conserved residues were compared to search for the structure determinants that underlie their diverse catalytic efficiency and those that may account for putative physiological substrate targets or other non-catalytic biological activities that contribute to the antipathogen role of the RNases involved in the host defence system. A side by side comparison with another endoribonuclease superfamily of secreted cytotoxic proteins, the microbial RNases, was carried on to analyse the common features and peculiarities that rule their substrate recognition. The data provides the structural basis for the development of applied therapies targeting cellular nucleotide polymers.
Collapse
|
13
|
Di Paola L, De Ruvo M, Paci P, Santoni D, Giuliani A. Protein Contact Networks: An Emerging Paradigm in Chemistry. Chem Rev 2012. [DOI: 10.1021/cr3002356] [Citation(s) in RCA: 173] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- L. Di Paola
- Faculty of Engineering, Università CAMPUS BioMedico, Via A. del Portillo,
21, 00128 Roma, Italy
| | | | | | - D. Santoni
- BioMathLab, CNR-Institute of Systems Analysis and Computer Science (IASI), viale Manzoni 30, 00185
Roma, Italy
| | - A. Giuliani
- Environment
and Health Department, Istituto Superiore di Sanità, Viale Regina Elena
299, 00161, Roma, Italy
| |
Collapse
|
14
|
Parca L, Gherardini PF, Truglio M, Mangone I, Ferrè F, Helmer-Citterich M, Ausiello G. Identification of nucleotide-binding sites in protein structures: a novel approach based on nucleotide modularity. PLoS One 2012; 7:e50240. [PMID: 23209685 PMCID: PMC3507729 DOI: 10.1371/journal.pone.0050240] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2012] [Accepted: 10/22/2012] [Indexed: 01/30/2023] Open
Abstract
Nucleotides are involved in several cellular processes, ranging from the transmission of genetic information, to energy transfer and storage. Both sequence and structure based methods have been developed to predict the location of nucleotide-binding sites in proteins. Here we propose a novel methodology that leverages the observation that nucleotide-binding sites have a modular structure. Nucleotides are composed of identifiable fragments, i.e. the phosphate, the nucleobase and the carbohydrate moieties. These fragments are bound by specific structural motifs that recur in proteins of different fold. Moreover these motifs behave as modules and are found in different combinations across fold space. Our method predicts binding sites for each nucleotide fragment by comparing a query protein with a database of templates extracted from proteins of known structure. Whenever a similarity is found the fragment bound by the template is transferred on the query protein, thus identifying a putative binding site. Predictions falling inside the surface of the protein are discarded, and the remaining ones are scored using clustering and conservation. The method is able to rank as first a correct prediction in the 48%, 48% and 68% of the analyzed proteins for the nucleobase, carbohydrate and phosphate respectively, while considering the first five predictions the performances change to 71%, 65% and 86% respectively. Furthermore we attempted to reconstruct the full structure of the binding site, starting from the predicted positions of the fragments. We calculated that in the 59% of the analyzed proteins the method ranks as first a reconstructed binding site or a part of it. Finally we tested the reliability of our method in a real world case in which it has to predict nucleotide-binding sites in unbound proteins. We analyzed proteins whose structure has been solved with and without the nucleotide and observed only little variations in the method performance.
Collapse
Affiliation(s)
- Luca Parca
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | | | - Mauro Truglio
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | - Iolanda Mangone
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | - Fabrizio Ferrè
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| | | | - Gabriele Ausiello
- Department of Biology, University of Rome “Tor Vergata”, Rome, Italy
| |
Collapse
|
15
|
Wu CY, Hwa YH, Chen YC, Lim C. Hidden relationship between conserved residues and locally conserved phosphate-binding structures in NAD(P)-binding proteins. J Phys Chem B 2012; 116:5644-52. [PMID: 22530587 DOI: 10.1021/jp3014332] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
A one-dimensional (1D) motif usually comprises conserved essential residues involved in catalysis, ligand binding, or maintaining a specific structure. However, it cannot be easily detected in proteins with low sequence identity because it is difficult to (1) identify protein sequences suspected to contain the motif, and (2) align sequences with little sequence identity to spot the conserved residues. Here, we present a strategy for discovering phosphate-binding 1D motifs in NAD(P)-binding proteins sharing low sequence identity that overcomes these two hurdles by determining all distinct locally conserved pyrophosphate-binding structures and aligning the same-length sequences comprising each of these structures to identify the conserved residues. We show that the sequence motifs derived from the distinct pyrophosphate-binding structures yield different numbers/spacing of conserved Gly residues. We also show that they depend on the side chain orientations and cofactor type (NAD or NADP). Thus, sequence motifs derived from local similarity of backbone structures without consideration of the cofactor type and/or side chain orientations would reduce their reliability in annotating protein function from sequence alone. The three-dimensional (3D) and 1D motifs comprising conserved residues in nonredundant proteins reveal hidden relationships between the protein structure/function and sequence as well as protein-cofactor interactions.
Collapse
Affiliation(s)
- Chih Yuan Wu
- Institute of Biomedical Sciences, Academia Sinica , Taipei 115, Taiwan
| | | | | | | |
Collapse
|
16
|
Bianchi V, Gherardini PF, Helmer-Citterich M, Ausiello G. Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities. BMC Bioinformatics 2012; 13 Suppl 4:S17. [PMID: 22536963 PMCID: PMC3434446 DOI: 10.1186/1471-2105-13-s4-s17] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Background The identification of ligand binding sites is a key task in the annotation of proteins with known structure but uncharacterized function. Here we describe a knowledge-based method exploiting the observation that unrelated binding sites share small structural motifs that bind the same chemical fragments irrespective of the nature of the ligand as a whole. Results PDBinder compares a query protein against a library of binding and non-binding protein surface regions derived from the PDB. The results of the comparison are used to derive a propensity value for each residue which is correlated with the likelihood that the residue is part of a ligand binding site. The method was applied to two different problems: i) the prediction of ligand binding residues and ii) the identification of which surface cleft harbours the binding site. In both cases PDBinder performed consistently better than existing methods. PDBinder has been trained on a non-redundant set of 1356 high-quality protein-ligand complexes and tested on a set of 239 holo and apo complex pairs. We obtained an MCC of 0.313 on the holo set with a PPV of 0.413 while on the apo set we achieved an MCC of 0.271 and a PPV of 0.372. Conclusions We show that PDBinder performs better than existing methods. The good performance on the unbound proteins is extremely important for real-world applications where the location of the binding site is unknown. Moreover, since our approach is orthogonal to those used in other programs, the PDBinder propensity value can be integrated in other algorithms further increasing the final performance.
Collapse
Affiliation(s)
- Valerio Bianchi
- Centre for Molecular Bioinformatics, Department of Biology, University of Rome Tor Vergata, Via della Ricerca Scientifica snc, Rome 00133, Italy
| | | | | | | |
Collapse
|
17
|
Tarrío R, Ayala FJ, Rodríguez-Trelles F. The Vein Patterning 1 (VEP1) gene family laterally spread through an ecological network. PLoS One 2011; 6:e22279. [PMID: 21818306 PMCID: PMC3144213 DOI: 10.1371/journal.pone.0022279] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2011] [Accepted: 06/18/2011] [Indexed: 11/23/2022] Open
Abstract
Lateral gene transfer (LGT) is a major evolutionary mechanism in prokaryotes. Knowledge about LGT— particularly, multicellular— eukaryotes has only recently started to accumulate. A widespread assumption sees the gene as the unit of LGT, largely because little is yet known about how LGT chances are affected by structural/functional features at the subgenic level. Here we trace the evolutionary trajectory of VEin Patterning 1, a novel gene family known to be essential for plant development and defense. At the subgenic level VEP1 encodes a dinucleotide-binding Rossmann-fold domain, in common with members of the short-chain dehydrogenase/reductase (SDR) protein family. We found: i) VEP1 likely originated in an aerobic, mesophilic and chemoorganotrophic α-proteobacterium, and was laterally propagated through nets of ecological interactions, including multiple LGTs between phylogenetically distant green plant/fungi-associated bacteria, and five independent LGTs to eukaryotes. Of these latest five transfers, three are ancient LGTs, implicating an ancestral fungus, the last common ancestor of land plants and an ancestral trebouxiophyte green alga, and two are recent LGTs to modern embryophytes. ii) VEP1's rampant LGT behavior was enabled by the robustness and broad utility of the dinucleotide-binding Rossmann-fold, which provided a platform for the evolution of two unprecedented departures from the canonical SDR catalytic triad. iii) The fate of VEP1 in eukaryotes has been different in different lineages, being ubiquitous and highly conserved in land plants, whereas fungi underwent multiple losses. And iv) VEP1-harboring bacteria include non-phytopathogenic and phytopathogenic symbionts which are non-randomly distributed with respect to the type of harbored VEP1 gene. Our findings suggest that VEP1 may have been instrumental for the evolutionary transition of green plants to land, and point to a LGT-mediated ‘Trojan Horse’ mechanism for the evolution of bacterial pathogenesis against plants. VEP1 may serve as tool for revealing microbial interactions in plant/fungi-associated environments.
Collapse
Affiliation(s)
- Rosa Tarrío
- Universidad de Santiago de Compostela, CIBERER, Genome Medicine Group, Santiago de Compostela, Spain
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California, United States of America
| | - Francisco J. Ayala
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California, United States of America
| | - Francisco Rodríguez-Trelles
- Grup de Biologia Evolutiva, Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Barcelona, Spain
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California, United States of America
- * E-mail:
| |
Collapse
|
18
|
Parca L, Mangone I, Gherardini PF, Ausiello G, Helmer-Citterich M. Phosfinder: a web server for the identification of phosphate-binding sites on protein structures. Nucleic Acids Res 2011; 39:W278-82. [PMID: 21622655 PMCID: PMC3125782 DOI: 10.1093/nar/gkr389] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Phosfinder is a web server for the identification of phosphate binding sites in protein structures. Phosfinder uses a structural comparison algorithm to scan a query structure against a set of known 3D phosphate binding motifs. Whenever a structural similarity between the query protein and a phosphate binding motif is detected, the phosphate bound by the known motif is added to the protein structure thus representing a putative phosphate binding site. Predicted binding sites are then evaluated according to (i) their position with respect to the query protein solvent-excluded surface and (ii) the conservation of the binding residues in the protein family. The server accepts as input either the PDB code of the protein to be analyzed or a user-submitted structure in PDB format. All the search parameters are user modifiable. Phosfinder outputs a list of predicted binding sites with detailed information about their structural similarity with known phosphate binding motifs, and the conservation of the residues involved. A graphical applet allows the user to visualize the predicted binding sites on the query protein structure. The results on a set of 52 apo/holo structure pairs show that the performance of our method is largely unaffected by ligand-induced conformational changes. Phosfinder is available at http://phosfinder.bio.uniroma2.it.
Collapse
Affiliation(s)
- Luca Parca
- Centre for Molecular Bioinformatics, Department of Biology, University of Rome Tor Vergata, Via della Ricerca Scientifica snc, 00133 Rome, Italy
| | | | | | | | | |
Collapse
|
19
|
Haupt VJ, Schroeder M. Old friends in new guise: repositioning of known drugs with structural bioinformatics. Brief Bioinform 2011; 12:312-26. [DOI: 10.1093/bib/bbr011] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
|
20
|
Abstract
Bacteria prefer to grow attached to themselves or an interface, and it is important for an array of applications to make biofilms disperse. Here we report simultaneously the discovery and protein engineering of BdcA (formerly YjgI) for biofilm dispersal using the universal signal 3,5-cyclic diguanylic acid (c-di-GMP). The bdcA deletion reduced biofilm dispersal, and production of BdcA increased biofilm dispersal to wild-type level. Since BdcA increases motility and extracellular DNA production while decreasing exopolysaccharide, cell length and aggregation, we reasoned that BdcA decreases the concentration of c-di-GMP, the intracellular messenger that controls cell motility through flagellar rotation and biofilm formation through synthesis of curli and cellulose. Consistently, c-di-GMP levels increase upon deleting bdcA, and purified BdcA binds c-di-GMP but does not act as a phosphodiesterase. Additionally, BdcR (formerly YjgJ) is a negative regulator of bdcA. To increase biofilm dispersal, we used protein engineering to evolve BdcA for greater c-di-GMP binding and found that the single amino acid change E50Q causes nearly complete removal of biofilms via dispersal without affecting initial biofilm formation.
Collapse
Affiliation(s)
- Qun Ma
- Department of Chemical Engineering, 220 Jack E. Brown Building, Texas A & M University, College Station, TX 77843-3122
| | - Zhonghua Yang
- Department of Chemical Engineering, 220 Jack E. Brown Building, Texas A & M University, College Station, TX 77843-3122
- College of Chemical Engineering and Technology, Wuhan University of Science and Technology, Wuhan 430081
| | - Mingming Pu
- Department of Chemical Engineering, 220 Jack E. Brown Building, Texas A & M University, College Station, TX 77843-3122
| | - Wolfgang Peti
- Department of Molecular Pharmacology, Physiology, and Biotechnology and Brown University, Providence, RI 02912
| | - Thomas K. Wood
- Department of Chemical Engineering, 220 Jack E. Brown Building, Texas A & M University, College Station, TX 77843-3122
- Department of Biology, 220 Jack E. Brown Building, Texas A & M University, College Station, TX 77843-3122
- Department of Civil Engineering, 220 Jack E. Brown Building, Texas A & M University, College Station, TX 77843-3122
| |
Collapse
|
21
|
Kinnings SL, Xie L, Fung KH, Jackson RM, Xie L, Bourne PE. The Mycobacterium tuberculosis drugome and its polypharmacological implications. PLoS Comput Biol 2010; 6:e1000976. [PMID: 21079673 PMCID: PMC2973814 DOI: 10.1371/journal.pcbi.1000976] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2010] [Accepted: 09/24/2010] [Indexed: 11/26/2022] Open
Abstract
We report a computational approach that integrates structural bioinformatics, molecular modelling and systems biology to construct a drug-target network on a structural proteome-wide scale. The approach has been applied to the genome of Mycobacterium tuberculosis (M.tb), the causative agent of one of today's most widely spread infectious diseases. The resulting drug-target interaction network for all structurally characterized approved drugs bound to putative M.tb receptors, we refer to as the ‘TB-drugome’. The TB-drugome reveals that approximately one-third of the drugs examined have the potential to be repositioned to treat tuberculosis and that many currently unexploited M.tb receptors may be chemically druggable and could serve as novel anti-tubercular targets. Furthermore, a detailed analysis of the TB-drugome has shed new light on the controversial issues surrounding drug-target networks [1]–[3]. Indeed, our results support the idea that drug-target networks are inherently modular, and further that any observed randomness is mainly caused by biased target coverage. The TB-drugome (http://funsite.sdsc.edu/drugome/TB) has the potential to be a valuable resource in the development of safe and efficient anti-tubercular drugs. More generally the methodology may be applied to other pathogens of interest with results improving as more of their structural proteomes are determined through the continued efforts of structural biology/genomics. The worldwide increase in multi-drug resistant TB poses a great threat to human health and highlights the need to identify new anti-tubercular agents. We have developed a computational strategy to link the structural proteome of Mycobacterium tuberculosis, the causative agent of tuberculosis, to all structurally characterized approved drugs, and hence construct a proteome-wide drug-target network – the TB-drugome. The TB-drugome has the potential to be a valuable resource in the development of safe and efficient anti-tubercular drugs. More generally, the proteome-wide and multi-scale view of target and drug space may facilitate a systematic drug discovery process, which concurrently takes into account the disease mechanism and druggability of targets, the drug-likeness and ADMET properties of chemical compounds, and the genetic dispositions of individuals. Ultimately it may help to reduce the high attrition rate in drug development through a better understanding of drug-receptor interactions on a large scale.
Collapse
Affiliation(s)
- Sarah L. Kinnings
- Institute of Molecular and Cellular Biology and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, California, United States of America
| | - Li Xie
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California, United States of America
| | - Kingston H. Fung
- Bioinformatics Program, University of California, San Diego, La Jolla, California, United States of America
| | - Richard M. Jackson
- Institute of Molecular and Cellular Biology and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
| | - Lei Xie
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California, United States of America
- * E-mail: (LX); (PEB)
| | - Philip E. Bourne
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, California, United States of America
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California, United States of America
- * E-mail: (LX); (PEB)
| |
Collapse
|
22
|
Parca L, Gherardini PF, Helmer-Citterich M, Ausiello G. Phosphate binding sites identification in protein structures. Nucleic Acids Res 2010; 39:1231-42. [PMID: 20974634 PMCID: PMC3045618 DOI: 10.1093/nar/gkq987] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Nearly half of known protein structures interact with phosphate-containing ligands, such as nucleotides and other cofactors. Many methods have been developed for the identification of metal ions-binding sites and some for bigger ligands such as carbohydrates, but none is yet available for the prediction of phosphate-binding sites. Here we describe Pfinder, a method that predicts binding sites for phosphate groups, both in the form of ions or as parts of other non-peptide ligands, in proteins of known structure. Pfinder uses the Query3D local structural comparison algorithm to scan a protein structure for the presence of a number of structural motifs identified for their ability to bind the phosphate chemical group. Pfinder has been tested on a data set of 52 proteins for which both the apo and holo forms were available. We obtained at least one correct prediction in 63% of the holo structures and in 62% of the apo. The ability of Pfinder to recognize a phosphate-binding site in unbound protein structures makes it an ideal tool for functional annotation and for complementing docking and drug design methods. The Pfinder program is available at http://pdbfun.uniroma2.it/pfinder.
Collapse
Affiliation(s)
- Luca Parca
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata, Via della Ricerca Scientifica snc, 00133 Rome, Italy
| | | | | | | |
Collapse
|
23
|
|