1
|
Janhvi S, Saini S, Singh V, Sharma T, Rao A. ProGlycProt V3.0: updated insights into prokaryotic glycoproteins and their glycosyltransferases. Glycobiology 2024; 34:cwad103. [PMID: 38153163 DOI: 10.1093/glycob/cwad103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/15/2023] [Accepted: 12/18/2023] [Indexed: 12/29/2023] Open
Abstract
ProGlycProt is a comprehensive database of experimentally validated information about protein glycosylation in prokaryotes, including the glycoproteins, glycosyltransferases, and their accessory enzymes. The first release of ProGlycProt featured experimentally validated information on glycoproteins only. For the second release in 2019, the size and scope of the database were expanded twofold, and experimental data on cognate glycosyltransferases and their accessory proteins was incorporated. The growing research and technology interest in microbial glycoproteins and their enzymes is evident from the steady rise in academic publications and patents in this area. Accordingly, the third update comprises a new section on patents related to glycosylation methods, novel glycosyltransferases, and technologies developed therefrom. The structure gallery is reorganized, wherein the number and quality of the models are upgraded with the help of AlphaFold2. Over the years, the influx of experimental proteomics data into public repositories like PRIDE has surged. Harnessing this legacy data for in-silico glycoprotein identification is a smart approach. Version 3.0 adds 45 N-glycoprotein entries annotated from MS datasets available on PRIDE and reviewed by independent research groups. With a 67% rise in entries corresponding to 119 genera of prokaryotes, the ProGlycProt continues to be the exclusive database of experimentally validated comprehensive information about protein glycosylation in prokaryotes.
Collapse
Affiliation(s)
- Shreya Janhvi
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh 160036, India
| | - Shikha Saini
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh 160036, India
| | - Vaidhvi Singh
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh 160036, India
| | - Tarun Sharma
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh 160036, India
| | - Alka Rao
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh 160036, India
- Academy of Scientific and Innovation Research (AcSIR), Sector 19, Kamla Nehru Nagar, Ghaziabad, Uttar Pradesh 201002, India
| |
Collapse
|
2
|
Weigle AT, Feng J, Shukla D. Thirty years of molecular dynamics simulations on posttranslational modifications of proteins. Phys Chem Chem Phys 2022; 24:26371-26397. [PMID: 36285789 PMCID: PMC9704509 DOI: 10.1039/d2cp02883b] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2023]
Abstract
Posttranslational modifications (PTMs) are an integral component to how cells respond to perturbation. While experimental advances have enabled improved PTM identification capabilities, the same throughput for characterizing how structural changes caused by PTMs equate to altered physiological function has not been maintained. In this Perspective, we cover the history of computational modeling and molecular dynamics simulations which have characterized the structural implications of PTMs. We distinguish results from different molecular dynamics studies based upon the timescales simulated and analysis approaches used for PTM characterization. Lastly, we offer insights into how opportunities for modern research efforts on in silico PTM characterization may proceed given current state-of-the-art computing capabilities and methodological advancements.
Collapse
Affiliation(s)
- Austin T Weigle
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Jiangyan Feng
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Diwakar Shukla
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.
| |
Collapse
|
3
|
Sharma Y, Ahlawat S, Rao A. Biochemical characterization of an inverting S/O-HexNAc-transferase and evidence of S-linked glycosylation in Actinobacteria. Glycobiology 2021; 32:148-161. [PMID: 34420053 DOI: 10.1093/glycob/cwab089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Revised: 07/31/2021] [Accepted: 08/09/2021] [Indexed: 11/13/2022] Open
Abstract
Antimicrobial peptides harboring S- and or O-linked glycans are known as glycocins. Glycocins were first discovered and best characterized in Firmicutes. S-glycosylation is an enzymatic process catalyzed by S-glycosyltransferases of the GT2 family. Using a heterologous expression system, here we describe an inverting S/O-HexNAc-transferase (SvGT), encoded by ORF AQF52_3101 of S. venezuelae ATCC 15439, along with its acceptor substrate (SvC), encoded by ORF AQF52_3099. Using in vitro and in vivo assays, we define the distinct donor specificity, acceptor specificity, regioselectivity, chemoselectivity, and Y(G/A/K/Q/E ≠ ΔG)(C/S/T ≠ Y/N)(G/A ≠ P/Q)G as the minimum acceptor sequon of SvGT. Although UDP-GlcNAc served as the donor in the cellular milieu, SvGT could also utilize UDP-Glc and UDP-GalNAc as donors in vitro. Using mass spectrometry and western blotting, we provide evidence that an anti-O-GlcNAc antibody (CTD110.6) cross-reacts with S-GlcNAc and may be used to detect S-GlcNAcylated glycoconjugates directly. With an understanding of enzyme specificities, we finally employed SvGT to generate two proof-of-concept neoglycocins against L. monocytogenes. In conclusion, this study provides the first experimental evidence for S-glycosylation in Actinobacteria and the application of its S/O-HexNAc-transferase in glycocin engineering.
Collapse
Affiliation(s)
- Yogita Sharma
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh, 160036, India
| | - Shimona Ahlawat
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh, 160036, India
| | - Alka Rao
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh, 160036, India.,Academy of Scientific and Innovation Research (AcSIR), Sector 19, Kamla Nehru Nagar, Ghaziabad, 201002, India
| |
Collapse
|
4
|
Huang J, Wu M, Zhang Y, Kong S, Liu M, Jiang B, Yang P, Cao W. OGP: A Repository of Experimentally Characterized O-Glycoproteins to Facilitate Studies on O-Glycosylation. GENOMICS PROTEOMICS & BIOINFORMATICS 2021; 19:611-618. [PMID: 33581334 PMCID: PMC9039567 DOI: 10.1016/j.gpb.2020.05.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 03/17/2020] [Accepted: 05/31/2020] [Indexed: 11/16/2022]
Abstract
Numerous studies on cancers, biopharmaceuticals, and clinical trials have necessitated comprehensive and precise analysis of protein O-glycosylation. However, the lack of updated and convenient databases deters the storage of and reference to emerging O-glycoprotein data. To resolve this issue, an O-glycoprotein repository named OGP was established in this work. It was constructed with a collection of O-glycoprotein data from different sources. OGP contains 9354 O-glycosylation sites and 11,633 site-specific O-glycans mapping to 2133 O-glycoproteins, and it is the largest O-glycoprotein repository thus far. Based on the recorded O-glycosylation sites, an O-glycosylation site prediction tool was developed. Moreover, an OGP-based website is already available (https://www.oglyp.org/). The website comprises four specially designed and user-friendly modules: statistical analysis, database search, site prediction, and data submission. The first version of OGP repository and the website allow users to obtain various O-glycoprotein-related information, such as protein accession Nos., O-glycosylation sites, O-glycopeptide sequences, site-specific O-glycan structures, experimental methods, and potential O-glycosylation sites. O-glycosylation data mining can be performed efficiently on this website, which will greatly facilitate related studies. In addition, the database is accessible from OGP website (https://www.oglyp.org/download.php).
Collapse
Affiliation(s)
- Jiangming Huang
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China
| | - Mengxi Wu
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China
| | - Yang Zhang
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China
| | - Siyuan Kong
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Mingqi Liu
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Biyun Jiang
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China
| | - Pengyuan Yang
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China; NHC Key Laboratory of Glycoconjugates Research (Fudan University), Shanghai 200032, China.
| | - Weiqian Cao
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China; NHC Key Laboratory of Glycoconjugates Research (Fudan University), Shanghai 200032, China.
| |
Collapse
|
5
|
Scherbinina SI, Toukach PV. Three-Dimensional Structures of Carbohydrates and Where to Find Them. Int J Mol Sci 2020; 21:E7702. [PMID: 33081008 PMCID: PMC7593929 DOI: 10.3390/ijms21207702] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2020] [Revised: 10/15/2020] [Accepted: 10/16/2020] [Indexed: 02/06/2023] Open
Abstract
Analysis and systematization of accumulated data on carbohydrate structural diversity is a subject of great interest for structural glycobiology. Despite being a challenging task, development of computational methods for efficient treatment and management of spatial (3D) structural features of carbohydrates breaks new ground in modern glycoscience. This review is dedicated to approaches of chemo- and glyco-informatics towards 3D structural data generation, deposition and processing in regard to carbohydrates and their derivatives. Databases, molecular modeling and experimental data validation services, and structure visualization facilities developed for last five years are reviewed.
Collapse
Affiliation(s)
- Sofya I. Scherbinina
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
- Higher Chemical College, D. Mendeleev University of Chemical Technology of Russia, Miusskaya Square 9, 125047 Moscow, Russia
| | - Philip V. Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
| |
Collapse
|
6
|
Amera GM, Khan RJ, Jha RK, Pathak A, Muthukumaran J, Singh AK. Prioritization of Mur family drug targets against A. baumannii and identification of their homologous proteins through molecular phylogeny, primary sequence, and structural analysis. J Genet Eng Biotechnol 2020; 18:33. [PMID: 32725318 PMCID: PMC7387395 DOI: 10.1186/s43141-020-00048-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 07/01/2020] [Indexed: 12/22/2022]
Abstract
Background The World Health Organization (WHO) report stated that Acinetobacter baumannii had been classified as one of the most important pathogenic bacteria causing nosocomial infection in hospital patients due to multi-drug resistance (MDR). It is vital to find out new bacterial drug targets and annotated their structure and function for the exploration of new anti-bacterial agents. The present study utilized a systematic route to prioritize the potential drug targets that belong to Mur family of Acinetobacter baumannii and identify their homologous proteins using a computational approach such as sequence similarity search, multiple sequence alignment, phylogenetic analysis, protein sequence, and protein structure analysis. Results From the results of protein sequence analysis of eight Mur family proteins, they divided into three main enzymatic classes namely transferases (MurG, MurA and MraY), ligases (MurC, MurD, MurE, and MurF), and oxidoreductase (MurB). Based on the results of intra-comparative protein sequence analysis and enzymatic classification, we have chosen MurB, MurE, and MurG as the prioritized drug targets from A. baumannii and subjected them for further detailed studies of inter-species comparison. This inter-species comparison help us to explore the sequential and structural properties of homologous proteins in other species and hence, opens a gateway for new target identification and using common inhibitor for different bacterial species caused by various diseases. The pairwise sequence alignment results between A. baumannii’s MurB with A. calcoaceticus’s MurB, A. baumannii’s MurE with A. seifertii’s MurE, and A. baumannii’s MurG with A. pittii’s MurG showed that every group of the proteins are highly similar with each other and they showed sequence identity of 95.7% and sequence similarity of 97.2%. Conclusion Together with the results of secondary and three-dimensional structure predictions explained that three selected proteins (MurB, MurE, and MurG) from A. baumannii and their related proteins (AcMurB, AsMurE, and ApMurG) belong to mixed αβ class and they are very similar.
Collapse
Affiliation(s)
- Gizachew Muluneh Amera
- Department of Biotechnology, School of Engineering and Technology, Sharda University, Greater Noida, UP, 201310, India
| | - Rameez Jabeer Khan
- Department of Biotechnology, School of Engineering and Technology, Sharda University, Greater Noida, UP, 201310, India
| | - Rajat Kumar Jha
- Department of Biotechnology, School of Engineering and Technology, Sharda University, Greater Noida, UP, 201310, India
| | - Amita Pathak
- Department of Chemistry, Indian Institute of Technology, Hauz Khas, New Delhi, 110016, India
| | - Jayaraman Muthukumaran
- Department of Biotechnology, School of Engineering and Technology, Sharda University, Greater Noida, UP, 201310, India
| | - Amit Kumar Singh
- Department of Biotechnology, School of Engineering and Technology, Sharda University, Greater Noida, UP, 201310, India.
| |
Collapse
|
7
|
Choudhary P, Nagar R, Singh V, Bhat AH, Sharma Y, Rao A. ProGlycProt V2.0, a repository of experimentally validated glycoproteins and protein glycosyltransferases of prokaryotes. Glycobiology 2020; 29:461-468. [PMID: 30835791 DOI: 10.1093/glycob/cwz013] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Revised: 02/22/2019] [Accepted: 03/01/2019] [Indexed: 01/05/2023] Open
Abstract
Knowledge of glycosylation status and glycan-pattern of proteins are of considerable medical, academic and application interest. ProGlycProt V2.0 (www.proglycprot.org) therefore, is conceived and maintained as an exclusive web-resource providing comprehensive information on experimentally validated glycoproteins and protein glycosyltransferases (GTs) of prokaryotic origin. The second release of ProGlycProt features a major update with a 191% increase in the total number of entries, manually collected and curated from 607 peer-reviewed publications, on the subject. Protein GTs from prokaryotes that catalyze a varied range of glycan linkages are amenable glycoengineering tools. Therefore, the second release presents content that is greatly expanded and reorganized in two sub-databases: ProGPdb and ProGTdb. While ProGPdb provides information about validated glycoproteins (222 entries), ProGTdb catalogs enzymes/proteins that are instrumental in protein glycosylation, directly (122) or as accessory proteins (182). ProGlycProt V2.0 remains highly cross-referenced yet exclusive and complementary in content to other related databases. The second release further features enhanced search capability, a "compare" entries option and an innovative geoanalytical tool (MapView) facilitating location-assisted search-cum filtering of the entries using geo-positioning information of researchers/groups cited in the ProGlycProt V2.0 databases. Thus, ProGlycProt V2.0 continues to serve as a useful one-point web-resource on various evidence-based information on protein glycosylation in prokaryotes.
Collapse
Affiliation(s)
| | - Rupa Nagar
- CSIR-Institute of Microbial Technology, Sector 39 A, Chandigarh, India
| | - Vaidhvi Singh
- CSIR-Institute of Microbial Technology, Sector 39 A, Chandigarh, India
| | | | - Yogita Sharma
- CSIR-Institute of Microbial Technology, Sector 39 A, Chandigarh, India
| | - Alka Rao
- CSIR-Institute of Microbial Technology, Sector 39 A, Chandigarh, India
| |
Collapse
|
8
|
Bhat AH, Maity S, Giri K, Ambatipudi K. Protein glycosylation: Sweet or bitter for bacterial pathogens? Crit Rev Microbiol 2019; 45:82-102. [PMID: 30632429 DOI: 10.1080/1040841x.2018.1547681] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Protein glycosylation systems in many bacteria are often associated with crucial biological processes like pathogenicity, immune evasion and host-pathogen interactions, implying the significance of protein-glycan linkage. Similarly, host protein glycosylation has been implicated in antimicrobial activity as well as in promoting growth of beneficial strains. In fact, few pathogens notably modulate host glycosylation machineries to facilitate their survival. To date, diverse chemical and biological strategies have been developed for conjugate vaccine production for disease control. Bioconjugate vaccines, largely being produced by glycoengineering using PglB (the N-oligosaccharyltransferase from Campylobacter jejuni) in suitable bacterial hosts, have been highly promising with respect to their effectiveness in providing protective immunity and ease of production. Recently, a novel method of glycoconjugate vaccine production involving an O-oligosaccharyltransferase, PglL from Neisseria meningitidis, has been optimized. Nevertheless, many questions on defining antigenic determinants, glycosylation markers, species-specific differences in glycosylation machineries, etc. still remain unanswered, necessitating further exploration of the glycosylation systems of important pathogens. Hence, in this review, we will discuss the impact of bacterial protein glycosylation on its pathogenesis and the interaction of pathogens with host protein glycosylation, followed by a discussion on strategies used for bioconjugate vaccine development.
Collapse
Affiliation(s)
- Aadil Hussain Bhat
- a Department of Biotechnology , Indian Institute of Technology Roorkee , Roorkee , Uttarakhand 247667 , India
| | - Sudipa Maity
- a Department of Biotechnology , Indian Institute of Technology Roorkee , Roorkee , Uttarakhand 247667 , India
| | - Kuldeep Giri
- a Department of Biotechnology , Indian Institute of Technology Roorkee , Roorkee , Uttarakhand 247667 , India
| | - Kiran Ambatipudi
- a Department of Biotechnology , Indian Institute of Technology Roorkee , Roorkee , Uttarakhand 247667 , India
| |
Collapse
|
9
|
Fujinami D, Taguchi Y, Kohda D. Asn-linked oligosaccharide chain of a crenarchaeon, Pyrobaculum calidifontis, is reminiscent of the eukaryotic high-mannose-type glycan. Glycobiology 2017; 27:701-712. [PMID: 28510654 DOI: 10.1093/glycob/cwx044] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Revised: 04/18/2017] [Accepted: 05/13/2017] [Indexed: 12/19/2022] Open
Abstract
Pyrobaculum calidifontis is a hyperthermophilic archaeon that belongs to the phylum Crenarchaeota. In contrast to the phylum Euryarchaeota, only the N-glycan structure of the genus Sulfolobus is known in Crenarchaeota. Here, we enriched glycoproteins from cultured P. calidifontis cells, by ConA lectin chromatography. The MASCOT search identified proteins with at least one potential N-glycosylation site. The tandem mass spectrometry (MS/MS) analysis of 12 small tryptic glycopeptides confirmed the canonical N-glycosylation consensus in P. calidifontis. We determined the N-linked oligosaccharide structure produced by an in vitro enzymatic oligosaccharyl transfer reaction. Pyrobaculum calidifontis cells were cultured in rich medium supplemented with 13C-glucose, for the metabolic labeling of N-oligosaccharide donors. An incubation with a synthetic peptide substrate produced glycopeptides with isotopically labeled oligosaccharide moieties. The MS and nuclear magnetic resonance analyses revealed that the P. calidifontisN-glycan has a biantennary, high-mannose-type structure consisting of up to 11 monosaccharide residues. The base portion of the P. calidifontisN-glycan strongly resembles the eukaryotic core structure, α-Man-(1-3)-(α-Man-(1-6)-)β-Man-(1-4)-β-GlcNAc-(1-4)-β-GlcNAc-Asn. Structural differences exist in the anomeric configuration between Man and GlcNAc, and the chitobiose structure is chemically modified: one GlcNAc residue is oxidized to glucoronate, and the GlcNAc residues are both modified with an additional acetamido group at the C-3 position. As a result, the core structure of the P. calidifontisN-glycan is α-Man-(1-3)-(α-Man-(1-6)-)α-Man-(1-4)-β-GlcANAc3NAc-(1-4)-β-GlcNAc3NAc-Asn, in which the unique features of the P. calidifontisN-glycan are underlined. In spite of these differences, the structure of the P. calidifontisN-glycan is the most similar to the eukaryotic counterparts, among all archaeal N-glycans reported to date.
Collapse
Affiliation(s)
- Daisuke Fujinami
- Division of Structural Biology, Medical Institute of Bioregulation, Kyushu University, Maidashi 3-1-1, Higashi-ku, Fukuoka 812-8582, Japan
| | - Yuya Taguchi
- Division of Structural Biology, Medical Institute of Bioregulation, Kyushu University, Maidashi 3-1-1, Higashi-ku, Fukuoka 812-8582, Japan
| | - Daisuke Kohda
- Division of Structural Biology, Medical Institute of Bioregulation, Kyushu University, Maidashi 3-1-1, Higashi-ku, Fukuoka 812-8582, Japan.,Research Center for Advanced Immunology, Kyushu University, Maidashi 3-1-1, Higashi-ku, Fukuoka 812-8582, Japan.,Research Center for Live-Protein Dynamics, Kyushu University, Maidashi 3-1-1, Higashi-ku, Fukuoka 812-8582, Japan
| |
Collapse
|
10
|
Nagar R, Rao A. An iterative glycosyltransferase EntS catalyzes transfer and extension of O- and S-linked monosaccharide in enterocin 96. Glycobiology 2017; 27:766-776. [PMID: 28498962 DOI: 10.1093/glycob/cwx042] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Revised: 05/05/2017] [Accepted: 05/09/2017] [Indexed: 01/18/2023] Open
Abstract
Glycosyltransferases are essential tools for in vitro glycoengineering. Bacteria harbor an unexplored variety of protein glycosyltransferases. Here, we describe a peptide glycosyltransferase (EntS) encoded by ORF0417 of Enterococcus faecalis TX0104. EntS di-glycosylates linear peptide of enterocin 96 - a known antibacterial, in vitro. It is capable of transferring as well as extending the glycan onto the peptide in an iterative sequential dissociative manner. It can catalyze multiple linkages: Glc/Gal(-O)Ser/Thr, Glc/Gal(-S)Cys and Glc/Gal(β)Glc/Gal(-O/S)Ser/Thr/Cys, in one pot. Using EntS generated glycovariants of enterocin 96 peptide, size and identity of the glycan are found to influence bioactivity of the peptide. The study identifies EntS as an enzyme worth pursuing, for in vitro peptide glycoengineering.
Collapse
Affiliation(s)
- Rupa Nagar
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh 160036, India
| | - Alka Rao
- CSIR-Institute of Microbial Technology, Sector 39A, Chandigarh 160036, India
| |
Collapse
|
11
|
Harvey DJ. Analysis of carbohydrates and glycoconjugates by matrix-assisted laser desorption/ionization mass spectrometry: An update for 2011-2012. MASS SPECTROMETRY REVIEWS 2017; 36:255-422. [PMID: 26270629 DOI: 10.1002/mas.21471] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2014] [Accepted: 01/15/2015] [Indexed: 06/04/2023]
Abstract
This review is the seventh update of the original article published in 1999 on the application of MALDI mass spectrometry to the analysis of carbohydrates and glycoconjugates and brings coverage of the literature to the end of 2012. General aspects such as theory of the MALDI process, matrices, derivatization, MALDI imaging, and fragmentation are covered in the first part of the review and applications to various structural types constitute the remainder. The main groups of compound are oligo- and poly-saccharides, glycoproteins, glycolipids, glycosides, and biopharmaceuticals. Much of this material is presented in tabular form. Also discussed are medical and industrial applications of the technique, studies of enzyme reactions, and applications to chemical synthesis. © 2015 Wiley Periodicals, Inc. Mass Spec Rev 36:255-422, 2017.
Collapse
Affiliation(s)
- David J Harvey
- Department of Biochemistry, Oxford Glycobiology Institute, University of Oxford, Oxford, OX1 3QU, UK
| |
Collapse
|
12
|
Eichler J, Koomey M. Sweet New Roles for Protein Glycosylation in Prokaryotes. Trends Microbiol 2017; 25:662-672. [PMID: 28341406 DOI: 10.1016/j.tim.2017.03.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Revised: 02/19/2017] [Accepted: 03/01/2017] [Indexed: 12/29/2022]
Abstract
Long-held to be a post-translational modification unique to Eukarya, it is now clear that both Bacteria and Archaea also perform protein glycosylation, namely the covalent attachment of mono- to polysaccharides to specific protein targets. At the same time, many of the roles assigned to this protein-processing event in eukaryotes, such as guiding protein folding/quality control, intracellular trafficking, dictating cellular recognition events and others, do not apply or are even irrelevant to prokaryotes. As such, protein glycosylation must serve novel functions in Bacteria and Archaea. Recent efforts have begun to elucidate some of these prokaryote-specific roles, which are addressed in this review.
Collapse
Affiliation(s)
- Jerry Eichler
- Department of Life Sciences, Ben Gurion University of the Negev, Beersheva 84105, Israel.
| | - Michael Koomey
- Department of Biosciences, University of Oslo, 0316 Oslo, Norway
| |
Collapse
|
13
|
Abstract
Databases play an increasingly important role in biology. They archive, store, maintain, and share information on genes, genomes, expression data, protein sequences and structures, metabolites and reactions, interactions, and pathways. All these data are critically important to microbiologists. Furthermore, microbiology has its own databases that deal with model microorganisms, microbial diversity, physiology, and pathogenesis. Thousands of biological databases are currently available, and it becomes increasingly difficult to keep up with their development. The purpose of this minireview is to provide a brief survey of current databases that are of interest to microbiologists.
Collapse
|
14
|
Gupta S, Chavan S, Deobagkar DN, Deobagkar DD. Bio/chemoinformatics in India: an outlook. Brief Bioinform 2014; 16:710-31. [PMID: 25159593 DOI: 10.1093/bib/bbu028] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 07/28/2014] [Indexed: 12/25/2022] Open
Abstract
With the advent of significant establishment and development of Internet facilities and computational infrastructure, an overview on bio/chemoinformatics is presented along with its multidisciplinary facts, promises and challenges. The Government of India has paved the way for more profound research in biological field with the use of computational facilities and schemes/projects to collaborate with scientists from different disciplines. Simultaneously, the growth of available biomedical data has provided fresh insight into the nature of redundant and compensatory data. Today, bioinformatics research in India is characterized by a powerful grid computing systems, great variety of biological questions addressed and the close collaborations between scientists and clinicians, with a full spectrum of focuses ranging from database building and methods development to biological discoveries. In fact, this outlook provides a resourceful platform highlighting the funding agencies, institutes and industries working in this direction, which would certainly be of great help to students seeking their career in bioinformatics. Thus, in short, this review highlights the current bio/chemoinformatics trend, educations, status, diverse applicability and demands for further development.
Collapse
|
15
|
Dave S, Nanduri R, Dkhar HK, Bhagyaraj E, Rao A, Gupta P. Nuclear MEK1 sequesters PPARγ and bisects MEK1/ERK signaling: a non-canonical pathway of retinoic acid inhibition of adipocyte differentiation. PLoS One 2014; 9:e100862. [PMID: 24959884 PMCID: PMC4069188 DOI: 10.1371/journal.pone.0100862] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2014] [Accepted: 05/31/2014] [Indexed: 02/07/2023] Open
Abstract
Uncontrolled adipogenesis and adipocyte proliferation have been connected to human comorbidities. Retinoic acid (RA) is known to inhibit adipocyte differentiation, however the underlying mechanisms have not been adequately understood. This study reports that RA acting as a ligand to RA receptors (RARs and RXRs) is not a sine qua non to the inhibition of adipogenesis. Our intriguing observation of a negative correlation between increased retinoylation and adipogenesis led us to explore retinoylated proteins in adipocytes. Exportin (CRM1) was found to be retinoylated, which in turn can affect the spatio-temporal regulation of the important signaling molecule mitogen-activated protein kinase kinase 1 (MEK1), likely by disrupting its export from the nucleus. Nuclear enrichment of MEK1 physically sequesters peroxisome proliferator-activated receptor gamma (PPARγ), the master regulator of adipogenesis, from its target genes and thus inhibits adipogenesis while also disrupting the MEK1-extracellular-signal regulated kinase (ERK) signaling cascade. This study is first to report the inhibition of adipocyte differentiation by retinoylation.
Collapse
Affiliation(s)
- Sandeep Dave
- CSIR-Institute of Microbial Technology, Chandigarh, India
| | | | | | - Ella Bhagyaraj
- CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Alka Rao
- CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Pawan Gupta
- CSIR-Institute of Microbial Technology, Chandigarh, India
| |
Collapse
|
16
|
Jarrell KF, Ding Y, Meyer BH, Albers SV, Kaminski L, Eichler J. N-linked glycosylation in Archaea: a structural, functional, and genetic analysis. Microbiol Mol Biol Rev 2014; 78:304-41. [PMID: 24847024 PMCID: PMC4054257 DOI: 10.1128/mmbr.00052-13] [Citation(s) in RCA: 146] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
N-glycosylation of proteins is one of the most prevalent posttranslational modifications in nature. Accordingly, a pathway with shared commonalities is found in all three domains of life. While excellent model systems have been developed for studying N-glycosylation in both Eukarya and Bacteria, an understanding of this process in Archaea was hampered until recently by a lack of effective molecular tools. However, within the last decade, impressive advances in the study of the archaeal version of this important pathway have been made for halophiles, methanogens, and thermoacidophiles, combining glycan structural information obtained by mass spectrometry with bioinformatic, genetic, biochemical, and enzymatic data. These studies reveal both features shared with the eukaryal and bacterial domains and novel archaeon-specific aspects. Unique features of N-glycosylation in Archaea include the presence of unusual dolichol lipid carriers, the use of a variety of linking sugars that connect the glycan to proteins, the presence of novel sugars as glycan constituents, the presence of two very different N-linked glycans attached to the same protein, and the ability to vary the N-glycan composition under different growth conditions. These advances are the focus of this review, with an emphasis on N-glycosylation pathways in Haloferax, Methanococcus, and Sulfolobus.
Collapse
Affiliation(s)
- Ken F Jarrell
- Department of Biomedical and Molecular Sciences, Queen's University, Kingston, Ontario, Canada
| | - Yan Ding
- Department of Biomedical and Molecular Sciences, Queen's University, Kingston, Ontario, Canada
| | - Benjamin H Meyer
- Molecular Biology of Archaea, Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
| | - Sonja-Verena Albers
- Molecular Biology of Archaea, Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
| | - Lina Kaminski
- Department of Life Sciences, Ben Gurion University, Beersheva, Israel
| | - Jerry Eichler
- Department of Life Sciences, Ben Gurion University, Beersheva, Israel
| |
Collapse
|
17
|
Craveur P, Rebehmed J, de Brevern AG. PTM-SD: a database of structurally resolved and annotated posttranslational modifications in proteins. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014; 2014:bau041. [PMID: 24857970 PMCID: PMC4038255 DOI: 10.1093/database/bau041] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Posttranslational modifications (PTMs) define covalent and chemical modifications of protein residues. They play important roles in modulating various biological functions. Current PTM databases contain important sequence annotations but do not provide informative 3D structural resource about these modifications. Posttranslational modification structural database (PTM-SD) provides access to structurally solved modified residues, which are experimentally annotated as PTMs. It combines different PTM information and annotation gathered from other databases, e.g. Protein DataBank for the protein structures and dbPTM and PTMCuration for fine sequence annotation. PTM-SD gives an accurate detection of PTMs in structural data. PTM-SD can be browsed by PDB id, UniProt accession number, organism and classic PTM annotation. Advanced queries can also be performed, i.e. detailed PTM annotations, amino acid type, secondary structure, SCOP class classification, PDB chain length and number of PTMs by chain. Statistics and analyses can be computed on a selected dataset of PTMs. Each PTM entry is detailed in a dedicated page with information on the protein sequence, local conformation with secondary structure and Protein Blocks. PTM-SD gives valuable information on observed PTMs in protein 3D structure, which is of great interest for studying sequence-structure- function relationships at the light of PTMs, and could provide insights for comparative modeling and PTM predictions protocols. Database URL: PTM-SD can be accessed at http://www.dsimb.inserm.fr/dsimb_tools/PTM-SD/.
Collapse
Affiliation(s)
- Pierrick Craveur
- INSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, FranceINSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, FranceINSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, FranceINSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, France
| | - Joseph Rebehmed
- INSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, FranceINSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, FranceINSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, FranceINSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, France
| | - Alexandre G de Brevern
- INSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, FranceINSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, FranceINSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, FranceINSERM, U 1134, DSIMB, F-75739 Paris, France, Univ Paris Diderot, Sorbonne Paris Cité, UMR-S 1134, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d'Excellence GR-Ex, F-75739 Paris, France
| |
Collapse
|
18
|
Chauhan JS, Rao A, Raghava GPS. In silico platform for prediction of N-, O- and C-glycosites in eukaryotic protein sequences. PLoS One 2013; 8:e67008. [PMID: 23840574 PMCID: PMC3695939 DOI: 10.1371/journal.pone.0067008] [Citation(s) in RCA: 156] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2012] [Accepted: 05/17/2013] [Indexed: 11/19/2022] Open
Abstract
Glycosylation is one of the most abundant and an important post-translational modification of proteins. Glycosylated proteins (glycoproteins) are involved in various cellular biological functions like protein folding, cell-cell interactions, cell recognition and host-pathogen interactions. A large number of eukaryotic glycoproteins also have therapeutic and potential technology applications. Therefore, characterization and analysis of glycosites (glycosylated residues) in these proteins is of great interest to biologists. In order to cater these needs a number of in silico tools have been developed over the years, however, a need to get even better prediction tools remains. Therefore, in this study we have developed a new webserver GlycoEP for more accurate prediction of N-linked, O-linked and C-linked glycosites in eukaryotic glycoproteins using two larger datasets, namely, standard and advanced datasets. In case of standard datasets no two glycosylated proteins are more similar than 40%; advanced datasets are highly non-redundant where no two glycosites’ patterns (as defined in methods) have more than 60% similarity. Further, based on our results with several algorihtms developed using different machine-learning techniques, we found Support Vector Machine (SVM) as optimum tool to develop glycosite prediction models. Accordingly, using our more stringent and non-redundant advanced datasets, the SVM based models developed in this study achieved a prediction accuracy of 84.26%, 86.87% and 91.43% with corresponding MCC of 0.54, 0.20 and 0.78, for N-, O- and C-linked glycosites, respectively. The best performing models trained on advanced datasets were then implemented as a user-friendly web server GlycoEP (http://www.imtech.res.in/raghava/glycoep/). Additionally, this server provides prediction models developed on standard datasets and allows users to scan sequons in input protein sequences.
Collapse
Affiliation(s)
| | - Alka Rao
- Protein Science and Engineering, Institute of Microbial Technology, Chandigarh, India
| | | |
Collapse
|
19
|
Theillet FX, Smet-Nocca C, Liokatis S, Thongwichian R, Kosten J, Yoon MK, Kriwacki RW, Landrieu I, Lippens G, Selenko P. Cell signaling, post-translational protein modifications and NMR spectroscopy. JOURNAL OF BIOMOLECULAR NMR 2012; 54:217-36. [PMID: 23011410 PMCID: PMC4939263 DOI: 10.1007/s10858-012-9674-x] [Citation(s) in RCA: 117] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2012] [Accepted: 09/07/2012] [Indexed: 05/13/2023]
Abstract
Post-translationally modified proteins make up the majority of the proteome and establish, to a large part, the impressive level of functional diversity in higher, multi-cellular organisms. Most eukaryotic post-translational protein modifications (PTMs) denote reversible, covalent additions of small chemical entities such as phosphate-, acyl-, alkyl- and glycosyl-groups onto selected subsets of modifiable amino acids. In turn, these modifications induce highly specific changes in the chemical environments of individual protein residues, which are readily detected by high-resolution NMR spectroscopy. In the following, we provide a concise compendium of NMR characteristics of the main types of eukaryotic PTMs: serine, threonine, tyrosine and histidine phosphorylation, lysine acetylation, lysine and arginine methylation, and serine, threonine O-glycosylation. We further delineate the previously uncharacterized NMR properties of lysine propionylation, butyrylation, succinylation, malonylation and crotonylation, which, altogether, define an initial reference frame for comprehensive PTM studies by high-resolution NMR spectroscopy.
Collapse
Affiliation(s)
- Francois-Xavier Theillet
- Department of NMR-Supported Structural Biology, Leibniz Institute of Molecular Pharmacology (FMP Berlin), In-cell NMR Group, Robert-Roessle Strasse 10, 13125 Berlin, German
| | - Caroline Smet-Nocca
- CNRS UMR 8576, Universite Lille Nord de France, 59655 Villeneuve d’Ascq, France
| | - Stamatios Liokatis
- Department of NMR-Supported Structural Biology, Leibniz Institute of Molecular Pharmacology (FMP Berlin), In-cell NMR Group, Robert-Roessle Strasse 10, 13125 Berlin, German
| | - Rossukon Thongwichian
- Department of NMR-Supported Structural Biology, Leibniz Institute of Molecular Pharmacology (FMP Berlin), In-cell NMR Group, Robert-Roessle Strasse 10, 13125 Berlin, German
| | - Jonas Kosten
- Department of NMR-Supported Structural Biology, Leibniz Institute of Molecular Pharmacology (FMP Berlin), In-cell NMR Group, Robert-Roessle Strasse 10, 13125 Berlin, German
| | - Mi-Kyung Yoon
- Department of Structural Biology, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Richard W. Kriwacki
- Department of Structural Biology, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Isabelle Landrieu
- CNRS UMR 8576, Universite Lille Nord de France, 59655 Villeneuve d’Ascq, France
| | - Guy Lippens
- CNRS UMR 8576, Universite Lille Nord de France, 59655 Villeneuve d’Ascq, France
| | - Philipp Selenko
- Department of NMR-Supported Structural Biology, Leibniz Institute of Molecular Pharmacology (FMP Berlin), In-cell NMR Group, Robert-Roessle Strasse 10, 13125 Berlin, German
| |
Collapse
|
20
|
Chauhan JS, Bhat AH, Raghava GPS, Rao A. GlycoPP: a webserver for prediction of N- and O-glycosites in prokaryotic protein sequences. PLoS One 2012; 7:e40155. [PMID: 22808107 PMCID: PMC3392279 DOI: 10.1371/journal.pone.0040155] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2012] [Accepted: 06/01/2012] [Indexed: 12/30/2022] Open
Abstract
Glycosylation is one of the most abundant post-translational modifications (PTMs) required for various structure/function modulations of proteins in a living cell. Although elucidated recently in prokaryotes, this type of PTM is present across all three domains of life. In prokaryotes, two types of protein glycan linkages are more widespread namely, N- linked, where a glycan moiety is attached to the amide group of Asn, and O- linked, where a glycan moiety is attached to the hydroxyl group of Ser/Thr/Tyr. For their biologically ubiquitous nature, significance, and technology applications, the study of prokaryotic glycoproteins is a fast emerging area of research. Here we describe new Support Vector Machine (SVM) based algorithms (models) developed for predicting glycosylated-residues (glycosites) with high accuracy in prokaryotic protein sequences. The models are based on binary profile of patterns, composition profile of patterns, and position-specific scoring matrix profile of patterns as training features. The study employ an extensive dataset of 107 N-linked and 116 O-linked glycosites extracted from 59 experimentally characterized glycoproteins of prokaryotes. This dataset includes validated N-glycosites from phyla Crenarchaeota, Euryarchaeota (domain Archaea), Proteobacteria (domain Bacteria) and validated O-glycosites from phyla Actinobacteria, Bacteroidetes, Firmicutes and Proteobacteria (domain Bacteria). In view of the current understanding that glycosylation occurs on folded proteins in bacteria, hybrid models have been developed using information on predicted secondary structures and accessible surface area in various combinations with training features. Using these models, N-glycosites and O-glycosites could be predicted with an accuracy of 82.71% (MCC 0.65) and 73.71% (MCC 0.48), respectively. An evaluation of the best performing models with 28 independent prokaryotic glycoproteins confirms the suitability of these models in predicting N- and O-glycosites in potential glycoproteins from aforementioned organisms, with reasonably high confidence. A web server GlycoPP, implementing these models is available freely at http:/www.imtech.res.in/raghava/glycopp/.
Collapse
Affiliation(s)
- Jagat S. Chauhan
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Adil H. Bhat
- Protein Science and Engineering, Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Gajendra P. S. Raghava
- Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
- * E-mail: (AR); (GPSR)
| | - Alka Rao
- Protein Science and Engineering, Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
- * E-mail: (AR); (GPSR)
| |
Collapse
|