1
|
Sasmal A, Khan N, Khedri Z, Kellman BP, Srivastava S, Verhagen A, Yu H, Bruntse AB, Diaz S, Varki N, Beddoe T, Paton AW, Paton JC, Chen X, Lewis NE, Varki A. Simple and practical sialoglycan encoding system reveals vast diversity in nature and identifies a universal sialoglycan-recognizing probe derived from AB5 toxin B subunits. Glycobiology 2022; 32:1101-1115. [PMID: 36048714 PMCID: PMC9680115 DOI: 10.1093/glycob/cwac057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/20/2022] [Accepted: 08/23/2022] [Indexed: 01/07/2023] Open
Abstract
Vertebrate sialic acids (Sias) display much diversity in modifications, linkages, and underlying glycans. Slide microarrays allow high-throughput explorations of sialoglycan-protein interactions. A microarray presenting ~150 structurally defined sialyltrisaccharides with various Sias linkages and modifications still poses challenges in planning, data sorting, visualization, and analysis. To address these issues, we devised a simple 9-digit code for sialyltrisaccharides with terminal Sias and underlying two monosaccharides assigned from the nonreducing end, with 3 digits assigning a monosaccharide, its modifications, and linkage. Calculations based on the encoding system reveal >113,000 likely linear sialyltrisaccharides in nature. Notably, a biantennary N-glycan with 2 terminal sialyltrisaccharides could thus have >1010 potential combinations and a triantennary N-glycan with 3 terminal sequences, >1015 potential combinations. While all possibilities likely do not exist in nature, sialoglycans encode enormous diversity. While glycomic approaches are used to probe such diverse sialomes, naturally occurring bacterial AB5 toxin B subunits are simpler tools to track the dynamic sialome in biological systems. Sialoglycan microarray was utilized to compare sialoglycan-recognizing bacterial toxin B subunits. Unlike the poor correlation between B subunits and species phylogeny, there is stronger correlation with Sia-epitope preferences. Further supporting this pattern, we report a B subunit (YenB) from Yersinia enterocolitica (broad host range) recognizing almost all sialoglycans in the microarray, including 4-O-acetylated-Sias not recognized by a Yersinia pestis orthologue (YpeB). Differential Sia-binding patterns were also observed with phylogenetically related B subunits from Escherichia coli (SubB), Salmonella Typhi (PltB), Salmonella Typhimurium (ArtB), extra-intestinal E.coli (EcPltB), Vibrio cholera (CtxB), and cholera family homologue of E. coli (EcxB).
Collapse
Affiliation(s)
- Aniruddha Sasmal
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Naazneen Khan
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Zahra Khedri
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Benjamin P Kellman
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
| | - Saurabh Srivastava
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Andrea Verhagen
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Hai Yu
- Department of Chemistry, University of California Davis, CA 95616, USA
| | - Anders Bech Bruntse
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
| | - Sandra Diaz
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Nissi Varki
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Travis Beddoe
- Department of Animal, Plant and Soil Science and Centre for AgriBioscience, La Trobe University, Bundoora, Victoria 3086, Australia
| | - Adrienne W Paton
- Research Centre for Infectious Diseases, Department of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA 5005, Australia
| | - James C Paton
- Research Centre for Infectious Diseases, Department of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA 5005, Australia
| | - Xi Chen
- Department of Chemistry, University of California Davis, CA 95616, USA
| | - Nathan E Lewis
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
| | - Ajit Varki
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
- Research Centre for Infectious Diseases, Department of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA 5005, Australia
- Department of Medicine, University of California at San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
5
|
Harvey DJ. Analysis of carbohydrates and glycoconjugates by matrix-assisted laser desorption/ionization mass spectrometry: An update for 2003-2004. MASS SPECTROMETRY REVIEWS 2009; 28:273-361. [PMID: 18825656 PMCID: PMC7168468 DOI: 10.1002/mas.20192] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2008] [Revised: 07/07/2008] [Accepted: 07/07/2008] [Indexed: 05/13/2023]
Abstract
This review is the third update of the original review, published in 1999, on the application of matrix-assisted laser desorption/ionization (MALDI) mass spectrometry to the analysis of carbohydrates and glycoconjugates and brings the topic to the end of 2004. Both fundamental studies and applications are covered. The main topics include methodological developments, matrices, fragmentation of carbohydrates and applications to large polymeric carbohydrates from plants, glycans from glycoproteins and those from various glycolipids. Other topics include the use of MALDI MS to study enzymes related to carbohydrate biosynthesis and degradation, its use in industrial processes, particularly biopharmaceuticals and its use to monitor products of chemical synthesis where glycodendrimers and carbohydrate-protein complexes are highlighted.
Collapse
Affiliation(s)
- David J Harvey
- Department of Biochemistry, Oxford Glycobiology Institute, University of Oxford, Oxford OX1 3QU, UK.
| |
Collapse
|
7
|
Yusufi FNK, Park W, Lee MM, Lee DY. An alpha-numeric code for representing N-linked glycan structures in secreted glycoproteins. Bioprocess Biosyst Eng 2008; 32:97-107. [PMID: 18458952 DOI: 10.1007/s00449-008-0226-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2008] [Accepted: 04/16/2008] [Indexed: 11/25/2022]
Abstract
Advances in high-throughput techniques have led to the creation of increasing amounts of glycome data. The storage and analysis of this data would benefit greatly from a compact notation for describing glycan structures that can be easily stored and interpreted by computers. Towards this end, we propose a fixed-length alpha-numeric code for representing N-linked glycan structures commonly found in secreted glycoproteins from mammalian cell cultures. This code, GlycoDigit, employs a pre-assigned alpha-numeric index to represent the monosaccharides attached in different branches to the core glycan structure. The present branch-centric representation allows us to visualize the structure while the numerical nature of the code makes it machine readable. In addition, a difference operator can be defined to quantitatively differentiate between glycan structures for further analysis. The usefulness and applicability of GlycoDigit were demonstrated by constructing and visualizing an N-linked glycosylation network.
Collapse
Affiliation(s)
- Faraaz Noor Khan Yusufi
- Bioprocessing Technology Institute, Biomedical Sciences Institutes, Agency for Science, Technology and Research (A*STAR), Singapore
| | | | | | | |
Collapse
|
8
|
Werz DB, Ranzinger R, Herget S, Adibekian A, von der Lieth CW, Seeberger PH. Exploring the structural diversity of mammalian carbohydrates ("glycospace") by statistical databank analysis. ACS Chem Biol 2007; 2:685-91. [PMID: 18041818 DOI: 10.1021/cb700178s] [Citation(s) in RCA: 209] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The diversity of three major classes of mammalian carbohydrates, mainly glycolipids and O- and N-linked glycans, deposited in the databank GLYCOSCIENCES.de was subjected to statistical analyses. Size, chain length, and branching complexity were accessed and revealed that the average oligosaccharide is composed of about eight monosaccharide units. About a quarter of all oligosaccharides are strictly linear, and the remainder are branched at least once. Glucosamine, galactose, and mannose are dominating and comprise ~75% of the monosaccharides within mammalian oligosaccharide frameworks. alpha-Linked sialic acid, alpha-linked fucose, and beta-linked galactose decorate the majority of reducing termini. Glucose as the most abundant carbohydrate in mammals plays only a very minor role within these structures. Particular emphasis was placed on analyzing the way the monosaccharide units are linked within the oligomeric framework. Just 11 monosaccharide connections account for >75% of all linkages. Thus, the number of structural combinations found in nature, the part of the occupied mammalian glycospace, is much smaller than expected. As a result, a potential set of building blocks for oligosaccharide assembly is presented. This potential building block set was correlated with the accessible 3299 mammalian carbohydrate structures in the GLYCOSCIENCES.de databank. Only 36 building blocks are required to construct 75% of the 3299 mammalian oligosaccharides.
Collapse
Affiliation(s)
- Daniel B. Werz
- Laboratory for Organic Chemistry, Swiss Federal Institute of Technology (ETH) Zürich, HCI F 315, Wolfgang-Pauli-Str. 10, CH-8093 Zürich, Switzerland
- Present address: Institute for Organic and Biomolecular Chemistry, Georg-August-Universität Göttingen, Tammannstr. 2, D-37077 Göttingen, Germany
| | - René Ranzinger
- German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany
| | - Stephan Herget
- German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany
| | - Alexander Adibekian
- Laboratory for Organic Chemistry, Swiss Federal Institute of Technology (ETH) Zürich, HCI F 315, Wolfgang-Pauli-Str. 10, CH-8093 Zürich, Switzerland
| | | | - Peter H. Seeberger
- Laboratory for Organic Chemistry, Swiss Federal Institute of Technology (ETH) Zürich, HCI F 315, Wolfgang-Pauli-Str. 10, CH-8093 Zürich, Switzerland
| |
Collapse
|
9
|
von der Lieth CW, Lütteke T, Frank M. The role of informatics in glycobiology research with special emphasis on automatic interpretation of MS spectra. Biochim Biophys Acta Gen Subj 2005; 1760:568-77. [PMID: 16459020 DOI: 10.1016/j.bbagen.2005.12.004] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2005] [Revised: 12/01/2005] [Accepted: 12/01/2005] [Indexed: 12/17/2022]
Abstract
This paper reviews the current status of bioinformatics applications and databases in glycobiology, which are based on bioinformatics approaches as well as informatics for glycobiology where an explicit encoding of glycan structures is required. The availability of the complete sequence of the human genome has accelerated the systematic identification of so far unidentified glycogenes considerably in many areas of glycobiology using well-established bioinfomatics tools. Although there has been an immense development of new glyco-related data collections as well as informatics tools and several efforts have been started to cross-link and reference the various data deposited in distributed databases, informatics for glycobiology and glycomics is still poorly developed compared to the genomics and proteomics area. The development of algorithms for the automatic interpretation of MS spectra - currently, a severe bottleneck, which hampers the rapid and reliable interpretation of MS data in high-throughput glycomics projects - is reviewed. A comprehensive list of web resources is given. Several lines of progression are discussed. There is an urgent need for the development of decentralised input facilities of experimentally determined glycan structures. Simultaneously, agreements of standards for the structural description of glycans as well as formats for the related data have to be established. The integration of glycomics with genomics/proteomics has to increase.
Collapse
Affiliation(s)
- Claus-W von der Lieth
- German Cancer Research Center, Spectroscopic Department (B090), Molecular Modelling, Heidelberg, Germany.
| | | | | |
Collapse
|
10
|
Lütteke T, Bohne-Lang A, Loss A, Goetz T, Frank M, von der Lieth CW. GLYCOSCIENCES.de: an Internet portal to support glycomics and glycobiology research. Glycobiology 2005; 16:71R-81R. [PMID: 16239495 DOI: 10.1093/glycob/cwj049] [Citation(s) in RCA: 174] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The development of glycan-related databases and bioinformatics applications is considerably lagging behind compared with the wealth of available data and software tools in genomics and proteomics. Because the encoding of glycan structures is more complex, most of the bioinformatics approaches cannot be applied to glycan structures. No standard procedures exist where glycan structures found in various species, organs, tissues or cells can be routinely deposited. In this article the concepts of the GLYCOSCIENCES.de portal are described. It is demonstrated how an efficient structure-based cross-linking of various glycan-related data originating from different resources can be accomplished using a single user interface. The structure oriented retrieval options-exact structure, substructure, motif, composition and sugar components-are discussed. The types of available data-references, composition, spatial structures, nuclear magnetic resonance (NMR) shifts (experimental and estimated), theoretically calculated fragments and Protein Database (PDB) entries-are exemplified for Man(3.) The free availability and unrestricted use of glycan-related data is an absolute prerequisite to efficiently share distributed resources. Additionally, there is an urgent need to agree to a generally accepted exchange format as well as to a common software interface. An open access repository for glyco-related experimental data will secure that the loss of primary data will be considerably reduced.
Collapse
Affiliation(s)
- Thomas Lütteke
- Spectroscopic Department(B9090), German Cancer Research Center, Molecular Modelling, Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany
| | | | | | | | | | | |
Collapse
|