1
|
Joeres R, Bojar D, Kalinina OV. GlyLES: Grammar-based Parsing of Glycans from IUPAC-condensed to SMILES. J Cheminform 2023; 15:37. [PMID: 36959676 PMCID: PMC10035253 DOI: 10.1186/s13321-023-00704-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 02/18/2023] [Indexed: 03/25/2023] Open
Abstract
Glycans are important polysaccharides on cellular surfaces that are bound to glycoproteins and glycolipids. These are one of the most common post-translational modifications of proteins in eukaryotic cells. They play important roles in protein folding, cell-cell interactions, and other extracellular processes. Changes in glycan structures may influence the course of different diseases, such as infections or cancer. Glycans are commonly represented using the IUPAC-condensed notation. IUPAC-condensed is a textual representation of glycans operating on the same topological level as the Symbol Nomenclature for Glycans (SNFG) that assigns colored, geometrical shapes to the main monomers. These symbols are then connected in tree-like structures, visualizing the glycan structure on a topological level. Yet for a representation on the atomic level, notations such as SMILES should be used. To our knowledge, there is no easy-to-use, general, open-source, and offline tool to convert the IUPAC-condensed notation to SMILES. Here, we present the open-access Python package GlyLES for the generalizable generation of SMILES representations out of IUPAC-condensed representations. GlyLES uses a grammar to read in the monomer tree from the IUPAC-condensed notation. From this tree, the tool can compute the atomic structures of each monomer based on their IUPAC-condensed descriptions. In the last step, it merges all monomers into the atomic structure of a glycan in the SMILES notation. GlyLES is the first package that allows conversion from the IUPAC-condensed notation of glycans to SMILES strings. This may have multiple applications, including straightforward visualization, substructure search, molecular modeling and docking, and a new featurization strategy for machine-learning algorithms. GlyLES is available at https://github.com/kalininalab/GlyLES.
Collapse
Affiliation(s)
- Roman Joeres
- grid.7490.a0000 0001 2238 295XHelmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbruecken, Germany
- grid.11749.3a0000 0001 2167 7588Center for Bioinformatics, Saarland University, Saarbruecken, Germany
| | - Daniel Bojar
- grid.8761.80000 0000 9919 9582Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, Sweden
- grid.8761.80000 0000 9919 9582Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg, Sweden
| | - Olga V. Kalinina
- grid.7490.a0000 0001 2238 295XHelmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbruecken, Germany
- grid.11749.3a0000 0001 2167 7588Center for Bioinformatics, Saarland University, Saarbruecken, Germany
- grid.11749.3a0000 0001 2167 7588Faculty of Medicine, Saarland University, Homburg, Germany
| |
Collapse
|
2
|
Toukach PV, Shirkovskaya AI. Carbohydrate Structure Database and Other Glycan Databases as an Important Element of Glycoinformatics. RUSSIAN JOURNAL OF BIOORGANIC CHEMISTRY 2022. [DOI: 10.1134/s1068162022030190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
3
|
Toukach PV, Egorova KS. Source files of the Carbohydrate Structure Database: the way to sophisticated analysis of natural glycans. Sci Data 2022; 9:131. [PMID: 35354826 PMCID: PMC8968703 DOI: 10.1038/s41597-022-01186-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 02/03/2022] [Indexed: 11/18/2022] Open
Abstract
The Carbohydrate Structure Database (CSDB, http://csdb.glycoscience.ru/ ) is a free curated repository storing various data on glycans of bacterial, fungal and plant origins. Currently, it maintains a close-to-full coverage on bacterial and fungal carbohydrates up to the year 2020. The CSDB web-interface provides free access to the database content and dedicated tools. Still, the number of these tools and the types of the corresponding analyses is limited, whereas the database itself contains data that can be used in a broader scope of analytical studies. In this paper, we present CSDB source data files and a self-contained SQL dump, and exemplify their possible application in glycan-related studies. By using CSDB in an SQL format, the user can gain access to the chain length distribution or charge distribution (as an example) in a given set of glycans defined according to specific structural, taxonomic, or other parameters, whereas the source text dump files can be imported to any dedicated database with a specific internal architecture differing from that of CSDB.
Collapse
Affiliation(s)
- Philip V Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky prospect 47, Moscow, 119991, Russia.
| | - Ksenia S Egorova
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky prospect 47, Moscow, 119991, Russia.
| |
Collapse
|
4
|
Structure of the Polysaccharide Secreted by Vibrio alginolyticus CNCM I-5035 (Epidermist 4.0 TM). Mar Drugs 2020; 18:md18100509. [PMID: 33050246 PMCID: PMC7600630 DOI: 10.3390/md18100509] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 10/06/2020] [Indexed: 01/27/2023] Open
Abstract
Vibrio alginolyticus (CNCM I-5035) secretes an exopolysaccharide used as ingredient in cosmetic industry under the trademark Epidermist 4.0TM. It is appreciated for its ability to improve the physical and chemical barrier functions of the skin by notably increasing the keratinocyte differentiation and epidermal renewal. Composition analyses and in depth characterization of the polysaccharides as well as oligosaccharides obtained by mild acid hydrolyses revealed that it was composed of a repetition unit of three residues: d-galactose (d-Gal), d-N-acetylglucosamine (GlcNAc) and l-N-acetylguluronic acid, of which 30% (M/M) was acetylated in position 3. The complete structure of the polysaccharide was resolved giving the repetition unit: [→3)-α-d-Gal-(1→4)-α-l-GulNAcA/α-l-3OAc-GulNAcA-(1→4)-β-d-GlcNAc-(1→].
Collapse
|
5
|
Databases and Bioinformatic Tools for Glycobiology and Glycoproteomics. Int J Mol Sci 2020; 21:ijms21186727. [PMID: 32937895 PMCID: PMC7556027 DOI: 10.3390/ijms21186727] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 09/03/2020] [Accepted: 09/11/2020] [Indexed: 02/07/2023] Open
Abstract
Glycosylation plays critical roles in various biological processes and is closely related to diseases. Deciphering the glycocode in diverse cells and tissues offers opportunities to develop new disease biomarkers and more effective recombinant therapeutics. In the past few decades, with the development of glycobiology, glycomics, and glycoproteomics technologies, a large amount of glycoscience data has been generated. Subsequently, a number of glycobiology databases covering glycan structure, the glycosylation sites, the protein scaffolds, and related glycogenes have been developed to store, analyze, and integrate these data. However, these databases and tools are not well known or widely used by the public, including clinicians and other researchers who are not in the field of glycobiology, but are interested in glycoproteins. In this study, the representative databases of glycan structure, glycoprotein, glycan-protein interactions, glycogenes, and the newly developed bioinformatic tools and integrated portal for glycoproteomics are reviewed. We hope this overview could assist readers in searching for information on glycoproteins of interest, and promote further clinical application of glycobiology.
Collapse
|
6
|
Toukach PV, Egorova KS. New Features of Carbohydrate Structure Database Notation (CSDB Linear), As Compared to Other Carbohydrate Notations. J Chem Inf Model 2019; 60:1276-1289. [DOI: 10.1021/acs.jcim.9b00744] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Philip V. Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky prosect 47, Moscow, Russia 119991
- National Research University Higher School of Economics, Myasnitskaya 20, Moscow, Russia 101000
| | - Ksenia S. Egorova
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky prosect 47, Moscow, Russia 119991
| |
Collapse
|
7
|
Hager FF, Sützl L, Stefanović C, Blaukopf M, Schäffer C. Pyruvate Substitutions on Glycoconjugates. Int J Mol Sci 2019; 20:E4929. [PMID: 31590345 PMCID: PMC6801904 DOI: 10.3390/ijms20194929] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Revised: 09/25/2019] [Accepted: 09/27/2019] [Indexed: 12/15/2022] Open
Abstract
Glycoconjugates are the most diverse biomolecules of life. Mostly located at the cell surface, they translate into cell-specific "barcodes" and offer a vast repertoire of functions, including support of cellular physiology, lifestyle, and pathogenicity. Functions can be fine-tuned by non-carbohydrate modifications on the constituting monosaccharides. Among these modifications is pyruvylation, which is present either in enol or ketal form. The most commonly best-understood example of pyruvylation is enol-pyruvylation of N-acetylglucosamine, which occurs at an early stage in the biosynthesis of the bacterial cell wall component peptidoglycan. Ketal-pyruvylation, in contrast, is present in diverse classes of glycoconjugates, from bacteria to algae to yeast-but not in humans. Mild purification strategies preventing the loss of the acid-labile ketal-pyruvyl group have led to a collection of elucidated pyruvylated glycan structures. However, knowledge of involved pyruvyltransferases creating a ring structure on various monosaccharides is scarce, mainly due to the lack of knowledge of fingerprint motifs of these enzymes and the unavailability of genome sequences of the organisms undergoing pyruvylation. This review compiles the current information on the widespread but under-investigated ketal-pyruvylation of monosaccharides, starting with different classes of pyruvylated glycoconjugates and associated functions, leading to pyruvyltransferases, their specificity and sequence space, and insight into pyruvate analytics.
Collapse
Affiliation(s)
- Fiona F Hager
- Department of NanoBiotechnology, NanoGlycobiology unit, Universität für Bodenkultur Wien, Muthgasse 11, A-1190 Vienna, Austria.
| | - Leander Sützl
- Department of Food Science and Technology, Food Biotechnology Laboratory, Muthgasse 11, Universität für Bodenkultur Wien, A-1190 Vienna, Austria.
| | - Cordula Stefanović
- Department of NanoBiotechnology, NanoGlycobiology unit, Universität für Bodenkultur Wien, Muthgasse 11, A-1190 Vienna, Austria.
| | - Markus Blaukopf
- Department of Chemistry, Division of Organic Chemistry, Universität für Bodenkultur Wien, Muthgasse 18, A-1190 Vienna, Austria.
| | - Christina Schäffer
- Department of NanoBiotechnology, NanoGlycobiology unit, Universität für Bodenkultur Wien, Muthgasse 11, A-1190 Vienna, Austria.
| |
Collapse
|
8
|
Gannesen AV, Zdorovenko EL, Botchkova EA, Hardouin J, Massier S, Kopitsyn DS, Gorbachevskii MV, Kadykova AA, Shashkov AS, Zhurina MV, Netrusov AI, Knirel YA, Plakunov VK, Feuilloley MGJ. Composition of the Biofilm Matrix of Cutibacterium acnes Acneic Strain RT5. Front Microbiol 2019; 10:1284. [PMID: 31293526 PMCID: PMC6598116 DOI: 10.3389/fmicb.2019.01284] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Accepted: 05/23/2019] [Indexed: 12/11/2022] Open
Abstract
In skin, Cutibacterium acnes (former Propionibacterium acnes) can behave as an opportunistic pathogen, depending on the strain and environmental conditions. Acneic strains of C. acnes form biofilms inside skin-gland hollows, inducing inflammation and skin disorders. The essential exogenous products of C. acnes accumulate in the extracellular matrix of the biofilm, conferring essential bacterial functions to this structure. However, little is known about the actual composition of the biofilm matrix of C. acnes. Here, we developed a new technique for the extraction of the biofilm matrix of Gram-positive bacteria without the use of chemical or enzymatic digestion, known to be a source of artifacts. Our method is based on the physical separation of the cells and matrix of sonicated biofilms by ultracentrifugation through a CsCl gradient. Biofilms were grown on the surface of cellulose acetate filters, and the biomass was collected without contamination by the growth medium. The biofilm matrix of the acneic C. acnes RT5 strain appears to consist mainly of polysaccharides. The following is the ratio of the main matrix components: 62.6% polysaccharides, 9.6% proteins, 4.0% DNA, and 23.8% other compounds (porphyrins precursors and other). The chemical structure of the major polysaccharide was determined using a nuclear magnetic resonance technique, the formula being →6)-α-D-Galp-(1→4)-β-D-ManpNAc3NAcA-(1→6)-α-D-Glcp-(1→4)-β-D-ManpNAc3NAcA-(1→3)-β-GalpNAc-(1→. We detected 447 proteins in the matrix, of which the most abundant were the chaperonin GroL, the elongation factors EF-Tu and EF-G, several enzymes of glycolysis, and proteins of unknown function. The matrix also contained more than 20 hydrolases of various substrata, pathogenicity factors, and many intracellular proteins and enzymes. We also performed surface-enhanced Raman spectroscopy analysis of the C. acnes RT5 matrix for the first time, providing the surface-enhanced Raman scattering (SERS) profiles of the C. acnes RT5 biofilm matrix and biofilm biomass. The difference between the matrix and biofilm biomass spectra showed successful matrix extraction rather than simply the presence of cell debris after sonication. These data show the complexity of the biofilm matrix composition and should be essential for the development of new anti-C. acnes biofilms and potential antibiofilm drugs.
Collapse
Affiliation(s)
- Andrei V. Gannesen
- Winogradsky Institute of Microbiology, Federal Research Centre «Fundamentals of Biotechnology», Russian Academy of Sciences, Moscow, Russia
| | - Evelina L. Zdorovenko
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Ekaterina A. Botchkova
- Winogradsky Institute of Microbiology, Federal Research Centre «Fundamentals of Biotechnology», Russian Academy of Sciences, Moscow, Russia
- Department of Physical and Colloid Chemistry, Gubkin University, Moscow, Russia
| | - Julie Hardouin
- Laboratory of Polymers, Biopolymers, Surfaces UMR 6270 PBS, Rouen University, Rouen, France
| | - Sebastien Massier
- Laboratory of Polymers, Biopolymers, Surfaces UMR 6270 PBS, Rouen University, Rouen, France
| | - Dmitry S. Kopitsyn
- Department of Physical and Colloid Chemistry, Gubkin University, Moscow, Russia
| | | | - Alexandra A. Kadykova
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia
- Higher Chemical College of the Russian Academy of Sciences, Mendeleyev University of Chemical Technology of Russia, Moscow, Russia
| | - Alexander S. Shashkov
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Marina V. Zhurina
- Winogradsky Institute of Microbiology, Federal Research Centre «Fundamentals of Biotechnology», Russian Academy of Sciences, Moscow, Russia
| | | | - Yuriy A. Knirel
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Vladimir K. Plakunov
- Winogradsky Institute of Microbiology, Federal Research Centre «Fundamentals of Biotechnology», Russian Academy of Sciences, Moscow, Russia
| | - Marc G. J. Feuilloley
- EA4312 Laboratory of Microbiology Signals and Microenvironment, Rouen University, Evreux, France
| |
Collapse
|
9
|
do Prado SBR, Castro-Alves VC, Ferreira GF, Fabi JP. Ingestion of Non-digestible Carbohydrates From Plant-Source Foods and Decreased Risk of Colorectal Cancer: A Review on the Biological Effects and the Mechanisms of Action. Front Nutr 2019; 6:72. [PMID: 31157230 PMCID: PMC6529955 DOI: 10.3389/fnut.2019.00072] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Accepted: 04/30/2019] [Indexed: 12/13/2022] Open
Abstract
The hypothesis that links the increase in the intake of plant-source foods to a decrease in colorectal cancer (CRC) risk has almost 50 years. Nowadays, systematic reviews and meta-analysis of case-control and cohort studies confirmed the association between dietary patterns and CRC risk, in which the non-digestible carbohydrates (NDC) from plant-source foods are known to play beneficial effects. However, the mechanisms behind the physicochemical properties and biological effects induced by NDC on the decrease of CRC development and progression remain not fully understood. NDC from plant-source foods consist mainly of complex carbohydrates from plant cell wall including pectin and hemicellulose, which vary among foods in structure and in composition, therefore in both physicochemical properties and biological effects. In the present review, we highlighted the mechanisms and described the recent findings showing how these complex NDC from plant-source foods are related to a decrease in CRC risk through induction of both physicochemical effects in the gastrointestinal tract, fermentation-related effects, and direct effects resulting from the interaction between NDC and cellular components including toll-like receptors and galectin-3. Studies support that the definition of the structure-function relationship-especially regarding the fermentation-related effects of NDC, as well as the direct effects of these complex carbohydrates in cells-is crucial for understanding the possible NDC anticancer effects. The dietary recommendations for the intake of NDC are usually quantitative, describing a defined amount of intake per day. However, as NDC from plant-source foods can exert effects that vary widely according to the NDC structure, the dietary recommendations for the intake of NDC plant-source foods are expected to change from a quantitative to a qualitative perspective in the next few years, as occurred for lipid recommendations. Thus, further studies are necessary to define whether specific and well-characterized NDC from plant-source foods induce beneficial effects related to a decrease in CRC risk, thereby improving nutritional recommendations of healthy individuals and CRC patients.
Collapse
Affiliation(s)
- Samira Bernardino Ramos do Prado
- Department of Food Science and Experimental Nutrition, School of Pharmaceutical Sciences, University of São Paulo, São Paulo, Brazil
- Food Research Center (FoRC), CEPID-FAPESP (Research, Innovation and Dissemination Centers, São Paulo Research Foundation), São Paulo, Brazil
| | - Victor Costa Castro-Alves
- Department of Food Science and Experimental Nutrition, School of Pharmaceutical Sciences, University of São Paulo, São Paulo, Brazil
- Food Research Center (FoRC), CEPID-FAPESP (Research, Innovation and Dissemination Centers, São Paulo Research Foundation), São Paulo, Brazil
| | - Gabrielle Fernandez Ferreira
- Department of Food Science and Experimental Nutrition, School of Pharmaceutical Sciences, University of São Paulo, São Paulo, Brazil
| | - João Paulo Fabi
- Department of Food Science and Experimental Nutrition, School of Pharmaceutical Sciences, University of São Paulo, São Paulo, Brazil
- Food Research Center (FoRC), CEPID-FAPESP (Research, Innovation and Dissemination Centers, São Paulo Research Foundation), São Paulo, Brazil
- Food and Nutrition Research Center (NAPAN), University of São Paulo, São Paulo, Brazil
| |
Collapse
|
10
|
Egorova KS, Toukach PV. Glykoinformatik: Brücken zwischen isolierten Inseln im Datenmeer. Angew Chem Int Ed Engl 2018. [DOI: 10.1002/ange.201803576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Ksenia S. Egorova
- Zelinsky Institute of Organic ChemistryRussian Academy of Sciences Leninsky Prospect 47 Moscow 119991 Russland
| | - Philip V. Toukach
- Zelinsky Institute of Organic ChemistryRussian Academy of Sciences Leninsky Prospect 47 Moscow 119991 Russland
| |
Collapse
|
11
|
Egorova KS, Toukach PV. Glycoinformatics: Bridging Isolated Islands in the Sea of Data. Angew Chem Int Ed Engl 2018; 57:14986-14990. [DOI: 10.1002/anie.201803576] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Indexed: 11/07/2022]
Affiliation(s)
- Ksenia S. Egorova
- Zelinsky Institute of Organic ChemistryRussian Academy of Sciences Leninsky Prospect 47 Moscow 119991 Russia
| | - Philip V. Toukach
- Zelinsky Institute of Organic ChemistryRussian Academy of Sciences Leninsky Prospect 47 Moscow 119991 Russia
| |
Collapse
|
12
|
|
13
|
Egorova KS, Kondakova AN, Toukach PV. Carbohydrate Structure Database: tools for statistical analysis of bacterial, plant and fungal glycomes. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bav073. [PMID: 26337239 PMCID: PMC4559136 DOI: 10.1093/database/bav073] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/05/2015] [Accepted: 07/01/2015] [Indexed: 12/02/2022]
Abstract
Carbohydrates are biological blocks participating in diverse and crucial processes both at cellular and organism levels. They protect individual cells, establish intracellular interactions, take part in the immune reaction and participate in many other processes. Glycosylation is considered as one of the most important modifications of proteins and other biologically active molecules. Still, the data on the enzymatic machinery involved in the carbohydrate synthesis and processing are scattered, and the advance on its study is hindered by the vast bulk of accumulated genetic information not supported by any experimental evidences for functions of proteins that are encoded by these genes. In this article, we present novel instruments for statistical analysis of glycomes in taxa. These tools may be helpful for investigating carbohydrate-related enzymatic activities in various groups of organisms and for comparison of their carbohydrate content. The instruments are developed on the Carbohydrate Structure Database (CSDB) platform and are available freely on the CSDB web-site at http://csdb.glycoscience.ru. Database URL: http://csdb.glycoscience.ru
Collapse
Affiliation(s)
- K S Egorova
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninskiy prospect 47, 119991 Moscow, Russia
| | - A N Kondakova
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninskiy prospect 47, 119991 Moscow, Russia
| | - Ph V Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninskiy prospect 47, 119991 Moscow, Russia
| |
Collapse
|
14
|
Toukach PV, Egorova KS. Carbohydrate structure database merged from bacterial, archaeal, plant and fungal parts. Nucleic Acids Res 2015; 44:D1229-36. [PMID: 26286194 PMCID: PMC4702937 DOI: 10.1093/nar/gkv840] [Citation(s) in RCA: 134] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 08/07/2015] [Indexed: 12/31/2022] Open
Abstract
The Carbohydrate Structure Databases (CSDBs, http://csdb.glycoscience.ru) store structural, bibliographic, taxonomic, NMR spectroscopic, and other data on natural carbohydrates and their derivatives published in the scientific literature. The CSDB project was launched in 2005 for bacterial saccharides (as BCSDB). Currently, it includes two parts, the Bacterial CSDB and the Plant&Fungal CSDB. In March 2015, these databases were merged to the single CSDB. The combined CSDB includes information on bacterial and archaeal glycans and derivatives (the coverage is close to complete), as well as on plant and fungal glycans and glycoconjugates (almost all structures published up to 1998). CSDB is regularly updated via manual expert annotation of original publications. Both newly annotated data and data imported from other databases are manually curated. The CSDB data are exportable in a number of modern formats, such as GlycoRDF. CSDB provides additional services for simulation of (1)H, (13)C and 2D NMR spectra of saccharides, NMR-based structure prediction, glycan-based taxon clustering and other.
Collapse
Affiliation(s)
- Philip V Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow 119991, Russia
| | - Ksenia S Egorova
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow 119991, Russia
| |
Collapse
|