1
|
Toukach P. Carbohydrate Structure Database: current state and recent developments. Anal Bioanal Chem 2024:10.1007/s00216-024-05383-w. [PMID: 38914734 DOI: 10.1007/s00216-024-05383-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 05/18/2024] [Accepted: 05/28/2024] [Indexed: 06/26/2024]
Abstract
Carbohydrate Structure Database (CSDB) is a curated glycan data collection and a glycoinformatic platform. In this report, its database, analytical, and other components that have appeared for the recent years are reviewed. The major improvements were achieving close-to-full coverage on glycans from microorganisms, launching modules for glycosyltransferases and saccharide conformations, online glycan builder and 3D modeler, NMR simulator, NMR-based structure predictor, and other tools.
Collapse
Affiliation(s)
- Philip Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia.
- Faculty of Chemistry, National Research University Higher School of Economics, Moscow, Russia.
| |
Collapse
|
2
|
Schnider B, M’Rad Y, el Ahmadie J, de Brevern AG, Imberty A, Lisacek F. HumanLectome, an update of UniLectin for the annotation and prediction of human lectins. Nucleic Acids Res 2024; 52:D1683-D1693. [PMID: 37889052 PMCID: PMC10767822 DOI: 10.1093/nar/gkad905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 10/03/2023] [Accepted: 10/06/2023] [Indexed: 10/28/2023] Open
Abstract
The UniLectin portal (https://unilectin.unige.ch/) was designed in 2019 with the goal of centralising curated and predicted data on carbohydrate-binding proteins known as lectins. UniLectin is also intended as a support for the study of lectomes (full lectin set) of organisms or tissues. The present update describes the inclusion of several new modules and details the latest (https://unilectin.unige.ch/humanLectome/), covering our knowledge of the human lectome and comprising 215 unevenly characterised lectins, particularly in terms of structural information. Each HumanLectome entry is protein-centric and compiles evidence of carbohydrate recognition domain(s), specificity, 3D-structure, tissue-based expression and related genomic data. Other recent improvements regarding interoperability and accessibility are outlined.
Collapse
Affiliation(s)
- Boris Schnider
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
| | - Yacine M’Rad
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
| | - Jalaa el Ahmadie
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
- University Grenoble Alpes, CNRS, CERMAV, F-38000 Grenoble, France
| | - Alexandre G de Brevern
- Université Paris Cité and Université de la Réunion and Université des Antilles, INSERM, BIGR, DSIMB Bioinformatics Team, F-75014 Paris, France
| | - Anne Imberty
- University Grenoble Alpes, CNRS, CERMAV, F-38000 Grenoble, France
| | - Frederique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
- Section of Biology, University of Geneva, CH-1205 Geneva, Switzerland
| |
Collapse
|
3
|
Cheng T, Ono T, Shiota M, Yamada I, Aoki-Kinoshita KF, Bolton EE. Bridging glycoinformatics and cheminformatics: integration efforts between GlyCosmos and PubChem. Glycobiology 2023; 33:454-463. [PMID: 37129482 PMCID: PMC10284107 DOI: 10.1093/glycob/cwad028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 03/27/2023] [Accepted: 03/28/2023] [Indexed: 05/03/2023] Open
Abstract
The GlyCosmos Glycoscience Portal (https://glycosmos.org) and PubChem (https://pubchem.ncbi.nlm.nih.gov/) are major portals for glycoscience and chemistry, respectively. GlyCosmos is a portal for glycan-related repositories, including GlyTouCan, GlycoPOST, and UniCarb-DR, as well as for glycan-related data resources that have been integrated from a variety of 'omics databases. Glycogenes, glycoproteins, lectins, pathways, and disease information related to glycans are accessible from GlyCosmos. PubChem, on the other hand, is a chemistry-based portal at the National Center for Biotechnology Information. PubChem provides information not only on chemicals, but also genes, proteins, pathways, as well as patents, bioassays, and more, from hundreds of data resources from around the world. In this work, these 2 portals have made substantial efforts to integrate their complementary data to allow users to cross between these 2 domains. In addition to glycan structures, key information, such as glycan-related genes, relevant diseases, glycoproteins, and pathways, was integrated and cross-linked with one another. The interfaces were designed to enable users to easily find, access, download, and reuse data of interest across these resources. Use cases are described illustrating and highlighting the type of content that can be investigated. In total, these integrations provide life science researchers improved awareness and enhanced access to glycan-related information.
Collapse
Affiliation(s)
- Tiejun Cheng
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, United States
| | - Tamiko Ono
- Glycan and Life Systems Integration Center, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo 192-8577, Japan
| | - Masaaki Shiota
- Glycan and Life Systems Integration Center, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo 192-8577, Japan
| | - Issaku Yamada
- Laboratory of Glycoinformatics, The Noguchi Institute, 1-9-7 Kaga, Itabashi, Tokyo 173-0003, Japan
| | - Kiyoko F Aoki-Kinoshita
- Glycan and Life Systems Integration Center, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo 192-8577, Japan
| | - Evan E Bolton
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, United States
| |
Collapse
|
4
|
Abstract
Glycans, carbohydrate molecules in the realm of biology, are present as biomedically important glycoconjugates and a characteristic aspect is that their structures in many instances are branched. In determining the primary structure of a glycan, the sugar components including the absolute configuration and ring form, anomeric configuration, linkage(s), sequence, and substituents should be elucidated. Solution state NMR spectroscopy offers a unique opportunity to resolve all these aspects at atomic resolution. During the last two decades, advancement of both NMR experiments and spectrometer hardware have made it possible to unravel carbohydrate structure more efficiently. These developments applicable to glycans include, inter alia, NMR experiments that reduce spectral overlap, use selective excitations, record tilted projections of multidimensional spectra, acquire spectra by multiple receivers, utilize polarization by fast-pulsing techniques, concatenate pulse-sequence modules to acquire several spectra in a single measurement, acquire pure shift correlated spectra devoid of scalar couplings, employ stable isotope labeling to efficiently obtain homo- and/or heteronuclear correlations, as well as those that rely on dipolar cross-correlated interactions for sequential information. Refined computer programs for NMR spin simulation and chemical shift prediction aid the structural elucidation of glycans, which are notorious for their limited spectral dispersion. Hardware developments include cryogenically cold probes and dynamic nuclear polarization techniques, both resulting in enhanced sensitivity as well as ultrahigh field NMR spectrometers with a 1H NMR resonance frequency higher than 1 GHz, thus improving resolution of resonances. Taken together, the developments have made and will in the future make it possible to elucidate carbohydrate structure in great detail, thereby forming the basis for understanding of how glycans interact with other molecules.
Collapse
Affiliation(s)
- Carolina Fontana
- Departamento
de Química del Litoral, CENUR Litoral Norte, Universidad de la República, Paysandú 60000, Uruguay
| | - Göran Widmalm
- Department
of Organic Chemistry, Arrhenius Laboratory, Stockholm University, S-106 91 Stockholm, Sweden,
| |
Collapse
|
5
|
Harvey DJ. Analysis of carbohydrates and glycoconjugates by matrix-assisted laser desorption/ionization mass spectrometry: An update for 2019-2020. MASS SPECTROMETRY REVIEWS 2022:e21806. [PMID: 36468275 DOI: 10.1002/mas.21806] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
This review is the tenth update of the original article published in 1999 on the application of matrix-assisted laser desorption/ionization (MALDI) mass spectrometry to the analysis of carbohydrates and glycoconjugates and brings coverage of the literature to the end of 2020. Also included are papers that describe methods appropriate to analysis by MALDI, such as sample preparation techniques, even though the ionization method is not MALDI. The review is basically divided into three sections: (1) general aspects such as theory of the MALDI process, matrices, derivatization, MALDI imaging, fragmentation, quantification and the use of arrays. (2) Applications to various structural types such as oligo- and polysaccharides, glycoproteins, glycolipids, glycosides and biopharmaceuticals, and (3) other areas such as medicine, industrial processes and glycan synthesis where MALDI is extensively used. Much of the material relating to applications is presented in tabular form. The reported work shows increasing use of incorporation of new techniques such as ion mobility and the enormous impact that MALDI imaging is having. MALDI, although invented nearly 40 years ago is still an ideal technique for carbohydrate analysis and advancements in the technique and range of applications show little sign of diminishing.
Collapse
Affiliation(s)
- David J Harvey
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, UK
- Department of Chemistry, University of Oxford, Oxford, Oxfordshire, United Kingdom
| |
Collapse
|
6
|
Abstract
Artificial intelligence (AI) methods have been and are now being increasingly integrated in prediction software implemented in bioinformatics and its glycoscience branch known as glycoinformatics. AI techniques have evolved in the past decades, and their applications in glycoscience are not yet widespread. This limited use is partly explained by the peculiarities of glyco-data that are notoriously hard to produce and analyze. Nonetheless, as time goes, the accumulation of glycomics, glycoproteomics, and glycan-binding data has reached a point where even the most recent deep learning methods can provide predictors with good performance. We discuss the historical development of the application of various AI methods in the broader field of glycoinformatics. A particular focus is placed on shining a light on challenges in glyco-data handling, contextualized by lessons learnt from related disciplines. Ending on the discussion of state-of-the-art deep learning approaches in glycoinformatics, we also envision the future of glycoinformatics, including development that need to occur in order to truly unleash the capabilities of glycoscience in the systems biology era.
Collapse
Affiliation(s)
- Daniel Bojar
- Department
of Chemistry and Molecular Biology, University
of Gothenburg, Gothenburg 41390, Sweden
- Wallenberg
Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg 41390, Sweden
| | - Frederique Lisacek
- Proteome
Informatics Group, Swiss Institute of Bioinformatics, CH-1227 Geneva, Switzerland
- Computer
Science Department & Section of Biology, University of Geneva, route de Drize 7, CH-1227, Geneva, Switzerland
| |
Collapse
|
7
|
Abstract
Glycoscience assembles all the scientific disciplines involved in studying various molecules and macromolecules containing carbohydrates and complex glycans. Such an ensemble involves one of the most extensive sets of molecules in quantity and occurrence since they occur in all microorganisms and higher organisms. Once the compositions and sequences of these molecules are established, the determination of their three-dimensional structural and dynamical features is a step toward understanding the molecular basis underlying their properties and functions. The range of the relevant computational methods capable of addressing such issues is anchored by the specificity of stereoelectronic effects from quantum chemistry to mesoscale modeling throughout molecular dynamics and mechanics and coarse-grained and docking calculations. The Review leads the reader through the detailed presentations of the applications of computational modeling. The illustrations cover carbohydrate-carbohydrate interactions, glycolipids, and N- and O-linked glycans, emphasizing their role in SARS-CoV-2. The presentation continues with the structure of polysaccharides in solution and solid-state and lipopolysaccharides in membranes. The full range of protein-carbohydrate interactions is presented, as exemplified by carbohydrate-active enzymes, transporters, lectins, antibodies, and glycosaminoglycan binding proteins. A final section features a list of 150 tools and databases to help address the many issues of structural glycobioinformatics.
Collapse
Affiliation(s)
- Serge Perez
- Centre de Recherche sur les Macromolecules Vegetales, University of Grenoble-Alpes, Centre National de la Recherche Scientifique, Grenoble F-38041, France
| | - Olga Makshakova
- FRC Kazan Scientific Center of Russian Academy of Sciences, Kazan Institute of Biochemistry and Biophysics, Kazan 420111, Russia
| |
Collapse
|
8
|
Mariethoz J, Alocci D, Karlsson NG, Packer NH, Lisacek F. An Interactive View of Glycosylation. Methods Mol Biol 2022; 2370:41-65. [PMID: 34611864 DOI: 10.1007/978-1-0716-1685-7_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The present chapter focuses on the interactive and explorative aspects of bioinformatics resources that have been recently released in glycobiology. The comparative analysis of data in a field where knowledge is scattered, incomplete, and disconnected from main biology requires efficient visualization, integration, and interactive tools that are currently only partially implemented. This overview highlights converging efforts toward building a consistent picture of protein glycosylation.
Collapse
Affiliation(s)
- Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- Computer Science Department, University of Geneva, Geneva, Switzerland
| | - Davide Alocci
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
| | - Niclas G Karlsson
- Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Nicolle H Packer
- Department of Molecular Sciences and ARC Centre of Excellence for Nanoscale Biophotonics, Macquarie University, Sydney, NSW, Australia
- Institute for Glycomics, Griffith University, Gold Coast, QLD, Australia
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, University of Geneva, Geneva, Switzerland.
| |
Collapse
|
9
|
Sehnal D, Bittrich S, Deshpande M, Svobodová R, Berka K, Bazgier V, Velankar S, Burley SK, Koča J, Rose AS. Mol* Viewer: modern web app for 3D visualization and analysis of large biomolecular structures. Nucleic Acids Res 2021; 49:W431-W437. [PMID: 33956157 PMCID: PMC8262734 DOI: 10.1093/nar/gkab314] [Citation(s) in RCA: 472] [Impact Index Per Article: 157.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 04/12/2021] [Accepted: 04/26/2021] [Indexed: 12/31/2022] Open
Abstract
Large biomolecular structures are being determined experimentally on a daily basis using established techniques such as crystallography and electron microscopy. In addition, emerging integrative or hybrid methods (I/HM) are producing structural models of huge macromolecular machines and assemblies, sometimes containing 100s of millions of non-hydrogen atoms. The performance requirements for visualization and analysis tools delivering these data are increasing rapidly. Significant progress in developing online, web-native three-dimensional (3D) visualization tools was previously accomplished with the introduction of the LiteMol suite and NGL Viewers. Thereafter, Mol* development was jointly initiated by PDBe and RCSB PDB to combine and build on the strengths of LiteMol (developed by PDBe) and NGL (developed by RCSB PDB). The web-native Mol* Viewer enables 3D visualization and streaming of macromolecular coordinate and experimental data, together with capabilities for displaying structure quality, functional, or biological context annotations. High-performance graphics and data management allows users to simultaneously visualise up to hundreds of (superimposed) protein structures, stream molecular dynamics simulation trajectories, render cell-level models, or display huge I/HM structures. It is the primary 3D structure viewer used by PDBe and RCSB PDB. It can be easily integrated into third-party services. Mol* Viewer is open source and freely available at https://molstar.org/.
Collapse
Affiliation(s)
- David Sehnal
- CEITEC - Central European Institute of Technology, Masaryk University, Brno 625 00, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno 602 00, Czech Republic.,Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Sebastian Bittrich
- Research Collaboratory for Structural Bioinformatics (RCSB), San Diego Supercomputer Center, University of California San Diego, San Diego, CA 92093-0743, USA
| | - Mandar Deshpande
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Radka Svobodová
- CEITEC - Central European Institute of Technology, Masaryk University, Brno 625 00, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno 602 00, Czech Republic
| | - Karel Berka
- Department of Physical Chemistry, Faculty of Science, Palacký University Olomouc, Olomouc 771 46, Czech Republic
| | - Václav Bazgier
- Department of Physical Chemistry, Faculty of Science, Palacký University Olomouc, Olomouc 771 46, Czech Republic
| | - Sameer Velankar
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854-8076, USA.,Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08903-2681, USA.,Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), San Diego Supercomputer Center and Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA 92093-0654, USA
| | - Jaroslav Koča
- CEITEC - Central European Institute of Technology, Masaryk University, Brno 625 00, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno 602 00, Czech Republic
| | - Alexander S Rose
- Research Collaboratory for Structural Bioinformatics (RCSB), San Diego Supercomputer Center, University of California San Diego, San Diego, CA 92093-0743, USA
| |
Collapse
|
10
|
Lisacek F, Alagesan K, Hayes C, Lippold S, de Haan N. Bioinformatics in Immunoglobulin Glycosylation Analysis. EXPERIENTIA SUPPLEMENTUM (2012) 2021; 112:205-233. [PMID: 34687011 DOI: 10.1007/978-3-030-76912-3_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Analytical methods developed for studying immunoglobulin glycosylation rely heavily on software tailored for this purpose. Many of these tools are now used in high-throughput settings, especially for the glycomic characterization of IgG. A collection of these tools, and the databases they rely on, are presented in this chapter. Specific applications are detailed in examples of immunoglobulin glycomics and glycoproteomics data processing workflows. The results obtained in the glycoproteomics workflow are emphasized with the use of dedicated visualizing tools. These tools enable the user to highlight glycan properties and their differential expression.
Collapse
Affiliation(s)
- Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland.
- Computer Science Department, University of Geneva, Geneva, Switzerland.
- Section of Biology, University of Geneva, Geneva, Switzerland.
| | | | - Catherine Hayes
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- Computer Science Department, University of Geneva, Geneva, Switzerland
| | - Steffen Lippold
- Center for Proteomics and Metabolomics, Leiden University Medical Center, Leiden, The Netherlands
| | - Noortje de Haan
- Copenhagen Center for Glycomics, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
11
|
Sehnal D, Svobodová R, Berka K, Rose AS, Burley SK, Velankar S, Koča J. High-performance macromolecular data delivery and visualization for the web. Acta Crystallogr D Struct Biol 2020; 76:1167-1173. [PMID: 33263322 PMCID: PMC7709201 DOI: 10.1107/s2059798320014515] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 11/01/2020] [Indexed: 11/11/2022] Open
Abstract
Biomacromolecular structural data make up a vital and crucial scientific resource that has grown not only in terms of its amount but also in its size and complexity. Furthermore, these data are accompanied by large and increasing amounts of experimental data. Additionally, the macromolecular data are enriched with value-added annotations describing their biological, physicochemical and structural properties. Today, the scientific community requires fast and fully interactive web visualization to exploit this complex structural information. This article provides a survey of the available cutting-edge web services that address this challenge. Specifically, it focuses on data-delivery problems, discusses the visualization of a single structure, including experimental data and annotations, and concludes with a focus on the results of molecular-dynamics simulations and the visualization of structural ensembles.
Collapse
Affiliation(s)
- David Sehnal
- CEITEC – Central European Institute of Technology, Masaryk University, Kamenice 753/5, 625 00 Brno, Czech Republic
- National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Kamenice 753/5, 625 00 Brno, Czech Republic
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Radka Svobodová
- CEITEC – Central European Institute of Technology, Masaryk University, Kamenice 753/5, 625 00 Brno, Czech Republic
- National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Kamenice 753/5, 625 00 Brno, Czech Republic
| | - Karel Berka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacký University Olomouc, Šlechtitelů 241/27, 779 00 Olomouc, Czech Republic
| | - Alexander S. Rose
- Research Collaboratory for Structural Bioinformatics (RCSB), San Diego Supercomputer Center, University of California San Diego, 9500 Gilman Drive, La Jolla, San Diego, CA 92093-0743, USA
| | - Stephen K. Burley
- RCSB Protein Data Bank, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ 08854-8076, USA
- Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, 195 Little Albany Street, New Brunswick, NJ 08903-2681, USA
- RCSB Protein Data Bank, San Diego Supercomputer Center and Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0654, USA
| | - Sameer Velankar
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Jaroslav Koča
- CEITEC – Central European Institute of Technology, Masaryk University, Kamenice 753/5, 625 00 Brno, Czech Republic
- National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Kamenice 753/5, 625 00 Brno, Czech Republic
| |
Collapse
|
12
|
Scherbinina SI, Toukach PV. Three-Dimensional Structures of Carbohydrates and Where to Find Them. Int J Mol Sci 2020; 21:E7702. [PMID: 33081008 PMCID: PMC7593929 DOI: 10.3390/ijms21207702] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2020] [Revised: 10/15/2020] [Accepted: 10/16/2020] [Indexed: 02/06/2023] Open
Abstract
Analysis and systematization of accumulated data on carbohydrate structural diversity is a subject of great interest for structural glycobiology. Despite being a challenging task, development of computational methods for efficient treatment and management of spatial (3D) structural features of carbohydrates breaks new ground in modern glycoscience. This review is dedicated to approaches of chemo- and glyco-informatics towards 3D structural data generation, deposition and processing in regard to carbohydrates and their derivatives. Databases, molecular modeling and experimental data validation services, and structure visualization facilities developed for last five years are reviewed.
Collapse
Affiliation(s)
- Sofya I. Scherbinina
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
- Higher Chemical College, D. Mendeleev University of Chemical Technology of Russia, Miusskaya Square 9, 125047 Moscow, Russia
| | - Philip V. Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
| |
Collapse
|
13
|
Lal K, Bermeo R, Perez S. Computational tools for drawing, building and displaying carbohydrates: a visual guide. Beilstein J Org Chem 2020; 16:2448-2468. [PMID: 33082879 PMCID: PMC7537382 DOI: 10.3762/bjoc.16.199] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 09/17/2020] [Indexed: 01/08/2023] Open
Abstract
Drawing and visualisation of molecular structures are some of the most common tasks carried out in structural glycobiology, typically using various software. In this perspective article, we outline developments in the computational tools for the sketching, visualisation and modelling of glycans. The article also provides details on the standard representation of glycans, and glycoconjugates, which helps the communication of structure details within the scientific community. We highlight the comparative analysis of the available tools which could help researchers to perform various tasks related to structure representation and model building of glycans. These tools can be useful for glycobiologists or any researcher looking for a ready to use, simple program for the sketching or building of glycans.
Collapse
Affiliation(s)
- Kanhaya Lal
- Univ. Grenoble Alpes, CNRS, CERMAV, 38000 Grenoble, France
- Dipartimento di Chimica, Università Degli Studi di Milano, via Golgi 19, I-20133, Italy
| | - Rafael Bermeo
- Univ. Grenoble Alpes, CNRS, CERMAV, 38000 Grenoble, France
- Dipartimento di Chimica, Università Degli Studi di Milano, via Golgi 19, I-20133, Italy
| | - Serge Perez
- Univ. Grenoble Alpes, CNRS, CERMAV, 38000 Grenoble, France
| |
Collapse
|
14
|
Harvey DJ. NEGATIVE ION MASS SPECTROMETRY FOR THE ANALYSIS OF N-LINKED GLYCANS. MASS SPECTROMETRY REVIEWS 2020; 39:586-679. [PMID: 32329121 DOI: 10.1002/mas.21622] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 12/13/2019] [Accepted: 12/22/2019] [Indexed: 05/03/2023]
Abstract
N-glycans from glycoproteins are complex, branched structures whose structural determination presents many analytical problems. Mass spectrometry, usually conducted in positive ion mode, often requires extensive sample manipulation, usually by derivatization such as permethylation, to provide the necessary structure-revealing fragment ions. The newer but, so far, lesser used negative ion techniques, on the contrary, provide a wealth of structural information not present in positive ion spectra that greatly simplify the analysis of these compounds and can usually be conducted without the need for derivatization. This review describes the use of negative ion mass spectrometry for the structural analysis of N-linked glycans and emphasises the many advantages that can be gained by this mode of operation. Biosynthesis and structures of the compounds are described followed by methods for release of the glycans from the protein. Methods for ionization are discussed with emphasis on matrix-assisted laser desorption/ionization (MALDI) and methods for producing negative ions from neutral compounds. Acidic glycans naturally give deprotonated species under most ionization conditions. Fragmentation of negative ions is discussed next with particular reference to those ions that are diagnostic for specific features such as the branching topology of the glycans and substitution positions of moieties such as fucose and sulfate, features that are often difficult to identify easily by conventional techniques such as positive ion fragmentation and exoglycosidase digestions. The advantages of negative over positive ions for this structural work are emphasised with an example of a series of glycans where all other methods failed to produce a structure. Fragmentation of derivatized glycans is discussed next, both with respect to derivatives at the reducing terminus of the molecules, and to methods for neutralization of the acidic groups on sialic acids to both stabilize them for MALDI analysis and to produce the diagnostic fragments seen with the neutral glycans. The use of ion mobility, combined with conventional mass spectrometry is described with emphasis on its use to extract clean glycan spectra both before and after fragmentation, to separate isomers and its use to extract additional information from separated fragment ions. A section on applications follows with examples of the identification of novel structures from lower organisms and tables listing the use of negative ions for structural identification of specific glycoproteins, glycans from viruses and uses in the biopharmaceutical industry and in medicine. The review concludes with a summary of the advantages and disadvantages of the technique. © 2020 John Wiley & Sons Ltd. Mass Spec Rev.
Collapse
Affiliation(s)
- David J Harvey
- Nuffield Department of Medicine, Target Discovery Institute, Roosevelt Drive, Oxford, OX3 7FZ, United Kingdom
- Centre for Biological Sciences, Faculty of Natural and Environmental Sciences, University of Southampton, Life Sciences Building 85, Highfield Campus, Southampton, SO17 1BJ, United Kingdom
| |
Collapse
|
15
|
Atanasova M, Bagdonas H, Agirre J. Structural glycobiology in the age of electron cryo-microscopy. Curr Opin Struct Biol 2020; 62:70-78. [DOI: 10.1016/j.sbi.2019.12.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Revised: 11/20/2019] [Accepted: 12/02/2019] [Indexed: 01/05/2023]
|
16
|
Neelamegham S, Aoki-Kinoshita K, Bolton E, Frank M, Lisacek F, Lütteke T, O'Boyle N, Packer NH, Stanley P, Toukach P, Varki A, Woods RJ. Updates to the Symbol Nomenclature for Glycans guidelines. Glycobiology 2020; 29:620-624. [PMID: 31184695 DOI: 10.1093/glycob/cwz045] [Citation(s) in RCA: 257] [Impact Index Per Article: 64.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 05/15/2019] [Accepted: 06/06/2019] [Indexed: 11/14/2022] Open
Abstract
The Symbol Nomenclature for Glycans (SNFG) is a community-curated standard for the depiction of monosaccharides and complex glycans using various colored-coded, geometric shapes, along with defined text additions. It is hosted by the National Center for Biotechnology Information (NCBI) at the NCBI-Glycans Page (www.ncbi.nlm.nih.gov/glycans/snfg.html). Several changes have been made to the SNFG page in the past year to update the rules for depicting glycans using the SNFG, to include more examples of use, particularly for non-mammalian organisms, and to provide guidelines for the depiction of ambiguous glycan structures. This Glycoforum article summarizes these recent changes.
Collapse
Affiliation(s)
- Sriram Neelamegham
- Department of Chemical & Biological Engineering and Medicine, State University of New York, 906 Furnas Hall, Buffalo, NY 14260, USA
| | - Kiyoko Aoki-Kinoshita
- Glycan & Life System Integration Center (GaLSIC), Faculty of Science and Engineering, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo 192-8577, Japan
| | - Evan Bolton
- National Library of Medicine, 8600 Rockville Pike, Bldg. 38A, Room 8S810, Bethesda, MD 20896, USA
| | - Martin Frank
- Biognos AB, Generatorsgatan 1 / Box 8963, 402 74 Göteborg, Sweden
| | - Frederique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Computer Science Department, University of Geneva, route de Drize 7, CH - 1227 Geneva Switzerland, and also Section of Biology, University of Geneva, Geneva, Switzerland
| | - Thomas Lütteke
- GIP GmbH, Strahlenberger Str. 112, 63067 Offenbach, Germany
| | - Noel O'Boyle
- NextMove Software, Innovation Centre, Cambridge Science Park, Milton Road, Cambridge, CB4 0EY, UK
| | - Nicolle H Packer
- Department of Molecular Sciences, Faculty of Science & Engineering, Rm 307, Building E8C, Macquarie University, Sydney, NSW 2109, Australia
| | - Pamela Stanley
- Department of Cell Biology, Albert Einstein College of Medicine, 1300 Morris Park Ave, New York, NY, 10461, USA
| | - Philip Toukach
- Laboratory of Carbohydrate Chemistry, Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences. 119991 Moscow, Leninsky prospect 47, Russia
| | - Ajit Varki
- Glycobiology Research and Training Center, University of California, San Diego, La Jolla, CA 92093, USA
| | - Robert J Woods
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Rd, Athens, GA, 30602, USA
| | | |
Collapse
|
17
|
Armstrong DR, Berrisford JM, Conroy MJ, Gutmanas A, Anyango S, Choudhary P, Clark AR, Dana JM, Deshpande M, Dunlop R, Gane P, Gáborová R, Gupta D, Haslam P, Koča J, Mak L, Mir S, Mukhopadhyay A, Nadzirin N, Nair S, Paysan-Lafosse T, Pravda L, Sehnal D, Salih O, Smart O, Tolchard J, Varadi M, Svobodova-Vařeková R, Zaki H, Kleywegt GJ, Velankar S. PDBe: improved findability of macromolecular structure data in the PDB. Nucleic Acids Res 2020; 48:D335-D343. [PMID: 31691821 PMCID: PMC7145656 DOI: 10.1093/nar/gkz990] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 10/11/2019] [Accepted: 10/25/2019] [Indexed: 11/23/2022] Open
Abstract
The Protein Data Bank in Europe (PDBe), a founding member of the Worldwide Protein Data Bank (wwPDB), actively participates in the deposition, curation, validation, archiving and dissemination of macromolecular structure data. PDBe supports diverse research communities in their use of macromolecular structures by enriching the PDB data and by providing advanced tools and services for effective data access, visualization and analysis. This paper details the enrichment of data at PDBe, including mapping of RNA structures to Rfam, and identification of molecules that act as cofactors. PDBe has developed an advanced search facility with ∼100 data categories and sequence searches. New features have been included in the LiteMol viewer at PDBe, with updated visualization of carbohydrates and nucleic acids. Small molecules are now mapped more extensively to external databases and their visual representation has been enhanced. These advances help users to more easily find and interpret macromolecular structure data in order to solve scientific problems.
Collapse
Affiliation(s)
- David R Armstrong
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - John M Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matthew J Conroy
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Aleksandras Gutmanas
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen Anyango
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Preeti Choudhary
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alice R Clark
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jose M Dana
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mandar Deshpande
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Roisin Dunlop
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Gane
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Romana Gáborová
- CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic
| | - Deepti Gupta
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Pauline Haslam
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jaroslav Koča
- CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic
| | - Lora Mak
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Saqib Mir
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Abhik Mukhopadhyay
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nurul Nadzirin
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sreenath Nair
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Typhaine Paysan-Lafosse
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- InterPro, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Lukas Pravda
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - David Sehnal
- CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic
| | - Osman Salih
- Electron Microscopy Data Bank (EMDB), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Oliver Smart
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - James Tolchard
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Radka Svobodova-Vařeková
- CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic
| | - Hossam Zaki
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gerard J Kleywegt
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- Electron Microscopy Data Bank (EMDB), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
18
|
A Bioinformatics View of Glycan⁻Virus Interactions. Viruses 2019; 11:v11040374. [PMID: 31018588 PMCID: PMC6521074 DOI: 10.3390/v11040374] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Revised: 04/05/2019] [Accepted: 04/15/2019] [Indexed: 02/06/2023] Open
Abstract
Evidence of the mediation of glycan molecules in the interaction between viruses and their hosts is accumulating and is now partially reflected in several online databases. Bioinformatics provides convenient and efficient means of searching, visualizing, comparing, and sometimes predicting, interactions in numerous and diverse molecular biology applications related to the -omics fields. As viromics is gaining momentum, bioinformatics support is increasingly needed. We propose a survey of the current resources for searching, visualizing, comparing, and possibly predicting host–virus interactions that integrate the presence and role of glycans. To the best of our knowledge, we have mapped the specialized and general-purpose databases with the appropriate focus. With an illustration of their potential usage, we also discuss the strong and weak points of the current bioinformatics landscape in the context of understanding viral infection and the immune response to it.
Collapse
|
19
|
Alocci D, Mariethoz J, Gastaldello A, Gasteiger E, Karlsson NG, Kolarich D, Packer NH, Lisacek F. GlyConnect: Glycoproteomics Goes Visual, Interactive, and Analytical. J Proteome Res 2018; 18:664-677. [DOI: 10.1021/acs.jproteome.8b00766] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Davide Alocci
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
| | - Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
| | - Alessandra Gastaldello
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
| | - Elisabeth Gasteiger
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CH-1211 Geneva, Switzerland
| | - Niclas G. Karlsson
- Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, University of Gothenburg, 40530 Gothenburg, Sweden
| | - Daniel Kolarich
- Institute for Glycomics, Griffith University, Southport, Queensland 4215, Australia
- ARC Centre for Nanoscale BioPhotonics, Macquarie University and Griffith University, Sydney, New South Wales 2109, Australia
| | - Nicolle H. Packer
- Institute for Glycomics, Griffith University, Southport, Queensland 4215, Australia
- ARC Centre for Nanoscale BioPhotonics, Macquarie University and Griffith University, Sydney, New South Wales 2109, Australia
- Department of Molecular Sciences, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
- Section of Biology, University of Geneva, CH-1211 Geneva, Switzerland
| |
Collapse
|