26
|
Ranzinger R, Aoki-Kinoshita KF, Campbell MP, Kawano S, Lütteke T, Okuda S, Shinmachi D, Shikanai T, Sawaki H, Toukach P, Matsubara M, Yamada I, Narimatsu H. GlycoRDF: an ontology to standardize glycomics data in RDF. Bioinformatics 2015; 31:919-25. [PMID: 25388145 PMCID: PMC4380026 DOI: 10.1093/bioinformatics/btu732] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2014] [Revised: 10/12/2014] [Accepted: 10/28/2014] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Over the last decades several glycomics-based bioinformatics resources and databases have been created and released to the public. Unfortunately, there is no common standard in the representation of the stored information or a common machine-readable interface allowing bioinformatics groups to easily extract and cross-reference the stored information. RESULTS An international group of bioinformatics experts in the field of glycomics have worked together to create a standard Resource Description Framework (RDF) representation for glycomics data, focused on glycan sequences and related biological source, publications and experimental data. This RDF standard is defined by the GlycoRDF ontology and will be used by database providers to generate common machine-readable exports of the data stored in their databases. AVAILABILITY AND IMPLEMENTATION The ontology, supporting documentation and source code used by database providers to generate standardized RDF are available online (http://www.glycoinfo.org/GlycoRDF/).
Collapse
|
27
|
Campbell MP, Royle L, Rudd PM. GlycoBase and autoGU: resources for interpreting HPLC-glycan data. Methods Mol Biol 2015; 1273:17-28. [PMID: 25753700 DOI: 10.1007/978-1-4939-2343-4_2] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
The biological relevance of protein glycosylation has made glycomics, the comprehensive study to identify all glycans in an organism, indispensable in many research fields. Determining the structure and functional relationship of glycoproteins requires the comprehensive characterization of glycan structures by a range of analytical methods. High performance liquid chromatography (HPLC) is a well-established technology commonly used for the complete structural elucidation of N- and O-linked glycans; however, the analysis of data is a major bottleneck and robust bioinformatic solutions are required. This chapter describes the availability of databases and tools, GlycoBase and autoGU developed in conjunction with the EUROCarbDB initiative, to assist the interpretation of HPLC-glycan data collections.
Collapse
|
28
|
Gotz L, Abrahams JL, Mariethoz J, Rudd PM, Karlsson NG, Packer NH, Campbell MP, Lisacek F. GlycoDigest: a tool for the targeted use of exoglycosidase digestions in glycan structure determination. Bioinformatics 2014; 30:3131-3. [PMID: 25015990 DOI: 10.1093/bioinformatics/btu425] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
UNLABELLED Sequencing oligosaccharides by exoglycosidases, either sequentially or in an array format, is a powerful tool to unambiguously determine the structure of complex N- and O-link glycans. Here, we introduce GlycoDigest, a tool that simulates exoglycosidase digestion, based on controlled rules acquired from expert knowledge and experimental evidence available in GlycoBase. The tool allows the targeted design of glycosidase enzyme mixtures by allowing researchers to model the action of exoglycosidases, thereby validating and improving the efficiency and accuracy of glycan analysis. AVAILABILITY AND IMPLEMENTATION http://www.glycodigest.org.
Collapse
|
29
|
York WS, Agravat S, Aoki-Kinoshita KF, McBride R, Campbell MP, Costello CE, Dell A, Feizi T, Haslam SM, Karlsson N, Khoo KH, Kolarich D, Liu Y, Novotny M, Packer NH, Paulson JC, Rapp E, Ranzinger R, Rudd PM, Smith DF, Struwe WB, Tiemeyer M, Wells L, Zaia J, Kettner C. MIRAGE: the minimum information required for a glycomics experiment. Glycobiology 2014; 24:402-6. [PMID: 24653214 PMCID: PMC3976285 DOI: 10.1093/glycob/cwu018] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
The MIRAGE (minimum information required for a glycomics experiment) initiative was founded in Seattle, WA, in November 2011 in order to develop guidelines for reporting the qualitative and quantitative results obtained by diverse types of glycomics analyses, including the conditions and techniques that were applied to prepare the glycans for analysis and generate the primary data along with the tools and parameters that were used to process and annotate this data. These guidelines must address a broad range of issues, as glycomics data are inherently complex and are generated using diverse methods, including mass spectrometry (MS), chromatography, glycan array-binding assays, nuclear magnetic resonance (NMR) and other rapidly developing technologies. The acceptance of these guidelines by scientists conducting research on biological systems in which glycans have a significant role will facilitate the evaluation and reproduction of glycomics experiments and data that is reported in scientific journals and uploaded to glycomics databases. As a first step, MIRAGE guidelines for glycan analysis by MS have been recently published (Kolarich D, Rapp E, Struwe WB, Haslam SM, Zaia J., et al. 2013. The minimum information required for a glycomics experiment (MIRAGE) project – Improving the standards for reporting mass spectrometry-based glycoanalytic data. Mol. Cell Proteomics. 12:991–995), allowing them to be implemented and evaluated in the context of real-world glycobiology research. In this paper, we set out the historical context, organization structure and overarching objectives of the MIRAGE initiative.
Collapse
|
30
|
Katayama T, Wilkinson MD, Aoki-Kinoshita KF, Kawashima S, Yamamoto Y, Yamaguchi A, Okamoto S, Kawano S, Kim JD, Wang Y, Wu H, Kano Y, Ono H, Bono H, Kocbek S, Aerts J, Akune Y, Antezana E, Arakawa K, Aranda B, Baran J, Bolleman J, Bonnal RJ, Buttigieg PL, Campbell MP, Chen YA, Chiba H, Cock PJ, Cohen KB, Constantin A, Duck G, Dumontier M, Fujisawa T, Fujiwara T, Goto N, Hoehndorf R, Igarashi Y, Itaya H, Ito M, Iwasaki W, Kalaš M, Katoda T, Kim T, Kokubu A, Komiyama Y, Kotera M, Laibe C, Lapp H, Lütteke T, Marshall MS, Mori T, Mori H, Morita M, Murakami K, Nakao M, Narimatsu H, Nishide H, Nishimura Y, Nystrom-Persson J, Ogishima S, Okamura Y, Okuda S, Oshita K, Packer NH, Prins P, Ranzinger R, Rocca-Serra P, Sansone S, Sawaki H, Shin SH, Splendiani A, Strozzi F, Tadaka S, Toukach P, Uchiyama I, Umezaki M, Vos R, Whetzel PL, Yamada I, Yamasaki C, Yamashita R, York WS, Zmasek CM, Kawamoto S, Takagi T. BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains. J Biomed Semantics 2014; 5:5. [PMID: 24495517 PMCID: PMC3978116 DOI: 10.1186/2041-1480-5-5] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2013] [Accepted: 11/26/2013] [Indexed: 01/24/2023] Open
Abstract
The application of semantic technologies to the integration of biological data and the interoperability of bioinformatics analysis and visualization tools has been the common theme of a series of annual BioHackathons hosted in Japan for the past five years. Here we provide a review of the activities and outcomes from the BioHackathons held in 2011 in Kyoto and 2012 in Toyama. In order to efficiently implement semantic technologies in the life sciences, participants formed various sub-groups and worked on the following topics: Resource Description Framework (RDF) models for specific domains, text mining of the literature, ontology development, essential metadata for biological databases, platforms to enable efficient Semantic Web technology development and interoperability, and the development of applications for Semantic Web data. In this review, we briefly introduce the themes covered by these sub-groups. The observations made, conclusions drawn, and software development projects that emerged from these activities are discussed.
Collapse
|
31
|
Campbell MP, Ranzinger R, Lütteke T, Mariethoz J, Hayes CA, Zhang J, Akune Y, Aoki-Kinoshita KF, Damerell D, Carta G, York WS, Haslam SM, Narimatsu H, Rudd PM, Karlsson NG, Packer NH, Lisacek F. Toolboxes for a standardised and systematic study of glycans. BMC Bioinformatics 2014; 15 Suppl 1:S9. [PMID: 24564482 PMCID: PMC4016020 DOI: 10.1186/1471-2105-15-s1-s9] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Background Recent progress in method development for characterising the branched structures of complex carbohydrates has now enabled higher throughput technology. Automation of structure analysis then calls for software development since adding meaning to large data collections in reasonable time requires corresponding bioinformatics methods and tools. Current glycobioinformatics resources do cover information on the structure and function of glycans, their interaction with proteins or their enzymatic synthesis. However, this information is partial, scattered and often difficult to find to for non-glycobiologists. Methods Following our diagnosis of the causes of the slow development of glycobioinformatics, we review the "objective" difficulties encountered in defining adequate formats for representing complex entities and developing efficient analysis software. Results Various solutions already implemented and strategies defined to bridge glycobiology with different fields and integrate the heterogeneous glyco-related information are presented. Conclusions Despite the initial stage of our integrative efforts, this paper highlights the rapid expansion of glycomics, the validity of existing resources and the bright future of glycobioinformatics.
Collapse
|
32
|
Aoki-Kinoshita KF, Bolleman J, Campbell MP, Kawano S, Kim JD, Lütteke T, Matsubara M, Okuda S, Ranzinger R, Sawaki H, Shikanai T, Shinmachi D, Suzuki Y, Toukach P, Yamada I, Packer NH, Narimatsu H. Introducing glycomics data into the Semantic Web. J Biomed Semantics 2013; 4:39. [PMID: 24280648 PMCID: PMC4177142 DOI: 10.1186/2041-1480-4-39] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2013] [Accepted: 10/17/2013] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Glycoscience is a research field focusing on complex carbohydrates (otherwise known as glycans)a, which can, for example, serve as "switches" that toggle between different functions of a glycoprotein or glycolipid. Due to the advancement of glycomics technologies that are used to characterize glycan structures, many glycomics databases are now publicly available and provide useful information for glycoscience research. However, these databases have almost no link to other life science databases. RESULTS In order to implement support for the Semantic Web most efficiently for glycomics research, the developers of major glycomics databases agreed on a minimal standard for representing glycan structure and annotation information using RDF (Resource Description Framework). Moreover, all of the participants implemented this standard prototype and generated preliminary RDF versions of their data. To test the utility of the converted data, all of the data sets were uploaded into a Virtuoso triple store, and several SPARQL queries were tested as "proofs-of-concept" to illustrate the utility of the Semantic Web in querying across databases which were originally difficult to implement. CONCLUSIONS We were able to successfully retrieve information by linking UniCarbKB, GlycomeDB and JCGGDB in a single SPARQL query to obtain our target information. We also tested queries linking UniProt with GlycoEpitope as well as lectin data with GlycomeDB through PDB. As a result, we have been able to link proteomics data with glycomics data through the implementation of Semantic Web technologies, allowing for more flexible queries across these domains.
Collapse
|
33
|
Campbell MP, Peterson R, Mariethoz J, Gasteiger E, Akune Y, Aoki-Kinoshita KF, Lisacek F, Packer NH. UniCarbKB: building a knowledge platform for glycoproteomics. Nucleic Acids Res 2013; 42:D215-21. [PMID: 24234447 PMCID: PMC3964942 DOI: 10.1093/nar/gkt1128] [Citation(s) in RCA: 129] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
The UniCarb KnowledgeBase (UniCarbKB; http://unicarbkb.org) offers public access to a growing, curated database of information on the glycan structures of glycoproteins. UniCarbKB is an international effort that aims to further our understanding of structures, pathways and networks involved in glycosylation and glyco-mediated processes by integrating structural, experimental and functional glycoscience information. This initiative builds upon the success of the glycan structure database GlycoSuiteDB, together with the informatic standards introduced by EUROCarbDB, to provide a high-quality and updated resource to support glycomics and glycoproteomics research. UniCarbKB provides comprehensive information concerning glycan structures, and published glycoprotein information including global and site-specific attachment information. For the first release over 890 references, 3740 glycan structure entries and 400 glycoproteins have been curated. Further, 598 protein glycosylation sites have been annotated with experimentally confirmed glycan structures from the literature. Among these are 35 glycoproteins, 502 structures and 60 publications previously not included in GlycoSuiteDB. This article provides an update on the transformation of GlycoSuiteDB (featured in previous NAR Database issues and hosted by ExPASy since 2009) to UniCarbKB and its integration with UniProtKB and GlycoMod. Here, we introduce a refactored database, supported by substantial new curated data collections and intuitive user-interfaces that improve database searching.
Collapse
|
34
|
Everest-Dass AV, Abrahams JL, Kolarich D, Packer NH, Campbell MP. Structural feature ions for distinguishing N- and O-linked glycan isomers by LC-ESI-IT MS/MS. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2013; 24:895-906. [PMID: 23605685 DOI: 10.1007/s13361-013-0610-4] [Citation(s) in RCA: 91] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Revised: 02/22/2013] [Accepted: 02/28/2013] [Indexed: 05/13/2023]
Abstract
Glycomics is the comprehensive study of glycan expression in an organism, cell, or tissue that relies on effective analytical technologies to understand glycan structure-function relationships. Owing to the macro- and micro-heterogeneity of oligosaccharides, detailed structure characterization has required an orthogonal approach, such as a combination of specific exoglycosidase digestions, LC-MS/MS, and the development of bioinformatic resources to comprehensively profile a complex biological sample. Liquid chromatography-electrospray ionization-mass spectrometry (LC-ESI-MS/MS) has emerged as a key tool in the structural analysis of oligosaccharides because of its high sensitivity, resolution, and robustness. Here, we present a strategy that uses LC-ESI-MS/MS to characterize over 200 N- and O-glycans from human saliva glycoproteins, complemented by sequential exoglycosidase treatment, to further verify the annotated glycan structures. Fragment-specific substructure diagnostic ions were collated from an extensive screen of the literature available on the detailed structural characterization of oligosaccharides and, together with other specific glycan structure feature ions derived from cross-ring and glycosidic-linkage fragmentation, were used to characterize the glycans and differentiate isomers. The availability of such annotated mass spectrometric fragmentation spectral libraries of glycan structures, together with such substructure diagnostic ions, will be key inputs for the future development of the automated elucidation of oligosaccharide structures from MS/MS data.
Collapse
|
35
|
Collins ES, Galligan MC, Saldova R, Adamczyk B, Abrahams JL, Campbell MP, Ng CT, Veale DJ, Murphy TB, Rudd PM, Fitzgerald O. Glycosylation status of serum in inflammatory arthritis in response to anti-TNF treatment. Rheumatology (Oxford) 2013; 52:1572-82. [PMID: 23681398 DOI: 10.1093/rheumatology/ket189] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
OBJECTIVE Glycosylation is the most common post-translational modification and is altered in disease. The typical glycosylation change in patients with inflammatory arthritis (IA) is a decrease in galactosylation levels on IgG. The aim of this study is to evaluate the effect of anti-TNF therapy on whole serum glycosylation from IA patients and determine whether these alterations in the glycome change upon treatment of the disease. METHODS Serum samples were collected from 54 IA patients before treatment and at 1 and 12 months after commencing anti-TNF therapy. N-linked glycans from whole serum samples were analysed using a high-throughput hydrophilic interaction liquid chromatography-based method. RESULTS Glycosylation on the serum proteins of IA patients changed significantly with anti-TNF treatment. We observed an increase in galactosylated glycans from IgG, also an increase in core-fucosylated biantennary galactosylated glycans and a decrease in sialylated triantennary glycans with and without outer arm fucose. This increase in galactosylated IgG glycans suggests a reversing of the N-glycome towards normal healthy profiles. These changes are strongly correlated with decreasing CRP, suggesting a link between glycosylation changes and decreases in inflammatory processes. CONCLUSION Glycosylation changes in the serum of IA patients on anti-TNF therapy are strongly associated with a decrease in inflammatory processes and reflect the effect of anti-TNF on the immune system.
Collapse
|
36
|
Everest-Dass AV, Kolarich D, Campbell MP, Packer NH. Tandem mass spectra of glycan substructures enable the multistage mass spectrometric identification of determinants on oligosaccharides. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2013; 27:931-939. [PMID: 23592194 DOI: 10.1002/rcm.6527] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2012] [Accepted: 01/29/2013] [Indexed: 06/02/2023]
Abstract
RATIONALE Glycosylation of proteins and lipids affects many biological processes, such as host-pathogen interactions, cell communication, and initiation of the immune responses. Terminal glycan substructures, or determinants, often govern the function or recognition of the carrier glycoconjugate and modulate these processes. In this study we describe a strategy using multistage mass spectrometry to identify and confirm these glycan substructures. METHODS An online tandem mass spectrometry (MS(2)) spectral fragment library of glycan substructures that typically occur at the non-reducing terminus of glycoconjugates was created to enable the easier identification and confirmation of glycan determinants on oligosaccharides released from glycoproteins. Oligosaccharides were separated by porous graphitized carbon capillary chromatography and analysed by ion trap MS. Candidate product ions that constitute the glycan substructure mass were identified in the MS(2) product ion spectrum, and used as the precursor ion for subsequent MS(3) fragmentation. The resulting MS(3) spectrum was matched against the MS(2) spectral fragment library to identify the glycan substructure(s) that comprise the parent oligosaccharide. RESULTS Thirty biologically important terminal glycan determinants commonly observed on glycoconjugates were fragmented by positive and negative ion mass spectrometry and the MS(2) product ion masses manually annotated and stored in the UniCarb-DB online database. Negative ion tandem mass spectra were especially useful in assigning isobaric glycan structures. We have applied this strategy for the identification of the sulphation, blood group antigens and sialic acid linkages on complex N-and O-glycans released from glycoproteins. CONCLUSIONS We show the potential of these glycan substructure MS(2) spectra in the negative ionization mode to facilitate the assignment of determinants on N- and O-linked glycans released from glycoproteins. Comparing the structural feature ions of known glycan reference substructures assists in the annotation of complex glycan product ion spectra, and can remove the need for other orthogonal confirmation analyses such as sequential glycosidase digestion.
Collapse
|
37
|
Galligan MC, Saldova R, Campbell MP, Rudd PM, Murphy TB. Greedy feature selection for glycan chromatography data with the generalized Dirichlet distribution. BMC Bioinformatics 2013; 14:155. [PMID: 23651459 PMCID: PMC3703279 DOI: 10.1186/1471-2105-14-155] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2012] [Accepted: 03/20/2013] [Indexed: 11/25/2022] Open
Abstract
Background Glycoproteins are involved in a diverse range of biochemical and biological processes. Changes in protein glycosylation are believed to occur in many diseases, particularly during cancer initiation and progression. The identification of biomarkers for human disease states is becoming increasingly important, as early detection is key to improving survival and recovery rates. To this end, the serum glycome has been proposed as a potential source of biomarkers for different types of cancers. High-throughput hydrophilic interaction liquid chromatography (HILIC) technology for glycan analysis allows for the detailed quantification of the glycan content in human serum. However, the experimental data from this analysis is compositional by nature. Compositional data are subject to a constant-sum constraint, which restricts the sample space to a simplex. Statistical analysis of glycan chromatography datasets should account for their unusual mathematical properties. As the volume of glycan HILIC data being produced increases, there is a considerable need for a framework to support appropriate statistical analysis. Proposed here is a methodology for feature selection in compositional data. The principal objective is to provide a template for the analysis of glycan chromatography data that may be used to identify potential glycan biomarkers. Results A greedy search algorithm, based on the generalized Dirichlet distribution, is carried out over the feature space to search for the set of “grouping variables” that best discriminate between known group structures in the data, modelling the compositional variables using beta distributions. The algorithm is applied to two glycan chromatography datasets. Statistical classification methods are used to test the ability of the selected features to differentiate between known groups in the data. Two well-known methods are used for comparison: correlation-based feature selection (CFS) and recursive partitioning (rpart). CFS is a feature selection method, while recursive partitioning is a learning tree algorithm that has been used for feature selection in the past. Conclusions The proposed feature selection method performs well for both glycan chromatography datasets. It is computationally slower, but results in a lower misclassification rate and a higher sensitivity rate than both correlation-based feature selection and the classification tree method.
Collapse
|
38
|
Campbell MP, Nguyen-Khuong T, Hayes CA, Flowers SA, Alagesan K, Kolarich D, Packer NH, Karlsson NG. Validation of the curation pipeline of UniCarb-DB: building a global glycan reference MS/MS repository. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1844:108-16. [PMID: 23624262 DOI: 10.1016/j.bbapap.2013.04.018] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2012] [Revised: 04/01/2013] [Accepted: 04/16/2013] [Indexed: 10/26/2022]
Abstract
The UniCarb-DB database is an emerging public glycomics data repository, containing over 500 tandem mass spectra (as of March 2013) of glycans released from glycoproteins. A major challenge in glycomics research is to provide and maintain high-quality datasets that will offer the necessary diversity to support the development of accurate bioinformatics tools for data deposition and analysis. The role of UniCarb-DB, as an archival database, is to provide the glycomics community with open-access to a comprehensive LC MS/MS library of N- and O- linked glycans released from glycoproteins that have been annotated with glycosidic and cross-ring fragmentation ions, retention times, and associated experimental metadata descriptions. Here, we introduce the UniCarb-DB data submission pipeline and its practical application to construct a library of LC-MS/MS glycan standards that forms part of this database. In this context, an independent consortium of three laboratories was established to analyze the same 23 commercially available oligosaccharide standards, all by using graphitized carbon-liquid chromatography (LC) electrospray ionization (ESI) ion trap mass spectrometry in the negative ion mode. A dot product score was calculated for each spectrum in the three sets of data as a measure of the comparability that is necessary for use of such a collection in library-based spectral matching and glycan structural identification. The effects of charge state, de-isotoping and threshold levels on the quality of the input data are shown. The provision of well-characterized oligosaccharide fragmentation data provides the opportunity to identify determinants of specific glycan structures, and will contribute to the confidence level of algorithms that assign glycan structures to experimental MS/MS spectra. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan.
Collapse
|
39
|
Kolarich D, Rapp E, Struwe WB, Haslam SM, Zaia J, McBride R, Agravat S, Campbell MP, Kato M, Ranzinger R, Kettner C, York WS. The minimum information required for a glycomics experiment (MIRAGE) project: improving the standards for reporting mass-spectrometry-based glycoanalytic data. Mol Cell Proteomics 2013; 12:991-5. [PMID: 23378518 DOI: 10.1074/mcp.o112.026492] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The MIRAGE guidelines are being developed in response to a critical need in the glycobiology community to clarify glycoanalytic results so that they are more readily evaluated (in terms of their scope and depth) and to facilitate the reproduction of important results in the laboratory. The molecular and biological complexity of the glycosylation process makes thorough reporting of the results of a glycomics experiment a highly challenging endeavor. The resulting data specify the identity and quantity of complex structures, the precise molecular features of which are sometimes inferred using prior knowledge, such as familiarity with a particular biosynthetic mechanism. Specifying the exact methods and assumptions that were used to assign and quantify reported structures allows the interested scientist to appreciate the scope and depth of the analysis. Mass spectrometry (MS) is the most widely used tool for glycomics experiments. The interpretation and reproducibility of MS-based glycomics data depend on comprehensive meta-data describing the instrumentation, instrument setup, and data acquisition protocols. The MIRAGE guidelines for MS-based glycomics have been designed to facilitate the collection and sharing of this critical information in order to assist the glycoanalyst in generating data sets with maximum information content and biological relevance.
Collapse
|
40
|
Harvey DJ, Merry AH, Royle L, Campbell MP, Rudd PM. Symbol nomenclature for representing glycan structures: Extension to cover different carbohydrate types. Proteomics 2011; 11:4291-5. [PMID: 21954138 DOI: 10.1002/pmic.201100300] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2011] [Revised: 08/08/2011] [Accepted: 08/10/2011] [Indexed: 11/10/2022]
Abstract
This Viewpoint article addresses comments made on our original article describing a symbolic system for the depiction of N- and O-linked carbohydrate structures and proposes a method for extending the symbol set to include monosaccharides commonly found in carbohydrates present in bacteria and plants. As before, basic monosaccharides are shown by shape with one or more additions such as solid fill or additions of lines, crosses or dots to represent functional groups. The use of colour to differentiate constituent monosaccharides is avoided, thus enabling the system to be used in a variety of formats. Linkage and anomericity are shown by the angle and type of line connecting the symbols. In this extended version, new symbols are proposed for additional hexoses and it is proposed that pyranose and furanose forms of the monosaccharides could be shown by solid or broken outlines to the symbols. Conventions for depicting the presence of multiple functional groups such as deoxy-(NH(2))(2) are also discussed. It is hoped that these proposals will stimulate discussion so that a consensus can be reached as to how the glycobiology community can best convey complex information in a simple manner.
Collapse
|
41
|
Campbell MP, Hayes CA, Struwe WB, Wilkins MR, Aoki-Kinoshita KF, Harvey DJ, Rudd PM, Kolarich D, Lisacek F, Karlsson NG, Packer NH. UniCarbKB: Putting the pieces together for glycomics research. Proteomics 2011; 11:4117-21. [DOI: 10.1002/pmic.201100302] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2011] [Revised: 07/27/2011] [Accepted: 07/29/2011] [Indexed: 12/21/2022]
|
42
|
Sumer-Bayraktar Z, Kolarich D, Campbell MP, Ali S, Packer NH, Thaysen-Andersen M. N-glycans modulate the function of human corticosteroid-binding globulin. Mol Cell Proteomics 2011; 10:M111.009100. [PMID: 21558494 DOI: 10.1074/mcp.m111.009100] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Human corticosteroid-binding globulin (CBG), a heavily glycosylated protein containing six N-linked glycosylation sites, transports cortisol and other corticosteroids in blood circulation. Here, we investigate the biological importance of the N-glycans of CBG derived from human serum by performing a structural and functional characterization of CBG N-glycosylation. Liquid chromatography-tandem MS-based glycoproteomics and glycomics combined with exoglycosidase treatment revealed 26 complex type N-glycoforms, all of which were terminated with α2,3-linked neuraminic acid (NeuAc) residues. The CBG N-glycans showed predominantly bi- and tri-antennary branching, but higher branching was also observed. N-glycans from all six N-glycosylation sites were identified with high site occupancies (70.5-99.5%) and glycoforms from all sites contained a relatively low degree of core-fucosylation (0-34.9%). CBG showed site-specific glycosylation and the site-to-site differences in core-fucosylation and branching could be in silico correlated with the accessibility to the individual glycosylation sites on the maturely folded protein. Deglycosylated and desialylated CBG analogs were generated to investigate the biological importance of CBG N-glycans. As a functional assay, MCF-7 cells were challenged with native and glycan-modified CBG and the amount of cAMP, which is produced as a quantitative response upon CBG binding to its cell surface receptor, was used to evaluate the CBG:receptor interaction. The removal of both CBG N-glycans and NeuAc residues increased the production of cAMP significantly. This confirms that N-glycans are involved in the CBG:receptor interaction and indicates that the modulation is performed by steric and/or electrostatic means through the terminal NeuAc residues.
Collapse
|
43
|
von der Lieth CW, Freire AA, Blank D, Campbell MP, Ceroni A, Damerell DR, Dell A, Dwek RA, Ernst B, Fogh R, Frank M, Geyer H, Geyer R, Harrison MJ, Henrick K, Herget S, Hull WE, Ionides J, Joshi HJ, Kamerling JP, Leeflang BR, Lütteke T, Lundborg M, Maass K, Merry A, Ranzinger R, Rosen J, Royle L, Rudd PM, Schloissnig S, Stenutz R, Vranken WF, Widmalm G, Haslam SM. EUROCarbDB: An open-access platform for glycoinformatics. Glycobiology 2011; 21:493-502. [PMID: 21106561 PMCID: PMC3055595 DOI: 10.1093/glycob/cwq188] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2010] [Revised: 11/03/2010] [Accepted: 11/03/2010] [Indexed: 01/03/2023] Open
Abstract
The EUROCarbDB project is a design study for a technical framework, which provides sophisticated, freely accessible, open-source informatics tools and databases to support glycobiology and glycomic research. EUROCarbDB is a relational database containing glycan structures, their biological context and, when available, primary and interpreted analytical data from high-performance liquid chromatography, mass spectrometry and nuclear magnetic resonance experiments. Database content can be accessed via a web-based user interface. The database is complemented by a suite of glycoinformatics tools, specifically designed to assist the elucidation and submission of glycan structure and experimental data when used in conjunction with contemporary carbohydrate research workflows. All software tools and source code are licensed under the terms of the Lesser General Public License, and publicly contributed structures and data are freely accessible. The public test version of the web interface to the EUROCarbDB can be found at http://www.ebi.ac.uk/eurocarb.
Collapse
|
44
|
Hayes CA, Karlsson NG, Struwe WB, Lisacek F, Rudd PM, Packer NH, Campbell MP. UniCarb-DB: a database resource for glycomic discovery. Bioinformatics 2011; 27:1343-4. [DOI: 10.1093/bioinformatics/btr137] [Citation(s) in RCA: 111] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
45
|
Illingworth CJR, Scott PD, Parkes KEB, Snell CR, Campbell MP, Reynolds CA. Connectivity and binding-site recognition: Applications relevant to drug design. J Comput Chem 2010; 31:2677-88. [DOI: 10.1002/jcc.21561] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
46
|
Artemenko NV, Campbell MP, Rudd PM. GlycoExtractor: A Web-Based Interface for High Throughput Processing of HPLC-Glycan Data. J Proteome Res 2010; 9:2037-41. [DOI: 10.1021/pr901213u] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
47
|
Campbell MP, Royle L, Radcliffe CM, Dwek RA, Rudd PM. GlycoBase and autoGU: tools for HPLC-based glycan analysis. Bioinformatics 2008; 24:1214-6. [PMID: 18344517 DOI: 10.1093/bioinformatics/btn090] [Citation(s) in RCA: 220] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
SUMMARY The development of robust high-performance liquid chromatography (HPLC) technologies continues to improve the detailed analysis and sequencing of glycan structures released from glycoproteins. Here, we present a database (GlycoBase) and analytical tool (autoGU) to assist the interpretation and assignment of HPLC-glycan profiles. GlycoBase is a relational database which contains the HPLC elution positions for over 350 2-AB labelled N-glycan structures together with predicted products of exoglycosidase digestions. AutoGU assigns provisional structures to each integrated HPLC peak and, when used in combination with exoglycosidase digestions, progressively assigns each structure automatically based on the footprint data. These tools are potentially very promising and facilitate basic research as well as the quantitative high-throughput analysis of low concentrations of glycans released from glycoproteins. AVAILABILITY http://glycobase.ucd.ie
Collapse
|
48
|
Royle L, Campbell MP, Radcliffe CM, White DM, Harvey DJ, Abrahams JL, Kim YG, Henry GW, Shadick NA, Weinblatt ME, Lee DM, Rudd PM, Dwek RA. HPLC-based analysis of serum N-glycans on a 96-well plate platform with dedicated database software. Anal Biochem 2007; 376:1-12. [PMID: 18194658 DOI: 10.1016/j.ab.2007.12.012] [Citation(s) in RCA: 373] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2007] [Revised: 12/04/2007] [Accepted: 12/06/2007] [Indexed: 11/29/2022]
Abstract
We present a robust, fully automatable technology platform that includes computer software for the detailed analysis of low femtomoles of N-linked sugars released from glycoproteins. Features include (i) sample immobilization in 96-well plates, glycan release, and fluorescent labeling; (ii) quantitative HPLC analysis, including monosaccharide sequence, linkage, and arm-specific information for charged and neutral glycans; (iii) automatic structural assignment of peaks from HPLC profiles via web-based software that accesses our database (GlycoBase) of more than 350 N-glycan structures, including 117 present in the human serum glycome; and (iv) software (autoGU) that progressively analyzes data from exoglycosidase digestions to produce a refined list of final structures. The N-glycans from a plate of 96 samples can be released and purified in 2 or 3 days and profiled in 2 days. This strategy can be used for (i) identification and screening of disease biomarkers and (ii) monitoring the production of therapeutic glycoproteins, allowing optimization of production conditions. This technology is also suitable for preparing released glycans for other analytical techniques. Here we demonstrate its application to rheumatoid arthritis using 5 microl of patient serum.
Collapse
|
49
|
Thummer RP, Campbell MP, Dean MK, Frusher MJ, Scott PD, Reynolds CA. Entropy and oligomerization in GPCRs. J Mol Neurosci 2005; 26:113-22. [PMID: 16012184 DOI: 10.1385/jmn:26:2-3:113] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Evolutionary trace (ET) and entropy are two related methods for analyzing a multiple sequence alignment to determine functionally important residues in proteins. In this article, these methods have been enhanced with a view to reinvestigate the issue ofGPCR dimerization and oligomerization. In particular, cluster analysis has replaced the subjective visual analysis element of the original ET method. Previous applications of the ET method predicted two dimerization interfaces on the external transmembrane lipid-facing region of GPCRs; these were discussed in terms of dimerization and linear oligomers. Removing the subjective element of the ET method gives rise to the prediction of functionally important residues on the external face of each transmembrane helix for a large number of class A GPCRs. These results are consistent with a growing body of experimental information that, taken over many receptor subtypes, has implicated each transmembrane helix in dimeric interactions. In this application, entropy gave superior results to those obtained from the ET method in that its use gives rise to higher z-scores and fewer instances of z-scores below 3.
Collapse
|
50
|
Engle J, Safi HJ, Miller CC, Campbell MP, Harlin SA, Letsou GV, Lloyd MD KS, Root DB. The impact of diaphragm management on prolonged ventilator support after thoracoabdominal aortic repair. J Vasc Surg 1999; 29:150-6. [PMID: 9882799 DOI: 10.1016/s0741-5214(99)70356-3] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
PURPOSE The relationship of the division of the diaphragm during thoracoabdominal aortic repair to prolonged ventilator support has not been studied. The purpose of this study was (1) to determine whether preservation of diaphragm integrity has a significant effect on postoperative ventilator duration and (2) to elucidate other pulmonary risk factors related to thoracoabdominal aortic surgery and to study the relationship of these factors to the intact diaphragm technique. METHODS Between February 1991 and January 1997, we repaired 397 descending and thoracoabdominal aortic aneurysms. Descending thoracic aneurysms were not included in the study because their repair does not include the diaphragm. A total of 256 patients participated in this study. The diaphragm was divided in 150 patients and left intact in 106 patients. Examined as potential risk factors were patient demographics, history and physical findings, aneurysm extent, urgency of the procedure, acute dissection, cross-clamp time, homologous and autologous blood product consumption, and adjunctive operative techniques. FEV1 also was considered in the 197 patients for whom preoperative spirometry was available. Prolonged mechanical ventilation was defined as ventilator support for >72 hours. Data were analyzed by univariate contingency table and multiple logistic regression methods. RESULTS Increasing age (odds ratio [OR], 1.02/y; P <.02), current smoking (OR, 2.6; P <.0008), total cross-clamp time (OR, 1.0/min; P <.008), units packed red blood cells transfused (OR, 1.06/unit; P <.008), and division of the diaphragm (OR, 2.03; P <.02) were significant, independent predictors of prolonged ventilation. Sixty-seven percent of patients (71 of 106) whose diaphragms were preserved were extubated in <72 hours compared with 52% of patients (78 of 150) who underwent diaphragm division (OR, 0.53; P <.02). CONCLUSION Independently of well known pulmonary risk factors, an intact diaphragm during thoracoabdominal aortic repair results in a higher probability of early ventilator weaning.
Collapse
|