1
|
Yan N, Song F, Ouyang Y, Li D, Tian H, Yi L, Linhardt RJ, Zhang Z. Glycan Mapping of Low-Molecular-Weight Heparin Using Mass Spectral Correction Based on Chromatography Fitting with “Glycomapping” Software. Anal Chem 2022; 94:13000-13009. [DOI: 10.1021/acs.analchem.2c01579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Na Yan
- Jiangsu Key Laboratory of Translational Research and Therapy for Neuro-Psycho-Diseases and College of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215021, China
| | - Feifan Song
- Jiangsu Key Laboratory of Translational Research and Therapy for Neuro-Psycho-Diseases and College of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215021, China
| | - Yilan Ouyang
- Jiangsu Key Laboratory of Translational Research and Therapy for Neuro-Psycho-Diseases and College of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215021, China
| | - Duxin Li
- Jiangsu Key Laboratory of Translational Research and Therapy for Neuro-Psycho-Diseases and College of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215021, China
| | - He Tian
- Jiangsu Key Laboratory of Translational Research and Therapy for Neuro-Psycho-Diseases and College of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215021, China
| | - Lin Yi
- Jiangsu Key Laboratory of Translational Research and Therapy for Neuro-Psycho-Diseases and College of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215021, China
| | - Robert J. Linhardt
- Departments of Chemistry and Chemical Biology, Biology, Chemical and Biological Engineering, and Biomedical Engineering, Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, 110 8th Street, Troy, New York 12180, United States
| | - Zhenqing Zhang
- Jiangsu Key Laboratory of Translational Research and Therapy for Neuro-Psycho-Diseases and College of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215021, China
| |
Collapse
|
2
|
Chen Z, Wei J, Tang Y, Lin C, Costello CE, Hong P. GlycoDeNovo2: An Improved MS/MS-Based De Novo Glycan Topology Reconstruction Algorithm. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2022; 33:436-445. [PMID: 35157458 PMCID: PMC9149727 DOI: 10.1021/jasms.1c00288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Glycan structure identification is essential to understanding the roles of glycans in various biological processes. Previously, we developed GlycoDeNovo, a de novo algorithm for reconstructing glycan topologies from tandem mass spectra (MS/MS). In this work, we introduce GlycoDeNovo2 that contains two major improvements to GlycoDeNovo. First, we use the precursor mass measured for a peak that likely corresponds to a glycan to determine its potential compositions, which are used to constrain the search space, enable parallel computation, and hence speed up topology reconstruction. Second, we developed a procedure to calculate the empirical p-value of a reconstructed topology candidate. Experimental results are provided to demonstrate the effectiveness of GlycoDeNovo2.
Collapse
Affiliation(s)
- Zizhang Chen
- Department of Computer Science, Brandeis University, Waltham, Massachusetts 02453, United States
| | - Juan Wei
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts 02118, United States
| | - Yang Tang
- Department of Chemistry, Boston University, Boston, Massachusetts 02215, United States
| | - Cheng Lin
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts 02118, United States
| | - Catherine E Costello
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts 02118, United States
- Department of Chemistry, Boston University, Boston, Massachusetts 02215, United States
| | - Pengyu Hong
- Department of Computer Science, Brandeis University, Waltham, Massachusetts 02453, United States
| |
Collapse
|
3
|
Wang H, Zhang J, Dong J, Hou M, Pan W, Bu D, Zhou J, Zhang Q, Wang Y, Zhao K, Li Y, Huang C, Sun S. Identification of glycan branching patterns using multistage mass spectrometry with spectra tree analysis. J Proteomics 2020; 217:103649. [PMID: 31978548 DOI: 10.1016/j.jprot.2020.103649] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Revised: 01/02/2020] [Accepted: 01/16/2020] [Indexed: 12/11/2022]
Abstract
Glycans are crucial to a wide range of biological processes, and their biological activities are closely related to the branching patterns of structures. Different from the simple linear chains of proteins, branching patterns of glycans are more complicated, making their identification extremely challenging. Tandem mass spectrometry (MS2) cannot provide sufficient structural information to deduce glycan branching patterns even with the assistance of various bioinformatic tools and algorithms.The promising technology to identify glycan branching patterns is multi-stage mass spectrometry (MSn). The production-relationship among MSn spectra of a glycan is essentially a tree, making deducing glycan structures from MSn spectra a great challenge. In the present study, we report an approach called glyBranch (glycan Branching pattern identification based on spectra tree) to fully exploit the information contained in the MSn spectra tree for glycan identification. Using 14 glycan standards, including 2 pairs with isomeric sequence, and 16 complex N-glycans isolated from RNase B and IgG, we demonstrated the successful application of glyBranch to branching pattern analysis. The source code of glyBranch is available at https://github.com/bigict/glyBranch/. We have also developed a web-server, which is freely accessible at http://glycan.ict.ac.cn/glyBranch/. SIGNIFICANCE: Glycans are crucial in various biological processes and their functions are closely related to the details of their structures; thus, the identification of glycan branching patterns is of great significance to biological studies. Multistage mass spectrometry (MSn) can provide detailed structural information by generating multiple-level fragments through consecutive fragmentation; however, the interpretation of numerous MSn spectra is extremely challenging. In this study, we present an approach called glyBranch (glycan Branching pattern identification based on spectra tree) to exploit the information contained in MSn spectra tree for glycan identification. This approach will greatly facilitate the automated identification of glycan structures and related biological studies.
Collapse
Affiliation(s)
- Hui Wang
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jingwei Zhang
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; Department of Computer Science, Stony Brook University, Stony Brook, NY 11794, USA
| | - Junchuan Dong
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Meijie Hou
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Weiyi Pan
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Dongbo Bu
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jinyu Zhou
- Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qi Zhang
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yaojun Wang
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; College of Information and Electrical Engineering, China Agricultural University, 100083,China
| | - Keli Zhao
- Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yan Li
- Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chuncui Huang
- Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| | - Shiwei Sun
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
4
|
Wei J, Tang Y, Bai Y, Zaia J, Costello CE, Hong P, Lin C. Toward Automatic and Comprehensive Glycan Characterization by Online PGC-LC-EED MS/MS. Anal Chem 2020; 92:782-791. [PMID: 31829560 PMCID: PMC7082718 DOI: 10.1021/acs.analchem.9b03183] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Despite the recent advances in mass spectrometry (MS)-based methods for glycan structural analysis, characterization of glycomes remains a significant analytical challenge, in part due to the widespread presence of isomeric structures and the need to define the many structural variables for each glycan. Interpretation of the complex tandem mass spectra of glycans is often laborious and requires substantial expertise. Broad adoption of MS methods for glycomics, within and outside the glycoscience community, has been hindered by the shortage of bioinformatics tools for rapid and accurate glycan sequencing. Here, we developed an online porous graphitic carbon liquid chromatography (PGC-LC)-electronic excitation dissociation (EED) MS/MS method that takes advantage of the superior isomer resolving power of PGC and the structural details provided by EED MS/MS for characterization of glycan mixtures. We also made improvements to GlycoDeNovo, our de novo glycan sequencing algorithm, so that it can automatically and accurately identify glycan topologies from EED tandem mass spectra acquired online. The majority of linkages can also be determined de novo, although in some cases, biological insight may be needed to fully define the glycan structure. Application of this method to the analysis of N-glycans released from ribonuclease B not only revealed the presence of 18 high-mannose structures, including new isomers not previously reported, but also provided relative quantification for each isomeric structure. With fully automated data acquisition and topology analysis, the approach presented here holds great potential for automated and comprehensive glycan characterization.
Collapse
Affiliation(s)
- Juan Wei
- Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, Massachusetts 02118, United States
| | - Yang Tang
- Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, Massachusetts 02118, United States
- Department of Chemistry, Boston University, Boston, Massachusetts 02215, United States
| | - Yu Bai
- Beijing National Laboratory for Molecular Sciences, Institute of Analytical Chemistry, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Joseph Zaia
- Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, Massachusetts 02118, United States
| | - Catherine E. Costello
- Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, Massachusetts 02118, United States
- Department of Chemistry, Boston University, Boston, Massachusetts 02215, United States
| | - Pengyu Hong
- Department of Computer Science, Brandeis University, Waltham, Massachusetts 02454, United States
| | - Cheng Lin
- Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, Massachusetts 02118, United States
| |
Collapse
|
5
|
De novo glycan structural identification from mass spectra using tree merging strategy. Comput Biol Chem 2019; 80:217-224. [DOI: 10.1016/j.compbiolchem.2019.03.015] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2019] [Accepted: 03/23/2019] [Indexed: 11/19/2022]
|
6
|
Sun W, Liu Y, Zhang K. An approach for N-linked glycan identification from MS/MS spectra by target-decoy strategy. Comput Biol Chem 2018; 74:391-398. [PMID: 29580737 DOI: 10.1016/j.compbiolchem.2018.03.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Accepted: 03/13/2018] [Indexed: 12/28/2022]
Abstract
Glycan structure determination serves as an essential step for the thorough investigation of the structure and function of protein. Currently, appropriate sample preparation followed by tandem mass spectrometry has emerged as the dominant technique for the characterization of glycans and glycopeptides. Although extensive efforts have been made to the development of computational approaches for the automated interpretation of glycopeptide spectra, the previously appeared methods lack a reasonable quality control strategy for the statistical validation of reported results. In this manuscript, we introduced a novel method that constructed a decoy glycan database based on the glycan structures in the target database, and searched the experimental spectra against both the target and decoy databases to find the best matched glycans. Specifically, a two-layer scoring scheme for calculating a normalized matching score is applied in the search procedure which enables the unbiased ranking of the matched glycans. Experimental analysis showed that our proposed method can report more structures with high confidence compared with previous approaches.
Collapse
Affiliation(s)
- Weiping Sun
- Department of Computer Science, University of Western Ontario, London, ON N6A5B7, Canada.
| | - Yi Liu
- Department of Computer Science, University of Western Ontario, London, ON N6A5B7, Canada
| | - Kaizhong Zhang
- Department of Computer Science, University of Western Ontario, London, ON N6A5B7, Canada
| |
Collapse
|
7
|
Xiao K, Wang Y, Shen Y, Han Y, Tian Z. Large-scale identification and visualization of N-glycans with primary structures using GlySeeker. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2018; 32:142-148. [PMID: 29105226 DOI: 10.1002/rcm.8023] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2017] [Revised: 10/19/2017] [Accepted: 10/20/2017] [Indexed: 06/07/2023]
Abstract
RATIONALE Most of the current popular tandem mass spectrometers have the capability of resolving the primary structures (monosaccharide composition, sequence and linkage) of N-glycans; however, compositions or putative structures have mostly been reported so far. Identification and visualization tools of N-glycans are needed. METHODS The isotopic mass-to-charge ratio and envelope fingerprinting algorithm, which has been successfully used for intact protein database search and identification, was adapted for N-glycan database search, and a stand-alone N-glycan database search engine, GlySeeker, for automated N-glycan identification and visualization was developed and successfully benchmarked. Both pseudo 2D graph and one-line text formats with one-letter symbols for monosaccharides were proposed for representing N-glycans. N-glycans were identified with comprehensive interpretation of product ions and false discovery rate (FDR) control. RESULTS In a database search of reversed-phase liquid chromatography/tandem mass spectrometry (RPLC/MS/MS) datasets of the N-glycome enriched from OVCAR-3 ovarian cancer cells, with FDR ≤1% and number of best hits (NoBHs) = 1-30, 1525 N-glycans with comprehensive primary structural information (composition, sequence and linkage) were identified and visualized; among these 1525 N-glycans, 559 had NoBHs = 1, i.e. their structures were uniquely identified. This represents a large-scale identification and visualization of N-glycans with primary structures from tandem mass spectra. CONCLUSIONS A stand-alone N-glycan database search engine called GlySeeker has been developed for large-scale identification and visualization of N-glycans with comprehensive interpretation of tandem mass spectra and FDR control.
Collapse
Affiliation(s)
- Kaijie Xiao
- School of Chemical Science and Engineering and Shanghai Key Laboratory of Chemical Assessment and Sustainability, Tongji University, Shanghai, 200092, China
| | - Yue Wang
- School of Chemical Science and Engineering and Shanghai Key Laboratory of Chemical Assessment and Sustainability, Tongji University, Shanghai, 200092, China
| | - Yun Shen
- School of Chemical Science and Engineering and Shanghai Key Laboratory of Chemical Assessment and Sustainability, Tongji University, Shanghai, 200092, China
| | - Yuyin Han
- School of Chemical Science and Engineering and Shanghai Key Laboratory of Chemical Assessment and Sustainability, Tongji University, Shanghai, 200092, China
| | - Zhixin Tian
- School of Chemical Science and Engineering and Shanghai Key Laboratory of Chemical Assessment and Sustainability, Tongji University, Shanghai, 200092, China
| |
Collapse
|
8
|
Hong P, Sun H, Sha L, Pu Y, Khatri K, Yu X, Tang Y, Lin C. GlycoDeNovo - an Efficient Algorithm for Accurate de novo Glycan Topology Reconstruction from Tandem Mass Spectra. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2017; 28:2288-2301. [PMID: 28786094 PMCID: PMC5647224 DOI: 10.1007/s13361-017-1760-6] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Revised: 07/03/2017] [Accepted: 07/09/2017] [Indexed: 05/15/2023]
Abstract
A major challenge in glycomics is the characterization of complex glycan structures that are essential for understanding their diverse roles in many biological processes. We present a novel efficient computational approach, named GlycoDeNovo, for accurate elucidation of the glycan topologies from their tandem mass spectra. Given a spectrum, GlycoDeNovo first builds an interpretation-graph specifying how to interpret each peak using preceding interpreted peaks. It then reconstructs the topologies of peaks that contribute to interpreting the precursor ion. We theoretically prove that GlycoDeNovo is highly efficient. A major innovative feature added to GlycoDeNovo is a data-driven IonClassifier which can be used to effectively rank candidate topologies. IonClassifier is automatically learned from experimental spectra of known glycans to distinguish B- and C-type ions from all other ion types. Our results showed that GlycoDeNovo is robust and accurate for topology reconstruction of glycans from their tandem mass spectra. Graphical Abstract ᅟ.
Collapse
Affiliation(s)
- Pengyu Hong
- Department of Computer Science, Brandeis University, Waltham, MA, 02453, USA.
| | - Hui Sun
- Department of Computer Science, Brandeis University, Waltham, MA, 02453, USA
| | - Long Sha
- Department of Computer Science, Brandeis University, Waltham, MA, 02453, USA
| | - Yi Pu
- Department of Chemistry, Boston University, Boston, MA, 02215, USA
| | - Kshitij Khatri
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, 02118, USA
| | - Xiang Yu
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, 02118, USA
| | - Yang Tang
- Department of Chemistry, Boston University, Boston, MA, 02215, USA
| | - Cheng Lin
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, 02118, USA.
| |
Collapse
|
9
|
Hu H, Khatri K, Zaia J. Algorithms and design strategies towards automated glycoproteomics analysis. MASS SPECTROMETRY REVIEWS 2017; 36:475-498. [PMID: 26728195 PMCID: PMC4931994 DOI: 10.1002/mas.21487] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Accepted: 11/30/2015] [Indexed: 05/09/2023]
Abstract
Glycoproteomics involves the study of glycosylation events on protein sequences ranging from purified proteins to whole proteome scales. Understanding these complex post-translational modification (PTM) events requires elucidation of the glycan moieties (monosaccharide sequences and glycosidic linkages between residues), protein sequences, as well as site-specific attachment of glycan moieties onto protein sequences, in a spatial and temporal manner in a variety of biological contexts. Compared with proteomics, bioinformatics for glycoproteomics is immature and many researchers still rely on tedious manual interpretation of glycoproteomics data. As sample preparation protocols and analysis techniques have matured, the number of publications on glycoproteomics and bioinformatics has increased substantially; however, the lack of consensus on tool development and code reuse limits the dissemination of bioinformatics tools because it requires significant effort to migrate a computational tool tailored for one method design to alternative methods. This review discusses algorithms and methods in glycoproteomics, and refers to the general proteomics field for potential solutions. It also introduces general strategies for tool integration and pipeline construction in order to better serve the glycoproteomics community. © 2016 Wiley Periodicals, Inc. Mass Spec Rev 36:475-498, 2017.
Collapse
Affiliation(s)
- Han Hu
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
- Center for Biomedical Mass Spectrometry, Department of Biochemistry, Boston University School of Medicine, Boston University, Boston, Massachusetts 02118, USA
| | - Kshitij Khatri
- Center for Biomedical Mass Spectrometry, Department of Biochemistry, Boston University School of Medicine, Boston University, Boston, Massachusetts 02118, USA
| | - Joseph Zaia
- Center for Biomedical Mass Spectrometry, Department of Biochemistry, Boston University School of Medicine, Boston University, Boston, Massachusetts 02118, USA
| |
Collapse
|
10
|
Tsai PL, Chen SF. A Brief Review of Bioinformatics Tools for Glycosylation Analysis by Mass Spectrometry. Mass Spectrom (Tokyo) 2017; 6:S0064. [PMID: 28337402 PMCID: PMC5358406 DOI: 10.5702/massspectrometry.s0064] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Accepted: 01/14/2017] [Indexed: 12/28/2022] Open
Abstract
The purpose of this review is to provide updated information regarding bioinformatic software for the use in the characterization of glycosylated structures since 2013. A comprehensive review by Woodin et al.Analyst 138: 2793-2803, 2013 (ref. 1) described two main approaches that are introduced for starting researchers in this area; analysis of released glycans and the identification of glycopeptide in enzymatic digests, respectively. Complementary to that report, this review focuses on mass spectrometry related bioinformatics tools for the characterization of N-linked and O-linked glycopeptides. Specifically, it also provides information regarding automated tools that can be used for glycan profiling using mass spectrometry.
Collapse
Affiliation(s)
- Pei-Lun Tsai
- Department of Chemistry, National Taiwan Normal University
- Mithra Biotechnology Inc
| | - Sung-Fang Chen
- Department of Chemistry, National Taiwan Normal University
| |
Collapse
|
11
|
Zhang P, Woen S, Wang T, Liau B, Zhao S, Chen C, Yang Y, Song Z, Wormald MR, Yu C, Rudd PM. Challenges of glycosylation analysis and control: an integrated approach to producing optimal and consistent therapeutic drugs. Drug Discov Today 2016; 21:740-65. [DOI: 10.1016/j.drudis.2016.01.006] [Citation(s) in RCA: 136] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Revised: 12/22/2015] [Accepted: 01/14/2016] [Indexed: 12/18/2022]
|
12
|
Agravat SB, Song X, Rojsajjakul T, Cummings RD, Smith DF. Computational approaches to define a human milk metaglycome. Bioinformatics 2016; 32:1471-8. [PMID: 26803164 DOI: 10.1093/bioinformatics/btw048] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 01/20/2016] [Indexed: 12/20/2022] Open
Abstract
MOTIVATION The goal of deciphering the human glycome has been hindered by the lack of high-throughput sequencing methods for glycans. Although mass spectrometry (MS) is a key technology in glycan sequencing, MS alone provides limited information about the identification of monosaccharide constituents, their anomericity and their linkages. These features of individual, purified glycans can be partly identified using well-defined glycan-binding proteins, such as lectins and antibodies that recognize specific determinants within glycan structures. RESULTS We present a novel computational approach to automate the sequencing of glycans using metadata-assisted glycan sequencing, which combines MS analyses with glycan structural information from glycan microarray technology. Success in this approach was aided by the generation of a 'virtual glycome' to represent all potential glycan structures that might exist within a metaglycomes based on a set of biosynthetic assumptions using known structural information. We exploited this approach to deduce the structures of soluble glycans within the human milk glycome by matching predicted structures based on experimental data against the virtual glycome. This represents the first meta-glycome to be defined using this method and we provide a publically available web-based application to aid in sequencing milk glycans. AVAILABILITY AND IMPLEMENTATION http://glycomeseq.emory.edu CONTACT sagravat@bidmc.harvard.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sanjay B Agravat
- Department of Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA and
| | - Xuezheng Song
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA, USA
| | - Teerapat Rojsajjakul
- Department of Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA and
| | - Richard D Cummings
- Department of Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA and
| | - David F Smith
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA, USA
| |
Collapse
|
13
|
Zhang Y, Yu CY, Song E, Li SC, Mechref Y, Tang H, Liu X. Identification of Glycopeptides with Multiple Hydroxylysine O-Glycosylation Sites by Tandem Mass Spectrometry. J Proteome Res 2015; 14:5099-108. [DOI: 10.1021/acs.jproteome.5b00299] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Yanlin Zhang
- Department
of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
- Department
of BioHealth Informatics, Indiana University−Purdue University Indianapolis, Indianapolis, Indiana 46202, United States
| | - Chuan-Yih Yu
- School
of Informatics and Computing, Indiana University Bloomington, Bloomington, Indiana 47405, United States
| | - Ehwang Song
- Department
of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States
| | - Shuai Cheng Li
- Department
of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| | - Yehia Mechref
- Department
of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States
| | - Haixu Tang
- School
of Informatics and Computing, Indiana University Bloomington, Bloomington, Indiana 47405, United States
| | - Xiaowen Liu
- Department
of BioHealth Informatics, Indiana University−Purdue University Indianapolis, Indianapolis, Indiana 46202, United States
- Center
for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
| |
Collapse
|
14
|
Kumozaki S, Sato K, Sakakibara Y. A Machine Learning Based Approach to de novo Sequencing of Glycans from Tandem Mass Spectrometry Spectrum. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:1267-1274. [PMID: 26671799 DOI: 10.1109/tcbb.2015.2430317] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Recently, glycomics has been actively studied and various technologies for glycomics have been rapidly developed. Currently, tandem mass spectrometry (MS/MS) is one of the key experimental tools for identification of structures of oligosaccharides. MS/MS can observe MS/MS peaks of fragmented glycan ions including cross-ring ions resulting from internal cleavages, which provide valuable information to infer glycan structures. Thus, the aim of de novo sequencing of glycans is to find the most probable assignments of observed MS/MS peaks to glycan substructures without databases. However, there are few satisfiable algorithms for glycan de novo sequencing from MS/MS spectra. We present a machine learning based approach to de novo sequencing of glycans from MS/MS spectrum. First, we build a suitable model for the fragmentation of glycans including cross-ring ions, and implement a solver that employs Lagrangian relaxation with a dynamic programming technique. Then, to optimize scores for the algorithm, we introduce a machine learning technique called structured support vector machines that enable us to learn parameters including scores for cross-ring ions from training data, i.e., known glycan mass spectra. Furthermore, we implement additional constraints for core structures of well-known glycan types including N-linked glycans and O-linked glycans. This enables us to predict more accurate glycan structures if the glycan type of given spectra is known. Computational experiments show that our algorithm performs accurate de novo sequencing of glycans. The implementation of our algorithm and the datasets are available at http://glyfon.dna.bio.keio.ac.jp/.
Collapse
|
15
|
Meitei NS, Apte A, Snovida SI, Rogers JC, Saba J. Automating mass spectrometry-based quantitative glycomics using aminoxy tandem mass tag reagents with SimGlycan. J Proteomics 2015; 127:211-22. [PMID: 26003531 DOI: 10.1016/j.jprot.2015.05.015] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Revised: 04/08/2015] [Accepted: 05/14/2015] [Indexed: 11/26/2022]
Abstract
Protein glycosylation is a common post-translational modification, which serves critical roles in the biological processes of organisms. Monitoring of changes in the abundance and structure of glycans may be necessary to explain the correlations between protein glycosylation and various diseases. Hence, the growing importance of glycoproteomics necessitates in-depth qualitative and quantitative studies of glycans. One of the emerging trends in glycomics research is the innovation related to accurate mass spectrometry based quantitative analysis of glycans. Recently, we have introduced aminoxyTMT reagents, which enable efficient relative quantitation of carbohydrates, improved glycan ionization efficiency and increased analytical throughput. These reagents can be used for quantitative analysis of N-glycans by direct infusion or liquid chromatography (LC)-coupled to electrospray ionization mass spectrometry (ESI-MS). However, unlike in proteomics, one of the major challenges left unaddressed is the lack of informatics tools to automate the qualitative and quantitative analysis of generated data. This analysis typically includes identification/quantitation of glycans using MS/MS data and differential analysis across biological samples. We have developed software modules to streamline such protocols for quantitative analysis of aminoxyTMT labeled-glycans derived from complex mixtures. This article is part of a Special Issue entitled: Proteomics in India.
Collapse
|
16
|
Dong L, Shi B, Tian G, Li Y, Wang B, Zhou M. An Accurate de novo Algorithm for Glycan Topology Determination from Mass Spectra. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:568-578. [PMID: 26357268 DOI: 10.1109/tcbb.2014.2368981] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Determining the glycan topology automatically from mass spectra represents a great challenge. Existing methods fall into approximate and exact ones. The former including greedy and heuristic ones can reduce the computational complexity, but suffer from information lost in the procedure of glycan interpretation. The latter including dynamic programming and exhaustive enumeration are much slower than the former. In the past years, nearly all emerging methods adopted a tree structure to represent a glycan. They share such problems as repetitive peak counting in reconstructing a candidate structure. Besides, tree-based glycan representation methods often have to give different computational formulas for binary and ternary glycans. We propose a new directed acyclic graph structure for glycan representation. Based on it, this work develops a de novo algorithm to accurately reconstruct the tree structure iteratively from mass spectra with logical constraints and some known biosynthesis rules, by a single computational formula. The experiments on multiple complex glycans extracted from human serum show that the proposed algorithm can achieve higher accuracy to determine a glycan topology than prior methods without increasing computational burden.
Collapse
|
17
|
He L, Xin L, Shan B, Lajoie GA, Ma B. GlycoMaster DB: software to assist the automated identification of N-linked glycopeptides by tandem mass spectrometry. J Proteome Res 2014; 13:3881-95. [PMID: 25113421 DOI: 10.1021/pr401115y] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Glycosylation is one of the most commonly observed post-translational modifications (PTMs) in eukaryotes. It is believed that more than 50% eukaryotic proteins are glycosylated. To reveal the biological functions of protein-linked glycans involved in numerous biological processes, the high-throughput identification of both glycoproteins and the attached glycan structures becomes fundamentally important. Tandem mass spectrometry (MS/MS) is an effective method for glycoproteomic analysis because of its high sensitivity and selectivity. Two experimental approaches exist to obtain MS/MS spectral data of glycopeptides. One consists of isolating glycans from glycopeptides and generating MS/MS spectra of the glycans and peptides separately. The other approach produces spectra directly from intact glycopeptides. The latter approach has the advantage of retaining the glycosylation site information. However, the spectral data cannot be readily analyzed because of the lack of software specifically designed for the identification of intact glycopeptides. To address this need, we developed a novel software tool, GlycoMaster DB, to assist the automated and high-throughput identification of intact N-linked glycopeptides from MS/MS spectra. The software simultaneously searches a protein sequence database and a glycan structure database to find the best pair of peptide and glycan for each input spectrum. GlycoMaster DB can analyze mass spectral data produced with HCD/ETD mixed fragmentation, where HCD spectra are used to identify glycans and ETD spectra are used to determine peptide sequences. When only HCD spectra are available, GlycoMaster DB can still help to identify the glycans, and a list of possible peptide sequences are reported according to the accurate precursor mass and the N-linked glycopeptide sequon. GlycoMaster DB is freely accessible at http://www-novo.cs.uwaterloo.ca:8080/GlycoMasterDB .
Collapse
Affiliation(s)
- Lin He
- David R. Cheriton School of Computer Science, University of Waterloo , Waterloo, Ontario N2L 3G1, Canada
| | | | | | | | | |
Collapse
|
18
|
Woodin CL, Maxon M, Desaire H. Software for automated interpretation of mass spectrometry data from glycans and glycopeptides. Analyst 2013; 138:2793-803. [PMID: 23293784 DOI: 10.1039/c2an36042j] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
The purpose of this review is to provide those interested in glycosylation analysis with the most updated information on the availability of automated tools for MS characterization of N-linked and O-linked glycosylation types. Specifically, this review describes software tools that facilitate elucidation of glycosylation from MS data on the basis of mass alone, as well as software designed to speed the interpretation of glycan and glycopeptide fragmentation from MS/MS data. This review focuses equally on software designed to interpret the composition of released glycans and on tools to characterize N-linked and O-linked glycopeptides. Several websites have been compiled and described that will be helpful to the reader who is interested in further exploring the described tools.
Collapse
Affiliation(s)
- Carrie L Woodin
- Department of Chemistry, University of Kansas, 2030 Becker Drive, Lawrence, KS 66047, USA
| | | | | |
Collapse
|
19
|
Abstract
We present an algorithm for counting glycan topologies of order n that improves on previously described algorithms by a factor n in both time and space. More generally, we provide such an algorithm for counting rooted or unrooted d-ary trees with labels or masses assigned to the vertices, and we give a "recipe" to estimate the asymptotic growth of the resulting sequences. We provide constants for the asymptotic growth of d-ary trees and labeled quaternary trees (glycan topologies). Finally, we show how a classical result from enumeration theory can be used to count glycan structures where edges are labeled by bond types. Our method also improves time bounds for counting alkanes.
Collapse
|
20
|
Campbell MP, Nguyen-Khuong T, Hayes CA, Flowers SA, Alagesan K, Kolarich D, Packer NH, Karlsson NG. Validation of the curation pipeline of UniCarb-DB: building a global glycan reference MS/MS repository. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1844:108-16. [PMID: 23624262 DOI: 10.1016/j.bbapap.2013.04.018] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2012] [Revised: 04/01/2013] [Accepted: 04/16/2013] [Indexed: 10/26/2022]
Abstract
The UniCarb-DB database is an emerging public glycomics data repository, containing over 500 tandem mass spectra (as of March 2013) of glycans released from glycoproteins. A major challenge in glycomics research is to provide and maintain high-quality datasets that will offer the necessary diversity to support the development of accurate bioinformatics tools for data deposition and analysis. The role of UniCarb-DB, as an archival database, is to provide the glycomics community with open-access to a comprehensive LC MS/MS library of N- and O- linked glycans released from glycoproteins that have been annotated with glycosidic and cross-ring fragmentation ions, retention times, and associated experimental metadata descriptions. Here, we introduce the UniCarb-DB data submission pipeline and its practical application to construct a library of LC-MS/MS glycan standards that forms part of this database. In this context, an independent consortium of three laboratories was established to analyze the same 23 commercially available oligosaccharide standards, all by using graphitized carbon-liquid chromatography (LC) electrospray ionization (ESI) ion trap mass spectrometry in the negative ion mode. A dot product score was calculated for each spectrum in the three sets of data as a measure of the comparability that is necessary for use of such a collection in library-based spectral matching and glycan structural identification. The effects of charge state, de-isotoping and threshold levels on the quality of the input data are shown. The provision of well-characterized oligosaccharide fragmentation data provides the opportunity to identify determinants of specific glycan structures, and will contribute to the confidence level of algorithms that assign glycan structures to experimental MS/MS spectra. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan.
Collapse
Affiliation(s)
- Matthew P Campbell
- Biomolecular Frontiers Research Centre, Macquarie University, Sydney, New South Wales, Australia
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Li F, Glinskii OV, Glinsky VV. Glycobioinformatics: Current strategies and tools for data mining in MS-based glycoproteomics. Proteomics 2012; 13:341-54. [DOI: 10.1002/pmic.201200149] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2012] [Revised: 10/06/2012] [Accepted: 11/06/2012] [Indexed: 12/18/2022]
|
22
|
Xu G, Liu X, Liu QY, Zhou Y, Li J. Improve accuracy and sensibility in glycan structure prediction by matching glycan isotope abundance. Anal Chim Acta 2012; 743:80-9. [DOI: 10.1016/j.aca.2012.07.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2012] [Revised: 06/13/2012] [Accepted: 07/10/2012] [Indexed: 11/30/2022]
|
23
|
Dallas DC, Martin WF, Hua S, German JB. Automated glycopeptide analysis--review of current state and future directions. Brief Bioinform 2012; 14:361-74. [PMID: 22843980 DOI: 10.1093/bib/bbs045] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Glycosylation of proteins is involved in immune defense, cell-cell adhesion, cellular recognition and pathogen binding and is one of the most common and complex post-translational modifications. Science is still struggling to assign detailed mechanisms and functions to this form of conjugation. Even the structural analysis of glycoproteins-glycoproteomics-remains in its infancy due to the scarcity of high-throughput analytical platforms capable of determining glycopeptide composition and structure, especially platforms for complex biological mixtures. Glycopeptide composition and structure can be determined with high mass-accuracy mass spectrometry, particularly when combined with chromatographic separation, but the sheer volume of generated data necessitates computational software for interpretation. This review discusses the current state of glycopeptide assignment software-advances made to date and issues that remain to be addressed. The various software and algorithms developed so far provide important insights into glycoproteomics. However, there is currently no freely available software that can analyze spectral data in batch and unambiguously determine glycopeptide compositions for N- and O-linked glycopeptides from relevant biological sources such as human milk and serum. Few programs are capable of aiding in structural determination of the glycan component. To significantly advance the field of glycoproteomics, analytical software and algorithms are required that: (i) solve for both N- and O-linked glycopeptide compositions, structures and glycosites in biological mixtures; (ii) are high-throughput and process data in batches; (iii) can interpret mass spectral data from a variety of sources and (iv) are open source and freely available.
Collapse
|
24
|
Lütteke T. The use of glycoinformatics in glycochemistry. Beilstein J Org Chem 2012; 8:915-29. [PMID: 23015842 PMCID: PMC3388882 DOI: 10.3762/bjoc.8.104] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2012] [Accepted: 05/29/2012] [Indexed: 01/10/2023] Open
Abstract
Glycoinformatics is a small but growing branch of bioinformatics and chemoinformatics. Various resources are now available that can be of use to glycobiologists, but also to chemists who work on the synthesis or analysis of carbohydrates. This article gives an overview of existing glyco-specific databases and tools, with a focus on their application to glycochemistry: Databases can provide information on candidate glycan structures for synthesis, or on glyco-enzymes that can be used to synthesize carbohydrates. Statistical analyses of glycan databases help to plan glycan synthesis experiments. 3D-Structural data of protein-carbohydrate complexes are used in targeted drug design, and tools to support glycan structure analysis aid with quality control. Specific problems of glycoinformatics compared to bioinformatics for genomics or proteomics, especially concerning integration and long-term maintenance of the existing glycan databases, are also discussed.
Collapse
Affiliation(s)
- Thomas Lütteke
- Justus-Liebig-University Gießen, Institute of Veterinary Physiology and Biochemistry, Frankfurter Str. 100, 35392 Gießen, Germany
| |
Collapse
|
25
|
Bauer S. Mass spectrometry for characterizing plant cell wall polysaccharides. FRONTIERS IN PLANT SCIENCE 2012; 3:45. [PMID: 22645587 PMCID: PMC3355817 DOI: 10.3389/fpls.2012.00045] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2012] [Accepted: 02/23/2012] [Indexed: 05/23/2023]
Abstract
Mass spectrometry is a selective and powerful technique to obtain identification and structural information on compounds present in complex mixtures. Since it requires only small sample amount it is an excellent tool for researchers interested in detecting changes in composition of complex carbohydrates of plants. This mini-review gives an overview of common mass spectrometry techniques applied to the analysis of plant cell wall carbohydrates. It presents examples in which mass spectrometry has been used to elucidate the structure of oligosaccharides derived from hemicelluloses and pectins and illustrates how information on sequence, linkages, branching, and modifications are obtained from characteristic fragmentation patterns.
Collapse
Affiliation(s)
- Stefan Bauer
- Energy Biosciences Institute, University of CaliforniaBerkeley, CA, USA
| |
Collapse
|
26
|
Robinson LN, Artpradit C, Raman R, Shriver ZH, Ruchirawat M, Sasisekharan R. Harnessing glycomics technologies: integrating structure with function for glycan characterization. Electrophoresis 2012; 33:797-814. [PMID: 22522536 PMCID: PMC3743516 DOI: 10.1002/elps.201100231] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Glycans, or complex carbohydrates, are a ubiquitous class of biological molecule which impinge on a variety of physiological processes ranging from signal transduction to tissue development and microbial pathogenesis. In comparison to DNA and proteins, glycans present unique challenges to the study of their structure and function owing to their complex and heterogeneous structures and the dominant role played by multivalency in their sequence-specific biological interactions. Arising from these challenges, there is a need to integrate information from multiple complementary methods to decode structure-function relationships. Focusing on acidic glycans, we describe here key glycomics technologies for characterizing their structural attributes, including linkage, modifications, and topology, as well as for elucidating their role in biological processes. Two cases studies, one involving sialylated branched glycans and the other sulfated glycosaminoglycans, are used to highlight how integration of orthogonal information from diverse datasets enables rapid convergence of glycan characterization for development of robust structure-function relationships.
Collapse
Affiliation(s)
- Luke N. Robinson
- Department of Biological Engineering, Harvard-MIT Division of Health Sciences & Technology and Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA
| | - Charlermchai Artpradit
- Program in Applied Biological Sciences: Environmental Health, Chulabhorn Graduate Institute, Bangkok, Thailand
| | - Rahul Raman
- Department of Biological Engineering, Harvard-MIT Division of Health Sciences & Technology and Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA
| | - Zachary H. Shriver
- Department of Biological Engineering, Harvard-MIT Division of Health Sciences & Technology and Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA
| | - Mathuros Ruchirawat
- Program in Applied Biological Sciences: Environmental Health, Chulabhorn Graduate Institute, Bangkok, Thailand
- Laboratory of Environmental Toxicology, Chulabhorn Research Institute, Bangkok, Thailand
| | - Ram Sasisekharan
- Department of Biological Engineering, Harvard-MIT Division of Health Sciences & Technology and Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA
| |
Collapse
|
27
|
An HJ, Lebrilla CB. Structure elucidation of native N- and O-linked glycans by tandem mass spectrometry (tutorial). MASS SPECTROMETRY REVIEWS 2011; 30:560-578. [PMID: 21656841 DOI: 10.1002/mas.20283] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Oligosaccharides play important roles in many biological processes. However, the structural elucidation of oligosaccharides remains a major challenge due to the complexities of their structures. Mass spectrometry provides a powerful method for determining oligosaccharide composition. Tandem mass spectrometry (MS) provides structural information with high sensitivity. Oligosaccharide structures differ from other polymers such as peptides because of the large number of linkage combinations and branching. This complexity makes the analysis of oligosaccharide unique from that of peptides. This tutorial addresses the issue of spectral interpretation of tandem MS under conditions of collision-induced dissociation (CID) and infrared multiphoton dissociation (IRMPD). The proper interpretation of tandem MS data can provide important structural information on different types of oligosaccharides including O- and N-linked.
Collapse
Affiliation(s)
- Hyun Joo An
- Department of Chemistry, University of California, Davis, USA
| | | |
Collapse
|
28
|
Böcker S, Kehr B, Rasche F. Determination of glycan structure from tandem mass spectra. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011; 8:976-986. [PMID: 21173459 DOI: 10.1109/tcbb.2010.129] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Glycans are molecules made from simple sugars that form complex tree structures. Glycans constitute one of the most important protein modifications and identification of glycans remains a pressing problem in biology. Unfortunately, the structure of glycans is hard to predict from the genome sequence of an organism. In this paper, we consider the problem of deriving the topology of a glycan solely from tandem mass spectrometry (MS) data. We study, how to generate glycan tree candidates that sufficiently match the sample mass spectrum, avoiding the combinatorial explosion of glycan structures. Unfortunately, the resulting problem is known to be computationally hard. We present an efficient exact algorithm for this problem based on fixed-parameter algorithmics that can process a spectrum in a matter of seconds. We also report some preliminary results of our method on experimental data, combining it with a preliminary candidate evaluation scheme. We show that our approach is fast in applications, and that we can reach very well de novo identification results. Finally, we show how to count the number of glycan topologies for a fixed size or a fixed mass. We generalize this result to count the number of (labeled) trees with bounded out degree, improving on results obtained using Pólya's enumeration theorem.
Collapse
Affiliation(s)
- Sebastian Böcker
- Faculty for Mathematics and Computer Science, Friedrich-Schiller-Universität Jena, Ernst-Abbe-Platz 2, Jena 07743, Germany.
| | | | | |
Collapse
|
29
|
Rakus JF, Mahal LK. New technologies for glycomic analysis: toward a systematic understanding of the glycome. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY (PALO ALTO, CALIF.) 2011; 4:367-392. [PMID: 21456971 DOI: 10.1146/annurev-anchem-061010-113951] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Carbohydrates are the most difficult class of biological molecules to study by high-throughput methods owing to the chemical similarities between the constituent monosaccharide building blocks, template-less biosynthesis, and the lack of clearly identifiable consensus sequences for the glycan modification of cohorts of glycoproteins. These molecules are crucial for a wide variety of cellular processes ranging from cell-cell communication to immunity, and they are altered in disease states such as cancer and inflammation. Thus, there has been a dedicated effort to develop glycan analysis into a high-throughput analytical field termed glycomics. Herein we highlight major advances in applying separation, mass spectrometry, and microarray methods to the fields of glycomics and glycoproteomics. These new analytical techniques are rapidly advancing our understanding of the importance of glycosylation in biology and disease.
Collapse
Affiliation(s)
- John F Rakus
- Department of Chemistry, New York University, New York, New York 10003, USA.
| | | |
Collapse
|
30
|
Frank M, Schloissnig S. Bioinformatics and molecular modeling in glycobiology. Cell Mol Life Sci 2010; 67:2749-72. [PMID: 20364395 PMCID: PMC2912727 DOI: 10.1007/s00018-010-0352-4] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Revised: 03/08/2010] [Accepted: 03/11/2010] [Indexed: 12/11/2022]
Abstract
The field of glycobiology is concerned with the study of the structure, properties, and biological functions of the family of biomolecules called carbohydrates. Bioinformatics for glycobiology is a particularly challenging field, because carbohydrates exhibit a high structural diversity and their chains are often branched. Significant improvements in experimental analytical methods over recent years have led to a tremendous increase in the amount of carbohydrate structure data generated. Consequently, the availability of databases and tools to store, retrieve and analyze these data in an efficient way is of fundamental importance to progress in glycobiology. In this review, the various graphical representations and sequence formats of carbohydrates are introduced, and an overview of newly developed databases, the latest developments in sequence alignment and data mining, and tools to support experimental glycan analysis are presented. Finally, the field of structural glycoinformatics and molecular modeling of carbohydrates, glycoproteins, and protein-carbohydrate interaction are reviewed.
Collapse
Affiliation(s)
- Martin Frank
- Molecular Structure Analysis Core Facility-W160, Deutsches Krebsforschungszentrum (German Cancer Research Centre), 69120 Heidelberg, Germany.
| | | |
Collapse
|
31
|
Apte A, Meitei NS. Bioinformatics in glycomics: glycan characterization with mass spectrometric data using SimGlycan. Methods Mol Biol 2010; 600:269-81. [PMID: 19882135 DOI: 10.1007/978-1-60761-454-8_19] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Mass spectrometry (MS), with its low sample requirement and high sensitivity, has been the predominantly used methodology for characterization and elucidation of glycan structures. However, manual interpretation of MS data is complex and tedious due to large number of product ions observed and also due to the variation in their m/z values under various experimental conditions. We present an automated tool, SimGlycan, for this purpose, which accepts raw/standard MS data files as input and characterizes the associated glycan structure with high accuracy using database searching and scoring techniques. Not only does it predict the glycan structure using an MS/MS database searching technique, but it also facilitates predicting novel glycans by drawing a glycan and mapping it onto an experimental spectrum to check the degree of proximity between the theoretical and the experimental glycans. It serves as a platform for developing advanced tools that may be used for glycopeptide identification using MS data and 3D structural analysis of glycans with a few improvements in the existing features.
Collapse
Affiliation(s)
- Arun Apte
- PREMIER Biosoft International, Palo Alto, CA, USA
| | | |
Collapse
|
32
|
Abstract
Carbohydrates exhibit many physiologically and pharmacologically important activities, yet their complicated structure and sequence pose major analytical challenges. Although their structural complexity makes analysis of carbohydrate difficult, mass spectrometry (MS) with high sensitivity, resolution and accuracy has become a vital tool in many applications related to analysis of carbohydrates or oligosaccharides. This application is essentially based on soft ionization technique which facilitates the ionization and vaporization of large, polar and thermally labile biomolecules. Electrospray-ionization (ESI), one of the soft ionization technique, tandem MS has been used in the sequencing of peptides, proteins, lipids, nucleic acids and more recently carbohydrates. The development of the ESI and tandem MS has begun to make carbohydrate analysis more routine. This review will focus on the application of the ESI tandem MS for the sequence analysis of native oligosaccharides, including neutral saccharides with multiple linkages, and the uronic acid polymers, alginate and glycosaminoglycans structures containing epimers.
Collapse
Affiliation(s)
- Zhenqing Zhang
- Departments of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, New York 12180, USA
| | - Robert J. Linhardt
- Departments of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, New York 12180, USA
| |
Collapse
|
33
|
Peltoniemi H, Joenväärä S, Renkonen R. De novo glycan structure search with the CID MS/MS spectra of native N-glycopeptides. Glycobiology 2009; 19:707-14. [PMID: 19270074 DOI: 10.1093/glycob/cwp034] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The aim of our study is to automatically analyze the glycan and peptide structures of N-glycopeptides without a need to release glycans from the glycopeptides. Our wet laboratory raw data represent a series of MS/MS mass spectra obtained from a reverse-phase liquid chromatography run of size-exclusion-enriched tryptic-digested glycopeptides from glycoproteins. The MS/MS spectra are first analyzed in order to identify glycosylated peptides and N-glycan monosaccharide compositions present on each glycopeptide. We further developed a Branch-and-Bound algorithm to search de novo N-glycan structures, i.e., monosaccharide compositions and their ordered sequences from native glycopeptides. Our de novo algorithm is based on iterative growth and selection of a population of glycan structures and it does not use databases of known glycan structures. We validate the algorithm with (i) in silico-generated spectra, with or without deteriorating deletions, (ii) with a purified glycoprotein transferrin, and (iii) with a complex mixture of N-glycopeptides enriched from human plasma. Our Branch-and-Bound algorithm depicted glycan structures from all the above-mentioned three input data types. Due to the large diversity of glycan structures, the results typically contained several proposed structures matching almost equally well to the spectra. In conclusion, this algorithm automatically identifies glycopeptides and their structures from the MS/MS spectra and thus greatly reduces the number of possible glycan structures from the vast amount of potential ones.
Collapse
Affiliation(s)
- Hannu Peltoniemi
- Transplantation Laboratory & Infection Biology Research Program, Haartman Institute, University of Helsinki, Finland
| | | | | |
Collapse
|
34
|
|
35
|
Goldberg D, Bern M, North SJ, Haslam SM, Dell A. Glycan family analysis for deducing N-glycan topology from single MS. ACTA ACUST UNITED AC 2008; 25:365-71. [PMID: 19073587 DOI: 10.1093/bioinformatics/btn636] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION In the past few years, mass spectrometry (MS) has emerged as the premier tool for identification and quantification of biological molecules such as peptides and glycans. There are two basic strategies: single-MS, which uses a single round of mass analysis, and MS/MS (or higher order MS(n)), which adds one or more additional rounds of mass analysis, interspersed with fragmentation steps. Single-MS offers higher throughput, broader mass coverage and more direct quantitation, but generally much weaker identification. Single-MS, however, does work fairly well for the case of N-glycan identification, which are more constrained than other biological polymers. We previously demonstrated single-MS identification of N-glycans to the level of 'cartoons' (monosaccharide composition and topology) by a system that incorporates an expert's detailed knowledge of the biological sample. In this article, we explore the possibility of ab initio single-MS N-glycan identification, with the goal of extending single-MS, or primarily-single-MS, identification to non-expert users, novel conditions and unstudied tissues. RESULTS We propose and test three cartoon-assignment algorithms that make inferences informed by biological knowledge about glycan synthesis. To test the algorithms, we used 71 single-MS spectra from a variety of tissues and organisms, containing more than 2800 manually annotated peaks. The most successful of the algorithms computes the most richly connected subgraph within a 'cartoon graph'. This algorithm uniquely assigns the correct cartoon to more than half of the peaks in 41 out of the 71 spectra.
Collapse
Affiliation(s)
- David Goldberg
- Palo Alto Research Center, 3333 Coyote Hill Rd, Palo Alto CA 94304, USA.
| | | | | | | | | |
Collapse
|
36
|
Haslam SM, Julien S, Burchell JM, Monk CR, Ceroni A, Garden OA, Dell A. Characterizing the glycome of the mammalian immune system. Immunol Cell Biol 2008; 86:564-73. [PMID: 18725885 DOI: 10.1038/icb.2008.54] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The outermost layer of all immune cells, the glycocalyx, is composed of a complex mixture of glycoproteins, glycolipids and lectins, which specifically recognize particular glycan epitopes. As the glycocalyx is the cell's primary interface with the external environment many biologically significant events can be attributed to glycan recognition. For this reason the rapidly expanding glycomics field is being increasingly recognized as an important component in our quest to better understand the functioning of the immune system. In this review, we highlight the current status of immune cell glycomics, with particular attention being paid to T- and B-lymphocytes and dendritic cells. We also describe the strategies and methodologies used to define immune cell glycomes.
Collapse
Affiliation(s)
- Stuart M Haslam
- Division of Molecular Biosciences, Imperial College London, and Breast Cancer Biology Group, Guy's Hospital, London, UK
| | | | | | | | | | | | | |
Collapse
|
37
|
Ceroni A, Maass K, Geyer H, Geyer R, Dell A, Haslam SM. GlycoWorkbench: a tool for the computer-assisted annotation of mass spectra of glycans. J Proteome Res 2008; 7:1650-9. [PMID: 18311910 DOI: 10.1021/pr7008252] [Citation(s) in RCA: 802] [Impact Index Per Article: 50.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Mass spectrometry is the main analytical technique currently used to address the challenges of glycomics as it offers unrivalled levels of sensitivity and the ability to handle complex mixtures of different glycan variations. Determination of glycan structures from analysis of MS data is a major bottleneck in high-throughput glycomics projects, and robust solutions to this problem are of critical importance. However, all the approaches currently available have inherent restrictions to the type of glycans they can identify, and none of them have proved to be a definitive tool for glycomics. GlycoWorkbench is a software tool developed by the EUROCarbDB initiative to assist the manual interpretation of MS data. The main task of GlycoWorkbench is to evaluate a set of structures proposed by the user by matching the corresponding theoretical list of fragment masses against the list of peaks derived from the spectrum. The tool provides an easy to use graphical interface, a comprehensive and increasing set of structural constituents, an exhaustive collection of fragmentation types, and a broad list of annotation options. The aim of GlycoWorkbench is to offer complete support for the routine interpretation of MS data. The software is available for download from: http://www.eurocarbdb.org/applications/ms-tools.
Collapse
Affiliation(s)
- Alessio Ceroni
- Division of Molecular Biosciences, Imperial College London, London SW72AZ, UK
| | | | | | | | | | | |
Collapse
|
38
|
Joenväärä S, Ritamo I, Peltoniemi H, Renkonen R. N-Glycoproteomics – An automated workflow approach. Glycobiology 2008; 18:339-49. [DOI: 10.1093/glycob/cwn013] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
39
|
Deguchi K. TRENDS GLYCOSCI GLYC 2008; 20:81-95. [DOI: 10.4052/tigg.20.81] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
40
|
Goldberg D, Bern M, Parry S, Sutton-Smith M, Panico M, Morris HR, Dell A. Automated N-glycopeptide identification using a combination of single- and tandem-MS. J Proteome Res 2007; 6:3995-4005. [PMID: 17727280 DOI: 10.1021/pr070239f] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We describe Peptoonist, a program that can automatically identify the glycans (sugars) present at each N-glycosylation site of a protein. The input to Peptoonist is a series of mass spectra, both MS and MS/MS, obtained from a liquid chromatography (LC) run of proteolytically digested purified glycoproteins. The program uses MS/MS to identify glycosylated peptides and single-MS to identify the N-glycans present on each of these peptides, at least to the level of monosaccharide composition. We validate the program on an LC run of mouse zona pellucida proteins that had been intensively hand annotated by a human expert. Our program doubled the number of glycopeptide identifications, and also found several possible errors in the hand annotation. In addition, it automatically made most of the same glycan isomer identifications as the expert annotator.
Collapse
Affiliation(s)
- David Goldberg
- Palo Alto Research Center, Palo Alto, California 94301, USA.
| | | | | | | | | | | | | |
Collapse
|
41
|
Ren JM, Rejtar T, Li L, Karger BL. N-Glycan structure annotation of glycopeptides using a linearized glycan structure database (GlyDB). J Proteome Res 2007; 6:3162-73. [PMID: 17625816 PMCID: PMC2557434 DOI: 10.1021/pr070111y] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
While glycoproteins are abundant in nature, and changes in glycosylation occur in cancer and other diseases, glycoprotein characterization remains a challenge due to the structural complexity of the biopolymers. This paper presents a general strategy, termed GlyDB, for glycan structure annotation of N-linked glycopeptides from tandem mass spectra in the LC-MS analysis of proteolytic digests of glycoproteins. The GlyDB approach takes advantage of low-energy collision-induced dissociation of N-linked glycopeptides that preferentially cleaves the glycosidic bonds while the peptide backbone remains intact. A theoretical glycan structure database derived from biosynthetic rules for N-linked glycans was constructed employing a novel representation of branched glycan structures consisting of multiple linear sequences. The commonly used peptide identification program, Sequest, could then be utilized to assign experimental tandem mass spectra to individual glycoforms. Analysis of synthetic glycopeptides and well-characterized glycoproteins demonstrate that the GlyDB approach can be a useful tool for annotation of glycan structures and for selection of a limited number of potential glycan structure candidates for targeted validation.
Collapse
|
42
|
Dwivedi P, Bendiak B, Clowers BH, Hill HH. Rapid resolution of carbohydrate isomers by electrospray ionization ambient pressure ion mobility spectrometry-time-of-flight mass spectrometry (ESI-APIMS-TOFMS). JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2007; 18:1163-75. [PMID: 17532226 DOI: 10.1016/j.jasms.2007.04.007] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2006] [Revised: 04/09/2007] [Accepted: 04/11/2007] [Indexed: 05/14/2023]
Abstract
Carbohydrates are an extremely complex group of isomeric molecules that have been difficult to analyze in the gas phase by mass spectrometry because (1) precursor ions and product ions to successive stages of MS(n) are frequently mixtures of isomers, and (2) detailed information about the anomeric configuration and location of specific stereochemical variants of monosaccharides within larger molecules has not been possible to obtain in a general way. Herein, it is demonstrated that gas-phase analyses by direct combination of electrospray ionization, ambient pressure ion mobility spectrometry, and time-of-flight mass spectrometry (ESI-APIMS-TOFMS) provides sufficient resolution to separate different anomeric methyl glycosides and to separate different stereoisomeric methyl glycosides having the same anomeric configuration. Reducing sugars were typically resolved into more than one peak, which might represent separation of cyclic species having different anomeric configurations and/or ring forms. The extent of separation, both with methyl glycosides and reducing sugars, was significantly affected by the nature of the drift gas and by the nature of an adducting metal ion or ion complex. The study demonstrated that ESI-APIMS-TOFMS is a rapid and effective analytical technique for the separation of isomeric methyl glycosides and simple sugars, and can be used to differentiate glycosides having different anomeric configurations.
Collapse
Affiliation(s)
- Prabha Dwivedi
- Department of Chemistry, Washington State University, Pullman, Washington 99164-4630, USA
| | | | | | | |
Collapse
|
43
|
Lapadula AJ, Hatcher PJ, Hanneman AJ, Ashline DJ, Zhang H, Reinhold VN. Congruent strategies for carbohydrate sequencing. 3. OSCAR: an algorithm for assigning oligosaccharide topology from MSn data. Anal Chem 2007; 77:6271-9. [PMID: 16194088 PMCID: PMC1435829 DOI: 10.1021/ac050726j] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
This is the third in a sequence of reports devoted to the development of congruent strategies for carbohydrate sequencing. Two previous reports outlined the strategies for observing structural detail from MSn data and introduced tools that compile, search, and compare fragment spectra in a bottom-up approach to oligosaccharide sequencing. In this third report, we introduce the operational details of an algorithm that we define as the Oligosaccharide Subtree Constraint Algorithm (OSCAR). This algorithm assimilates analyst-selected MSn ion fragmentation pathways into oligosaccharide topology (branching and linkage) using what may be considered a top-down sequencing strategy. Guided by a series of logical constraints, this de novo algorithm provides molecular topology without presumed biosynthetic constraints or external comparisons. In this introductory study, OSCAR is applied to a series of permethylated oligomers and isomeric glycans, and topologies are assigned in a few hundredths of a second.
Collapse
Affiliation(s)
- Anthony J Lapadula
- Department of Chemistry, University of New Hampshire Durham, New Hampshire 03824, USA
| | | | | | | | | | | |
Collapse
|
44
|
Deguchi K, Ito H, Baba T, Hirabayashi A, Nakagawa H, Fumoto M, Hinou H, Nishimura SI. Structural analysis of O-glycopeptides employing negative- and positive-ion multi-stage mass spectra obtained by collision-induced and electron-capture dissociations in linear ion trap time-of-flight mass spectrometry. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2007; 21:691-8. [PMID: 17279605 DOI: 10.1002/rcm.2885] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Structural analyses of various glycans attached to proteins and peptides are highly desirable for elucidating their biological roles. An approach based on mass spectrometry (MS) combining both collision-induced dissociation (CID) and electron-capture dissociation (ECD) in the positive- and negative-ion modes has been proposed as a simple and direct method of assigning an O-glycan without releasing it from the peptide and of determining the amino acid sequence of the peptide and glycosylation site. The instrument used is an electrospray ionization (ESI) linear ion trap (LIT) time-of-flight (TOF) mass spectrometer with tandem LITs for CID by He gas and ECD. The proposed approach was tested with two synthetic O-glycopeptides binding a sialyl Lewis x (sLe(x)) oligosaccharide and a 3'-sialyl N-acetyllactosamine (3'-SLN) on a serine (S) residue. In the negative-ion mode, the CID MS(2) spectra of O-glycopeptides showed a relatively abundant glycoside-bond cleavage between the core N-acetylglucosamine (GlcNAc) and serine (S) that yields deprotonated C(3)-type fragment ions of O-glycan and deprotonated Z(0)-type peptide ions. The structure of the sLe(x) (3'-SLN) oligosaccharide was simply assigned by comparing the CID MS(3) spectrum derived from the C(3)-type fragment ion with the CID MS(2) spectra of the sLe(x) and sLe(a) (3'- and 6'-SLN) standards (i.e., negative-ion MS(n) spectral matching). The amino acid sequence of the peptide including the glycosylation site was determined from the ECD MS(2) spectrum in the positive-ion mode.
Collapse
Affiliation(s)
- Kisaburo Deguchi
- Graduate School of Advanced Life Science, Hokkaido University, Sapporo 001-0021, Japan.
| | | | | | | | | | | | | | | |
Collapse
|
45
|
Morelle W, Canis K, Chirat F, Faid V, Michalski JC. The use of mass spectrometry for the proteomic analysis of glycosylation. Proteomics 2006; 6:3993-4015. [PMID: 16786490 DOI: 10.1002/pmic.200600129] [Citation(s) in RCA: 180] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Of all protein PTMs, glycosylation is by far the most common, and is a target for proteomic research. Glycosylation plays key roles in controlling various cellular processes and the modifications of the glycan structures in diseases highlight the clinical importance of this PTM. Glycosylation analysis remains a difficult task. MS, in combination with modern separation methodologies, is one of the most powerful and versatile techniques for the structural analysis of glycoconjugates. This review describes methodologies based on MS for detailed characterization of glycoconjugates in complex biological samples at the sensitivity required for proteomic work.
Collapse
Affiliation(s)
- Willy Morelle
- Unité Mixte de Recherche CNRS/USTL 8576, Université des Sciences et Technologies de Lille 1, Villeneuve d'Ascq Cedex, France.
| | | | | | | | | |
Collapse
|
46
|
von der Lieth C. An Endorsement to Create Open Access Databases for Analytical Data of Complex Carbohydrates. J Carbohydr Chem 2006. [DOI: 10.1081/car-200030093] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Affiliation(s)
- Claus‐W. von der Lieth
- a German Cancer Research Center, Central Spectroscopic Department B090 , Im Neuenheimer Feld 280, D‐69120, Heidelberg, Germany
| |
Collapse
|
47
|
Tang H, Mechref Y, Novotny MV. Automated interpretation of MS/MS spectra of oligosaccharides. Bioinformatics 2006; 21 Suppl 1:i431-9. [PMID: 15961488 PMCID: PMC1513159 DOI: 10.1093/bioinformatics/bti1038] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION The emerging glycomics and glycoproteomics projects aim to characterize all forms of glycoproteins in different tissues and organisms. Tandem mass spectrometry (MS/MS) is the key experimental methodology for high-throughput glycan identification and characterization. Fragmentation of glycans from high energy collision-induced dissociation generates ions from glycosidic as well as internal cleavages. The cross-ring ions resulting from internal cleavages provide additional information that is important to reveal the type of linkage between monosaccharides. This information, however, is not incorporated into the current programs for analyzing glycan mass spectra. As a result, they can rarely distinguish from the mass spectra isomeric oligosaccharides, which have the same saccharide composition but different types of sequences, branches or linkages. RESULTS In this paper, we describe a novel algorithm for glycan characterization using MS/MS. This algorithm consists of three steps. First, we develop a scoring scheme to identify potential bond linkages between monosaccharides, based on the appearance pattern of cross-ring ions. Next, we use a dynamic programming algorithm to determine the most probable oligosaccharide structures from the mass spectrum. Finally, we re-evaluate these oligosaccharide structures, taking into account the double fragmentation ions. We also show the preliminary results of testing our algorithm on several MS/MS spectra of oligosaccharides. AVAILABILITY The program GLYCH is available upon request from the authors.
Collapse
Affiliation(s)
- Haixu Tang
- School of Informatics, Indiana University Bloomington, IN, USA.
| | | | | |
Collapse
|
48
|
Goldberg D, Bern M, Li B, Lebrilla CB. Automatic determination of O-glycan structure from fragmentation spectra. J Proteome Res 2006; 5:1429-34. [PMID: 16739994 PMCID: PMC2570313 DOI: 10.1021/pr060035j] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Glycosylation is one of the most important classes of post-translational protein modifications, but the identification of glycans is difficult because of their branched structures and numerous isomers. We describe an algorithm called CartoonistTwo that proposes structures for O-linked glycans by automatically analyzing fragmentation mass spectra. CartoonistTwo improves upon previous glycan identification software primarily in its scoring function, which can more successfully distinguish among a number of similar structures. CartoonistTwo was designed and tested with FTICR mass spectra, and includes automatic recalibration and peak selection especially tuned for such data, yet it can be easily adapted to fragmentation spectra (MS2 or MSn) from other instrument types. On a validated test set of 34 SORI-CID MSn FTICR spectra from Xenopus egg jelly, CartoonistTwo gave the manually determined structural assignment either the first or second highest score over 90% of the time. And for over 50% of these spectra, CartoonistTwo selected a unique highest scoring structure that agreed with the manually determined one.
Collapse
|
49
|
Harvey DJ. Proteomic analysis of glycosylation: structural determination of N- and O-linked glycans by mass spectrometry. Expert Rev Proteomics 2006; 2:87-101. [PMID: 15966855 DOI: 10.1586/14789450.2.1.87] [Citation(s) in RCA: 123] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
This review summarizes the methods, mainly based on mass spectrometry, for the structural determination of N- and O-linked carbohydrates that are post-translationally attached to a large number of proteins and which play a key role in determining the function and biophysical properties of these compounds. Analysis of these carbohydrates has proved difficult in the past due to their structural complexity. However, modern analytical methods such as mass spectrometry have the ability to elucidate most structural details at the concentration levels required for proteomics. This review describes methods for direct examination of glycoproteins by mass spectrometry, the release of N- and O-linked glycans from glycoproteins separated in sodium dodecyl sulfate polyacrylamide electrophoresis gels, and the analysis of these compounds by techniques such as matrix-assisted laser desorption/ionization and electrospray ionization mass spectrometry. Matrix-assisted laser desorption/ionization mass spectrometry provides the most rapid method for comparing glycan profiles and is probably most appropriate for clinical studies. One of the most promising techniques for determining the structures of N-glycans in proteomic studies is negative ion fragmentation of electrosprayed ions. This technique combines high throughput with ease of structural interpretation and provides structural details that are difficult to obtain by classical methods.
Collapse
Affiliation(s)
- David J Harvey
- Department of Biochemistry, Glycobiology Institute, University of Oxford, South Parks Road, Oxford, OX1 3QU, UK.
| |
Collapse
|
50
|
Adamson JT, Håkansson K. Infrared Multiphoton Dissociation and Electron Capture Dissociation of High-Mannose Type Glycopeptides. J Proteome Res 2006; 5:493-501. [PMID: 16512663 DOI: 10.1021/pr0504081] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The combination of electron capture dissociation (ECD) and infrared multiphoton dissociation (IRMPD) for the structural characterization of high-mannose type glycopeptides is explored in depth for the first time. Contrary to previous applications to other glycan types, our analyses reveal that IRMPD does not necessarily selectively induce glycan cleavage in high-mannose type glycopeptides; rather peptide backbone cleavage can effectively compete with glycosidic cleavage. Poor glycan cleavage with IRMPD is due to a higher gas-phase stability of mannose-linking glycosidic bonds. This reasoning also explains mannose cleavage patterns observed for a xylose type glycopeptide with IRMPD. In addition, extensive peptide backbone cleavage is observed for a >6 kDa glycopeptide with ECD, to our knowledge the largest glycopeptide examined with this technique to date.
Collapse
Affiliation(s)
- Julie T Adamson
- Department of Chemistry, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan, 48109-1055, USA
| | | |
Collapse
|