1
|
Pradita T, Chen YJ, Su TH, Chang KH, Chen PJ, Chen YJ. Data Independent Acquisition Mass Spectrometry Enhanced Personalized Glycosylation Profiling of Haptoglobin in Hepatocellular Carcinoma. J Proteome Res 2024; 23:3571-3584. [PMID: 38994555 DOI: 10.1021/acs.jproteome.4c00227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Abstract
Aberrant glycosylation has gained significant interest for biomarker discovery. However, low detectability, complex glycan structures, and heterogeneity present challenges in glycoprotein assay development. Using haptoglobin (Hp) as a model, we developed an integrated platform combining functionalized magnetic nanoparticles and zwitterionic hydrophilic interaction liquid chromatography (ZIC-HILIC) for highly specific glycopeptide enrichment, followed by a data-independent acquisition (DIA) strategy to establish a deep cancer-specific Hp-glycosylation profile in hepatitis B virus (HBV, n = 5) and hepatocellular carcinoma (HCC, n = 5) patients. The DIA strategy established one of the deepest Hp-glycosylation landscapes (1029 glycopeptides, 130 glycans) across serum samples, including 54 glycopeptides exclusively detected in HCC patients. Additionally, single-shot DIA searches against a DIA-based spectral library outperformed the DDA approach by 2-3-fold glycopeptide coverage across patients. Among the four N-glycan sites on Hp (N-184, N-207, N-211, N-241), the total glycan type distribution revealed significantly enhanced detection of combined fucosylated-sialylated glycans, which were the most dominant glycoforms identified in HCC patients. Quantitation analysis revealed 48 glycopeptides significantly enriched in HCC (p < 0.05), including a hybrid monosialylated triantennary glycopeptide on the N-184 site with nearly none-to-all elevation to differentiate HCC from the HBV group (HCC/HBV ratio: 2462 ± 766, p < 0.05). In summary, DIA-MS presents an unbiased and comprehensive alternative for targeted glycoproteomics to guide discovery and validation of glyco-biomarkers.
Collapse
Affiliation(s)
- Tiara Pradita
- Institute of Chemistry, Academia Sinica, Taipei 115, Taiwan
- Sustainable Chemical Science and Technology, Taiwan International Graduate Program, Academia Sinica, Taipei 115, Taiwan
- Department of Applied Chemistry, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Yi-Ju Chen
- Institute of Chemistry, Academia Sinica, Taipei 115, Taiwan
| | - Tung-Hung Su
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, National Taiwan University Hospital, Taipei 100, Taiwan
- Hepatitis Research Center, National Taiwan University Hospital, Taipei 100, Taiwan
| | - Kun-Hao Chang
- Institute of Chemistry, Academia Sinica, Taipei 115, Taiwan
- Molecular Science and Technology Program, Taiwan International Graduate Program, Academia Sinica, Taipei 115, Taiwan
- Department of Chemistry, National Tsing-Hua University, Hsinchu 300, Taiwan
| | - Pei-Jer Chen
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, National Taiwan University Hospital, Taipei 100, Taiwan
- Hepatitis Research Center, National Taiwan University Hospital, Taipei 100, Taiwan
- Graduate Institute of Clinical Medicine, National Taiwan University College of Medicine, Taipei 100, Taiwan
- Department of Medical Research, National Taiwan University Hospital, Taipei 100, Taiwan
| | - Yu-Ju Chen
- Institute of Chemistry, Academia Sinica, Taipei 115, Taiwan
- Sustainable Chemical Science and Technology, Taiwan International Graduate Program, Academia Sinica, Taipei 115, Taiwan
- Department of Chemistry, National Taiwan University, Taipei 106, Taiwan
| |
Collapse
|
2
|
Klein J, Carvalho L, Zaia J. Expanding N-glycopeptide identifications by modeling fragmentation, elution, and glycome connectivity. Nat Commun 2024; 15:6168. [PMID: 39039063 PMCID: PMC11263600 DOI: 10.1038/s41467-024-50338-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 07/08/2024] [Indexed: 07/24/2024] Open
Abstract
Accurate glycopeptide identification in mass spectrometry-based glycoproteomics is a challenging problem at scale. Recent innovation has been made in increasing the scope and accuracy of glycopeptide identifications, with more precise uncertainty estimates for each part of the structure. We present a dynamically adapting relative retention time model for detecting and correcting ambiguous glycan assignments that are difficult to detect from fragmentation alone, a layered approach to glycopeptide fragmentation modeling that improves N-glycopeptide identification in samples without compromising identification quality, and a site-specific method to increase the depth of the glycoproteome confidently identifiable even further. We demonstrate our techniques on a set of previously published datasets, showing the performance gains at each stage of optimization. These techniques are provided in the open-source glycomics and glycoproteomics platform GlycReSoft available at https://github.com/mobiusklein/glycresoft .
Collapse
Affiliation(s)
- Joshua Klein
- Program for Bioinformatics, Boston University, Boston, MA, US.
| | - Luis Carvalho
- Program for Bioinformatics, Boston University, Boston, MA, US
- Department of Math and Statistics, Boston University, Boston, MA, US
| | - Joseph Zaia
- Program for Bioinformatics, Boston University, Boston, MA, US.
- Department of Biochemistry and Cell Biology, Boston University, Boston, MA, US.
| |
Collapse
|
3
|
Cao W. Advancing mass spectrometry-based glycoproteomic software tools for comprehensive site-specific glycoproteome analysis. Curr Opin Chem Biol 2024; 80:102442. [PMID: 38460452 DOI: 10.1016/j.cbpa.2024.102442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 02/18/2024] [Accepted: 02/19/2024] [Indexed: 03/11/2024]
Abstract
Glycoproteome analysis at a site-specific level and proteome scale stands out as a highly promising approach for gaining insights into the intricate roles of glycans in biological systems. Recent years have witnessed an upsurge in the development of innovative methodologies tailored for precisely this purpose. Breakthroughs in mass spectrometry-based glycoproteomic techniques, enabling the identification, quantification, and systematic exploration of site-specific glycans, have significantly enhanced our capacity to comprehensively and thoroughly characterize glycoproteins. In this short review, we delve into novel tools in advancing site-specific glycoproteomic analysis and summarize pertinent studies published in the past two years. Lastly, we discuss the ongoing challenges and outline future prospects in the field, considering both the analytical strategies of mass spectrometry and the tools employed for data interpretation.
Collapse
Affiliation(s)
- Weiqian Cao
- Shanghai Fifth People's Hospital and Institutes of Biomedical Sciences, NHC Key Laboratory of Glycoconjugates Research, Fudan University, Shanghai, 200433, China.
| |
Collapse
|
4
|
Kellman BP, Mariethoz J, Zhang Y, Shaul S, Alteri M, Sandoval D, Jeffris M, Armingol E, Bao B, Lisacek F, Bojar D, Lewis NE. Decoding glycosylation potential from protein structure across human glycoproteins with a multi-view recurrent neural network. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.15.594334. [PMID: 38798633 PMCID: PMC11118808 DOI: 10.1101/2024.05.15.594334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Glycosylation is described as a non-templated biosynthesis. Yet, the template-free premise is antithetical to the observation that different N-glycans are consistently placed at specific sites. It has been proposed that glycosite-proximal protein structures could constrain glycosylation and explain the observed microheterogeneity. Using site-specific glycosylation data, we trained a hybrid neural network to parse glycosites (recurrent neural network) and match them to feasible N-glycosylation events (graph neural network). From glycosite-flanking sequences, the algorithm predicts most human N-glycosylation events documented in the GlyConnect database and proposed structures corresponding to observed monosaccharide composition of the glycans at these sites. The algorithm also recapitulated glycosylation in Enhanced Aromatic Sequons, SARS-CoV-2 spike, and IgG3 variants, thus demonstrating the ability of the algorithm to predict both glycan structure and abundance. Thus, protein structure constrains glycosylation, and the neural network enables predictive in silico glycosylation of uncharacterized or novel protein sequences and genetic variants.
Collapse
Affiliation(s)
- Benjamin P. Kellman
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
- Augment Biologics, La Jolla, CA 92092
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, USA
| | - Julien Mariethoz
- Proteome Informatics Group, Swiss Institute of Bioinformatics, CH-1227 Geneva, Switzerland
| | - Yujie Zhang
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Sigal Shaul
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Mia Alteri
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Daniel Sandoval
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Mia Jeffris
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Erick Armingol
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
| | - Bokan Bao
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
| | - Frederique Lisacek
- Proteome Informatics Group, Swiss Institute of Bioinformatics, CH-1227 Geneva, Switzerland
- Computer Science Department & Section of Biology, University of Geneva, route de Drize 7, CH-1227, Geneva, Switzerland
| | - Daniel Bojar
- Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg 41390, Sweden
- Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg 41390, Sweden
| | - Nathan E. Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, USA
| |
Collapse
|
5
|
Yang Y, Fang Q. Prediction of glycopeptide fragment mass spectra by deep learning. Nat Commun 2024; 15:2448. [PMID: 38503734 PMCID: PMC10951270 DOI: 10.1038/s41467-024-46771-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 03/11/2024] [Indexed: 03/21/2024] Open
Abstract
Deep learning has achieved a notable success in mass spectrometry-based proteomics and is now emerging in glycoproteomics. While various deep learning models can predict fragment mass spectra of peptides with good accuracy, they cannot cope with the non-linear glycan structure in an intact glycopeptide. Herein, we present DeepGlyco, a deep learning-based approach for the prediction of fragment spectra of intact glycopeptides. Our model adopts tree-structured long-short term memory networks to process the glycan moiety and a graph neural network architecture to incorporate potential fragmentation pathways of a specific glycan structure. This feature is beneficial to model explainability and differentiation ability of glycan structural isomers. We further demonstrate that predicted spectral libraries can be used for data-independent acquisition glycoproteomics as a supplement for library completeness. We expect that this work will provide a valuable deep learning resource for glycoproteomics.
Collapse
Affiliation(s)
- Yi Yang
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, 311200, China.
| | - Qun Fang
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, 311200, China.
- Department of Chemistry, Zhejiang University, Hangzhou, 310058, China.
| |
Collapse
|
6
|
Wang Y, Cheng W, Chen X, Cheng C, Zhang L, Huang W. Serum Proteomics Identified TAFI as a Potential Molecule Facilitating the Migration of Peripheral Monocytes to Damaged White Matter During Chronic Cerebral Hypoperfusion. Neurochem Res 2024; 49:597-616. [PMID: 37978153 DOI: 10.1007/s11064-023-04050-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 10/15/2023] [Accepted: 10/17/2023] [Indexed: 11/19/2023]
Abstract
Neuroinflammation is assumed as the critical pathophysiologic mechanism of white matter lesions (WMLs), and infiltrated peripheral monocyte-derived macrophages are implicated in the development of neuroinflammation. This study sought to explore the blood molecules that promote the migration of peripheral monocytes to the sites of WMLs. The serum protein expression profiles of patients and Sprague-Dawley rat models with WMLs were detected by data-independent acquisition (DIA) proteomics technique. Compared with corresponding control groups, we acquired 62 and 41 differentially expressed proteins (DEPs) in the serum of patients and model rats with WMLs respectively. Bioinformatics investigations demonstrated that these DEPs were linked to various Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways and Gene Ontology (GO) terms involved in neuroinflammation. Afterward, we identified thrombin-activatable fibrinolysis inhibitor (TAFI) as a shared and overexpressed protein in clinical and animal serum samples, which was further verified by enzyme-linked immunosorbent assay. Additionally, an upregulation of TAFI was also observed in the white matter of rat models, and the inhibition of TAFI impeded the migration of peripheral monocytes to the area of WMLs. In vitro experiments suggested that TAFI could enhance the migration ability of RAW264.7 cells and increase the expression of Ccr2. Our study demonstrates that neuroinflammatory signals can be detected in the peripheral blood of WMLs patients and model rats. TAFI may serve as a potential protein that promotes the migration of peripheral monocytes to WMLs regions, thereby providing a novel molecular target for further investigation into the interaction between the central and peripheral immune systems.
Collapse
Affiliation(s)
- Yuhan Wang
- Department of Neurology, Xinqiao Hospital, The Army Medical University (Third Military Medical University), Chongqing, China
| | - Wenchao Cheng
- Department of Neurology, Xinqiao Hospital, The Army Medical University (Third Military Medical University), Chongqing, China
| | - Xiuying Chen
- Department of Neurology, Chongqing Emergency Medical Center, Chongqing University Central Hospital, Chongqing University, Chongqing, China
| | - Chang Cheng
- Department of Neurology, Xinqiao Hospital, The Army Medical University (Third Military Medical University), Chongqing, China
| | - Lan Zhang
- Department of Neurology, Xinqiao Hospital, The Army Medical University (Third Military Medical University), Chongqing, China
| | - Wen Huang
- Department of Neurology, Xinqiao Hospital, The Army Medical University (Third Military Medical University), Chongqing, China.
- Department of Neurology, Chongqing Emergency Medical Center, Chongqing University Central Hospital, Chongqing University, Chongqing, China.
| |
Collapse
|
7
|
White MEH, Sinn LR, Jones DM, de Folter J, Aulakh SK, Wang Z, Flynn HR, Krüger L, Tober-Lau P, Demichev V, Kurth F, Mülleder M, Blanchard V, Messner CB, Ralser M. Oxonium ion scanning mass spectrometry for large-scale plasma glycoproteomics. Nat Biomed Eng 2024; 8:233-247. [PMID: 37474612 DOI: 10.1038/s41551-023-01067-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 06/15/2023] [Indexed: 07/22/2023]
Abstract
Protein glycosylation, a complex and heterogeneous post-translational modification that is frequently dysregulated in disease, has been difficult to analyse at scale. Here we report a data-independent acquisition technique for the large-scale mass-spectrometric quantification of glycopeptides in plasma samples. The technique, which we named 'OxoScan-MS', identifies oxonium ions as glycopeptide fragments and exploits a sliding-quadrupole dimension to generate comprehensive and untargeted oxonium ion maps of precursor masses assigned to fragment ions from non-enriched plasma samples. By applying OxoScan-MS to quantify 1,002 glycopeptide features in the plasma glycoproteomes from patients with COVID-19 and healthy controls, we found that severe COVID-19 induces differential glycosylation in IgA, haptoglobin, transferrin and other disease-relevant plasma glycoproteins. OxoScan-MS may allow for the quantitative mapping of glycoproteomes at the scale of hundreds to thousands of samples.
Collapse
Affiliation(s)
- Matthew E H White
- Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, London, UK
| | - Ludwig R Sinn
- Department of Biochemistry, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - D Marc Jones
- Bioinformatics and Computational Biology Laboratory, The Francis Crick Institute, London, UK
- Department of Basic and Clinical Neuroscience, Maurice Wohl Clinical Neuroscience Institute, London, UK
| | - Joost de Folter
- Software Engineering and Artificial Intelligence Technology Platform, The Francis Crick Institute, London, UK
| | - Simran Kaur Aulakh
- Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, London, UK
| | - Ziyue Wang
- Department of Biochemistry, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - Helen R Flynn
- Mass Spectrometry Proteomics Science Technology Platform, The Francis Crick Institute, London, UK
| | - Lynn Krüger
- Institute of Diagnostic Laboratory Medicine, Charité - Universitätsmedizin Berlin Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
- Department of Human Medicine, Medical School Berlin, Berlin, Germany
| | - Pinkus Tober-Lau
- Department of Infectious Diseases and Critical Care Medicine, Charité - Universitätsmedizin Berlin Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - Vadim Demichev
- Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, London, UK
- Department of Biochemistry, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - Florian Kurth
- Department of Infectious Diseases and Critical Care Medicine, Charité - Universitätsmedizin Berlin Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - Michael Mülleder
- Core Facility High-throughput Mass Spectrometry, Charité - Universitätsmedizin Berlin Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - Véronique Blanchard
- Institute of Diagnostic Laboratory Medicine, Charité - Universitätsmedizin Berlin Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
- Department of Human Medicine, Medical School Berlin, Berlin, Germany
| | - Christoph B Messner
- Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, London, UK.
- Precision Proteomic Center, Swiss Institute of Allergy and Asthma Research (SIAF), University of Zurich, Davos, Switzerland.
| | - Markus Ralser
- Molecular Biology of Metabolism Laboratory, The Francis Crick Institute, London, UK.
- Department of Biochemistry, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany.
- Max Planck Institute for Molecular Genetics, Berlin, Germany.
| |
Collapse
|
8
|
Bashyal A, Brodbelt JS. Uncommon posttranslational modifications in proteomics: ADP-ribosylation, tyrosine nitration, and tyrosine sulfation. MASS SPECTROMETRY REVIEWS 2024; 43:289-326. [PMID: 36165040 PMCID: PMC10040477 DOI: 10.1002/mas.21811] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 09/06/2022] [Accepted: 09/07/2022] [Indexed: 06/16/2023]
Abstract
Posttranslational modifications (PTMs) are covalent modifications of proteins that modulate the structure and functions of proteins and regulate biological processes. The development of various mass spectrometry-based proteomics workflows has facilitated the identification of hundreds of PTMs and aided the understanding of biological significance in a high throughput manner. Improvements in sample preparation and PTM enrichment techniques, instrumentation for liquid chromatography-tandem mass spectrometry (LC-MS/MS), and advanced data analysis tools enhance the specificity and sensitivity of PTM identification. Highly prevalent PTMs like phosphorylation, glycosylation, acetylation, ubiquitinylation, and methylation are extensively studied. However, the functions and impact of less abundant PTMs are not as well understood and underscore the need for analytical methods that aim to characterize these PTMs. This review focuses on the advancement and analytical challenges associated with the characterization of three less common but biologically relevant PTMs, specifically, adenosine diphosphate-ribosylation, tyrosine sulfation, and tyrosine nitration. The advantages and disadvantages of various enrichment, separation, and MS/MS techniques utilized to identify and localize these PTMs are described.
Collapse
Affiliation(s)
- Aarti Bashyal
- Department of Chemistry, The University of Texas at Austin, Austin, Texas, USA
| | - Jennifer S Brodbelt
- Department of Chemistry, The University of Texas at Austin, Austin, Texas, USA
| |
Collapse
|
9
|
Lou R, Shui W. Acquisition and Analysis of DIA-Based Proteomic Data: A Comprehensive Survey in 2023. Mol Cell Proteomics 2024; 23:100712. [PMID: 38182042 PMCID: PMC10847697 DOI: 10.1016/j.mcpro.2024.100712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/27/2023] [Accepted: 01/02/2024] [Indexed: 01/07/2024] Open
Abstract
Data-independent acquisition (DIA) mass spectrometry (MS) has emerged as a powerful technology for high-throughput, accurate, and reproducible quantitative proteomics. This review provides a comprehensive overview of recent advances in both the experimental and computational methods for DIA proteomics, from data acquisition schemes to analysis strategies and software tools. DIA acquisition schemes are categorized based on the design of precursor isolation windows, highlighting wide-window, overlapping-window, narrow-window, scanning quadrupole-based, and parallel accumulation-serial fragmentation-enhanced DIA methods. For DIA data analysis, major strategies are classified into spectrum reconstruction, sequence-based search, library-based search, de novo sequencing, and sequencing-independent approaches. A wide array of software tools implementing these strategies are reviewed, with details on their overall workflows and scoring approaches at different steps. The generation and optimization of spectral libraries, which are critical resources for DIA analysis, are also discussed. Publicly available benchmark datasets covering global proteomics and phosphoproteomics are summarized to facilitate performance evaluation of various software tools and analysis workflows. Continued advances and synergistic developments of versatile components in DIA workflows are expected to further enhance the power of DIA-based proteomics.
Collapse
Affiliation(s)
- Ronghui Lou
- iHuman Institute, ShanghaiTech University, Shanghai, China; School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
| | - Wenqing Shui
- iHuman Institute, ShanghaiTech University, Shanghai, China; School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
| |
Collapse
|
10
|
Hackett WE, Chang D, Carvalho L, Zaia J. RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein composition that occur during biological processes. BIOINFORMATICS ADVANCES 2024; 4:vbae012. [PMID: 38384861 PMCID: PMC10879752 DOI: 10.1093/bioadv/vbae012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 12/15/2023] [Accepted: 01/22/2024] [Indexed: 02/23/2024]
Abstract
Motivation Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically synthesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene products. Glycoproteins, accounting for approximately half of all proteins, require specialized proteomics data analysis methods due to micro- and macro-heterogeneities as a given glycosite can be divided into several glycosylated forms, each of which must be quantified. Sampling of heterogeneous glycopeptides is limited by mass spectrometer speed and sensitivity, resulting in missing values. In conjunction with the low sample size inherent to glycoproteomics, a specialized toolset is needed to determine if observed changes in glycopeptide abundances are biologically significant or due to data quality limitations. Results We developed an R package, Relative Assessment of m/z Identifications by Similarity (RAMZIS), that uses similarity metrics to guide researchers to a more rigorous interpretation of glycoproteomics data. RAMZIS uses a permutation test to generate contextual similarity, which assesses the quality of mass spectral data and outputs a graphical demonstration of the likelihood of finding biologically significant differences in glycosylation abundance datasets. Investigators can assess dataset quality, holistically differentiate glycosites, and identify which glycopeptides are responsible for glycosylation pattern change. RAMZIS is validated by theoretical cases and a proof-of-concept application. RAMZIS enables comparison between datasets too stochastic, small, or sparse for interpolation while acknowledging these issues in its assessment. Using this tool, researchers will be able to rigorously define the role of glycosylation and the changes that occur during biological processes. Availability and implementation https://github.com/WillHackett22/RAMZIS.
Collapse
Affiliation(s)
| | - Deborah Chang
- Department of Biochemistry, Boston University, Boston, MA 02215, United States
| | - Luis Carvalho
- Bioinformatics Program, Boston University, Boston, MA 02215, United States
- Department of Mathematics, Boston University, Boston, MA 02215, United States
| | - Joseph Zaia
- Bioinformatics Program, Boston University, Boston, MA 02215, United States
- Department of Biochemistry, Boston University, Boston, MA 02215, United States
| |
Collapse
|
11
|
Kitata RB, Yang JC, Chen YJ. Advances in data-independent acquisition mass spectrometry towards comprehensive digital proteome landscape. MASS SPECTROMETRY REVIEWS 2023; 42:2324-2348. [PMID: 35645145 DOI: 10.1002/mas.21781] [Citation(s) in RCA: 37] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Revised: 12/17/2021] [Accepted: 01/21/2022] [Indexed: 06/15/2023]
Abstract
The data-independent acquisition mass spectrometry (DIA-MS) has rapidly evolved as a powerful alternative for highly reproducible proteome profiling with a unique strength of generating permanent digital maps for retrospective analysis of biological systems. Recent advancements in data analysis software tools for the complex DIA-MS/MS spectra coupled to fast MS scanning speed and high mass accuracy have greatly expanded the sensitivity and coverage of DIA-based proteomics profiling. Here, we review the evolution of the DIA-MS techniques, from earlier proof-of-principle of parallel fragmentation of all-ions or ions in selected m/z range, the sequential window acquisition of all theoretical mass spectra (SWATH-MS) to latest innovations, recent development in computation algorithms for data informatics, and auxiliary tools and advanced instrumentation to enhance the performance of DIA-MS. We further summarize recent applications of DIA-MS and experimentally-derived as well as in silico spectra library resources for large-scale profiling to facilitate biomarker discovery and drug development in human diseases with emphasis on the proteomic profiling coverage. Toward next-generation DIA-MS for clinical proteomics, we outline the challenges in processing multi-dimensional DIA data set and large-scale clinical proteomics, and continuing need in higher profiling coverage and sensitivity.
Collapse
Affiliation(s)
| | - Jhih-Ci Yang
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan
- Sustainable Chemical Science and Technology, Taiwan International Graduate Program, Academia Sinica and National Yang Ming Chiao Tung University, Taipei, Taiwan
- Department of Applied Chemistry, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Yu-Ju Chen
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan
- Sustainable Chemical Science and Technology, Taiwan International Graduate Program, Academia Sinica and National Yang Ming Chiao Tung University, Taipei, Taiwan
- Department of Chemistry, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
12
|
Chau TH, Chernykh A, Kawahara R, Thaysen-Andersen M. Critical considerations in N-glycoproteomics. Curr Opin Chem Biol 2023; 73:102272. [PMID: 36758418 DOI: 10.1016/j.cbpa.2023.102272] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 12/30/2022] [Accepted: 01/05/2023] [Indexed: 02/10/2023]
Abstract
N-Glycoproteomics, the system-wide study of glycans asparagine-linked to protein carriers, holds a unique and still largely untapped potential to provide deep insights into the complexity and dynamics of the heterogeneous N-glycoproteome. Despite the advent of innovative analytical and informatics tools aiding the analysis, N-glycoproteomics remains challenging and consequently largely restricted to specialised laboratories. Aiming to stimulate discussions of method harmonisation, data standardisation and reporting guidelines to make N-glycoproteomics more reproducible and accessible to the community, we here discuss critical considerations related to the design and execution of N-glycoproteomics experiments and highlight good practices in N-glycopeptide data collection, analysis, interpretation and sharing. Giving the rapid maturation and, expectedly, a wide-spread implementation of N-glycoproteomics capabilities across the community in future years, this piece aims to point out common pitfalls, to encourage good data sharing and documentation practices, and to highlight practical solutions and strategies to enhance the insight into the N-glycoproteome.
Collapse
Affiliation(s)
- The Huong Chau
- School of Natural Sciences, Faculty of Science and Engineering, Macquarie University, Sydney, Australia; Biomolecular Discovery Research Centre, Macquarie University, Sydney, Australia
| | - Anastasia Chernykh
- School of Natural Sciences, Faculty of Science and Engineering, Macquarie University, Sydney, Australia; Biomolecular Discovery Research Centre, Macquarie University, Sydney, Australia
| | - Rebeca Kawahara
- School of Natural Sciences, Faculty of Science and Engineering, Macquarie University, Sydney, Australia; Biomolecular Discovery Research Centre, Macquarie University, Sydney, Australia
| | - Morten Thaysen-Andersen
- School of Natural Sciences, Faculty of Science and Engineering, Macquarie University, Sydney, Australia; Biomolecular Discovery Research Centre, Macquarie University, Sydney, Australia; Institute for Glyco-core Research (iGCORE), Nagoya University, Nagoya, Japan.
| |
Collapse
|
13
|
Yang Y, Qiao L. Profiling Serum Intact N-Glycopeptides Using Data-Independent Acquisition Mass Spectrometry. Methods Mol Biol 2023; 2628:365-391. [PMID: 36781798 DOI: 10.1007/978-1-0716-2978-9_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
Abstract
Large-scale profiling of intact glycopeptides is critical but challenging in glycoproteomics. Data-independent acquisition (DIA) mass spectrometry is an emerging technology with deep proteome coverage as well as accurate quantitative capability for large-scale proteomics studies and has also been applied to the field of glycoproteomics. In this protocol, we describe how to analyze data from a DIA experiment for profiling serum intact N-glycopeptides. We present a comprehensive data analysis workflow using GproDIA, including glycopeptide spectral library building, chromatographic feature extraction from the DIA data, and feature scoring with appropriate statistical control of error rates. We anticipate that this method could provide a powerful tool to explore the serum glycoproteome.
Collapse
Affiliation(s)
- Yi Yang
- Department of Chemistry and Shanghai Stomatological Hospital, Fudan University, Shanghai, China.,ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, China
| | - Liang Qiao
- Department of Chemistry and Shanghai Stomatological Hospital, Fudan University, Shanghai, China.
| |
Collapse
|
14
|
Polasky DA, Nesvizhskii AI. Recent advances in computational algorithms and software for large-scale glycoproteomics. Curr Opin Chem Biol 2023; 72:102238. [PMID: 36525809 DOI: 10.1016/j.cbpa.2022.102238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 11/12/2022] [Accepted: 11/14/2022] [Indexed: 12/15/2022]
Abstract
Glycoproteomics, or characterizing glycosylation events at a proteome scale, has seen rapid advances in methods for analyzing glycopeptides by tandem mass spectrometry in recent years. These advances have enabled acquisition of far more comprehensive and large-scale datasets, precipitating an urgent need for improved informatics methods to analyze the resulting data. A new generation of glycoproteomics search methods has recently emerged, using glycan fragmentation to split the identification of a glycopeptide into peptide and glycan components and solve each component separately. In this review, we discuss these new methods and their implications for large-scale glycoproteomics, as well as several outstanding challenges in glycoproteomics data analysis, including validation of glycan assignments and quantitation. Finally, we provide an outlook on the future of glycoproteomics from an informatics perspective, noting the key challenges to achieving widespread and reproducible glycopeptide annotation and quantitation.
Collapse
Affiliation(s)
- Daniel A Polasky
- University of Michigan Department of Pathology, Ann Arbor, MI, USA.
| | - Alexey I Nesvizhskii
- University of Michigan Department of Pathology, Ann Arbor, MI, USA; University of Michigan Department of Computational Medicine and Bioinformatics, Ann Arbor, MI, USA.
| |
Collapse
|
15
|
pGlycoQuant with a deep residual network for quantitative glycoproteomics at intact glycopeptide level. Nat Commun 2022; 13:7539. [PMID: 36477196 PMCID: PMC9729625 DOI: 10.1038/s41467-022-35172-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Accepted: 11/17/2022] [Indexed: 12/12/2022] Open
Abstract
Large-scale intact glycopeptide identification has been advanced by software tools. However, tools for quantitative analysis remain lagging behind, which hinders exploring the differential site-specific glycosylation. Here, we report pGlycoQuant, a generic tool for both primary and tandem mass spectrometry-based intact glycopeptide quantitation. pGlycoQuant advances in glycopeptide matching through applying a deep learning model that reduces missing values by 19-89% compared with Byologic, MSFragger-Glyco, Skyline, and Proteome Discoverer, as well as a Match In Run algorithm for more glycopeptide coverage, greatly expanding the quantitative function of several widely used search engines, including pGlyco 2.0, pGlyco3, Byonic and MSFragger-Glyco. Further application of pGlycoQuant to the N-glycoproteomic study in three different metastatic HCC cell lines quantifies 6435 intact N-glycopeptides and, together with in vitro molecular biology experiments, illustrates site 979-core fucosylation of L1CAM as a potential regulator of HCC metastasis. We expected further applications of the freely available pGlycoQuant in glycoproteomic studies.
Collapse
|
16
|
Ren W, Bian Q, Cai Y. Mass spectrometry-based N-glycosylation analysis in kidney disease. Front Mol Biosci 2022; 9:976298. [PMID: 36072428 PMCID: PMC9442644 DOI: 10.3389/fmolb.2022.976298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 07/18/2022] [Indexed: 11/14/2022] Open
Abstract
Kidney disease is a global health concern with an enormous expense. It is estimated that more than 10% of the population worldwide is affected by kidney disease and millions of patients would progress to death prematurely and unnecessarily. Although creatinine detection and renal biopsy are well-established tools for kidney disease diagnosis, they are limited by several inevitable defects. Therefore, diagnostic tools need to be upgraded, especially for the early stage of the disease and possible progression. As one of the most common post-translational modifications of proteins, N-glycosylation plays a vital role in renal structure and function. Deepening research on N-glycosylation in kidney disease provides new insights into the pathophysiology and paves the way for clinical application. In this study, we reviewed recent N-glycosylation studies on several kidney diseases. We also summarized the development of mass spectrometric methods in the field of N-glycoproteomics and N-glycomics.
Collapse
Affiliation(s)
- Weifu Ren
- Shanghai Institute of Precision Medicine, Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
- Department of Nephrology, Changhai Hospital, Naval Medical University, Shanghai, China
| | - Qi Bian
- Department of Nephrology, Changhai Hospital, Naval Medical University, Shanghai, China
| | - Yan Cai
- Shanghai Institute of Precision Medicine, Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| |
Collapse
|
17
|
Fröhlich K, Brombacher E, Fahrner M, Vogele D, Kook L, Pinter N, Bronsert P, Timme-Bronsert S, Schmidt A, Bärenfaller K, Kreutz C, Schilling O. Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity. Nat Commun 2022; 13:2622. [PMID: 35551187 PMCID: PMC9098472 DOI: 10.1038/s41467-022-30094-0] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 04/14/2022] [Indexed: 12/25/2022] Open
Abstract
Numerous software tools exist for data-independent acquisition (DIA) analysis of clinical samples, necessitating their comprehensive benchmarking. We present a benchmark dataset comprising real-world inter-patient heterogeneity, which we use for in-depth benchmarking of DIA data analysis workflows for clinical settings. Combining spectral libraries, DIA software, sparsity reduction, normalization, and statistical tests results in 1428 distinct data analysis workflows, which we evaluate based on their ability to correctly identify differentially abundant proteins. From our dataset, we derive bootstrap datasets of varying sample sizes and use the whole range of bootstrap datasets to robustly evaluate each workflow. We find that all DIA software suites benefit from using a gas-phase fractionated spectral library, irrespective of the library refinement used. Gas-phase fractionation-based libraries perform best against two out of three reference protein lists. Among all investigated statistical tests non-parametric permutation-based statistical tests consistently perform best. Data independent acquisition (DIA) has been gaining momentum in clinical proteomics. Here, the authors create a benchmark dataset comprising inter-patient heterogeneity to compare popular DIA data analysis workflows for identifying differentially abundant proteins.
Collapse
Affiliation(s)
- Klemens Fröhlich
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Biology, University of Freiburg, Freiburg im Breisgau, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg im Breisgau, Germany
| | - Eva Brombacher
- Faculty of Biology, University of Freiburg, Freiburg im Breisgau, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg im Breisgau, Germany.,Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center - University of Freiburg, Freiburg im Breisgau, Germany.,Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg im Breisgau, Germany
| | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Biology, University of Freiburg, Freiburg im Breisgau, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg im Breisgau, Germany
| | - Daniel Vogele
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Biology, University of Freiburg, Freiburg im Breisgau, Germany
| | - Lucas Kook
- Epidemiology, Biostatistics & Prevention Institute, University of Zurich, Zurich, Switzerland.,Institute for Data Analysis and Process Design, Zurich University of Applied Sciences, Winterthur, Switzerland
| | - Niko Pinter
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Peter Bronsert
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany.,Tumorbank Comprehensive Cancer Center Freiburg, Medical Center University of Freiburg, Freiburg im Breisgau, Germany
| | - Sylvia Timme-Bronsert
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,Tumorbank Comprehensive Cancer Center Freiburg, Medical Center University of Freiburg, Freiburg im Breisgau, Germany
| | - Alexander Schmidt
- Proteomics Core Facility, Biozentrum, University of Basel, Basel, Switzerland
| | - Katja Bärenfaller
- Swiss Institute of Allergy and Asthma Research (SIAF), University of Zurich, and Swiss Institute of Bioinformatics (SIB), Wolfgang, Switzerland
| | - Clemens Kreutz
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center - University of Freiburg, Freiburg im Breisgau, Germany.,Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg im Breisgau, Germany
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany. .,German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany. .,BIOSS Centre for Biological Signaling Studies, University of Freiburg, Freiburg im Breisgau, Germany.
| |
Collapse
|
18
|
Fang P, Ji Y, Oellerich T, Urlaub H, Pan KT. Strategies for Proteome-Wide Quantification of Glycosylation Macro- and Micro-Heterogeneity. Int J Mol Sci 2022; 23:ijms23031609. [PMID: 35163546 PMCID: PMC8835892 DOI: 10.3390/ijms23031609] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 01/26/2022] [Accepted: 01/27/2022] [Indexed: 12/03/2022] Open
Abstract
Protein glycosylation governs key physiological and pathological processes in human cells. Aberrant glycosylation is thus closely associated with disease progression. Mass spectrometry (MS)-based glycoproteomics has emerged as an indispensable tool for investigating glycosylation changes in biological samples with high sensitivity. Following rapid improvements in methodologies for reliable intact glycopeptide identification, site-specific quantification of glycopeptide macro- and micro-heterogeneity at the proteome scale has become an urgent need for exploring glycosylation regulations. Here, we summarize recent advances in N- and O-linked glycoproteomic quantification strategies and discuss their limitations. We further describe a strategy to propagate MS data for multilayered glycopeptide quantification, enabling a more comprehensive examination of global and site-specific glycosylation changes. Altogether, we show how quantitative glycoproteomics methods explore glycosylation regulation in human diseases and promote the discovery of biomarkers and therapeutic targets.
Collapse
Affiliation(s)
- Pan Fang
- Department of Biochemistry and Molecular Biology, School of Biology & Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou 215123, China;
| | - Yanlong Ji
- Bioanalytical Mass Spectrometry Group, Max Planck Institute for Multidisciplinary Sciences, 37077 Göttingen, Germany;
- Hematology/Oncology, Department of Medicine II, Johann Wolfgang Goethe University, 60590 Frankfurt am Main, Germany;
- Frankfurt Cancer Institute, Johann Wolfgang Goethe University, 60596 Frankfurt am Main, Germany
| | - Thomas Oellerich
- Hematology/Oncology, Department of Medicine II, Johann Wolfgang Goethe University, 60590 Frankfurt am Main, Germany;
- Frankfurt Cancer Institute, Johann Wolfgang Goethe University, 60596 Frankfurt am Main, Germany
- German Cancer Consortium (DKTK), Partner Site Frankfurt/Mainz, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Henning Urlaub
- Bioanalytical Mass Spectrometry Group, Max Planck Institute for Multidisciplinary Sciences, 37077 Göttingen, Germany;
- Institute of Clinical Chemistry, University Medical Center Göttingen, 37075 Göttingen, Germany
- Correspondence: (H.U.); (K.-T.P.)
| | - Kuan-Ting Pan
- Hematology/Oncology, Department of Medicine II, Johann Wolfgang Goethe University, 60590 Frankfurt am Main, Germany;
- Frankfurt Cancer Institute, Johann Wolfgang Goethe University, 60596 Frankfurt am Main, Germany
- Correspondence: (H.U.); (K.-T.P.)
| |
Collapse
|
19
|
Polasky DA, Geiszler DJ, Yu F, Nesvizhskii AI. Multi-attribute Glycan Identification and FDR Control for Glycoproteomics. Mol Cell Proteomics 2022; 21:100205. [PMID: 35091091 PMCID: PMC8933705 DOI: 10.1016/j.mcpro.2022.100205] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 01/10/2022] [Accepted: 01/20/2022] [Indexed: 11/18/2022] Open
Abstract
Rapidly improving methods for glycoproteomics have enabled increasingly large-scale analyses of complex glycopeptide samples, but annotating the resulting mass spectrometry data with high confidence remains a major bottleneck. We recently introduced a fast and sensitive glycoproteomics search method in our MSFragger search engine, which reports glycopeptides as a combination of a peptide sequence and the mass of the attached glycan. In samples with complex glycosylation patterns, converting this mass to a specific glycan composition is not straightforward; however, as many glycans have similar or identical masses. Here, we have developed a new method for determining the glycan composition of N-linked glycopeptides fragmented by collisional or hybrid activation that uses multiple sources of information from the spectrum, including observed glycan B-type (oxonium) and Y-type ions and mass and precursor monoisotopic selection errors to discriminate between possible glycan candidates. Combined with false discovery rate estimation for the glycan assignment, we show that this method is capable of specifically and sensitively identifying glycans in complex glycopeptide analyses and effectively controls the rate of false glycan assignments. The new method has been incorporated into the PTM-Shepherd modification analysis tool to work directly with the MSFragger glyco search in the FragPipe graphical user interface, providing a complete computational pipeline for annotation of N-glycopeptide spectra with false discovery rate control of both peptide and glycan components that is both sensitive and robust against false identifications. Identifying the glycan on intact glycopeptides remains difficult in glycoproteomics. We developed a method to assign glycan compositions in N-glycoproteomics searches. We demonstrate well-controlled glycan FDR in multiple sample types. The method annotates more glycopeptide spectra than competing tools. The method is included PTM-Shepherd for a full glycoproteomics workflow in FragPipe.
Collapse
Affiliation(s)
- Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Daniel J Geiszler
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA.
| |
Collapse
|
20
|
Yang Y, Lin L, Qiao L. Deep learning approaches for data-independent acquisition proteomics. Expert Rev Proteomics 2021; 18:1031-1043. [PMID: 34918987 DOI: 10.1080/14789450.2021.2020654] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
INTRODUCTION Data-independent acquisition (DIA) is an emerging technology for large-scale proteomic studies. DIA data analysis methods are evolving rapidly, and deep learning has cut a conspicuous figure in this field. AREAS COVERED This review discusses and provides an overview of the deep learning methods that are used for DIA data analysis, including spectral library prediction, feature scoring, and statistical control in peptide-centric analysis, as well as de novo peptide sequencing. Literature searches were performed for articles, including preprints, up to December 2021 from PubMed, Scopus, and Web of Science databases. EXPERT OPINION While spectral library prediction has broken through the limitation on proteome coverage of experimental libraries, the statistical burden due to the large query space is the remaining challenge of utilizing proteome-wide predicted libraries. Analysis of post-translational modifications is another promising direction of deep learning-based DIA methods.
Collapse
Affiliation(s)
- Yi Yang
- Department of Chemistry, Shanghai Stomatological Hospital, and Minhang Hospital, Fudan University, Shanghai China
| | - Ling Lin
- Department of Chemistry, Shanghai Stomatological Hospital, and Minhang Hospital, Fudan University, Shanghai China
| | - Liang Qiao
- Department of Chemistry, Shanghai Stomatological Hospital, and Minhang Hospital, Fudan University, Shanghai China
| |
Collapse
|
21
|
Gong Y, Qin S, Dai L, Tian Z. The glycosylation in SARS-CoV-2 and its receptor ACE2. Signal Transduct Target Ther 2021; 6:396. [PMID: 34782609 PMCID: PMC8591162 DOI: 10.1038/s41392-021-00809-8] [Citation(s) in RCA: 102] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 10/10/2021] [Accepted: 10/24/2021] [Indexed: 02/05/2023] Open
Abstract
Coronavirus disease 2019 (COVID-19), a highly infectious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has infected more than 235 million individuals and led to more than 4.8 million deaths worldwide as of October 5 2021. Cryo-electron microscopy and topology show that the SARS-CoV-2 genome encodes lots of highly glycosylated proteins, such as spike (S), envelope (E), membrane (M), and ORF3a proteins, which are responsible for host recognition, penetration, binding, recycling and pathogenesis. Here we reviewed the detections, substrates, biological functions of the glycosylation in SARS-CoV-2 proteins as well as the human receptor ACE2, and also summarized the approved and undergoing SARS-CoV-2 therapeutics associated with glycosylation. This review may not only broad the understanding of viral glycobiology, but also provide key clues for the development of new preventive and therapeutic methodologies against SARS-CoV-2 and its variants.
Collapse
Affiliation(s)
- Yanqiu Gong
- National Clinical Research Center for Geriatrics and Department of General Practice, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, and Collaborative Innovation Center of Biotherapy, 610041, Chengdu, China
| | - Suideng Qin
- School of Chemical Science & Engineering, Shanghai Key Laboratory of Chemical Assessment and Sustainability, Tongji University, 200092, Shanghai, China
| | - Lunzhi Dai
- National Clinical Research Center for Geriatrics and Department of General Practice, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, and Collaborative Innovation Center of Biotherapy, 610041, Chengdu, China.
| | - Zhixin Tian
- School of Chemical Science & Engineering, Shanghai Key Laboratory of Chemical Assessment and Sustainability, Tongji University, 200092, Shanghai, China.
| |
Collapse
|