1
|
Akmal MA, Hassan MA, Muhammad S, Khurshid KS, Mohamed A. An analytical study on the identification of N-linked glycosylation sites using machine learning model. PeerJ Comput Sci 2022; 8:e1069. [PMID: 36262138 PMCID: PMC9575850 DOI: 10.7717/peerj-cs.1069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 07/25/2022] [Indexed: 06/16/2023]
Abstract
N-linked is the most common type of glycosylation which plays a significant role in identifying various diseases such as type I diabetes and cancer and helps in drug development. Most of the proteins cannot perform their biological and psychological functionalities without undergoing such modification. Therefore, it is essential to identify such sites by computational techniques because of experimental limitations. This study aims to analyze and synthesize the progress to discover N-linked places using machine learning methods. It also explores the performance of currently available tools to predict such sites. Almost seventy research articles published in recognized journals of the N-linked glycosylation field have shortlisted after the rigorous filtering process. The findings of the studies have been reported based on multiple aspects: publication channel, feature set construction method, training algorithm, and performance evaluation. Moreover, a literature survey has developed a taxonomy of N-linked sequence identification. Our study focuses on the performance evaluation criteria, and the importance of N-linked glycosylation motivates us to discover resources that use computational methods instead of the experimental method due to its limitations.
Collapse
Affiliation(s)
- Muhammad Aizaz Akmal
- Department of Computer Science, University of Engineering and Technology, KSK, Lahore, Punjab, Pakistan
| | - Muhammad Awais Hassan
- Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan
| | - Shoaib Muhammad
- Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan
| | - Khaldoon S. Khurshid
- Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan
| | | |
Collapse
|
2
|
Prediction of Intact N-Glycopeptide Retention Time Windows in Hydrophilic Interaction Liquid Chromatography. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27123723. [PMID: 35744847 PMCID: PMC9228347 DOI: 10.3390/molecules27123723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 05/28/2022] [Accepted: 06/07/2022] [Indexed: 11/18/2022]
Abstract
Analysis of protein glycosylation is challenging due to micro- and macro-heterogeneity of the attached glycans. Hydrophilic interaction liquid chromatography (HILIC) is a mode of choice for separation of intact glycopeptides, which are inadequately resolved by reversed phase chromatography. In this work, we propose an easy-to-use model to predict retention time windows of glycopeptides in HILIC. We constructed this model based on the parameters derived from chromatographic separation of six differently glycosylated peptides obtained from tryptic digests of three plasma proteins: haptoglobin, hemopexin, and sex hormone-binding globulin. We calculated relative retention times of different glycoforms attached to the same peptide to the bi-antennary form and showed that the character of the peptide moiety did not significantly change the relative retention time differences between the glycoforms. To challenge the model, we assessed chromatographic behavior of fetuin glycopeptides experimentally, and their retention times all fell within the calculated retention time windows, which suggests that the retention time window prediction model in HILIC is sufficiently accurate. Relative retention time windows provide complementary information to mass spectrometric data, and we consider them useful for reliable determination of protein glycosylation in a site-specific manner.
Collapse
|
3
|
Pujić I, Perreault H. Recent advancements in glycoproteomic studies: Glycopeptide enrichment and derivatization, characterization of glycosylation in SARS CoV2, and interacting glycoproteins. MASS SPECTROMETRY REVIEWS 2022; 41:488-507. [PMID: 33393161 DOI: 10.1002/mas.21679] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Revised: 12/13/2020] [Accepted: 12/16/2020] [Indexed: 06/12/2023]
Abstract
Proteomics studies allow for the determination of the identity, amount, and interactions of proteins under specific conditions that allow the biological state of an organism to ultimately change. These conditions can be either beneficial or detrimental. Diseases are due to detrimental changes caused by either protein overexpression or underexpression caused by as a result of a mutation or posttranslational modifications (PTM), among other factors. Identification of disease biomarkers through proteomics can be potentially used as clinical information for diagnostics. Common biomarkers to look for include PTM. For example, aberrant glycosylation of proteins is a common marker and will be a focus of interest in this review. A common way to analyze glycoproteins is by glycoproteomics involving mass spectrometry. Due to factors such as micro- and macroheterogeneity which result in a lower abundance of each version of a glycoprotein, it is difficult to obtain meaningful results unless rigorous sample preparation procedures are in place. Microheterogeneity represents the diversity of glycans at a single site, whereas macroheterogeneity depicts glycosylation levels at each site of a protein. Enrichment and derivatization of glycopeptides help to overcome these limitations. Over the time range of 2016 to 2020, several methods have been proposed in the literature and have contributed to drastically improve the outcome of glycosylation analysis, as presented in the sampling surveyed in this review. As a current topic in 2020, glycoproteins carried by pathogens can also cause disease and this is seen with SARS CoV2, causing the COVID-19 pandemic. This review will discuss glycoproteomic studies of the spike glycoprotein and interacting proteins such as the ACE2 receptor.
Collapse
Affiliation(s)
- Ivona Pujić
- Chemistry Department, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Hélène Perreault
- Chemistry Department, University of Manitoba, Winnipeg, Manitoba, Canada
| |
Collapse
|
4
|
Xie Y, Butler M. Construction of InstantPC derivatized glycan GU database: A foundation work for high-throughput and high-sensitivity glycomic analysis. Glycobiology 2021; 32:289-303. [PMID: 34972858 DOI: 10.1093/glycob/cwab128] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 11/08/2021] [Accepted: 11/30/2021] [Indexed: 11/12/2022] Open
Abstract
Glycosylation is well-recognized as a critical quality attribute of biotherapeutics being routinely monitored to ensure desired product quality, safety, and efficacy. Additionally, as one of the most prominent and complex post-translational modifications, glycosylation plays a key role in disease manifestation. Changes in glycosylation may serve as a specific and sensitive biomarker for disease diagnostics and prognostics. However, the conventional 2-aminobenzamide based N-glycosylation analysis procedure is time-consuming and insensitive, with poor reproducibility. We have evaluated an innovative streamlined 96-well-plate-based platform utilizing InstantPC label for high-throughput, high-sensitivity glycan profiling, which is user-friendly, robust, and ready for automation. However, the limited availability of InstantPC labelled glycan standards has significantly hampered the applicability and transferability of this platform for expedited glycan structural profiling. To address this challenge, we have constructed a detailed InstantPC labelled glycan glucose unit database through analysis of human serum and a variety of other glycoproteins from various sources. Following preliminary hydrophilic interaction liquid chromatography with fluorescence detection separation and analysis, glycoproteins with complex glycan profiles were subjected to further fractionation by weak anion exchange hydrophilic interaction liquid chromatography and exoglycosidase sequential digestion for cross-validation of the glycan assignment. Hydrophilic interaction ultra-performance liquid chromatography coupled with electrospray ionization mass spectrometry was subsequently utilised for glycan fragmentation and accurate glycan mass confirmation. The constructed InstantPC glycan GU database is accurate and robust. It is believed that this database will enhance the application of the developed platform for high-throughput, high-sensitivity glycan profiling, and eventually advance glycan-based biopharmaceutical production and disease biomarker discovery.
Collapse
Affiliation(s)
- Yongjing Xie
- National Institute for Bioprocessing Research and Training, Foster Avenue, Mount Merrion, Blackrock, Co. Dublin, Ireland
| | - Michael Butler
- National Institute for Bioprocessing Research and Training, Foster Avenue, Mount Merrion, Blackrock, Co. Dublin, Ireland
| |
Collapse
|
5
|
High-throughput rat immunoglobulin G N-glycosylation profiling revealed subclass-specific changes associated with chronic stress. J Proteomics 2021; 245:104293. [PMID: 34118474 DOI: 10.1016/j.jprot.2021.104293] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Revised: 05/03/2021] [Accepted: 05/26/2021] [Indexed: 12/31/2022]
Abstract
Immunoglobulin G (IgG) glycosylation corresponds well with immune system changes, so it can potentially be used as a biomarker for the consequences of chronic stress such as low-grade inflammation and enhanced immunosenescence in older animals. Here we present a high-throughput glycoproteomic workflow, including IgG enrichment, HILIC glycopeptide purification, and nano-LC-MS analysis of tryptic glycopeptides applied for the analysis of rat IgG. A cohort of 80 animals was exposed to seven stressors in a customized chronic stress protocol with blood and tissue sampling in three timepoints. Young female rats experienced an increase in agalactosylated glycoforms on IgG2a and IgG2c accompanied by a decrease in monogalactosylation. Among old females, increased galactosylation was observed in the IgG2b subclass, pointing to an anti-inflammatory activity of IgG. Additionally, IgG Fc N-glycosylation patterns in Sprague Dawley rats were analyzed, quantified, and reported for the first time. Our findings emphasize age-, sex- and subclass-dependent differences in IgG glycosylation related to chronic stress exposure, confirming the relevance of newly developed methods for further research in glycobiology of rodent immune response. SIGNIFICANCE: In this study, we showed that a high-throughput streamlined methodology based on protein L 96-well monolithic plates for efficient rat IgG immunoaffinity enrichment from blood plasma, paired with appropriate tryptic glycopeptide preparation, HILIC-SPE enrichment, and nano-LC-MS methods was suitable for quick processing of large sample sets. We report a subclass-specific profiling and changes in rat IgG Fc galactosylation and adrenal gland immunohistochemistry of male and female animals exposed to a customized chronic stress protocol.
Collapse
|
6
|
Molnarova K, Duris A, Jecmen T, Kozlik P. Comparison of human IgG glycopeptides separation using mixed-mode hydrophilic interaction/ion-exchange liquid chromatography and reversed-phase mode. Anal Bioanal Chem 2021; 413:4321-4328. [PMID: 34002272 DOI: 10.1007/s00216-021-03388-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 04/12/2021] [Accepted: 05/03/2021] [Indexed: 12/24/2022]
Abstract
Glycoproteomics is a challenging branch of proteomics because of the micro- and macro-heterogeneity of protein glycosylation. Hydrophilic interaction liquid chromatography (HILIC) is an advantageous alternative to reversed-phase chromatography for intact glycopeptide separation prior to their identification by mass spectrometry. Nowadays, several HILIC columns differing in used chemistries are commercially available. However, there is a lack of comparative studies assessing their performance, and thus providing guidance for the selection of an adequate stationary phase for different glycoproteomics applications. Here, we compare three HILIC columns recently developed by Advanced Chromatography Technologies (ACE)- with unfunctionalized (HILIC-A), polyhydroxy functionalized (HILIC-N), and aminopropyl functionalized (HILIC-B) silica- with a C18 reversed-phase column in the separation of human immunoglobulin G glycopeptides. HILIC-A and HILIC-B exhibit mixed-mode separation combining hydrophilic and ion-exchange interactions for analyte retention. Expectably, reversed-phase mode successfully separated clusters of immunoglobulin G1 and immunoglobulin G2 glycopeptides, which differ in amino acid sequence, but was not able to adequately separate different glycoforms of the same peptide. All ACE HILIC columns showed higher separation power for different glycoforms, and we show that each column separates a different group of glycopeptides more effectively than the others. Moreover, HILIC-A and HILIC-N columns separated the isobaric A2G1F1 glycopeptides of immunoglobulin G, and thus showed the potential for the elucidation of the structure of isomeric glycoforms. Furthermore, the possible retention mechanism for the HILIC columns is discussed on the basis of the determined chromatographic parameters.
Collapse
Affiliation(s)
- Katarina Molnarova
- Department of Analytical Chemistry, Faculty of Science, Charles University, Hlavova 8, 128 43, Prague 2, Czech Republic
| | - Ales Duris
- Department of Analytical Chemistry, Faculty of Science, Charles University, Hlavova 8, 128 43, Prague 2, Czech Republic
| | - Tomas Jecmen
- Department of Biochemistry, Faculty of Science, Charles University, 128 00, Prague 2, Czech Republic
| | - Petr Kozlik
- Department of Analytical Chemistry, Faculty of Science, Charles University, Hlavova 8, 128 43, Prague 2, Czech Republic.
| |
Collapse
|
7
|
Zhao Y, Raidas S, Mao Y, Li N. Glycine additive facilitates site-specific glycosylation profiling of biopharmaceuticals by ion-pairing hydrophilic interaction chromatography mass spectrometry. Anal Bioanal Chem 2020; 413:1267-1277. [PMID: 33244686 DOI: 10.1007/s00216-020-03089-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 11/19/2020] [Accepted: 11/21/2020] [Indexed: 11/28/2022]
Abstract
Many biotherapeutics such as monoclonal antibodies (mAb) and Fc-domain fusion proteins contain heterogeneous glycan contents at one or multiple glycosylation site(s). Site-specific glycan profile characterization is critical for monitoring the quality of these molecules during different stages of drug development. Hydrophilic interaction chromatography (HILIC) as an orthogonal separation method to reversed-phase liquid chromatography (RPLC) can achieve better glycopeptide identification due to the effective separation between individual glycoforms as well as the separation of glycopeptides from high-abundance non-glycosylated peptides, which can be further improved by modifying the mobile phases with ion-pairing agents (IP-HILIC). However, an online IP-HILIC coupled to mass spectrometry (MS) detection may suffer from the suppression of mass spectrometry signal during electrospray ionization due to the trifluoroacetic acid (TFA), commonly used as an ion-pairing agent. Here, we reported an optimized experimental condition for IP-HILIC-MS where glycine is added in the TFA-containing mobile phases to enhance the MS detection sensitivity for glycopeptides up to ~ 50-fold by eliminating the ion-suppression effect of an ion-pairing agent while still retaining excellent separation capacity. We demonstrated that with enhanced detection sensitivity, IP-HILIC-MS can confidently identify an increased number of site-specific N-linked glycans for IgG1, and IgG4 mAbs as well as an Fc-domain fusion protein (containing five N-glycosylation sites) through MS/MS-based search in the data-dependent acquisition mode, meanwhile, achieve comparable quantitative results compared with the traditional methods. We also demonstrated that IP-HILIC-MS can be used to identify low-level O-glycosylation and non-consensus N-glycosylation on mAbs without any enrichment prior to LC-MS analysis. Graphical abstract.
Collapse
Affiliation(s)
- Yunlong Zhao
- Analytical Chemistry, Regeneron Pharmaceuticals, Inc., 777 Old Saw Mill River Road, Tarrytown, NY, 10591, USA
| | - Shivkumar Raidas
- Analytical Chemistry, Regeneron Pharmaceuticals, Inc., 777 Old Saw Mill River Road, Tarrytown, NY, 10591, USA
| | - Yuan Mao
- Analytical Chemistry, Regeneron Pharmaceuticals, Inc., 777 Old Saw Mill River Road, Tarrytown, NY, 10591, USA.
| | - Ning Li
- Analytical Chemistry, Regeneron Pharmaceuticals, Inc., 777 Old Saw Mill River Road, Tarrytown, NY, 10591, USA
| |
Collapse
|