1
|
Remoroza CA, Burke MC, Mak TD, Sheetlin SL, Mirokhin YA, Cooper BT, Goecker ZC, Lowenthal MS, Yang X, Wang G, Tchekhovskoi DV, Stein SE. Comparison of N-Glycopeptide to Released N-Glycan Abundances and the Influence of Glycopeptide Mass and Charge States on N-Linked Glycosylation of IgG Antibodies. J Proteome Res 2024; 23:1443-1457. [PMID: 38450643 PMCID: PMC10997438 DOI: 10.1021/acs.jproteome.3c00904] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2024]
Abstract
We report the comparison of mass-spectral-based abundances of tryptic glycopeptides to fluorescence abundances of released labeled glycans and the effects of mass and charge state and in-source fragmentation on glycopeptide abundances. The primary glycoforms derived from Rituximab, NISTmAb, Evolocumab, and Infliximab were high-mannose and biantennary complex galactosylated and fucosylated N-glycans. Except for Evolocumab, in-source ions derived from the loss of HexNAc or HexNAc-Hex sugars are prominent for other therapeutic IgGs. After excluding in-source fragmentation of glycopeptide ions from the results, a linear correlation was observed between fluorescently labeled N-glycan and glycopeptide abundances over a dynamic range of 500. Different charge states of human IgG-derived glycopeptides containing a wider variety of abundant attached glycans were also investigated to examine the effects of the charge state on ion abundances. These revealed a linear dependence of glycopeptide abundance on the mass of the glycan with higher charge states favoring higher-mass glycans. Findings indicate that the mass spectrometry-based bottom-up approach can provide results as accurate as those of glycan release studies while revealing the origin of each attached glycan. These site-specific relative abundances are conveniently displayed and compared using previously described glycopeptide abundance distribution spectra "GADS" representations. Mass spectrometry data are available from the MAssIVE repository (MSV000093562).
Collapse
Affiliation(s)
| | | | | | | | | | - Brian T. Cooper
- Mass Spectrometry Data Center
- Department of Chemistry, University of North Carolina at Charlotte, Charlotte, North Carolina 28223, United States
| | | | - Mark S. Lowenthal
- Bioanalytical Science Group, Biomolecular Measurement Division, National Institute of Standards and Technology, 100 Bureau Drive Gaithersburg, MD 20899, US
| | | | | | | | | |
Collapse
|
2
|
Sun Z, Xiao W, Li N, Chang L, Xu P, Li Y. Large-Scale Profiling of Unexpected Tryptic Cleaved Sites at Ubiquitinated Lysines. J Proteome Res 2023; 22:1245-1254. [PMID: 36877145 DOI: 10.1021/acs.jproteome.2c00748] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2023]
Abstract
Trypsin specifically cleaves the C-terminus of lysine and arginine residues but often fails to cleave modified lysines, such as ubiquitination, therefore resulting in the uncleaved K-ε-GG peptides. Therefore, the cleaved ubiquitinated peptide identification was often regarded as false positives and discarded. Interestingly, unexpected cleavage at the K48-linked ubiquitin chain has been reported, suggesting the latent ability of trypsin to cleave ubiquitinated lysine residues. However, it remains unclear whether other trypsin-cleavable ubiquitinated sites are present. In this study, we verified the ability of trypsin in cleaving K6 and K63 besides K48 chains. The uncleaved K-ε-GG peptide was quickly and efficiently generated during trypsin digestion, whereas cleaved ones were produced with much lower efficiency. Then, the K-ε-GG antibody was proved to efficiently enrich the cleaved K-ε-GG peptides and several published large-scale ubiquitylation datasets were re-analyzed to interrogate the cleaved sequence features. In total, more than 2400 cleaved ubiquitinated peptides were identified in the K-ε-GG and UbiSite antibody-based datasets. The frequency of lysine upstream of the cleaved modified K was significantly enriched. The kinetic activity of trypsin in cleaving ubiquitinated peptides was further elucidated. We suggest that the cleaved K-ε-GG sites with high post-translational modification probability (≥0.75) should be considered as true positives in future ubiquitome analyses.
Collapse
Affiliation(s)
- Zhen Sun
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing 100850, P. R. China
| | - Weidi Xiao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China
| | - Naikang Li
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China
| | - Ping Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China.,Anhui Medical University, Hefei 230032, China.,School of Basic Medical Science, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of Ministry of Education, School of Pharmaceutical Sciences, Wuhan University, Wuhan 430071, P. R. China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China.,Anhui Medical University, Hefei 230032, China
| |
Collapse
|
3
|
Neely BA, Dorfer V, Martens L, Bludau I, Bouwmeester R, Degroeve S, Deutsch EW, Gessulat S, Käll L, Palczynski P, Payne SH, Rehfeldt TG, Schmidt T, Schwämmle V, Uszkoreit J, Vizcaíno JA, Wilhelm M, Palmblad M. Toward an Integrated Machine Learning Model of a Proteomics Experiment. J Proteome Res 2023; 22:681-696. [PMID: 36744821 PMCID: PMC9990124 DOI: 10.1021/acs.jproteome.2c00711] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
In recent years machine learning has made extensive progress in modeling many aspects of mass spectrometry data. We brought together proteomics data generators, repository managers, and machine learning experts in a workshop with the goals to evaluate and explore machine learning applications for realistic modeling of data from multidimensional mass spectrometry-based proteomics analysis of any sample or organism. Following this sample-to-data roadmap helped identify knowledge gaps and define needs. Being able to generate bespoke and realistic synthetic data has legitimate and important uses in system suitability, method development, and algorithm benchmarking, while also posing critical ethical questions. The interdisciplinary nature of the workshop informed discussions of what is currently possible and future opportunities and challenges. In the following perspective we summarize these discussions in the hope of conveying our excitement about the potential of machine learning in proteomics and to inspire future research.
Collapse
Affiliation(s)
- Benjamin A Neely
- National Institute of Standards and Technology, Charleston, South Carolina 29412, United States
| | - Viktoria Dorfer
- Bioinformatics Research Group, University of Applied Sciences Upper Austria, Softwarepark 11, 4232 Hagenberg, Austria
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Faculty of Health Sciences and Medicine, Ghent University, 9000 Ghent, Belgium
| | - Isabell Bludau
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, 82152 Martinsried, Germany
| | - Robbin Bouwmeester
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Faculty of Health Sciences and Medicine, Ghent University, 9000 Ghent, Belgium
| | - Sven Degroeve
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Faculty of Health Sciences and Medicine, Ghent University, 9000 Ghent, Belgium
| | - Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | | | - Lukas Käll
- Science for Life Laboratory, KTH - Royal Institute of Technology, 171 21 Solna, Sweden
| | - Pawel Palczynski
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Samuel H Payne
- Department of Biology, Brigham Young University, Provo, Utah 84602, United States
| | - Tobias Greisager Rehfeldt
- Institute for Mathematics and Computer Science, University of Southern Denmark, 5230 Odense, Denmark
| | | | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Julian Uszkoreit
- Medical Proteome Analysis, Center for Protein Diagnostics (ProDi), Ruhr University Bochum, 44801 Bochum, Germany.,Medizinisches Proteom-Center, Medical Faculty, Ruhr University Bochum, 44801 Bochum, Germany
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Mathias Wilhelm
- Computational Mass Spectrometry, Technical University of Munich (TUM), 85354 Freising, Germany
| | - Magnus Palmblad
- Leiden University Medical Center, Postbus 9600, 2300 RC Leiden, The Netherlands
| |
Collapse
|
4
|
Liu Z, Simayijiang H, Wang Q, Yang J, Sun H, Wu R, Yan J. DNA and protein analyses of hair in forensic genetics. Int J Legal Med 2023; 137:613-633. [PMID: 36732435 DOI: 10.1007/s00414-023-02955-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 01/20/2023] [Indexed: 02/04/2023]
Abstract
Hair is one of the most common pieces of biological evidence found at a crime scene and plays an essential role in forensic investigation. Hairs, especially non-follicular hairs, are usually found at various crime scenes, either by natural shedding or by forcible shedding. However, the genetic material in hairs is usually highly degraded, which makes forensic analysis difficult. As a result, the value of hair has not been fully exploited in forensic investigations and trials. In recent years, with advances in molecular biology, forensic analysis of hair has achieved remarkable strides and provided crucial clues in numerous cases. This article reviews recent developments in DNA and protein analysis of hair and attempts to provide a comprehensive solution to improve forensic hair analysis.
Collapse
Affiliation(s)
- Zhiyong Liu
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Halimureti Simayijiang
- School of Forensic Medicine, Shanxi Medical University, Jinzhong, Shanxi, 030600, People's Republic of China
| | - Qiangwei Wang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Jingyi Yang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Hongyu Sun
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China.,Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China
| | - Riga Wu
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China. .,Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, 510080, People's Republic of China.
| | - Jiangwei Yan
- School of Forensic Medicine, Shanxi Medical University, Jinzhong, Shanxi, 030600, People's Republic of China.
| |
Collapse
|
5
|
Sun B, Smialowski P, Aftab W, Schmidt A, Forne I, Straub T, Imhof A. Improving SWATH-MS analysis by deep-learning. Proteomics 2022; 23:e2200179. [PMID: 36571325 DOI: 10.1002/pmic.202200179] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 11/22/2022] [Accepted: 12/21/2022] [Indexed: 12/27/2022]
Abstract
Data-independent acquisition (DIA) of tandem mass spectrometry spectra has emerged as a promising technology to improve coverage and quantification of proteins in complex mixtures. The success of DIA experiments is dependent on the quality of spectral libraries used for data base searching. Frequently, these libraries need to be generated by labor and time intensive data dependent acquisition (DDA) experiments. Recently, several algorithms have been published that allow the generation of theoretical libraries by an efficient prediction of retention time and intensity of the fragment ions. Sequential windowed acquisition of all theoretical fragment ion spectra mass spectrometry (SWATH-MS) is a DIA method that can be applied at an unprecedented speed, but the fragmentation spectra suffer from a lower quality than data acquired on Orbitrap instruments. To reliably generate theoretical libraries that can be used in SWATH experiments, we developed deep-learning for SWATH analysis (dpSWATH), to improve the sensitivity and specificity of data generated by Q-TOF mass spectrometers. The theoretical library built by dpSWATH allowed us to increase the identification rate of proteins compared to traditional or library-free methods. Based on our analysis we conclude that dpSWATH is a superior prediction framework for SWATH-MS measurements than other algorithms based on Orbitrap data.
Collapse
Affiliation(s)
- Bo Sun
- Faculty of Medicine, Biomedical Center, Protein Analysis Unit, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| | - Pawel Smialowski
- Institute of Stem Cell Research, Helmholtz Center Munich, German Research Center for Environmental Health, Germany.,Faculty of Medicine, Biomedical Center, Computational Biology Unit, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| | - Wasim Aftab
- Faculty of Medicine, Biomedical Center, Protein Analysis Unit, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| | - Andreas Schmidt
- Faculty of Medicine, Biomedical Center, Protein Analysis Unit, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| | - Ignasi Forne
- Faculty of Medicine, Biomedical Center, Protein Analysis Unit, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| | - Tobias Straub
- Faculty of Medicine, Biomedical Center, Computational Biology Unit, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| | - Axel Imhof
- Faculty of Medicine, Biomedical Center, Protein Analysis Unit, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| |
Collapse
|
6
|
Tng SS, Le NQK, Yeh HY, Chua MCH. Improved Prediction Model of Protein Lysine Crotonylation Sites Using Bidirectional Recurrent Neural Networks. J Proteome Res 2021; 21:265-273. [PMID: 34812044 DOI: 10.1021/acs.jproteome.1c00848] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Histone lysine crotonylation (Kcr) is a post-translational modification of histone proteins that is involved in the regulation of gene transcription, acute and chronic kidney injury, spermatogenesis, depression, cancer, and so forth. The identification of Kcr sites in proteins is important for characterizing and regulating primary biological mechanisms. The use of computational approaches such as machine learning and deep learning algorithms have emerged in recent years as the traditional wet-lab experiments are time-consuming and costly. We propose as part of this study a deep learning model based on a recurrent neural network (RNN) termed as Sohoko-Kcr for the prediction of Kcr sites. Through the embedded encoding of the peptide sequences, we investigate the efficiency of RNN-based models such as long short-term memory (LSTM), bidirectional LSTM (BiLSTM), and bidirectional gated recurrent unit (BiGRU) networks using cross-validation and independent tests. We also established the comparison between Sohoko-Kcr and other published tools to verify the efficiency of our model based on 3-fold, 5-fold, and 10-fold cross-validations using independent set tests. The results then show that the BiGRU model has consistently displayed outstanding performance and computational efficiency. Based on the proposed model, a webserver called Sohoko-Kcr was deployed for free use and is accessible at https://sohoko-research-9uu23.ondigitalocean.app.
Collapse
Affiliation(s)
- Sian Soo Tng
- Institute of Systems Science, National University of Singapore, 29 Heng Mui Keng Terrace, Singapore 119620, Singapore
| | - Nguyen Quoc Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei 106, Taiwan.,Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei 106, Taiwan.,Translational Imaging Research Center, Taipei Medical University Hospital, Taipei 110, Taiwan
| | - Hui-Yuan Yeh
- Medical Humanities Research Cluster, School of Humanities, Nanyang Technological University, 48 Nanyang Avenue, Singapore 639818, Singapore
| | - Matthew Chin Heng Chua
- Institute of Systems Science, National University of Singapore, 29 Heng Mui Keng Terrace, Singapore 119620, Singapore
| |
Collapse
|