1
|
Karpov OA, Stotland A, Raedschelders K, Chazarin B, Ai L, Murray CI, Van Eyk JE. Proteomics of the heart. Physiol Rev 2024; 104:931-982. [PMID: 38300522 DOI: 10.1152/physrev.00026.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 12/25/2023] [Accepted: 01/14/2024] [Indexed: 02/02/2024] Open
Abstract
Mass spectrometry-based proteomics is a sophisticated identification tool specializing in portraying protein dynamics at a molecular level. Proteomics provides biologists with a snapshot of context-dependent protein and proteoform expression, structural conformations, dynamic turnover, and protein-protein interactions. Cardiac proteomics can offer a broader and deeper understanding of the molecular mechanisms that underscore cardiovascular disease, and it is foundational to the development of future therapeutic interventions. This review encapsulates the evolution, current technologies, and future perspectives of proteomic-based mass spectrometry as it applies to the study of the heart. Key technological advancements have allowed researchers to study proteomes at a single-cell level and employ robot-assisted automation systems for enhanced sample preparation techniques, and the increase in fidelity of the mass spectrometers has allowed for the unambiguous identification of numerous dynamic posttranslational modifications. Animal models of cardiovascular disease, ranging from early animal experiments to current sophisticated models of heart failure with preserved ejection fraction, have provided the tools to study a challenging organ in the laboratory. Further technological development will pave the way for the implementation of proteomics even closer within the clinical setting, allowing not only scientists but also patients to benefit from an understanding of protein interplay as it relates to cardiac disease physiology.
Collapse
Affiliation(s)
- Oleg A Karpov
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Aleksandr Stotland
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Koen Raedschelders
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Blandine Chazarin
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Lizhuo Ai
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Christopher I Murray
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| | - Jennifer E Van Eyk
- Smidt Heart Institute, Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California, United States
| |
Collapse
|
2
|
Fröhlich K, Fahrner M, Brombacher E, Seredynska A, Maldacker M, Kreutz C, Schmidt A, Schilling O. Data-independent acquisition: A milestone and prospect in clinical mass spectrometry-based proteomics. Mol Cell Proteomics 2024:100800. [PMID: 38880244 DOI: 10.1016/j.mcpro.2024.100800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 06/08/2024] [Accepted: 06/13/2024] [Indexed: 06/18/2024] Open
Abstract
Data-independent acquisition (DIA) has revolutionized the field of mass spectrometry (MS)-based proteomics over the past few years. DIA stands out for its ability to systematically sample all peptides in a given mass-to-charge range, allowing an unbiased acquisition of proteomics data. This greatly mitigates the issue of missing values and significantly enhances quantitative accuracy, precision, and reproducibility compared to many traditional methods. This review focuses on the critical role of DIA analysis software tools, primarily focusing on their capabilities and the challenges they address in proteomic research. Advances in MS technology, such as trapped ion mobility spectrometry, or high field asymmetric waveform ion mobility spectrometry require sophisticated analysis software capable of handling the increased data complexity and exploiting the full potential of DIA. We identify and critically evaluate leading software tools in the DIA landscape, discussing their unique features, and the reliability of their quantitative and qualitative outputs. We present the biological and clinical relevance of DIA-MS and discuss crucial publications that paved the way for in-depth proteomic characterization in patient-derived specimens. Furthermore, we provide a perspective on emerging trends in clinical applications and present upcoming challenges including standardization and certification of MS-based acquisition strategies in molecular diagnostics. While we emphasize the need for continuous development of software tools to keep pace with evolving technologies, we advise researchers against uncritically accepting the results from DIA software tools. Each tool may have its own biases, and some may not be as sensitive or reliable as others. Our overarching recommendation for both researchers and clinicians is to employ multiple DIA analysis tools, utilizing orthogonal analysis approaches to enhance the robustness and reliability of their findings.
Collapse
Affiliation(s)
- Klemens Fröhlich
- Proteomics Core Facility, Biozentrum Basel, University of Basel, Basel, Switzerland
| | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany
| | - Eva Brombacher
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center-University of Freiburg, Germany; Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Germany; Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Germany; Faculty of Biology, University of Freiburg, Germany
| | - Adrianna Seredynska
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany; Faculty of Biology, University of Freiburg, Germany
| | - Maximilian Maldacker
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Germany; Faculty of Biology, University of Freiburg, Germany
| | - Clemens Kreutz
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center-University of Freiburg, Germany; Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Germany
| | - Alexander Schmidt
- Proteomics Core Facility, Biozentrum Basel, University of Basel, Basel, Switzerland
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany
| |
Collapse
|
3
|
Plubell DL, Remes PM, Wu CC, Jacob CC, Merrihew GE, Hsu C, Shulman N, MacLean BX, Heil L, Poston K, Montine T, MacCoss MJ. Development of highly multiplex targeted proteomics assays in biofluids using the Stellar mass spectrometer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.04.597431. [PMID: 38895256 PMCID: PMC11185584 DOI: 10.1101/2024.06.04.597431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
The development of targeted assays that monitor biomedically relevant proteins is an important step in bridging discovery experiments to large scale clinical studies. Targeted assays are currently unable to scale to hundreds or thousands of targets. We demonstrate the generation of large-scale assays using a novel hybrid nominal mass instrument. The scale of these assays is achievable with the Stellar™ mass spectrometer through the accommodation of shifting retention times by real-time alignment, while being sensitive and fast enough to handle many concurrent targets. Assays were constructed using precursor information from gas-phase fractionated (GPF) data-independent acquisition (DIA). We demonstrate the ability to schedule methods from an orbitrap and linear ion trap acquired GPF DIA library and compare the quantification of a matrix-matched calibration curve from orbitrap DIA and linear ion trap parallel reaction monitoring (PRM). Two applications of these proposed workflows are shown with a cerebrospinal fluid (CSF) neurodegenerative disease protein PRM assay and with a Mag-Net enriched plasma extracellular vesicle (EV) protein survey PRM assay.
Collapse
Affiliation(s)
- Deanna L Plubell
- Department of Genome Sciences, University of Washington, Seattle WA
| | | | - Christine C Wu
- Department of Genome Sciences, University of Washington, Seattle WA
| | | | | | - Chris Hsu
- Department of Genome Sciences, University of Washington, Seattle WA
| | - Nick Shulman
- Department of Genome Sciences, University of Washington, Seattle WA
| | | | | | - Kathleen Poston
- Department of Neurology & Neurological Sciences, Stanford University, Palo Alto CA, USA
| | - Tom Montine
- Department of Pathology, Stanford University, Palo Alto CA, USA
| | | |
Collapse
|
4
|
Lewis JM, Jebeli L, Coulon PML, Lay CE, Scott NE. Glycoproteomic and proteomic analysis of Burkholderia cenocepacia reveals glycosylation events within FliF and MotB are dispensable for motility. Microbiol Spectr 2024; 12:e0034624. [PMID: 38709084 PMCID: PMC11237607 DOI: 10.1128/spectrum.00346-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2024] [Accepted: 04/16/2024] [Indexed: 05/07/2024] Open
Abstract
Across the Burkholderia genus O-linked protein glycosylation is highly conserved. While the inhibition of glycosylation has been shown to be detrimental for virulence in Burkholderia cepacia complex species, such as Burkholderia cenocepacia, little is known about how specific glycosylation sites impact protein functionality. Within this study, we sought to improve our understanding of the breadth, dynamics, and requirement for glycosylation across the B. cenocepacia O-glycoproteome. Assessing the B. cenocepacia glycoproteome across different culture media using complementary glycoproteomic approaches, we increase the known glycoproteome to 141 glycoproteins. Leveraging this repertoire of glycoproteins, we quantitively assessed the glycoproteome of B. cenocepacia using Data-Independent Acquisition (DIA) revealing the B. cenocepacia glycoproteome is largely stable across conditions with most glycoproteins constitutively expressed. Examination of how the absence of glycosylation impacts the glycoproteome reveals that the protein abundance of only five glycoproteins (BCAL1086, BCAL2974, BCAL0525, BCAM0505, and BCAL0127) are altered by the loss of glycosylation. Assessing ΔfliF (ΔBCAL0525), ΔmotB (ΔBCAL0127), and ΔBCAM0505 strains, we demonstrate the loss of FliF, and to a lesser extent MotB, mirror the proteomic effects observed in the absence of glycosylation in ΔpglL. While both MotB and FliF are essential for motility, we find loss of glycosylation sites in MotB or FliF does not impact motility supporting these sites are dispensable for function. Combined this work broadens our understanding of the B. cenocepacia glycoproteome supporting that the loss of glycoproteins in the absence of glycosylation is not an indicator of the requirement for glycosylation for protein function. IMPORTANCE Burkholderia cenocepacia is an opportunistic pathogen of concern within the Cystic Fibrosis community. Despite a greater appreciation of the unique physiology of B. cenocepacia gained over the last 20 years a complete understanding of the proteome and especially the O-glycoproteome, is lacking. In this study, we utilize systems biology approaches to expand the known B. cenocepacia glycoproteome as well as track the dynamics of glycoproteins across growth phases, culturing media and in response to the loss of glycosylation. We show that the glycoproteome of B. cenocepacia is largely stable across conditions and that the loss of glycosylation only impacts five glycoproteins including the motility associated proteins FliF and MotB. Examination of MotB and FliF shows, while these proteins are essential for motility, glycosylation is dispensable. Combined this work supports that B. cenocepacia glycosylation can be dispensable for protein function and may influence protein properties beyond stability.
Collapse
Affiliation(s)
- Jessica M Lewis
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Leila Jebeli
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Pauline M L Coulon
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Catrina E Lay
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Nichollas E Scott
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| |
Collapse
|
5
|
Li K, Teo GC, Yang KL, Yu F, Nesvizhskii AI. diaTracer enables spectrum-centric analysis of diaPASEF proteomics data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.25.595875. [PMID: 38854051 PMCID: PMC11160675 DOI: 10.1101/2024.05.25.595875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Data-independent acquisition (DIA) has become a widely used strategy for peptide and protein quantification in mass spectrometry-based proteomics studies. The integration of ion mobility separation into DIA analysis, such as the diaPASEF technology available on Bruker's timsTOF platform, further improves the quantification accuracy and protein depth achievable using DIA. We introduce diaTracer, a new spectrum-centric computational tool optimized for diaPASEF data. diaTracer performs three-dimensional (m/z, retention time, ion mobility) peak tracing and feature detection to generate precursor-resolved "pseudo-MS/MS" spectra, facilitating direct ("spectral-library free") peptide identification and quantification from diaPASEF data. diaTracer is available as a stand-alone tool and is fully integrated into the widely used FragPipe computational platform. We demonstrate the performance of diaTracer and FragPipe using diaPASEF data from cerebrospinal fluid (CSF) and plasma samples, data from phosphoproteomics and HLA immunopeptidomics experiments, and low-input data from a spatial proteomics study. We also show that diaTracer enables unrestricted identification of post-translational modifications from diaPASEF data using open/mass offset searches.
Collapse
Affiliation(s)
- Kai Li
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Kevin L. Yang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Alexey I. Nesvizhskii
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
6
|
Plubell DL, Huang E, Spencer SE, Poston K, Montine TJ, MacCoss MJ. Data Independent Acquisition to Inform the Development of Targeted Proteomics Assays Using a Triple Quadrupole Mass Spectrometer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.29.596554. [PMID: 38853953 PMCID: PMC11160738 DOI: 10.1101/2024.05.29.596554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Mass spectrometry based targeted proteomics methods provide sensitive and high-throughput analysis of selected proteins. To develop a targeted bottom-up proteomics assay, peptides must be evaluated as proxies for the measurement of a protein or proteoform in a biological matrix. Candidate peptide selection typically relies on predetermined biochemical properties, data from semi-stochastic sampling, or by empirical measurements. These strategies require extensive testing and method refinement due to the difficulties associated with prediction of peptide response in the biological matrix of interest. Gas-phase fractionated (GPF) narrow window data-independent acquisition (DIA) aids in the development of reproducible selected reaction monitoring (SRM) assays by providing matrix-specific information on peptide detectability and quantification by mass spectrometry. To demonstrate the suitability of DIA data for selecting peptide targets, we reimplement a portion of an existing assay to measure 98 Alzheimer's disease proteins in cerebrospinal fluid (CSF). Peptides were selected from GPF-DIA based on signal intensity and reproducibility. The resulting SRM assay exhibits similar quantitative precision to published data, despite the inclusion of different peptides between the assays. This workflow enables development of new assays without additional up-front data acquisition, demonstrated here through generation of a separate assay for an unrelated set of proteins in CSF from the same dataset.
Collapse
Affiliation(s)
- Deanna L Plubell
- University of Washington, Department of Genome Sciences, Seattle, WA, 98195, USA
| | - Eric Huang
- University of Washington, Department of Genome Sciences, Seattle, WA, 98195, USA
| | - Sandra E Spencer
- University of Washington, Department of Genome Sciences, Seattle, WA, 98195, USA
| | - Kathleen Poston
- Stanford University, Department of Neurology & Neurological Sciences, Stanford, CA, 94305, USA
| | - Thomas J Montine
- Stanford University, Department of Pathology, Stanford, CA, 94305, USA
| | - Michael J MacCoss
- University of Washington, Department of Genome Sciences, Seattle, WA, 98195, USA
| |
Collapse
|
7
|
Peters-Clarke TM, Coon JJ, Riley NM. Instrumentation at the Leading Edge of Proteomics. Anal Chem 2024; 96:7976-8010. [PMID: 38738990 DOI: 10.1021/acs.analchem.3c04497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2024]
Affiliation(s)
- Trenton M Peters-Clarke
- Department of Chemistry, University of Wisconsin─Madison, Madison, Wisconsin 53706, United States
- Department of Biomolecular Chemistry, University of Wisconsin─Madison, Madison, Wisconsin 53706, United States
| | - Joshua J Coon
- Department of Chemistry, University of Wisconsin─Madison, Madison, Wisconsin 53706, United States
- Department of Biomolecular Chemistry, University of Wisconsin─Madison, Madison, Wisconsin 53706, United States
- Morgridge Institute for Research, Madison, Wisconsin 53715, United States
| | - Nicholas M Riley
- Department of Chemistry, University of Washington, Seattle, Washington 98195, United States
| |
Collapse
|
8
|
Sing JC, Charkow J, AlHigaylan M, Horecka I, Xu L, Röst HL. MassDash: A Web-Based Dashboard for Data-Independent Acquisition Mass Spectrometry Visualization. J Proteome Res 2024. [PMID: 38684072 DOI: 10.1021/acs.jproteome.4c00026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2024]
Abstract
With the increased usage and diversity of methods and instruments being applied to analyze Data-Independent Acquisition (DIA) data, visualization is becoming increasingly important to validate automated software results. Here we present MassDash, a cross-platform DIA mass spectrometry visualization and validation software for comparing features and results across popular tools. MassDash provides a web-based interface and Python package for interactive feature visualizations and summary report plots across multiple automated DIA feature detection tools, including OpenSwath, DIA-NN, and dreamDIA. Furthermore, MassDash processes peptides on the fly, enabling interactive visualization of peptides across dozens of runs simultaneously on a personal computer. MassDash supports various multidimensional visualizations across retention time, ion mobility, m/z, and intensity, providing additional insights into the data. The modular framework is easily extendable, enabling rapid algorithm development of novel peak-picker techniques, such as deep-learning-based approaches and refinement of existing tools. MassDash is open-source under a BSD 3-Clause license and freely available at https://github.com/Roestlab/massdash, and a demo version can be accessed at https://massdash.streamlit.app.
Collapse
Affiliation(s)
- Justin C Sing
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Joshua Charkow
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Mohammed AlHigaylan
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Ira Horecka
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Leon Xu
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| | - Hannes L Röst
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario M5G 1A8, Canada
| |
Collapse
|
9
|
Basharat AR, Xiong X, Xu T, Zang Y, Sun L, Liu X. TopDIA: A Software Tool for Top-Down Data-Independent Acquisition Proteomics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.05.588302. [PMID: 38645171 PMCID: PMC11030422 DOI: 10.1101/2024.04.05.588302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
Top-down mass spectrometry is widely used for proteoform identification, characterization, and quantification owing to its ability to analyze intact proteoforms. In the last decade, top-down proteomics has been dominated by top-down data-dependent acquisition mass spectrometry (TD-DDA-MS), and top-down data-independent acquisition mass spectrometry (TD-DIA-MS) has not been well studied. While TD-DIA-MS produces complex multiplexed tandem mass spectrometry (MS/MS) spectra, which are challenging to confidently identify, it selects more precursor ions for MS/MS analysis and has the potential to increase proteoform identifications compared with TD-DDA-MS. Here we present TopDIA, the first software tool for proteoform identification by TD-DIA-MS. It generates demultiplexed pseudo MS/MS spectra from TD-DIA-MS data and then searches the pseudo MS/MS spectra against a protein sequence database for proteoform identification. We compared the performance of TD-DDA-MS and TD-DIA-MS using Escherichia coli K-12 MG1655 cells and demonstrated that TD-DIA-MS with TopDIA increased proteoform and protein identifications compared with TD-DDA-MS.
Collapse
Affiliation(s)
- Abdul Rehman Basharat
- Department of BioHealth Informatics, Luddy School of Informatics, Computing, and Engineering, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202, USA
| | - Xingzhao Xiong
- Deming Department of Medicine, Tulane University School of Medicine, New Orleans, LA, 70112, USA
| | - Tian Xu
- Department of Chemistry, Michigan State University, East Lansing, MI, 48824, USA
| | - Yong Zang
- Department of Biostatistics and Health Data Sciences, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Liangliang Sun
- Department of Chemistry, Michigan State University, East Lansing, MI, 48824, USA
| | - Xiaowen Liu
- Deming Department of Medicine, Tulane University School of Medicine, New Orleans, LA, 70112, USA
| |
Collapse
|
10
|
He Q, Guo H, Li Y, He G, Li X, Shuai J. SeFilter-DIA: Squeeze-and-Excitation Network for Filtering High-Confidence Peptides of Data-Independent Acquisition Proteomics. Interdiscip Sci 2024:10.1007/s12539-024-00611-4. [PMID: 38472692 DOI: 10.1007/s12539-024-00611-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 01/12/2024] [Accepted: 01/21/2024] [Indexed: 03/14/2024]
Abstract
Mass spectrometry is crucial in proteomics analysis, particularly using Data Independent Acquisition (DIA) for reliable and reproducible mass spectrometry data acquisition, enabling broad mass-to-charge ratio coverage and high throughput. DIA-NN, a prominent deep learning software in DIA proteome analysis, generates peptide results but may include low-confidence peptides. Conventionally, biologists have to manually screen peptide fragment ion chromatogram peaks (XIC) for identifying high-confidence peptides, a time-consuming and subjective process prone to variability. In this study, we introduce SeFilter-DIA, a deep learning algorithm, aiming at automating the identification of high-confidence peptides. Leveraging compressed excitation neural network and residual network models, SeFilter-DIA extracts XIC features and effectively discerns between high and low-confidence peptides. Evaluation of the benchmark datasets demonstrates SeFilter-DIA achieving 99.6% AUC on the test set and 97% for other performance indicators. Furthermore, SeFilter-DIA is applicable for screening peptides with phosphorylation modifications. These results demonstrate the potential of SeFilter-DIA to replace manual screening, providing an efficient and objective approach for high-confidence peptide identification while mitigating associated limitations.
Collapse
Affiliation(s)
- Qingzu He
- Department of Physics, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, 361005, China
- Wenzhou Key Laboratory of Biophysics, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, 325001, China
| | - Huan Guo
- Department of Physics, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, 361005, China
| | - Yulin Li
- Department of Physics, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, 361005, China
| | - Guoqiang He
- Wenzhou Key Laboratory of Biophysics, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, 325001, China
| | - Xiang Li
- Department of Physics, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, 361005, China.
| | - Jianwei Shuai
- Wenzhou Key Laboratory of Biophysics, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, 325001, China.
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou, 325001, China.
| |
Collapse
|
11
|
Lou R, Shui W. Acquisition and Analysis of DIA-Based Proteomic Data: A Comprehensive Survey in 2023. Mol Cell Proteomics 2024; 23:100712. [PMID: 38182042 PMCID: PMC10847697 DOI: 10.1016/j.mcpro.2024.100712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/27/2023] [Accepted: 01/02/2024] [Indexed: 01/07/2024] Open
Abstract
Data-independent acquisition (DIA) mass spectrometry (MS) has emerged as a powerful technology for high-throughput, accurate, and reproducible quantitative proteomics. This review provides a comprehensive overview of recent advances in both the experimental and computational methods for DIA proteomics, from data acquisition schemes to analysis strategies and software tools. DIA acquisition schemes are categorized based on the design of precursor isolation windows, highlighting wide-window, overlapping-window, narrow-window, scanning quadrupole-based, and parallel accumulation-serial fragmentation-enhanced DIA methods. For DIA data analysis, major strategies are classified into spectrum reconstruction, sequence-based search, library-based search, de novo sequencing, and sequencing-independent approaches. A wide array of software tools implementing these strategies are reviewed, with details on their overall workflows and scoring approaches at different steps. The generation and optimization of spectral libraries, which are critical resources for DIA analysis, are also discussed. Publicly available benchmark datasets covering global proteomics and phosphoproteomics are summarized to facilitate performance evaluation of various software tools and analysis workflows. Continued advances and synergistic developments of versatile components in DIA workflows are expected to further enhance the power of DIA-based proteomics.
Collapse
Affiliation(s)
- Ronghui Lou
- iHuman Institute, ShanghaiTech University, Shanghai, China; School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
| | - Wenqing Shui
- iHuman Institute, ShanghaiTech University, Shanghai, China; School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
| |
Collapse
|
12
|
Wu E, Yang Y, Zhao J, Zheng J, Wang X, Shen C, Qiao L. High-Abundance Protein-Guided Hybrid Spectral Library for Data-Independent Acquisition Metaproteomics. Anal Chem 2024; 96:1029-1037. [PMID: 38180447 DOI: 10.1021/acs.analchem.3c03255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2024]
Abstract
Metaproteomics offers a direct avenue to identify microbial proteins in microbiota, enabling the compositional and functional characterization of microbiota. Due to the complexity and heterogeneity of microbial communities, in-depth and accurate metaproteomics faces tremendous limitations. One challenge in metaproteomics is the construction of a suitable protein sequence database to interpret the highly complex metaproteomic data, especially in the absence of metagenomic sequencing data. Herein, we present a high-abundance protein-guided hybrid spectral library strategy for in-depth data independent acquisition (DIA) metaproteomic analysis (HAPs-hyblibDIA). A dedicated high-abundance protein database of gut microbial species is constructed and used to mine the taxonomic information on microbiota samples. Then, a sample-specific protein sequence database is built based on the taxonomic information using Uniprot protein sequence for subsequent analysis of the DIA data using hybrid spectral library-based DIA analysis. We evaluated the accuracy and sensitivity of the method using synthetic microbial community samples and human gut microbiome samples. It was demonstrated that the strategy can successfully identify taxonomic compositions of microbiota samples and that the peptides identified by HAPs-hyblibDIA overlapped greatly with the peptides identified using a metagenomic sequencing-derived database. At the peptide and species level, our results can serve as a complement to the results obtained using a metagenomic sequencing-derived database. Furthermore, we validated the applicability of the HAPs-hyblibDIA strategy in a cohort of human gut microbiota samples of colorectal cancer patients and controls, highlighting its usability in biomedical research.
Collapse
Affiliation(s)
- Enhui Wu
- Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China
| | - Yi Yang
- Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 310000, China
| | - Jinzhi Zhao
- Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China
| | - Jianxujie Zheng
- Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China
| | - Xiaoqing Wang
- Shanghai Omicsolution Co., Ltd., Shanghai 200000, China
| | - Chengpin Shen
- Shanghai Omicsolution Co., Ltd., Shanghai 200000, China
| | - Liang Qiao
- Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China
| |
Collapse
|
13
|
Kitata RB, Yang JC, Chen YJ. Advances in data-independent acquisition mass spectrometry towards comprehensive digital proteome landscape. MASS SPECTROMETRY REVIEWS 2023; 42:2324-2348. [PMID: 35645145 DOI: 10.1002/mas.21781] [Citation(s) in RCA: 37] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Revised: 12/17/2021] [Accepted: 01/21/2022] [Indexed: 06/15/2023]
Abstract
The data-independent acquisition mass spectrometry (DIA-MS) has rapidly evolved as a powerful alternative for highly reproducible proteome profiling with a unique strength of generating permanent digital maps for retrospective analysis of biological systems. Recent advancements in data analysis software tools for the complex DIA-MS/MS spectra coupled to fast MS scanning speed and high mass accuracy have greatly expanded the sensitivity and coverage of DIA-based proteomics profiling. Here, we review the evolution of the DIA-MS techniques, from earlier proof-of-principle of parallel fragmentation of all-ions or ions in selected m/z range, the sequential window acquisition of all theoretical mass spectra (SWATH-MS) to latest innovations, recent development in computation algorithms for data informatics, and auxiliary tools and advanced instrumentation to enhance the performance of DIA-MS. We further summarize recent applications of DIA-MS and experimentally-derived as well as in silico spectra library resources for large-scale profiling to facilitate biomarker discovery and drug development in human diseases with emphasis on the proteomic profiling coverage. Toward next-generation DIA-MS for clinical proteomics, we outline the challenges in processing multi-dimensional DIA data set and large-scale clinical proteomics, and continuing need in higher profiling coverage and sensitivity.
Collapse
Affiliation(s)
| | - Jhih-Ci Yang
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan
- Sustainable Chemical Science and Technology, Taiwan International Graduate Program, Academia Sinica and National Yang Ming Chiao Tung University, Taipei, Taiwan
- Department of Applied Chemistry, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Yu-Ju Chen
- Institute of Chemistry, Academia Sinica, Taipei, Taiwan
- Sustainable Chemical Science and Technology, Taiwan International Graduate Program, Academia Sinica and National Yang Ming Chiao Tung University, Taipei, Taiwan
- Department of Chemistry, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
14
|
Ahn G, Riley NM, Kamber RA, Wisnovsky S, Moncayo von Hase S, Bassik MC, Banik SM, Bertozzi CR. Elucidating the cellular determinants of targeted membrane protein degradation by lysosome-targeting chimeras. Science 2023; 382:eadf6249. [PMID: 37856615 PMCID: PMC10766146 DOI: 10.1126/science.adf6249] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 08/31/2023] [Indexed: 10/21/2023]
Abstract
Targeted protein degradation can provide advantages over inhibition approaches in the development of therapeutic strategies. Lysosome-targeting chimeras (LYTACs) harness receptors, such as the cation-independent mannose 6-phosphate receptor (CI-M6PR), to direct extracellular proteins to lysosomes. In this work, we used a genome-wide CRISPR knockout approach to identify modulators of LYTAC-mediated membrane protein degradation in human cells. We found that disrupting retromer genes improved target degradation by reducing LYTAC recycling to the plasma membrane. Neddylated cullin-3 facilitated LYTAC-complex lysosomal maturation and was a predictive marker for LYTAC efficacy. A substantial fraction of cell surface CI-M6PR remains occupied by endogenous M6P-modified glycoproteins. Thus, inhibition of M6P biosynthesis increased the internalization of LYTAC-target complexes. Our findings inform design strategies for next-generation LYTACs and elucidate aspects of cell surface receptor occupancy and trafficking.
Collapse
Affiliation(s)
- Green Ahn
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Nicholas M. Riley
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Roarke A. Kamber
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Simon Wisnovsky
- Pharmaceutical Sciences, University of British Columbia, Vancouver, BC, Canada
| | - Salvador Moncayo von Hase
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Michael C. Bassik
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Steven M. Banik
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Carolyn R. Bertozzi
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Howard Hughes Medical Institute, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
15
|
Searle BC, Chien A, Koller A, Hawke D, Herren AW, Kim Kim J, Lee KA, Leib RD, Nelson AJ, Patel P, Ren JM, Stemmer PM, Zhu Y, Neely BA, Patel B. A Multipathway Phosphopeptide Standard for Rapid Phosphoproteomics Assay Development. Mol Cell Proteomics 2023; 22:100639. [PMID: 37657519 PMCID: PMC10561125 DOI: 10.1016/j.mcpro.2023.100639] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 08/22/2023] [Accepted: 08/24/2023] [Indexed: 09/03/2023] Open
Abstract
Recent advances in methodology have made phosphopeptide analysis a tractable problem for many proteomics researchers. There are now a wide variety of robust and accessible enrichment strategies to generate phosphoproteomes while free or inexpensive software tools for quantitation and site localization have simplified phosphoproteome analysis workflow tremendously. As a research group under the Association for Biomolecular Resource Facilities umbrella, the Proteomics Standards Research Group has worked to develop a multipathway phosphopeptide standard based on a mixture of heavy-labeled phosphopeptides designed to enable researchers to rapidly develop assays. This mixture contains 131 mass spectrometry vetted phosphopeptides specifically chosen to cover as many known biologically interesting phosphosites as possible from seven different signaling networks: AMPK signaling, death and apoptosis signaling, ErbB signaling, insulin/insulin-like growth factor-1 signaling, mTOR signaling, PI3K/AKT signaling, and stress (p38/SAPK/JNK) signaling. Here, we describe a characterization of this mixture spiked into a HeLa tryptic digest stimulated with both epidermal growth factor and insulin-like growth factor-1 to activate the MAPK and PI3K/AKT/mTOR pathways. We further demonstrate a comparison of phosphoproteomic profiling of HeLa performed independently in five labs using this phosphopeptide mixture with data-independent acquisition. Despite different experimental and instrumentation processes, we found that labs could produce reproducible, harmonized datasets by reporting measurements as ratios to the standard, while intensity measurements showed lower consistency between labs even after normalization. Our results suggest that widely available, biologically relevant phosphopeptide standards can act as a quantitative "yardstick" across laboratories and sample preparations enabling experimental designs larger than a single laboratory can perform. Raw data files are publicly available in the MassIVE dataset MSV000090564.
Collapse
Affiliation(s)
- Brian C Searle
- Department of Biomedical Informatics, The Ohio State University, Columbus, Ohio, USA; Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio, USA.
| | - Allis Chien
- Mass Spectrometry Center, Stanford University, Stanford, California, USA
| | | | | | - Anthony W Herren
- UC Davis Genome Center, Proteomics Core, University of California Davis, Davis California, USA
| | - Jenny Kim Kim
- Herbert Irving Comprehensive Cancer Center, Columbia University Medical Center, New York, New York, USA
| | - Kimberly A Lee
- Cell Signaling Technology, Inc, Danvers, Massachusetts, USA
| | - Ryan D Leib
- Mass Spectrometry Center, Stanford University, Stanford, California, USA
| | | | - Purvi Patel
- Herbert Irving Comprehensive Cancer Center, Columbia University Medical Center, New York, New York, USA
| | - Jian Min Ren
- Cell Signaling Technology, Inc, Danvers, Massachusetts, USA
| | - Paul M Stemmer
- Department of Pharmaceutical Sciences, Wayne State University, Detroit, Michigan, USA
| | - Yiying Zhu
- Cell Signaling Technology, Inc, Danvers, Massachusetts, USA
| | - Benjamin A Neely
- National Institute of Standards and Technology, Charleston, South Carolina, USA
| | - Bhavin Patel
- Thermo Fisher Scientific, Rockford, Illinois, USA
| |
Collapse
|
16
|
Koenig C, Martinez-Val A, Naicker P, Stoychev S, Jordaan J, Olsen JV. Protocol for high-throughput semi-automated label-free- or TMT-based phosphoproteome profiling. STAR Protoc 2023; 4:102536. [PMID: 37659085 PMCID: PMC10491724 DOI: 10.1016/j.xpro.2023.102536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 07/01/2023] [Accepted: 08/02/2023] [Indexed: 09/04/2023] Open
Abstract
Tandem mass tags data-dependent acquisition (TMT-DDA) as well as data-independent acquisition-based label-free quantification (LFQ-DIA) have become the leading workflows to achieve deep proteome and phosphoproteome profiles. We present a modular pipeline for TMT-DDA and LFQ-DIA that integrates steps to perform scalable phosphoproteome profiling, including protein lysate extraction, clean-up, digestion, phosphopeptide enrichment, and TMT-labeling. We also detail peptide and/or phosphopeptide fractionation and pre-mass spectrometry desalting and provide researchers guidance on choosing the best workflow based on sample number and input. For complete details on the use and execution of this protocol, please refer to Koenig et al.1 and Martínez-Val et al.2.
Collapse
Affiliation(s)
- Claire Koenig
- Novo Nordisk Foundation Center for Protein Research, Copenhagen, Denmark
| | - Ana Martinez-Val
- Novo Nordisk Foundation Center for Protein Research, Copenhagen, Denmark.
| | - Previn Naicker
- Council for Scientific and Industrial Research (CSIR), Pretoria, South Africa; ReSyn Biosciences, Pretoria, South Africa
| | - Stoyan Stoychev
- ReSyn Biosciences, Pretoria, South Africa; Evosep Biosystems, Odense, Denmark.
| | | | - Jesper V Olsen
- Novo Nordisk Foundation Center for Protein Research, Copenhagen, Denmark
| |
Collapse
|
17
|
Ng CCA, Zhou Y, Yao ZP. Algorithms for de-novo sequencing of peptides by tandem mass spectrometry: A review. Anal Chim Acta 2023; 1268:341330. [PMID: 37268337 DOI: 10.1016/j.aca.2023.341330] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 05/04/2023] [Accepted: 05/06/2023] [Indexed: 06/04/2023]
Abstract
Peptide sequencing is of great significance to fundamental and applied research in the fields such as chemical, biological, medicinal and pharmaceutical sciences. With the rapid development of mass spectrometry and sequencing algorithms, de-novo peptide sequencing using tandem mass spectrometry (MS/MS) has become the main method for determining amino acid sequences of novel and unknown peptides. Advanced algorithms allow the amino acid sequence information to be accurately obtained from MS/MS spectra in short time. In this review, algorithms from exhaustive search to the state-of-art machine learning and neural network for high-throughput and automated de-novo sequencing are introduced and compared. Impacts of datasets on algorithm performance are highlighted. The current limitations and promising direction of de-novo peptide sequencing are also discussed in this review.
Collapse
Affiliation(s)
- Cheuk Chi A Ng
- State Key Laboratory of Chemical Biology and Drug Discovery, and Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; Research Institute for Future Food, and Research Center for Chinese Medicine Innovation, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; State Key Laboratory of Chinese Medicine and Molecular Pharmacology (Incubation), and Shenzhen Key Laboratory of Food Biological Safety Control, The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, 518057, China
| | - Yin Zhou
- State Key Laboratory of Chemical Biology and Drug Discovery, and Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; Research Institute for Future Food, and Research Center for Chinese Medicine Innovation, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; State Key Laboratory of Chinese Medicine and Molecular Pharmacology (Incubation), and Shenzhen Key Laboratory of Food Biological Safety Control, The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, 518057, China
| | - Zhong-Ping Yao
- State Key Laboratory of Chemical Biology and Drug Discovery, and Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; Research Institute for Future Food, and Research Center for Chinese Medicine Innovation, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; State Key Laboratory of Chinese Medicine and Molecular Pharmacology (Incubation), and Shenzhen Key Laboratory of Food Biological Safety Control, The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, 518057, China.
| |
Collapse
|
18
|
Allen C, Meinl R, Paez JS, Searle BC, Just S, Pino LK, Fondrie WE. nf-encyclopedia: A Cloud-Ready Pipeline for Chromatogram Library Data-Independent Acquisition Proteomics Workflows. J Proteome Res 2023; 22:2743-2749. [PMID: 37417926 DOI: 10.1021/acs.jproteome.2c00613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/08/2023]
Abstract
Data-independent acquisition (DIA) mass spectrometry methods provide systematic and comprehensive quantification of the proteome; yet, relatively few open-source tools are available to analyze DIA proteomics experiments. Fewer still are tools that can leverage gas phase fractionated (GPF) chromatogram libraries to enhance the detection and quantification of peptides in these experiments. Here, we present nf-encyclopedia, an open-source NextFlow pipeline that connects three open-source tools, MSConvert, EncyclopeDIA, and MSstats, to analyze DIA proteomics experiments with or without chromatogram libraries. We demonstrate that nf-encyclopedia is reproducible when run on either a cloud platform or a local workstation and provides robust peptide and protein quantification. Additionally, we found that MSstats enhances protein-level quantitative performance over EncyclopeDIA alone. Finally, we benchmarked the ability of nf-encyclopedia to scale to large experiments in the cloud by leveraging the parallelization of compute resources. The nf-encyclopedia pipeline is available under a permissive Apache 2.0 license; run it on your desktop, cluster, or in the cloud: https://github.com/TalusBio/nf-encyclopedia.
Collapse
Affiliation(s)
- Carolyn Allen
- Talus Bioscience, Seattle, Washington 98122, United States
| | - Rico Meinl
- Talus Bioscience, Seattle, Washington 98122, United States
| | | | - Brian C Searle
- Department of Biomedical Informatics, The Ohio State University, Columbus, Ohio 43210, United States
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio 43210, United States
- Proteome Software, Inc., Portland, Oregon 97219, United States
| | - Seth Just
- Proteome Software, Inc., Portland, Oregon 97219, United States
| | - Lindsay K Pino
- Talus Bioscience, Seattle, Washington 98122, United States
| | | |
Collapse
|
19
|
Fels U, Willems P, De Meyer M, Gevaert K, Van Damme P. Shift in vacuolar to cytosolic regime of infecting Salmonella from a dual proteome perspective. PLoS Pathog 2023; 19:e1011183. [PMID: 37535689 PMCID: PMC10426988 DOI: 10.1371/journal.ppat.1011183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 08/15/2023] [Accepted: 06/19/2023] [Indexed: 08/05/2023] Open
Abstract
By applying dual proteome profiling to Salmonella enterica serovar Typhimurium (S. Typhimurium) encounters with its epithelial host (here, S. Typhimurium infected human HeLa cells), a detailed interdependent and holistic proteomic perspective on host-pathogen interactions over the time course of infection was obtained. Data-independent acquisition (DIA)-based proteomics was found to outperform data-dependent acquisition (DDA) workflows, especially in identifying the downregulated bacterial proteome response during infection progression by permitting quantification of low abundant bacterial proteins at early times of infection when bacterial infection load is low. S. Typhimurium invasion and replication specific proteomic signatures in epithelial cells revealed interdependent host/pathogen specific responses besides pointing to putative novel infection markers and signalling responses, including regulated host proteins associated with Salmonella-modified membranes.
Collapse
Affiliation(s)
- Ursula Fels
- iRIP unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
- VIB-UGent Center for Medical Biotechnology, Ghent, Belgium
| | - Patrick Willems
- iRIP unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| | - Margaux De Meyer
- iRIP unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
- VIB-UGent Center for Medical Biotechnology, Ghent, Belgium
| | - Kris Gevaert
- VIB-UGent Center for Medical Biotechnology, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Petra Van Damme
- iRIP unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| |
Collapse
|
20
|
Yu F, Teo GC, Kong AT, Fröhlich K, Li GX, Demichev V, Nesvizhskii AI. Analysis of DIA proteomics data using MSFragger-DIA and FragPipe computational platform. Nat Commun 2023; 14:4154. [PMID: 37438352 PMCID: PMC10338508 DOI: 10.1038/s41467-023-39869-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 06/28/2023] [Indexed: 07/14/2023] Open
Abstract
Liquid chromatography (LC) coupled with data-independent acquisition (DIA) mass spectrometry (MS) has been increasingly used in quantitative proteomics studies. Here, we present a fast and sensitive approach for direct peptide identification from DIA data, MSFragger-DIA, which leverages the unmatched speed of the fragment ion indexing-based search engine MSFragger. Different from most existing methods, MSFragger-DIA conducts a database search of the DIA tandem mass (MS/MS) spectra prior to spectral feature detection and peak tracing across the LC dimension. To streamline the analysis of DIA data and enable easy reproducibility, we integrate MSFragger-DIA into the FragPipe computational platform for seamless support of peptide identification and spectral library building from DIA, data-dependent acquisition (DDA), or both data types combined. We compare MSFragger-DIA with other DIA tools, such as DIA-Umpire based workflow in FragPipe, Spectronaut, DIA-NN library-free, and MaxDIA. We demonstrate the fast, sensitive, and accurate performance of MSFragger-DIA across a variety of sample types and data acquisition schemes, including single-cell proteomics, phosphoproteomics, and large-scale tumor proteome profiling studies.
Collapse
Affiliation(s)
- Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA.
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Andy T Kong
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Klemens Fröhlich
- Proteomics Core Facility, Biozentrum, University of Basel, Basel, Switzerland
| | - Ginny Xiaohe Li
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Vadim Demichev
- Department of Biochemistry, Charité - Universitätsmedizin Berlin, Berlin, Germany
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
21
|
Wei W, Riley NM, Lyu X, Shen X, Guo J, Raun SH, Zhao M, Moya-Garzon MD, Basu H, Sheng-Hwa Tung A, Li VL, Huang W, Wiggenhorn AL, Svensson KJ, Snyder MP, Bertozzi CR, Long JZ. Organism-wide, cell-type-specific secretome mapping of exercise training in mice. Cell Metab 2023; 35:1261-1279.e11. [PMID: 37141889 PMCID: PMC10524249 DOI: 10.1016/j.cmet.2023.04.011] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 02/21/2023] [Accepted: 04/05/2023] [Indexed: 05/06/2023]
Abstract
There is a significant interest in identifying blood-borne factors that mediate tissue crosstalk and function as molecular effectors of physical activity. Although past studies have focused on an individual molecule or cell type, the organism-wide secretome response to physical activity has not been evaluated. Here, we use a cell-type-specific proteomic approach to generate a 21-cell-type, 10-tissue map of exercise training-regulated secretomes in mice. Our dataset identifies >200 exercise training-regulated cell-type-secreted protein pairs, the majority of which have not been previously reported. Pdgfra-cre-labeled secretomes were the most responsive to exercise training. Finally, we show anti-obesity, anti-diabetic, and exercise performance-enhancing activities for proteoforms of intracellular carboxylesterases whose secretion from the liver is induced by exercise training.
Collapse
Affiliation(s)
- Wei Wei
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Biology, Stanford University, Stanford, CA 94305, USA; Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Nicholas M Riley
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA; Department of Chemistry, Stanford University, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford University, Stanford, CA 94305, USA
| | - Xuchao Lyu
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA; Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA; Wu Tsai Human Performance Alliance, Stanford University, Stanford, CA 94305, USA
| | - Xiaotao Shen
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94035, USA
| | - Jing Guo
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Steffen H Raun
- Department of Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Meng Zhao
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA; Stanford Diabetes Research Center, Stanford University School of Medicine, Stanford, CA 94305, USA; Cardiovascular Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Maria Dolores Moya-Garzon
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA; Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Himanish Basu
- Department of Immunology, Harvard Medical School, Boston, MA 02115, USA
| | - Alan Sheng-Hwa Tung
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA; Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Veronica L Li
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA; Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA; Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Wentao Huang
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Amanda L Wiggenhorn
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA; Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA; Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Katrin J Svensson
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA; Stanford Diabetes Research Center, Stanford University School of Medicine, Stanford, CA 94305, USA; Cardiovascular Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Michael P Snyder
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94035, USA; Stanford Diabetes Research Center, Stanford University School of Medicine, Stanford, CA 94305, USA; Cardiovascular Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Carolyn R Bertozzi
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA; Department of Chemistry, Stanford University, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford University, Stanford, CA 94305, USA
| | - Jonathan Z Long
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA; Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA; Stanford Diabetes Research Center, Stanford University School of Medicine, Stanford, CA 94305, USA; Cardiovascular Institute, Stanford University School of Medicine, Stanford, CA 94305, USA; Wu Tsai Human Performance Alliance, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
22
|
Kirkpatrick J, Stemmer PM, Searle BC, Herring LE, Martin L, Midha MK, Phinney BS, Shan B, Palmblad M, Wang Y, Jagtap PD, Neely BA. 2019 Association of Biomolecular Resource Facilities Multi-Laboratory Data-Independent Acquisition Proteomics Study. J Biomol Tech 2023; 34:3fc1f5fe.9b78d780. [PMID: 37435391 PMCID: PMC10332336 DOI: 10.7171/3fc1f5fe.9b78d780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2023]
Abstract
Despite the advantages of fewer missing values by collecting fragment ion data on all analytes in the sample as well as the potential for deeper coverage, the adoption of data-independent acquisition (DIA) in proteomics core facility settings has been slow. The Association of Biomolecular Resource Facilities conducted a large interlaboratory study to evaluate DIA performance in proteomics laboratories with various instrumentation. Participants were supplied with generic methods and a uniform set of test samples. The resulting 49 DIA datasets act as benchmarks and have utility in education and tool development. The sample set consisted of a tryptic HeLa digest spiked with high or low levels of 4 exogenous proteins. Data are available in MassIVE MSV000086479. Additionally, we demonstrate how the data can be analyzed by focusing on 2 datasets using different library approaches and show the utility of select summary statistics. These data can be used by DIA newcomers, software developers, or DIA experts evaluating performance with different platforms, acquisition settings, and skill levels.
Collapse
Affiliation(s)
- Joanna Kirkpatrick
- Leibniz Institute on AgingFritz Lipmann Institute07745JenaGermany
- The Francis Crick InstituteLondonNW1 1ATUnited Kingdom
| | | | - Brian C. Searle
- Department of Biomedical InformaticsThe Ohio State UniversityColumbusOhio43210USA
- Pelotonia Institute for Immuno-OncologyThe Ohio State University Comprehensive Cancer CenterColumbusOhio43210USA
| | - Laura E. Herring
- UNC Proteomics Core FacilityDepartment of PharmacologyUniversity of North Carolina at Chapel HillChapel HillNorth Carolina27514USA
| | | | | | | | - Baozhen Shan
- Bioinformatics Solutions Inc.WaterlooON N2L 3K8Canada
| | - Magnus Palmblad
- Center for Proteomics and MetabolomicsLeiden University Medical Center2333 ZC LeidenThe Netherlands
| | - Yan Wang
- National Institute of Dental and Craniofacial ResearchNational Institutes of HealthBethesdaMaryland20892USA
| | - Pratik D. Jagtap
- Department of BiochemistryMolecular Biology and BiophysicsUniversity of MinnesotaMinneapolisMinnesota55455USA
| | - Benjamin A. Neely
- National Institute of Standards and TechnologyCharlestonSouth Carolina29412USA
| |
Collapse
|
23
|
He Q, Zhong CQ, Li X, Guo H, Li Y, Gao M, Yu R, Liu X, Zhang F, Guo D, Ye F, Guo T, Shuai J, Han J. Dear-DIA XMBD: Deep Autoencoder Enables Deconvolution of Data-Independent Acquisition Proteomics. RESEARCH (WASHINGTON, D.C.) 2023; 6:0179. [PMID: 37377457 PMCID: PMC10292580 DOI: 10.34133/research.0179] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 06/01/2023] [Indexed: 06/29/2023]
Abstract
Data-independent acquisition (DIA) technology for protein identification from mass spectrometry and related algorithms is developing rapidly. The spectrum-centric analysis of DIA data without the use of spectra library from data-dependent acquisition data represents a promising direction. In this paper, we proposed an untargeted analysis method, Dear-DIAXMBD, for direct analysis of DIA data. Dear-DIAXMBD first integrates the deep variational autoencoder and triplet loss to learn the representations of the extracted fragment ion chromatograms, then uses the k-means clustering algorithm to aggregate fragments with similar representations into the same classes, and finally establishes the inverted index tables to determine the precursors of fragment clusters between precursors and peptides and between fragments and peptides. We show that Dear-DIAXMBD performs superiorly with the highly complicated DIA data of different species obtained by different instrument platforms. Dear-DIAXMBD is publicly available at https://github.com/jianweishuai/Dear-DIA-XMBD.
Collapse
Affiliation(s)
- Qingzu He
- Department of Physics, and Fujian Provincial Key Laboratory for Soft Functional Materials Research,
Xiamen University, Xiamen 361005, China
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health) and Wenzhou Institute,
University of Chinese Academy of Sciences, Wenzhou, Zhejiang 325001, China
| | - Chuan-Qi Zhong
- School of Life Sciences,
Xiamen University, Xiamen 361102, China
- State Key Laboratory of Cellular Stress Biology,
Innovation Center for Cell Signaling Network, Xiamen 361102, China
| | - Xiang Li
- Department of Physics, and Fujian Provincial Key Laboratory for Soft Functional Materials Research,
Xiamen University, Xiamen 361005, China
- State Key Laboratory of Cellular Stress Biology,
Innovation Center for Cell Signaling Network, Xiamen 361102, China
| | - Huan Guo
- Department of Physics, and Fujian Provincial Key Laboratory for Soft Functional Materials Research,
Xiamen University, Xiamen 361005, China
| | - Yiming Li
- Department of Physics, and Fujian Provincial Key Laboratory for Soft Functional Materials Research,
Xiamen University, Xiamen 361005, China
| | - Mingxuan Gao
- Department of Computer Science,
Xiamen University, Xiamen 361005, China
| | - Rongshan Yu
- Department of Computer Science,
Xiamen University, Xiamen 361005, China
- National Institute for Data Science in Health and Medicine, School of Medicine,
Xiamen University, Xiamen 361102, China
| | - Xianming Liu
- Bruker (Beijing) Scientific Technology Co. Ltd., Beijing, China
| | - Fangfei Zhang
- Westlake Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences,
Westlake University, 18 Shilongshan Road, Hangzhou 310024, China
- Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, 18 Shilongshan Road, Hangzhou 310024, China
| | - Donghui Guo
- Department of Electronic Engineering,
Xiamen University, Xiamen 361005, China
| | - Fangfu Ye
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health) and Wenzhou Institute,
University of Chinese Academy of Sciences, Wenzhou, Zhejiang 325001, China
| | - Tiannan Guo
- Westlake Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences,
Westlake University, 18 Shilongshan Road, Hangzhou 310024, China
- Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, 18 Shilongshan Road, Hangzhou 310024, China
- Westlake Omics Ltd., Yunmeng Road 1, Hangzhou, China
| | - Jianwei Shuai
- Department of Physics, and Fujian Provincial Key Laboratory for Soft Functional Materials Research,
Xiamen University, Xiamen 361005, China
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health) and Wenzhou Institute,
University of Chinese Academy of Sciences, Wenzhou, Zhejiang 325001, China
- State Key Laboratory of Cellular Stress Biology,
Innovation Center for Cell Signaling Network, Xiamen 361102, China
- National Institute for Data Science in Health and Medicine, School of Medicine,
Xiamen University, Xiamen 361102, China
| | - Jiahuai Han
- School of Life Sciences,
Xiamen University, Xiamen 361102, China
- State Key Laboratory of Cellular Stress Biology,
Innovation Center for Cell Signaling Network, Xiamen 361102, China
- National Institute for Data Science in Health and Medicine, School of Medicine,
Xiamen University, Xiamen 361102, China
| |
Collapse
|
24
|
Souza Junior DR, Silva ARM, Ronsein GE. Strategies for consistent and automated quantification of HDL proteome using data-independent acquisition (DIA). J Lipid Res 2023:100397. [PMID: 37286042 PMCID: PMC10339053 DOI: 10.1016/j.jlr.2023.100397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 05/11/2023] [Accepted: 05/31/2023] [Indexed: 06/09/2023] Open
Abstract
The introduction of mass spectrometry-based proteomics has revolutionized HDL field, with the description, characterization and implication of HDL-associated proteins in an array of pathologies. However, acquiring robust, reproducible data is still a challenge in the quantitative assessment of HDL proteome. Data-independent acquisition (DIA) is a mass spectrometry methodology that allows the acquisition of reproducible data, but data analysis remains a challenge in the field. Up to date, there is no consensus in how to process DIA-derived data for HDL proteomics. Here, we developed a pipeline aiming to standardize HDL proteome quantification. We optimized instrument parameters, and compared the performance of four freely available, user-friendly software tools (DIA-NN, EncyclopeDIA, MaxDIA and Skyline) in processing DIA data. Importantly, pooled samples were used as quality controls throughout our experimental setup. A carefully evaluation of precision, linearity, and detection limits, first using E. coli background for HDL proteomics, and second using HDL proteome and synthetic peptides, was undertaken. Finally, as a proof of concept, we employed our optimized and automated pipeline to quantify the proteome of HDL and apolipoprotein B (APOB)-containing lipoproteins. Our results show that determination of precision is key to confidently and consistently quantify HDL proteins. Taking this precaution, any of the available software tested here would be appropriate for quantification of HDL proteome, although their performance varied considerably.
Collapse
Affiliation(s)
| | | | - Graziella Eliza Ronsein
- Department of Biochemistry, Institute of Chemistry, University of São Paulo, São Paulo, Brazil.
| |
Collapse
|
25
|
Merrihew GE, Park J, Plubell D, Searle BC, Keene CD, Larson EB, Bateman R, Perrin RJ, Chhatwal JP, Farlow MR, McLean CA, Ghetti B, Newell KL, Frosch MP, Montine TJ, MacCoss MJ. A peptide-centric quantitative proteomics dataset for the phenotypic assessment of Alzheimer's disease. Sci Data 2023; 10:206. [PMID: 37059743 PMCID: PMC10104800 DOI: 10.1038/s41597-023-02057-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/08/2023] [Indexed: 04/16/2023] Open
Abstract
Alzheimer's disease (AD) is a looming public health disaster with limited interventions. Alzheimer's is a complex disease that can present with or without causative mutations and can be accompanied by a range of age-related comorbidities. This diverse presentation makes it difficult to study molecular changes specific to AD. To better understand the molecular signatures of disease we constructed a unique human brain sample cohort inclusive of autosomal dominant AD dementia (ADD), sporadic ADD, and those without dementia but with high AD histopathologic burden, and cognitively normal individuals with no/minimal AD histopathologic burden. All samples are clinically well characterized, and brain tissue was preserved postmortem by rapid autopsy. Samples from four brain regions were processed and analyzed by data-independent acquisition LC-MS/MS. Here we present a high-quality quantitative dataset at the peptide and protein level for each brain region. Multiple internal and external control strategies were included in this experiment to ensure data quality. All data are deposited in the ProteomeXchange repositories and available from each step of our processing.
Collapse
Affiliation(s)
- Gennifer E Merrihew
- Department of Genome Sciences, University of Washington, Seattle, Washington, 98195, USA
| | - Jea Park
- Department of Genome Sciences, University of Washington, Seattle, Washington, 98195, USA
| | - Deanna Plubell
- Department of Genome Sciences, University of Washington, Seattle, Washington, 98195, USA
| | - Brian C Searle
- Department of Biomedical Informatics, The Ohio State University, Columbus, Ohio, 43210, USA
| | - C Dirk Keene
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, 98195, USA
| | - Eric B Larson
- Department of Medicine, University of Washington, Seattle, Washington, 98195, USA
| | - Randall Bateman
- Department of Neurology, Washington University School of Medicine, 660 South Euclid Avenue, Box 8111, St. Louis, Missouri, 63110, USA
| | - Richard J Perrin
- Department of Pathology, Washington University School of Medicine, 660 South Euclid Avenue, Box 8111, St. Louis, Missouri, 63110, USA
| | - Jasmeer P Chhatwal
- Massachusetts General Hospital, Department of Neurology, Harvard Medical School, 15 Parkman St, Suite 835, Boston, Massachusetts, 02114, USA
| | - Martin R Farlow
- Department of Neurology, Indiana University School of Medicine, Indianapolis, Indiana, 46202, USA
| | - Catriona A McLean
- Department of Anatomical Pathology, Alfred Health, Melbourne, VIC, 3004, Australia
| | - Bernardino Ghetti
- Department of Pathology and Laboratory Medicine, Indiana University School of Medicine, Indianapolis, Indiana, 46202, USA
| | - Kathy L Newell
- Department of Pathology and Laboratory Medicine, Indiana University School of Medicine, Indianapolis, Indiana, 46202, USA
| | - Matthew P Frosch
- C.S. Kubik Laboratory for Neuropathology, and Massachusetts Alzheimer Disease Research Center, Massachusetts General Hospital, Boston, Massachusetts, 02114, USA
| | - Thomas J Montine
- Department of Pathology, Stanford University, Stanford, CA, 94305, USA.
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington, Seattle, Washington, 98195, USA.
| |
Collapse
|
26
|
Phlairaharn T, Ye Z, Krismer E, Pedersen AK, Pietzner M, Olsen JV, Schoof EM, Searle BC. Optimizing linear ion trap data independent acquisition towards single cell proteomics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.21.529444. [PMID: 36865114 PMCID: PMC9980145 DOI: 10.1101/2023.02.21.529444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]
Abstract
A linear ion trap (LIT) is an affordable, robust mass spectrometer that proves fast scanning speed and high sensitivity, where its primary disadvantage is inferior mass accuracy compared to more commonly used time-of-flight (TOF) or orbitrap (OT) mass analyzers. Previous efforts to utilize the LIT for low-input proteomics analysis still rely on either built-in OTs for collecting precursor data or OT-based library generation. Here, we demonstrate the potential versatility of the LIT for low-input proteomics as a stand-alone mass analyzer for all mass spectrometry measurements, including library generation. To test this approach, we first optimized LIT data acquisition methods and performed library-free searches with and without entrapment peptides to evaluate both the detection and quantification accuracy. We then generated matrix-matched calibration curves to estimate the lower limit of quantification using only 10 ng of starting material. While LIT-MS1 measurements provided poor quantitative accuracy, LIT-MS2 measurements were quantitatively accurate down to 0.5 ng on column. Finally, we optimized a suitable strategy for spectral library generation from low-input material, which we used to analyze single-cell samples by LIT-DIA using LIT-based libraries generated from as few as 40 cells.
Collapse
|
27
|
Kertesz-Farkas A, Nii Adoquaye Acquaye FL, Bhimani K, Eng JK, Fondrie WE, Grant C, Hoopmann MR, Lin A, Lu YY, Moritz RL, MacCoss MJ, Noble WS. The Crux Toolkit for Analysis of Bottom-Up Tandem Mass Spectrometry Proteomics Data. J Proteome Res 2023; 22:561-569. [PMID: 36598107 DOI: 10.1021/acs.jproteome.2c00615] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
The Crux tandem mass spectrometry data analysis toolkit provides a collection of algorithms for analyzing bottom-up proteomics tandem mass spectrometry data. Many publications have described various individual components of Crux, but a comprehensive summary has not been published since 2014. The goal of this work is to summarize the functionality of Crux, focusing on developments since 2014. We begin with empirical results demonstrating our recently implemented speedups to the Tide search engine. Other new features include a new score function in Tide, two new confidence estimation procedures, as well as three new tools: Param-medic for estimating search parameters directly from mass spectrometry data, Kojak for searching cross-linked mass spectra, and DIAmeter for searching data independent acquisition data against a sequence database.
Collapse
Affiliation(s)
- Attila Kertesz-Farkas
- Department of Data Analysis and Artificial Intelligence and Laboratory on AI for Computational Biology, Faculty of Computer Science, HSE University, 20 Myasnitskaya ulitsa, Moscow 101000, Russia
| | - Frank Lawrence Nii Adoquaye Acquaye
- Department of Data Analysis and Artificial Intelligence and Laboratory on AI for Computational Biology, Faculty of Computer Science, HSE University, 20 Myasnitskaya ulitsa, Moscow 101000, Russia
| | - Kishankumar Bhimani
- Department of Data Analysis and Artificial Intelligence and Laboratory on AI for Computational Biology, Faculty of Computer Science, HSE University, 20 Myasnitskaya ulitsa, Moscow 101000, Russia
| | - Jimmy K Eng
- Proteomics Resource, University of Washington, 850 Republican Street, Seattle, Washington 98109-4725, United States
| | - William E Fondrie
- Talus Bioscience550 17th Avenue, Seattle, Washington 98122, United States
| | - Charles Grant
- Department of Genome Sciences, University of Washington3720 15th Avenue NE, Seattle, Washington 98195, United States
| | - Michael R Hoopmann
- Insititute for Systems Biology, 401 Terry Avenue N, Seattle, Washington 98109, United States
| | - Andy Lin
- Department of Genome Sciences, University of Washington3720 15th Avenue NE, Seattle, Washington 98195, United States
| | - Yang Y Lu
- Department of Genome Sciences, University of Washington3720 15th Avenue NE, Seattle, Washington 98195, United States
| | - Robert L Moritz
- Insititute for Systems Biology, 401 Terry Avenue N, Seattle, Washington 98109, United States
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington3720 15th Avenue NE, Seattle, Washington 98195, United States
| | - William Stafford Noble
- Department of Genome Sciences, University of Washington3720 15th Avenue NE, Seattle, Washington 98195, United States.,Paul G. Allen School of Computer Science and Engineering, University of Washington185 E Stevens Way NE, Seattle, Washington 98195-2350, United States
| |
Collapse
|
28
|
Danko K, Lukasheva E, Zhukov VA, Zgoda V, Frolov A. Detergent-Assisted Protein Digestion-On the Way to Avoid the Key Bottleneck of Shotgun Bottom-Up Proteomics. Int J Mol Sci 2022; 23:13903. [PMID: 36430380 PMCID: PMC9695859 DOI: 10.3390/ijms232213903] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 11/02/2022] [Accepted: 11/05/2022] [Indexed: 11/16/2022] Open
Abstract
Gel-free bottom-up shotgun proteomics is the principal methodological platform for the state-of-the-art proteome research. This methodology assumes quantitative isolation of the total protein fraction from a complex biological sample, its limited proteolysis with site-specific proteases, analysis of the resulted peptides with nanoscaled reversed-phase high-performance liquid chromatography-(tandem) mass spectrometry (nanoRP-HPLC-MS and MS/MS), protein identification by sequence database search and peptide-based quantitative analysis. The most critical steps of this workflow are protein reconstitution and digestion; therefore, detergents and chaotropic agents are strongly mandatory to ensure complete solubilization of complex protein isolates and to achieve accessibility of all protease cleavage sites. However, detergents are incompatible with both RP separation and electrospray ionization (ESI). Therefore, to make LC-MS analysis possible, several strategies were implemented in the shotgun proteomics workflow. These techniques rely either on enzymatic digestion in centrifugal filters with subsequent evacuation of the detergent, or employment of MS-compatible surfactants, which can be degraded upon the digestion. In this review we comprehensively address all currently available strategies for the detergent-assisted proteolysis in respect of their relative efficiency when applied to different biological matrices. We critically discuss the current progress and the further perspectives of these technologies in the context of its advances and gaps.
Collapse
Affiliation(s)
- Katerina Danko
- Department of Biochemistry, St. Petersburg State University, 199034 St. Petersburg, Russia
| | - Elena Lukasheva
- Department of Biochemistry, St. Petersburg State University, 199034 St. Petersburg, Russia
| | - Vladimir A. Zhukov
- All-Russia Research Institute for Agricultural Microbiology, Podbelsky Chaussee 3, Pushkin, 196608 St. Petersburg, Russia
| | - Viktor Zgoda
- Institute of Biomedical Chemistry, 119121 Moscow, Russia
| | - Andrej Frolov
- K.A. Timiryazev Institute of Plant Physiology RAS, 127276 Moscow, Russia
| |
Collapse
|
29
|
Jones AR, Deutsch EW, Vizcaíno JA. Is DIA proteomics data FAIR? Current data sharing practices, available bioinformatics infrastructure and recommendations for the future. Proteomics 2022; 23:e2200014. [PMID: 36074795 PMCID: PMC10155627 DOI: 10.1002/pmic.202200014] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 08/27/2022] [Accepted: 08/29/2022] [Indexed: 11/06/2022]
Abstract
Data independent acquisition (DIA) proteomics techniques have matured enormously in recent years, thanks to multiple technical developments in e.g. instrumentation and data analysis approaches. However, there are many improvements that are still possible for DIA data in the area of the FAIR (Findability, Accessibility, Interoperability and Reusability) data principles. These include more tailored data sharing practices and open data standards, since public databases and data standards for proteomics were mostly designed with DDA data in mind. Here we first describe the current state of the art in the context of FAIR data for proteomics in general, and for DIA approaches in particular. For improving the current situation for DIA data, we make the following recommendations for the future: (i) development of an open data standard for spectral libraries; (ii) make mandatory the availability of the spectral libraries used in DIA experiments in ProteomeXchange resources; (iii) improve the support for DIA data in the data standards developed by the Proteomics Standards Initiative; and (iv) improve the support for DIA datasets in ProteomeXchange resources, including more tailored metadata requirements. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Andrew R Jones
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 3BX, UK
| | - Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington, 98109, USA
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| |
Collapse
|
30
|
Frankenfield AM, Ni J, Ahmed M, Hao L. Protein Contaminants Matter: Building Universal Protein Contaminant Libraries for DDA and DIA Proteomics. J Proteome Res 2022; 21:2104-2113. [PMID: 35793413 PMCID: PMC10040255 DOI: 10.1021/acs.jproteome.2c00145] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Mass spectrometry-based proteomics is constantly challenged by the presence of contaminant background signals. In particular, protein contaminants from reagents and sample handling are almost impossible to avoid. For data-dependent acquisition (DDA) proteomics, an exclusion list can be used to reduce the influence of protein contaminants. However, protein contamination has not been evaluated and is rarely addressed in data-independent acquisition (DIA). How protein contaminants influence proteomic data is also unclear. In this study, we established new protein contaminant FASTA and spectral libraries that are applicable to all proteomic workflows and evaluated the impact of protein contaminants on both DDA and DIA proteomics. We demonstrated that including our contaminant libraries can reduce false discoveries and increase protein identifications, without influencing the quantification accuracy in various proteomic software platforms. With the pressing need to standardize proteomic workflow in the research community, we highly recommend including our contaminant FASTA and spectral libraries in all bottom-up proteomic data analysis. Our contaminant libraries and a step-by-step tutorial to incorporate these libraries in various DDA and DIA data analysis platforms can be valuable resources for proteomic researchers, freely accessible at https://github.com/HaoGroup-ProtContLib.
Collapse
Affiliation(s)
- Ashley M Frankenfield
- Department of Chemistry, The George Washington University, Science and Engineering Hall 4000, 800, 22nd St., Northwest, Washington, DC 20052, United States
| | - Jiawei Ni
- Department of Chemistry, The George Washington University, Science and Engineering Hall 4000, 800, 22nd St., Northwest, Washington, DC 20052, United States
| | - Mustafa Ahmed
- Department of Chemistry, The George Washington University, Science and Engineering Hall 4000, 800, 22nd St., Northwest, Washington, DC 20052, United States
| | - Ling Hao
- Department of Chemistry, The George Washington University, Science and Engineering Hall 4000, 800, 22nd St., Northwest, Washington, DC 20052, United States
| |
Collapse
|
31
|
Saxton MW, Perry BW, Evans Hutzenbiler BD, Trojahn S, Gee A, Brown AP, Merrihew GE, Park J, Cornejo OE, MacCoss MJ, Robbins CT, Jansen HT, Kelley JL. Serum plays an important role in reprogramming the seasonal transcriptional profile of brown bear adipocytes. iScience 2022; 25:105084. [PMID: 36317158 PMCID: PMC9617460 DOI: 10.1016/j.isci.2022.105084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Revised: 06/30/2022] [Accepted: 09/01/2022] [Indexed: 11/19/2022] Open
Abstract
Understanding how metabolic reprogramming happens in cells will aid the progress in the treatment of a variety of metabolic disorders. Brown bears undergo seasonal shifts in insulin sensitivity, including reversible insulin resistance in hibernation. We performed RNA-sequencing on brown bear adipocytes and proteomics on serum to identify changes possibly responsible for reversible insulin resistance. We observed dramatic transcriptional changes, which depended on both the cell and serum season of origin. Despite large changes in adipocyte gene expression, only changes in eight circulating proteins were identified as related to the seasonal shifts in insulin sensitivity, including some that have not previously been associated with glucose homeostasis. The identified serum proteins may be sufficient for shifting hibernation adipocytes to an active-like state. Hibernation in grizzly bears is marked by insulin resistance Bear adipocytes were stimulated with active and hibernating bear blood serum Serum elicited dramatic gene expression responses related to insulin signaling Eight serum proteins were implicated in driving this transcriptional response
Collapse
Affiliation(s)
- Michael W. Saxton
- School of Biological Sciences, Washington State University, Pullman, WA 99163, USA
| | - Blair W. Perry
- School of Biological Sciences, Washington State University, Pullman, WA 99163, USA
| | | | - Shawn Trojahn
- School of Biological Sciences, Washington State University, Pullman, WA 99163, USA
| | - Alexia Gee
- School of Biological Sciences, Washington State University, Pullman, WA 99163, USA
| | - Anthony P. Brown
- School of Biological Sciences, Washington State University, Pullman, WA 99163, USA
| | | | - Jea Park
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Omar E. Cornejo
- School of Biological Sciences, Washington State University, Pullman, WA 99163, USA
| | - Michael J. MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Charles T. Robbins
- School of Biological Sciences, Washington State University, Pullman, WA 99163, USA
- School of the Environment, Washington State University, Pullman, WA 99163, USA
| | - Heiko T. Jansen
- Department of Integrative Physiology and Neuroscience, Washington State University, Pullman, WA 99163, USA
| | - Joanna L. Kelley
- School of Biological Sciences, Washington State University, Pullman, WA 99163, USA
- Corresponding author
| |
Collapse
|
32
|
Mendes ML, Dittmar G. Targeted proteomics on its way to discovery. Proteomics 2022; 22:e2100330. [PMID: 35816345 DOI: 10.1002/pmic.202100330] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 06/20/2022] [Accepted: 06/27/2022] [Indexed: 11/06/2022]
Abstract
For a long time, targeted and discovery proteomics covered different corners of the detection spectrum, with targeted proteomics focused on small target sets. This changed with the recent advances in highly multiplexed analysis. While discovery proteomics still pushes higher numbers of identified and quantified proteins, the advances in targeted proteomics rose to cover large parts of less complex proteomes or proteomes with low protein detection counts due to dynamic range restrictions, like the blood proteome. These new developments will impact, especially on the field of biomarker discovery and the possibility of using targeted proteomics for diagnostic purposes. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Marta L Mendes
- Proteomics of cellular signalling, Department of Infection and Immunity, Luxembourg Institute of Health, L-1445, Strassen, Luxembourg
| | - Gunnar Dittmar
- Proteomics of cellular signalling, Department of Infection and Immunity, Luxembourg Institute of Health, L-1445, Strassen, Luxembourg.,Department of Life Sciences and Medicine, University of Luxembourg, L-4367, Belvaux, Luxembourg
| |
Collapse
|
33
|
Xiao J, Lu S, Wang X, Liang M, Dong C, Zhang X, Qiu M, Ou C, Zeng X, Lan Y, Hu L, Tan L, Peng T, Zhang Q, Long F. Serum Proteomic Analysis Identifies SAA1, FGA, SAP, and CETP as New Biomarkers for Eosinophilic Granulomatosis With Polyangiitis. Front Immunol 2022; 13:866035. [PMID: 35757752 PMCID: PMC9226334 DOI: 10.3389/fimmu.2022.866035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Accepted: 05/06/2022] [Indexed: 11/13/2022] Open
Abstract
Background Eosinophilic granulomatosis with polyangiitis (EGPA) is characterized by asthma-like attacks in its early stage, which is easily misdiagnosed as severe asthma. Therefore, new biomarkers for the early diagnosis of EGPA are needed, especially for differentiating the diagnosis of asthma. Objectives To identify serum biomarkers that can be used for early diagnosis of EGPA and to distinguish EGPA from severe asthma. Method Data-independent acquisition (DIA) analysis was performed to identify 45 healthy controls (HC), severe asthma (S-A), and EGPA patients in a cohort to screen biomarkers for early diagnosis of EGPA and to differentiate asthma diagnosis. Subsequently, parallel reaction monitoring (PRM) analysis was applied to a validation cohort of 71 HC, S-A, and EGPA patients. Result Four candidate biomarkers were identified from DIA and PRM analysis-i.e., serum amyloid A1 (SAA1), fibrinogen-α (FGA), and serum amyloid P component (SAP)-and were upregulated in the EGPA group, while cholesteryl ester transfer protein (CETP) was downregulated in the EGPA group compared with the S-A group. Receiver operating characteristics analysis shows that, as biomarkers for early diagnosis of EGPA, the combination of SAA1, FGA, and SAP has an area under the curve (AUC) of 0.947, a sensitivity of 82.35%, and a specificity of 100%. The combination of SAA1, FGA, SAP, and CETP as biomarkers for differential diagnosis of asthma had an AUC of 0.921, a sensitivity of 78.13%, and a specificity of 100%, which were all larger than single markers. Moreover, SAA1, FGA, and SAP were positively and CETP was negatively correlated with eosinophil count. Conclusion DIA-PRM combined analysis screened and validated four previously unexplored but potentially useful biomarkers for early diagnosis of EGPA and differential diagnosis of asthma.
Collapse
Affiliation(s)
- Jing Xiao
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| | - Shaohua Lu
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| | - Xufei Wang
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| | - Mengdi Liang
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| | - Cong Dong
- Pulmonary and Critical Care Medicine, Guangzhou Institute of Respiratory Health, National Clinical Research Center for Respiratory Disease, National Center for Respiratory Medicine, State Key Laboratory of Respiratory Diseases, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Xiaoxian Zhang
- Pulmonary and Critical Care Medicine, Guangzhou Institute of Respiratory Health, National Clinical Research Center for Respiratory Disease, National Center for Respiratory Medicine, State Key Laboratory of Respiratory Diseases, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Minzhi Qiu
- Health Management Center, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University; the First Affiliated Hospital, Southern University of Science and Technology), Shenzhen, China
| | - Changxing Ou
- Pulmonary and Critical Care Medicine, Guangzhou Institute of Respiratory Health, National Clinical Research Center for Respiratory Disease, National Center for Respiratory Medicine, State Key Laboratory of Respiratory Diseases, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Xiaoyin Zeng
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| | - Yanting Lan
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| | - Longbo Hu
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| | - Long Tan
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| | - Tao Peng
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China.,Guangdong South China Vaccine Co., Ltd, Guangzhou, China
| | - Qingling Zhang
- Pulmonary and Critical Care Medicine, Guangzhou Institute of Respiratory Health, National Clinical Research Center for Respiratory Disease, National Center for Respiratory Medicine, State Key Laboratory of Respiratory Diseases, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Fei Long
- Sino-French Hoffmann Institute, State Key Laboratory of Respiratory Disease, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| |
Collapse
|
34
|
Rice SJ, Belani CP. Optimizing data-independent acquisition (DIA) spectral library workflows for plasma proteomics studies. Proteomics 2022; 22:e2200125. [PMID: 35708973 DOI: 10.1002/pmic.202200125] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 06/10/2022] [Accepted: 06/13/2022] [Indexed: 11/11/2022]
Abstract
Traditional data-independent acquisition (DIA) workflows employ off-column fractionation with data-dependent acquisition (DDA) to generate spectral libraries for data extraction. Recent advances have led to the establishment of library-independent approaches for DIA analyses. The selection of a DIA workflow may affect the outcome of plasma proteomics studies. Here, we establish a gas-phase fractionation (GPF) workflow to create DIA libraries for DIA with parallel accumulation and serial fragmentation (diaPASEF). This workflow along with three other workflows, fractionated DDA libraries, fractionated DIA libraries, and predicted spectra libraries, were evaluated on 20 plasma samples from nonsmall cell lung cancer patients with low or high levels of IL-6. We sought to optimize protein identification and total experiment time. The novel GPF workflow for diaPASEF outperformed the traditional ddaPASEF workflow in the number of identified and quantified proteins. A library-independent workflow based on predicted spectra identified and quantified the most proteins in our experiment at the cost of computational power. Overall, the choice of DIA library workflow seemed to have a limited effect on the overall outcome of a plasma proteomics experiment, but it can affect the number of proteins identified and the total experiment time.
Collapse
Affiliation(s)
- Shawn J Rice
- Penn State Cancer Institute, Hershey, Pennsylvania, USA
| | - Chandra P Belani
- Penn State Cancer Institute, Hershey, Pennsylvania, USA.,Department of Medicine, Milton S. Hershey Medical Center, Hershey, Pennsylvania, USA
| |
Collapse
|
35
|
Heil LR, Fondrie WE, McGann CD, Federation AJ, Noble WS, MacCoss MJ, Keich U. Building Spectral Libraries from Narrow-Window Data-Independent Acquisition Mass Spectrometry Data. J Proteome Res 2022; 21:1382-1391. [PMID: 35549345 DOI: 10.1021/acs.jproteome.1c00895] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Advances in library-based methods for peptide detection from data-independent acquisition (DIA) mass spectrometry have made it possible to detect and quantify tens of thousands of peptides in a single mass spectrometry run. However, many of these methods rely on a comprehensive, high-quality spectral library containing information about the expected retention time and fragmentation patterns of peptides in the sample. Empirical spectral libraries are often generated through data-dependent acquisition and may suffer from biases as a result. Spectral libraries can be generated in silico, but these models are not trained to handle all possible post-translational modifications. Here, we propose a false discovery rate-controlled spectrum-centric search workflow to generate spectral libraries directly from gas-phase fractionated DIA tandem mass spectrometry data. We demonstrate that this strategy is able to detect phosphorylated peptides and can be used to generate a spectral library for accurate peptide detection and quantitation in wide-window DIA data. We compare the results of this search workflow to other library-free approaches and demonstrate that our search is competitive in terms of accuracy and sensitivity. These results demonstrate that the proposed workflow has the capacity to generate spectral libraries while avoiding the limitations of other methods.
Collapse
Affiliation(s)
- Lilian R Heil
- Department of Genome Sciences, University of Washington, Seattle, Washington 98105, United States
| | - William E Fondrie
- Department of Genome Sciences, University of Washington, Seattle, Washington 98105, United States
| | - Christopher D McGann
- Department of Genome Sciences, University of Washington, Seattle, Washington 98105, United States
| | - Alexander J Federation
- Department of Genome Sciences, University of Washington, Seattle, Washington 98105, United States
| | - William S Noble
- Department of Genome Sciences, University of Washington, Seattle, Washington 98105, United States.,Paul G. Allen School for Computer Science and Engineering, University of Washington, Seattle, Washington 98105, United States
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington, Seattle, Washington 98105, United States
| | - Uri Keich
- School of Mathematics and Statistics, University of Sydney, Sydney, NSW 2006, Australia
| |
Collapse
|
36
|
Jiang N, Gao Y, Xu J, Luo F, Zhang X, Chen R. A data-independent acquisition (DIA)-based quantification workflow for proteome analysis of 5000 cells. J Pharm Biomed Anal 2022; 216:114795. [PMID: 35489320 DOI: 10.1016/j.jpba.2022.114795] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 04/18/2022] [Accepted: 04/21/2022] [Indexed: 11/18/2022]
Abstract
Data independent acquisition (DIA) has emerged as a powerful proteomic technique with exceptional reproducibility and throughput, and has been widely applied to clinical sample analysis. DIA approaches normally rely on project-specific spectral libraries generated by data dependent acquisition (DDA), requiring extensive off-line fractionation and large amounts of input material. In this study, we aimed to explore the utility of DIA for the analysis of samples with limited quantities. We employed three software tools (DIA-NN, Spectronaut, and EncyclopeDIA) for data analysis and generated three types of libraries, including an experiment-specific library built by DDA analysis of off-line fractions, a FASTA sequence database, and a library generated by gas-phase fractionation (GPF), resulting in eight analysis pipelines. Then we systematically compared the performance of the eight strategies by analyzing the DIA data from HEK293T cell tryptic peptides with sample loads of 500 ng, 100 ng, 20 ng, and 4 ng. The results showed that DIA-NN with GPF-based libraries outperformed the others in protein identification and retention time calibration. Next, we further evaluated the optimized workflow by analyzing the proteome alteration in 5000 peripheral blood mononuclear cells (PBMCs) induced by lipopolysaccharide (LPS) stimulation. As a result, 3179 protein groups were quantified, and functional analysis revealed activation of multiple signaling pathways, e. g., endocytosis, NF-kappa B signaling, and T cell receptor signaling. The results showed the practicability of using DIA for scarce samples, and the established workflow of PBMC analysis could be easily adapted for biomarker discovery, immune status evaluation, and drug response monitoring, especially for diseases involved with dysfunction of the immune system.
Collapse
Affiliation(s)
- Na Jiang
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin 300072, China
| | - Yan Gao
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin 300072, China
| | - Jia Xu
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin 300072, China
| | - Fengting Luo
- Department of Clinical Laboratory, Tianjin Hospital, Tianjin 300142, China
| | - Xiangyang Zhang
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin 300072, China
| | - Ruibing Chen
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin 300072, China.
| |
Collapse
|
37
|
Abstract
Proteins are the molecular effectors of the information encoded in the genome. Proteomics aims at understanding the molecular functions of proteins in their biological context. In contrast to transcriptomics and genomics, the study of proteomes provides deeper insight into the dynamic regulatory layers encoded at the protein level, such as posttranslational modifications, subcellular localization, cell signaling, and protein-protein interactions. Currently, mass spectrometry (MS)-based proteomics is the technology of choice for studying proteomes at a system-wide scale, contributing to clinical biomarker discovery and fundamental molecular biology. MS technologies are continuously being developed to fulfill the requirements of speed, resolution, and quantitative accuracy, enabling the acquisition of comprehensive proteomes. In this review, we present how MS technology and acquisition methods have evolved to meet the requirements of cutting-edge proteomics research, which is describing the human proteome and its dynamic posttranslational modifications with unprecedented depth. Finally, we provide a perspective on studying proteomes at single-cell resolution. Expected final online publication date for the Annual Review of Genomics and Human Genetics, Volume 23 is October 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Ana Martinez-Val
- Novo Nordisk Foundation Center for Protein Research, Proteomics Program, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark;
| | - Ulises H Guzmán
- Novo Nordisk Foundation Center for Protein Research, Proteomics Program, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark;
| | - Jesper V Olsen
- Novo Nordisk Foundation Center for Protein Research, Proteomics Program, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark;
| |
Collapse
|
38
|
Richards AL, Chen KH, Wilburn DB, Stevenson E, Polacco BJ, Searle BC, Swaney DL. Data-Independent Acquisition Protease-Multiplexing Enables Increased Proteome Sequence Coverage Across Multiple Fragmentation Modes. J Proteome Res 2022; 21:1124-1136. [PMID: 35234472 PMCID: PMC9035370 DOI: 10.1021/acs.jproteome.1c00960] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
The use of multiple proteases has been shown to increase protein sequence coverage in proteomics experiments; however, due to the additional analysis time required, it has not been widely adopted in routine data-dependent acquisition (DDA) proteomic workflows. Alternatively, data-independent acquisition (DIA) has the potential to analyze multiplexed samples from different protease digests, but has been primarily optimized for fragmenting tryptic peptides. Here we evaluate a DIA multiplexing approach that combines three proteolytic digests (Trypsin, AspN, and GluC) into a single sample. We first optimize data acquisition conditions for each protease individually with both the canonical DIA fragmentation mode (beam type CID), as well as resonance excitation CID, to determine optimal consensus conditions across proteases. Next, we demonstrate that application of these conditions to a protease-multiplexed sample of human peptides results in similar protein identifications and quantitative performance as compared to trypsin alone, but enables up to a 63% increase in peptide detections, and a 45% increase in nonredundant amino acid detections. Nontryptic peptides enabled noncanonical protein isoform determination and resulted in 100% sequence coverage for numerous proteins, suggesting the utility of this approach in applications where sequence coverage is critical, such as protein isoform analysis.
Collapse
Affiliation(s)
- Alicia L Richards
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| | - Kuei-Ho Chen
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| | - Damien B Wilburn
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio 43210, United States.,Department of Biomedical Informatics, The Ohio State University, Columbus, Ohio 43210, United States.,Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
| | - Erica Stevenson
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| | - Benjamin J Polacco
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| | - Brian C Searle
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio 43210, United States.,Department of Biomedical Informatics, The Ohio State University, Columbus, Ohio 43210, United States
| | - Danielle L Swaney
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| |
Collapse
|
39
|
Weke K, Kote S, Faktor J, Al Shboul S, Uwugiaren N, Brennan PM, Goodlett DR, Hupp TR, Dapic I. DIA-MS proteome analysis of formalin-fixed paraffin-embedded glioblastoma tissues. Anal Chim Acta 2022; 1204:339695. [DOI: 10.1016/j.aca.2022.339695] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 03/04/2022] [Accepted: 03/05/2022] [Indexed: 12/11/2022]
|
40
|
Ahamad N, Gupta S, Parashar D. Using Omics to Study Leprosy, Tuberculosis, and Other Mycobacterial Diseases. Front Cell Infect Microbiol 2022; 12:792617. [PMID: 35281437 PMCID: PMC8908319 DOI: 10.3389/fcimb.2022.792617] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2021] [Accepted: 02/01/2022] [Indexed: 12/12/2022] Open
Abstract
Mycobacteria are members of the Actinomycetales order, and they are classified into one family, Mycobacteriaceae. More than 20 mycobacterial species cause disease in humans. The Mycobacterium group, called the Mycobacterium tuberculosis complex (MTBC), has nine closely related species that cause tuberculosis in animals and humans. TB can be detected worldwide and one-fourth of the world’s population is contaminated with tuberculosis. According to the WHO, about two million dies from it, and more than nine million people are newly infected with TB each year. Mycobacterium tuberculosis (M. tuberculosis) is the most potential causative agent of tuberculosis and prompts enormous mortality and morbidity worldwide due to the incompletely understood pathogenesis of human tuberculosis. Moreover, modern diagnostic approaches for human tuberculosis are inefficient and have many lacks, while MTBC species can modulate host immune response and escape host immune attacks to sustain in the human body. “Multi-omics” strategies such as genomics, transcriptomics, proteomics, metabolomics, and deep sequencing technologies could be a comprehensive strategy to investigate the pathogenesis of mycobacterial species in humans and offer significant discovery to find out biomarkers at the early stage of disease in the host. Thus, in this review, we attempt to understand an overview of the mission of “omics” approaches in mycobacterial pathogenesis, including tuberculosis, leprosy, and other mycobacterial diseases.
Collapse
Affiliation(s)
- Naseem Ahamad
- Department of Oral and Maxillofacial Diagnostic Sciences, College of Dentistry, University of Florida, Gainesville, FL, United States
- *Correspondence: Naseem Ahamad,
| | - Saurabh Gupta
- Department of Biotechnology, GLA University, Mathura, India
| | - Deepak Parashar
- Department of Obstetrics and Gynecology, Medical College of Wisconsin, Milwaukee, WI, United States
| |
Collapse
|
41
|
Sun Y, Li L, Zhou Y, Ge W, Wang H, Wu R, Liu W, Chen H, Xiao Q, Cai X, Dong Z, Zhang F, Xiao J, Wang G, He Y, Gao J, Kon OL, Iyer NG, Guan H, Teng X, Zhu Y, Zhao Y, Guo T. Stratification of follicular thyroid tumors using data-independent acquisition proteomics and a comprehensive thyroid tissue spectral library. Mol Oncol 2022; 16:1611-1624. [PMID: 35194950 PMCID: PMC9019893 DOI: 10.1002/1878-0261.13198] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 12/17/2021] [Accepted: 02/21/2022] [Indexed: 11/23/2022] Open
Abstract
Thyroid nodules occur in about 60% of the population. A major challenge in thyroid nodule diagnosis is to distinguish between follicular adenoma (FA) and carcinoma (FTC). Here, we present a comprehensive thyroid spectral library covering five types of thyroid tissues. This library includes 121 960 peptides and 9941 protein groups. This spectral library can be used to quantify up to 7863 proteins from thyroid tissues, and can also be used to develop parallel reaction monitoring (PRM) assays for targeted protein quantification. Next, to stratify follicular thyroid tumours, we compared the proteomes of 24 FA and 22 FTC samples, and identified 204 differentially expressed proteins (DEPs). Our data suggest altered ferroptosis pathways in malignant follicular carcinoma. In all, 31 selected proteins effectively distinguished follicular tumours. Of those DEPs, nine proteins were further verified by PRM in an independent cohort of 18 FA and 19 FTC. Together, we present a comprehensive spectral library for DIA and targeted proteomics analysis of thyroid tissue specimens, and identified nine proteins that could potentially distinguish FA and FTC.
Collapse
Affiliation(s)
- Yaoting Sun
- Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China.,Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| | - Lu Li
- Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China.,Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| | - Yan Zhou
- Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China.,Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| | - Weigang Ge
- Westlake Omics (Hangzhou) Biotechnology Co., Ltd., No.1 Yunmeng Road, Hangzhou, 310024, China
| | - He Wang
- Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| | - Runxin Wu
- Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China.,Whiting School of Engineering, Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218-2625, USA
| | - Wei Liu
- Westlake Omics (Hangzhou) Biotechnology Co., Ltd., No.1 Yunmeng Road, Hangzhou, 310024, China
| | - Hao Chen
- Westlake Omics (Hangzhou) Biotechnology Co., Ltd., No.1 Yunmeng Road, Hangzhou, 310024, China
| | - Qi Xiao
- Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China.,Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| | - Xue Cai
- Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China.,Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| | - Zhen Dong
- Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| | - Fangfei Zhang
- Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| | - Junhong Xiao
- Department of General Surgery, The Second Hospital of Dalian Medical University, Dalian, 116023, China
| | - Guangzhi Wang
- Department of General Surgery, The Second Hospital of Dalian Medical University, Dalian, 116023, China
| | - Yi He
- Department of Urology, The Second Hospital of Dalian Medical University, Dalian, 116023, China
| | - Jinlong Gao
- Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Whiting School of Engineering, Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218-2625, USA
| | - Oi Lian Kon
- Division of Medical Sciences, National Cancer Centre Singapore, Singapore, 169610, Republic of Singapore
| | - N Gopalakrishna Iyer
- Division of Medical Sciences, National Cancer Centre Singapore, Singapore, 169610, Republic of Singapore.,Department of Head and Neck Surgery, National Cancer Centre Singapore, Republic of Singapore
| | - Haixia Guan
- Department of Endocrinology, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, 106 Zhongshan erlu, Guangzhou, 510080, China
| | - Xiaodong Teng
- Department of Pathology, the First Affiliated Hospital, Zhejiang University School of Medicine, 79 Qingchun Road, Hangzhou, 310063, China
| | - Yi Zhu
- Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| | - Yongfu Zhao
- Department of General Surgery, The Second Hospital of Dalian Medical University, Dalian, 116023, China
| | - Tiannan Guo
- Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China.,Westlake Laboratory of Life Sciences and Biomedicine, No.18 Shilongshan Road, Hangzhou, 310024, China.,School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China.,Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, No.18 Shilongshan Road, Hangzhou, 310024, China.,Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, No.18 Shilongshan Road, Hangzhou, 310024, China
| |
Collapse
|
42
|
Zhu C, Yang S, Li H, Wang Y, Xiong Y, Shen F, Zhang L, Yang P, Liu X. Rapid sample preparation workflow based on enzymatic nanoreactors for potential serum biomarker discovery in pancreatic cancer. Talanta 2022; 238:123018. [PMID: 34808569 DOI: 10.1016/j.talanta.2021.123018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 10/27/2021] [Accepted: 10/29/2021] [Indexed: 11/16/2022]
Abstract
Mass spectrometry (MS)-based proteomics have been extensively applied in clinical practice to discover potential protein and peptide biomarkers. However, the traditional sample pretreatment workflow remains labor-intensive and time-consuming, which limits the application of MS-based proteomic biomarker discovery studies in a high throughput manner. In the current work, we improved the previously reported procedure of the simple and rapid sample preparation methods (RSP) by introducing macroporous ordered siliceous foams (MOSF), namely RSP-MOSF. With the aid of MOSF, we further reduced the digestion time to 10 min, facilitating the whole sample handling process within 30 min. Combining with 30 min direct data independent acquisition (DIA) of LC-MS/MS, we accomplished a serum sample analysis in 1 h. Comparing with the RSP method, the performance of protein and peptide identification, quantitation, as well as the reproducibility of RSP-MOSF is comparable or even outperformed the RSP method. We further applied this workflow to analyze serum samples for potential candidate biomarker discovery of pancreatic cancer. Overall, 576 serum proteins were detected with 41 proteins significantly changed, which could serve as potential biomarkers for pancreatic cancer. Additionally, we evaluated the performance of RSP-MOSF method in a 96-well plate format which demonstrated an excellent reproducibility of the analysis. These results indicated that RSP-MOSF method had the potential to be applied on an automatic platform for further scaled analysis.
Collapse
Affiliation(s)
- Chenxin Zhu
- The Fifth People Hospital, Fudan University, And the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institute of Biomedical Science, Fudan University, Shanghai, 200433, China
| | - Shuang Yang
- The Fifth People Hospital, Fudan University, And the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institute of Biomedical Science, Fudan University, Shanghai, 200433, China
| | - Hengchao Li
- Department of Pancreatic Surgery, Huashan Hospital, Fudan University, Shanghai, 200040, China
| | - Yuning Wang
- Department of Chemistry, Fudan University, Shanghai, 200433, China
| | - Yueting Xiong
- The Fifth People Hospital, Fudan University, And the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institute of Biomedical Science, Fudan University, Shanghai, 200433, China
| | - Fenglin Shen
- The Fifth People Hospital, Fudan University, And the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institute of Biomedical Science, Fudan University, Shanghai, 200433, China
| | - Lei Zhang
- The Fifth People Hospital, Fudan University, And the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institute of Biomedical Science, Fudan University, Shanghai, 200433, China
| | - Pengyuan Yang
- The Fifth People Hospital, Fudan University, And the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institute of Biomedical Science, Fudan University, Shanghai, 200433, China
| | - Xiaohui Liu
- The Fifth People Hospital, Fudan University, And the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institute of Biomedical Science, Fudan University, Shanghai, 200433, China.
| |
Collapse
|
43
|
Hubbard EE, Heil LR, Merrihew GE, Chhatwal JP, Farlow MR, McLean CA, Ghetti B, Newell KL, Frosch MP, Bateman RJ, Larson EB, Keene CD, Perrin RJ, Montine TJ, MacCoss MJ, Julian RR. Does Data-Independent Acquisition Data Contain Hidden Gems? A Case Study Related to Alzheimer's Disease. J Proteome Res 2022; 21:118-131. [PMID: 34818016 PMCID: PMC8741752 DOI: 10.1021/acs.jproteome.1c00558] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
One of the potential benefits of using data-independent acquisition (DIA) proteomics protocols is that information not originally targeted by the study may be present and discovered by subsequent analysis. Herein, we reanalyzed DIA data originally recorded for global proteomic analysis to look for isomerized peptides, which occur as a result of spontaneous chemical modifications to long-lived proteins. Examination of a large set of human brain samples revealed a striking relationship between Alzheimer's disease (AD) status and isomerization of aspartic acid in a peptide from tau. Relative to controls, a surprising increase in isomer abundance was found in both autosomal dominant and sporadic AD samples. To explore potential mechanisms that might account for these observations, quantitative analysis of proteins related to isomerization repair and autophagy was performed. Differences consistent with reduced autophagic flux in AD-related samples relative to controls were found for numerous proteins, including most notably p62, a recognized indicator of autophagic inhibition. These results suggest, but do not conclusively demonstrate, that lower autophagic flux may be strongly associated with loss of function in AD brains. This study illustrates that DIA data may contain unforeseen results of interest and may be particularly useful for pilot studies investigating new research directions. In this case, a promising target for future investigations into the therapy and prevention of AD has been identified.
Collapse
Affiliation(s)
- Evan E. Hubbard
- Department of Chemistry, University of California, Riverside, California 92521, United States
| | - Lilian R. Heil
- Department of Genome Sciences, University of Washington, Seattle, Washington, 98195, United States
| | - Gennifer E. Merrihew
- Department of Genome Sciences, University of Washington, Seattle, Washington, 98195, United States
| | - Jasmeer P. Chhatwal
- Harvard Medical School, Massachusetts General Hospital, Department of Neurology, 15 Parkman St, Suite 835, Boston MA 02114
| | - Martin R. Farlow
- Department of Neurology, Indiana University School of Medicine, Indianapolis, Indiana, 46202
| | | | - Bernardino Ghetti
- Department of Pathology and Laboratory Medicine, Indiana University School of Medicine, Indianapolis, IN, 46202
| | - Kathy L. Newell
- Department of Pathology and Laboratory Medicine, Indiana University School of Medicine, Indianapolis, IN, 46202
| | - Matthew P. Frosch
- C.S. Kubik Laboratory for Neuropathology, and Massachusetts Alzheimer Disease Research Center, Massachusetts General Hospital, Boston, MA 02114
| | - Randall J. Bateman
- Department of Neurology, Washington University School of Medicine, 660 South Euclid Avenue, Box 8111, St. Louis, 63110, Missouri, USA
| | - Eric B. Larson
- Kaiser Permanente Washington Health Research Institute and Department of Medicine, University of Washington, Seattle WA
| | - C. Dirk Keene
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, 98195, United States
| | - Richard J. Perrin
- Department of Pathology and Immunology, Department of Neurology, Washington University School of Medicine, Saint Louis, Missouri 63110, United States
| | - Thomas J. Montine
- Department of Pathology, Stanford University, Stanford, CA, 94305, United States
| | - Michael J. MacCoss
- Department of Genome Sciences, University of Washington, Seattle, Washington, 98195, United States
| | - Ryan R. Julian
- Department of Chemistry, University of California, Riverside, California 92521, United States,corresponding author:
| |
Collapse
|
44
|
Li H, Uittenbogaard M, Navarro R, Ahmed M, Gropman A, Chiaramello A, Hao L. Integrated proteomic and metabolomic analyses of the mitochondrial neurodegenerative disease MELAS. Mol Omics 2022; 18:196-205. [PMID: 34982085 DOI: 10.1039/d1mo00416f] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
MELAS (mitochondrial encephalomyopathy, lactic acidosis, stroke-like episodes) is a progressive neurodegenerative disease caused by pathogenic mitochondrial DNA variants. The pathogenic mechanism of MELAS remains enigmatic due to the exceptional clinical heterogeneity and the obscure genotype-phenotype correlation among MELAS patients. To gain insights into the pathogenic signature of MELAS, we designed a comprehensive strategy integrating proteomics and metabolomics in patient-derived dermal fibroblasts harboring the ultra-rare MELAS pathogenic variant m.14453G>A, specifically affecting the mitochondrial respiratory complex I. Global proteomics was achieved by data-dependent acquisition (DDA) and verified by data-independent acquisition (DIA) using both Spectronaut and the recently launched MaxDIA platforms. Comprehensive metabolite coverage was achieved for both polar and nonpolar metabolites in both reverse phase and HILIC LC-MS/MS analyses. Our proof-of-principle MELAS study with multi-omics integration revealed OXPHOS dysregulation with a predominant deficiency of complex I subunits, as well as alterations in key bioenergetic pathways, glycolysis, tricarboxylic acid cycle, and fatty acid β-oxidation. The most clinically relevant discovery is the downregulation of the arginine biosynthesis pathway, likely due to blocked argininosuccinate synthase, which is congruent with the MELAS cardinal symptom of stroke-like episodes and its current treatment by arginine infusion. In conclusion, we demonstrated an integrated proteomic and metabolomic strategy for patient-derived fibroblasts, which has great clinical potential to discover therapeutic targets and design personalized interventions after validation with a larger patient cohort in the future.
Collapse
Affiliation(s)
- Haorong Li
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| | - Martine Uittenbogaard
- Department of Anatomy and Cell Biology, George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA
| | - Ryan Navarro
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| | - Mustafa Ahmed
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| | - Andrea Gropman
- Division of Neurogenetics and Neurodevelopmental Pediatrics, Children's National Medical Center, Washington, DC 20010, USA
| | - Anne Chiaramello
- Department of Anatomy and Cell Biology, George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA
| | - Ling Hao
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| |
Collapse
|
45
|
Ulhaq ZS, Tse WKF. A Brief Analysis of Proteomic Profile Changes during Zebrafish Regeneration. Biomolecules 2021; 12:biom12010035. [PMID: 35053182 PMCID: PMC8773715 DOI: 10.3390/biom12010035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 12/21/2021] [Accepted: 12/21/2021] [Indexed: 11/16/2022] Open
Abstract
Unlike mammals, zebrafish are capable to regenerate many of their organs, however, the response of tissue damage varies across tissues. Understanding the molecular mechanism behind the robust regenerative capacity in a model organism may help to identify and develop novel treatment strategies for mammals (including humans). Hence, we systematically analyzed the current literature on the proteome profile collected from different regenerated zebrafish tissues. Our analyses underlining that several proteins and protein families responsible as a component of cytoskeleton and structure, protein synthesis and degradation, cell cycle control, and energy metabolism were frequently identified. Moreover, target proteins responsible for the initiation of the regeneration process, such as inflammation and immune response were less frequently detected. This highlights the limitation of previous proteomic analysis and suggested a more sensitive modern proteomics analysis is needed to unfold the mechanism. This brief report provides a list of target proteins with predicted functions that could be useful for further biological studies.
Collapse
Affiliation(s)
- Zulvikar Syambani Ulhaq
- Department of Biochemistry, Faculty of Medicine and Health Sciences, Maulana Malik Ibrahim State Islamic University of Malang, Batu 65144, Indonesia;
- National Research and Innovation Agency, Central Jakarta 10340, Indonesia
| | - William Ka Fai Tse
- Laboratory of Developmental Disorders and Toxicology, Center for Promotion of International Education and Research, Faculty of Agriculture, Kyushu University, Fukuoka 819-0395, Japan
- Correspondence:
| |
Collapse
|
46
|
Halder A, Verma A, Biswas D, Srivastava S. Recent advances in mass-spectrometry based proteomics software, tools and databases. DRUG DISCOVERY TODAY. TECHNOLOGIES 2021; 39:69-79. [PMID: 34906327 DOI: 10.1016/j.ddtec.2021.06.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 05/08/2021] [Accepted: 06/21/2021] [Indexed: 01/12/2023]
Abstract
The field of proteomics immensely depends on data generation and data analysis which are thoroughly supported by software and databases. There has been a massive advancement in mass spectrometry-based proteomics over the last 10 years which has compelled the scientific community to upgrade or develop algorithms, tools, and repository databases in the field of proteomics. Several standalone software, and comprehensive databases have aided the establishment of integrated omics pipeline and meta-analysis workflow which has contributed to understand the disease pathobiology, biomarker discovery and predicting new therapeutic modalities. For shotgun proteomics where Data Dependent Acquisition is performed, several user-friendly software are developed that can analyse the pre-processed data to provide mechanistic insights of the disease. Likewise, in Data Independent Acquisition, pipelines are emerged which can accomplish the task from building the spectral library to identify the therapeutic targets. Furthermore, in the age of big data analysis the implications of machine learning and cloud computing are appending robustness, rapidness and in-depth proteomics data analysis. The current review talks about the recent advancement, and development of software, tools, and database in the field of mass-spectrometry based proteomics.
Collapse
Affiliation(s)
- Ankit Halder
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Ayushi Verma
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Deeptarup Biswas
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Sanjeeva Srivastava
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India.
| |
Collapse
|
47
|
Data-Independent Acquisition Mass Spectrometry-Based Deep Proteome Analysis for Hydrophobic Proteins from Dried Blood Spots Enriched by Sodium Carbonate Precipitation. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2021; 2420:39-52. [PMID: 34905164 DOI: 10.1007/978-1-0716-1936-0_4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Dried blood spots (DBS) are widely used for screening molecular profiles, including enzymatic activity. However, hydrophilic proteins present in large amounts in blood inhibit detection of other proteins in DBS by liquid chromatography-mass spectrometry (LC-MS/MS) without preenrichment. Sodium carbonate precipitation (SCP) can concentrate hydrophobic proteins from DBS and effectively remove soluble hydrophilic proteins. Furthermore, SCP combination with data-independent acquisition (DIA) for quantitative LC-MS/MS enhanced the proteome analysis sensitivity and quantification limits. In this protocol, we have described in detail a simple preenrichment method using SCP and a deep proteome analysis method for LC-MS/MS data using DIA.
Collapse
|
48
|
Deep representation features from DreamDIA XMBD improve the analysis of data-independent acquisition proteomics. Commun Biol 2021; 4:1190. [PMID: 34650228 PMCID: PMC8517002 DOI: 10.1038/s42003-021-02726-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Accepted: 09/27/2021] [Indexed: 12/24/2022] Open
Abstract
We developed DreamDIAXMBD (denoted as DreamDIA), a software suite based on a deep representation model for data-independent acquisition (DIA) data analysis. DreamDIA adopts a data-driven strategy to capture comprehensive information from elution patterns of peptides in DIA data and achieves considerable improvements on both identification and quantification performance compared with other state-of-the-art methods such as OpenSWATH, Skyline and DIA-NN. Specifically, in contrast to existing methods which use only 6 to 10 selected fragment ions from spectral libraries, DreamDIA extracts additional features from hundreds of theoretical elution profiles originated from different ions of each precursor using a deep representation network. To achieve higher coverage of target peptides without sacrificing specificity, the extracted features are further processed by nonlinear discriminative models under the framework of positive-unlabeled learning with decoy peptides as affirmative negative controls. DreamDIA is publicly available at https://github.com/xmuyulab/DreamDIA-XMBD for high coverage and accuracy DIA data analysis.
Collapse
|
49
|
Santiago-Rodriguez TM, Hollister EB. Multi 'omic data integration: A review of concepts, considerations, and approaches. Semin Perinatol 2021; 45:151456. [PMID: 34256961 DOI: 10.1016/j.semperi.2021.151456] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The application of 'omic techniques including, but not limited to genomics/metagenomics, transcriptomics/meta-transcriptomics, proteomics/meta-proteomics, and metabolomics to generate multiple datasets from a single sample have facilitated hypothesis generation leading to the identification of biological, molecular and ecological functions and mechanisms, as well as associations and correlations. Despite their power and promise, a variety of challenges must be considered in the successful design and execution of a multi-omics study. In this review, various 'omic technologies applicable to single- and meta-organisms (i.e., host + microbiome) are described, and considerations for sample collection, storage and processing prior to data generation and analysis, as well as approaches to data storage, dissemination and analysis are discussed. Finally, case studies are included as examples of multi-omic applications providing novel insights and a more holistic understanding of biological processes.
Collapse
Affiliation(s)
| | - Emily B Hollister
- Diversigen, Inc, 3 Greenway Plaza, Suite 1575, Houston, TX 77046, USA.
| |
Collapse
|
50
|
Abstract
Direct infusion shotgun proteome analysis (DISPA) is a new paradigm for expedited mass spectrometry-based proteomics, but the original data analysis workflow was onerous. Here, we introduce CsoDIAq, a user-friendly software package for the identification and quantification of peptides and proteins from DISPA data. In addition to establishing a complete and automated analysis workflow with a graphical user interface, CsoDIAq introduces algorithmic concepts to spectrum-spectrum matching to improve peptide identification speed and sensitivity. These include spectra pooling to reduce search time complexity and a new spectrum-spectrum match score called match count and cosine, which improves target discrimination in a target-decoy analysis. Fragment mass tolerance correction also increased the number of peptide identifications. Finally, we adapt CsoDIAq to standard LC-MS DIA and show that it outperforms other spectrum-spectrum matching software.
Collapse
Affiliation(s)
- Caleb W Cranney
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, United States
| | - Jesse G Meyer
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, United States
| |
Collapse
|