1
|
Jiang Y, DeBord D, Vitrac H, Stewart J, Haghani A, Van Eyk JE, Fert-Bober J, Meyer JG. The Future of Proteomics is Up in the Air: Can Ion Mobility Replace Liquid Chromatography for High Throughput Proteomics? J Proteome Res 2024; 23:1871-1882. [PMID: 38713528 PMCID: PMC11161313 DOI: 10.1021/acs.jproteome.4c00248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/09/2024]
Abstract
The coevolution of liquid chromatography (LC) with mass spectrometry (MS) has shaped contemporary proteomics. LC hyphenated to MS now enables quantification of more than 10,000 proteins in a single injection, a number that likely represents most proteins in specific human cells or tissues. Separations by ion mobility spectrometry (IMS) have recently emerged to complement LC and further improve the depth of proteomics. Given the theoretical advantages in speed and robustness of IMS in comparison to LC, we envision that ongoing improvements to IMS paired with MS may eventually make LC obsolete, especially when combined with targeted or simplified analyses, such as rapid clinical proteomics analysis of defined biomarker panels. In this perspective, we describe the need for faster analysis that might drive this transition, the current state of direct infusion proteomics, and discuss some technical challenges that must be overcome to fully complete the transition to entirely gas phase proteomics.
Collapse
Affiliation(s)
- Yuming Jiang
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Daniel DeBord
- MOBILion Systems Inc., Chadds Ford, Pennsylvania 19317, United States
| | - Heidi Vitrac
- MOBILion Systems Inc., Chadds Ford, Pennsylvania 19317, United States
| | - Jordan Stewart
- MOBILion Systems Inc., Chadds Ford, Pennsylvania 19317, United States
| | - Ali Haghani
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Jennifer E Van Eyk
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Justyna Fert-Bober
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Jesse G Meyer
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| |
Collapse
|
2
|
Ross DH, Bhotika H, Zheng X, Smith RD, Burnum-Johnson KE, Bilbao A. Computational tools and algorithms for ion mobility spectrometry-mass spectrometry. Proteomics 2024; 24:e2200436. [PMID: 38438732 DOI: 10.1002/pmic.202200436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 02/12/2024] [Accepted: 02/14/2024] [Indexed: 03/06/2024]
Abstract
Ion mobility spectrometry-mass spectrometry (IMS-MS or IM-MS) is a powerful analytical technique that combines the gas-phase separation capabilities of IM with the identification and quantification capabilities of MS. IM-MS can differentiate molecules with indistinguishable masses but different structures (e.g., isomers, isobars, molecular classes, and contaminant ions). The importance of this analytical technique is reflected by a staged increase in the number of applications for molecular characterization across a variety of fields, from different MS-based omics (proteomics, metabolomics, lipidomics, etc.) to the structural characterization of glycans, organic matter, proteins, and macromolecular complexes. With the increasing application of IM-MS there is a pressing need for effective and accessible computational tools. This article presents an overview of the most recent free and open-source software tools specifically tailored for the analysis and interpretation of data derived from IM-MS instrumentation. This review enumerates these tools and outlines their main algorithmic approaches, while highlighting representative applications across different fields. Finally, a discussion of current limitations and expectable improvements is presented.
Collapse
Affiliation(s)
- Dylan H Ross
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Harsh Bhotika
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Xueyun Zheng
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Richard D Smith
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Kristin E Burnum-Johnson
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Aivett Bilbao
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, USA
| |
Collapse
|
3
|
Oostrom MT, Colby SM, Metz TO. DEIMoS GUI: An Open-Source User Interface for a High-Dimensional Mass Spectrometry Data Processing Tool. J Chem Inf Model 2024; 64:1419-1424. [PMID: 38412257 DOI: 10.1021/acs.jcim.3c01222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
We report here the creation of a graphical user interface (GUI) for the Data Extraction for Integrated Multidimensional Spectrometry (DEIMoS) tool. DEIMoS is a Python package that processes data from high-dimensional mass spectrometry measurements. It is divided into several modules, each representing a data processing step such as peak detection, alignment, and tandem mass spectra extraction and deconvolution. The inputs for and outputs from DEIMoS can include millions of N-dimensional data points, which can be challenging to visualize in a way that is interactive, informative, and responsive. Here, we used the HoloViz Python data visualization stack, including DataShader and Param, to create an interactive visualization of the mass spectrometry data. We believe the GUI will increase the accessibility of DEIMoS and that the visualization methods could be useful for other open-source mass spectrometry tools.
Collapse
Affiliation(s)
- Marjolein T Oostrom
- National Security Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Sean M Colby
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Thomas O Metz
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| |
Collapse
|
4
|
Strauss MT, Bludau I, Zeng WF, Voytik E, Ammar C, Schessner JP, Ilango R, Gill M, Meier F, Willems S, Mann M. AlphaPept: a modern and open framework for MS-based proteomics. Nat Commun 2024; 15:2168. [PMID: 38461149 PMCID: PMC10924963 DOI: 10.1038/s41467-024-46485-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 02/20/2024] [Indexed: 03/11/2024] Open
Abstract
In common with other omics technologies, mass spectrometry (MS)-based proteomics produces ever-increasing amounts of raw data, making efficient analysis a principal challenge. A plethora of different computational tools can process the MS data to derive peptide and protein identification and quantification. However, during the last years there has been dramatic progress in computer science, including collaboration tools that have transformed research and industry. To leverage these advances, we develop AlphaPept, a Python-based open-source framework for efficient processing of large high-resolution MS data sets. Numba for just-in-time compilation on CPU and GPU achieves hundred-fold speed improvements. AlphaPept uses the Python scientific stack of highly optimized packages, reducing the code base to domain-specific tasks while accessing the latest advances. We provide an easy on-ramp for community contributions through the concept of literate programming, implemented in Jupyter Notebooks. Large datasets can rapidly be processed as shown by the analysis of hundreds of proteomes in minutes per file, many-fold faster than acquisition. AlphaPept can be used to build automated processing pipelines with web-serving functionality and compatibility with downstream analysis tools. It provides easy access via one-click installation, a modular Python library for advanced users, and via an open GitHub repository for developers.
Collapse
Affiliation(s)
- Maximilian T Strauss
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
- NNF Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark.
| | - Isabell Bludau
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Wen-Feng Zeng
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Eugenia Voytik
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Constantin Ammar
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Julia P Schessner
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | | | | | - Florian Meier
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
- Functional Proteomics, Jena University Hospital, Jena, Germany
| | - Sander Willems
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Matthias Mann
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
- NNF Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
5
|
Xu A, Tang LC, Jovanovic M, Regev O. Uncovering Distinct Peptide Charging Behaviors in Electrospray Ionization Mass Spectrometry Using a Large-Scale Dataset. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:90-99. [PMID: 38095561 PMCID: PMC10767741 DOI: 10.1021/jasms.3c00325] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/20/2023] [Accepted: 11/22/2023] [Indexed: 12/26/2023]
Abstract
Electrospray ionization is a powerful and prevalent technique used to ionize analytes in mass spectrometry. The distribution of charges that an analyte receives (charge state distribution, CSD) is an important consideration for interpreting mass spectra. However, due to an incomplete understanding of the ionization mechanism, the analyte properties that influence CSDs are not fully understood. Here, we employ a machine learning-based approach and analyze CSDs of hundreds of thousands of peptides. Interestingly, half of the peptides exhibit charges that differ from what one would naively expect (the number of basic sites). We find that these peptides can be classified into two regimes (undercharging and overcharging) and that these two regimes display markedly different charging characteristics. Notably, peptides in the overcharging regime show minimal dependence on basic site count, and more generally, the two regimes exhibit distinct sequence determinants. These findings highlight the rich ionization behavior of peptides and the potential of CSDs for enhancing peptide identification.
Collapse
Affiliation(s)
- Allyn
M. Xu
- Department
of Mathematics, Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
| | - Lauren C. Tang
- Department
of Biological Sciences, Columbia University, New York, New York 10027, United States
| | - Marko Jovanovic
- Department
of Biological Sciences, Columbia University, New York, New York 10027, United States
| | - Oded Regev
- Computer
Science Department, Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
| |
Collapse
|
6
|
Jiang LX, Hernly E, Hu H, Hilger RT, Neuweger H, Yang M, Laskin J. Nanospray Desorption Electrospray Ionization (Nano-DESI) Mass Spectrometry Imaging with High Ion Mobility Resolution. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2023; 34:1798-1804. [PMID: 37463098 PMCID: PMC10513741 DOI: 10.1021/jasms.3c00199] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2023]
Abstract
Untargeted separation of isomeric and isobaric species in mass spectrometry imaging (MSI) is challenging. The combination of ion mobility spectrometry (IMS) with MSI has emerged as an effective strategy for differentiating isomeric and isobaric species, which substantially enhances the molecular coverage and specificity of MSI experiments. In this study, we have implemented nanospray desorption electrospray ionization (nano-DESI) MSI on a trapped ion mobility spectrometry (TIMS) mass spectrometer. A new nano-DESI source was constructed, and a specially designed inlet extension was fabricated to accommodate the new source. The nano-DESI-TIMS-MSI platform was evaluated by imaging mouse brain tissue sections. We achieved high ion mobility resolution by utilizing three narrow mobility scan windows that covered the majority of the lipid molecules. Notably, the mobility resolution reaching up to 300 in this study is much higher than the resolution obtained in our previous study using drift tube IMS. High-resolution TIMS successfully separated lipid isomers and isobars, revealing their distinct localizations in tissue samples. Our results further demonstrate the power of high-mobility-resolution IMS for unraveling the complexity of biomolecular mixtures analyzed in MSI experiments.
Collapse
Affiliation(s)
- Li-Xue Jiang
- Department of Chemistry, Purdue University, West Lafayette, IN, 47907, United States
| | - Emerson Hernly
- Department of Chemistry, Purdue University, West Lafayette, IN, 47907, United States
| | - Hang Hu
- Department of Chemistry, Purdue University, West Lafayette, IN, 47907, United States
| | - Ryan T. Hilger
- Department of Chemistry, Purdue University, West Lafayette, IN, 47907, United States
| | | | - Manxi Yang
- Department of Chemistry, Purdue University, West Lafayette, IN, 47907, United States
| | - Julia Laskin
- Department of Chemistry, Purdue University, West Lafayette, IN, 47907, United States
| |
Collapse
|
7
|
Xu AM, Tang LC, Jovanovic M, Regev O. A high-throughput approach reveals distinct peptide charging behaviors in electrospray ionization mass spectrometry. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.31.535171. [PMID: 37066236 PMCID: PMC10103939 DOI: 10.1101/2023.03.31.535171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Electrospray ionization is a powerful and prevalent technique used to ionize analytes in mass spectrometry. The distribution of charges that an analyte receives (charge state distribution, CSD) is an important consideration for interpreting mass spectra. However, due to an incomplete understanding of the ionization mechanism, the analyte properties that influence CSDs are not fully understood. Here, we employ a machine learning-based high-throughput approach and analyze CSDs of hundreds of thousands of peptides. Interestingly, half of the peptides exhibit charges that differ from what one would naively expect (number of basic sites). We find that these peptides can be classified into two regimes-undercharging and overcharging-and that these two regimes display markedly different charging characteristics. Strikingly, peptides in the overcharging regime show minimal dependence on basic site count, and more generally, the two regimes exhibit distinct sequence determinants. These findings highlight the rich ionization behavior of peptides and the potential of CSDs for enhancing peptide identification.
Collapse
Affiliation(s)
- Allyn M. Xu
- Department of Mathematics, Courant Institute of Mathematical Sciences, New York University, NY, USA
| | - Lauren C. Tang
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Marko Jovanovic
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Oded Regev
- Computer Science Department, Courant Institute of Mathematical Sciences, New York University, NY, USA
| |
Collapse
|
8
|
Skowronek P, Krohs F, Lubeck M, Wallmann G, Itang ECM, Koval P, Wahle M, Thielert M, Meier F, Willems S, Raether O, Mann M. Synchro-PASEF Allows Precursor-Specific Fragment Ion Extraction and Interference Removal in Data-Independent Acquisition. Mol Cell Proteomics 2023; 22:100489. [PMID: 36566012 PMCID: PMC9868879 DOI: 10.1016/j.mcpro.2022.100489] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 12/12/2022] [Accepted: 12/19/2022] [Indexed: 12/24/2022] Open
Abstract
Data-independent acquisition (DIA) methods have become increasingly popular in mass spectrometry-based proteomics because they enable continuous acquisition of fragment spectra for all precursors simultaneously. However, these advantages come with the challenge of correctly reconstructing the precursor-fragment relationships in these highly convoluted spectra for reliable identification and quantification. Here, we introduce a scan mode for the combination of trapped ion mobility spectrometry with parallel accumulation-serial fragmentation (PASEF) that seamlessly and continuously follows the natural shape of the ion cloud in ion mobility and peptide precursor mass dimensions. Termed synchro-PASEF, it increases the detected fragment ion current several-fold at sub-second cycle times. Consecutive quadrupole selection windows move synchronously through the mass and ion mobility range. In this process, the quadrupole slices through the peptide precursors, which separates fragment ion signals of each precursor into adjacent synchro-PASEF scans. This precisely defines precursor-fragment relationships in ion mobility and mass dimensions and effectively deconvolutes the DIA fragment space. Importantly, the partitioned parts of the fragment ion transitions provide a further dimension of specificity via a lock-and-key mechanism. This is also advantageous for quantification, where signals from interfering precursors in the DIA selection window do not affect all partitions of the fragment ion, allowing to retain only the specific parts for quantification. Overall, we establish the defining features of synchro-PASEF and explore its potential for proteomic analyses.
Collapse
Affiliation(s)
- Patricia Skowronek
- Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Florian Krohs
- Research and Development, Bruker Daltonics GmbH & Co KG, Bremen, Germany
| | - Markus Lubeck
- Research and Development, Bruker Daltonics GmbH & Co KG, Bremen, Germany
| | - Georg Wallmann
- Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Ericka C M Itang
- Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Polina Koval
- Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany; Biomedicine and Neuroscience, Kyiv Academic University, Kyiv, Ukraine
| | - Maria Wahle
- Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Marvin Thielert
- Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Florian Meier
- Functional Proteomics, Jena University Hospital, Jena, Germany
| | - Sander Willems
- Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany; Research and Development, Bruker Belgium nv., Kontich, Belgium.
| | - Oliver Raether
- Research and Development, Bruker Daltonics GmbH & Co KG, Bremen, Germany.
| | - Matthias Mann
- Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany; Proteomics, NNF Center for Protein Research, Copenhagen, Denmark.
| |
Collapse
|
9
|
Hu H, Laskin J. Emerging Computational Methods in Mass Spectrometry Imaging. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2022; 9:e2203339. [PMID: 36253139 PMCID: PMC9731724 DOI: 10.1002/advs.202203339] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 09/17/2022] [Indexed: 05/10/2023]
Abstract
Mass spectrometry imaging (MSI) is a powerful analytical technique that generates maps of hundreds of molecules in biological samples with high sensitivity and molecular specificity. Advanced MSI platforms with capability of high-spatial resolution and high-throughput acquisition generate vast amount of data, which necessitates the development of computational tools for MSI data analysis. In addition, computation-driven MSI experiments have recently emerged as enabling technologies for further improving the MSI capabilities with little or no hardware modification. This review provides a critical summary of computational methods and resources developed for MSI data analysis and interpretation along with computational approaches for improving throughput and molecular coverage in MSI experiments. This review is focused on the recently developed artificial intelligence methods and provides an outlook for a future paradigm shift in MSI with transformative computational methods.
Collapse
Affiliation(s)
- Hang Hu
- Department of ChemistryPurdue University560 Oval DriveWest LafayetteIN47907USA
| | - Julia Laskin
- Department of ChemistryPurdue University560 Oval DriveWest LafayetteIN47907USA
| |
Collapse
|
10
|
Luu GT, Freitas MA, Lizama-Chamu I, McCaughey CS, Sanchez LM, Wang M. TIMSCONVERT: a workflow to convert trapped ion mobility data to open data formats. Bioinformatics 2022; 38:4046-4047. [PMID: 35758608 PMCID: PMC9991885 DOI: 10.1093/bioinformatics/btac419] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 05/10/2022] [Accepted: 06/26/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Advances in mass spectrometry have led to the development of mass spectrometers with ion mobility spectrometry capabilities and dual-source instrumentation; however, the current software ecosystem lacks interoperability with downstream data analysis using open-source software and pipelines. RESULTS Here, we present TIMSCONVERT, a data conversion high-throughput workflow from timsTOF Pro/fleX mass spectrometer raw data files to mzML and imzML formats that incorporates ion mobility data while maintaining compatibility with data analysis tools. We showcase several examples using data acquired across different experiments and acquisition modalities on the timsTOF fleX MS. AVAILABILITY AND IMPLEMENTATION TIMSCONVERT and its documentation can be found at https://github.com/gtluu/timsconvert and is available as a standalone command-line interface tool for Windows and Linux, NextFlow workflow and online in the Global Natural Products Social (GNPS) platform. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Gordon T Luu
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Michael A Freitas
- Department of Cancer Biology and Genetics, Ohio State University, Columbus, OH, 43210, USA
| | - Itzel Lizama-Chamu
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Catherine S McCaughey
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Laura M Sanchez
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Mingxun Wang
- Department of Computer Science and Engineering, University of California Riverside, Riverside, CA 92521, USA
| |
Collapse
|
11
|
Skowronek P, Thielert M, Voytik E, Tanzer MC, Hansen FM, Willems S, Karayel Ö, Brunner AD, Meier F, Mann M. Rapid and in-depth coverage of the (phospho-)proteome with deep libraries and optimal window design for dia-PASEF. Mol Cell Proteomics 2022; 21:100279. [PMID: 35944843 PMCID: PMC9465115 DOI: 10.1016/j.mcpro.2022.100279] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 07/31/2022] [Accepted: 08/02/2022] [Indexed: 11/05/2022] Open
Abstract
Data-independent acquisition (DIA) methods have become increasingly attractive in mass spectrometry–based proteomics because they enable high data completeness and a wide dynamic range. Recently, we combined DIA with parallel accumulation–serial fragmentation (dia-PASEF) on a Bruker trapped ion mobility (IM) separated quadrupole time-of-flight mass spectrometer. This requires alignment of the IM separation with the downstream mass selective quadrupole, leading to a more complex scheme for dia-PASEF window placement compared with DIA. To achieve high data completeness and deep proteome coverage, here we employ variable isolation windows that are placed optimally depending on precursor density in the m/z and IM plane. This is implemented in the freely available py_diAID (Python package for DIA with an automated isolation design) package. In combination with in-depth project-specific proteomics libraries and the Evosep liquid chromatography system, we reproducibly identified over 7700 proteins in a human cancer cell line in 44 min with quadruplicate single-shot injections at high sensitivity. Even at a throughput of 100 samples per day (11 min liquid chromatography gradients), we consistently quantified more than 6000 proteins in mammalian cell lysates by injecting four replicates. We found that optimal dia-PASEF window placement facilitates in-depth phosphoproteomics with very high sensitivity, quantifying more than 35,000 phosphosites in a human cancer cell line stimulated with an epidermal growth factor in triplicate 21 min runs. This covers a substantial part of the regulated phosphoproteome with high sensitivity, opening up for extensive systems-biological studies. Optimal dia-PASEF window design with py_diAID combined with deep libraries. Quantification of the HeLa cell proteome to a depth of >7700 in only 44 min. Ion mobility–resolved phosphoproteomics determines >35,000 class I phosphosites. py_diAID is freely available as GUI, CLI, and Python modules.
Collapse
|
12
|
Simplifying MS1 and MS2 spectra to achieve lower mass error, more dynamic range, and higher peptide identification confidence on the Bruker timsTOF Pro. PLoS One 2022; 17:e0271025. [PMID: 35797390 PMCID: PMC9262215 DOI: 10.1371/journal.pone.0271025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 06/19/2022] [Indexed: 11/24/2022] Open
Abstract
For bottom-up proteomic analysis, the goal of analytical pipelines that process the raw output of mass spectrometers is to detect, characterise, identify, and quantify peptides. The initial steps of detecting and characterising features in raw data must overcome some considerable challenges. The data presents as a sparse array, sometimes containing billions of intensity readings over time. These points represent both signal and chemical or electrical noise. Depending on the biological sample’s complexity, tens to hundreds of thousands of peptides may be present in this vast data landscape. For ion mobility-based LC-MS analysis, each peptide is comprised of a grouping of hundreds of single intensity readings in three dimensions: mass-over-charge (m/z), mobility, and retention time. There is no inherent information about any associations between individual points; whether they represent a peptide or noise must be inferred from their structure. Peptides each have multiple isotopes, different charge states, and a dynamic range of intensity of over six orders of magnitude. Due to the high complexity of most biological samples, peptides often overlap in time and mobility, making it very difficult to tease apart isotopic peaks, to apportion the intensity of each and the contribution of each isotope to the determination of the peptide’s monoisotopic mass, which is critical for the peptide’s identification. Here we describe four algorithms for the Bruker timsTOF Pro that each play an important role in finding peptide features and determining their characteristics. These algorithms focus on separate characteristics that determine how candidate features are detected in the raw data. The first two algorithms deal with the complexity of the raw data, rapidly clustering raw data into spectra that allows isotopic peaks to be resolved. The third algorithm compensates for saturation of the instrument’s detector thereby recovering lost dynamic range, and lastly, the fourth algorithm increases confidence of peptide identifications by simplification of the fragment spectra. These algorithms are effective in processing raw data to detect features and extracting the attributes required for peptide identification, and make an important contribution to an analytical pipeline by detecting features that are higher quality and better segmented from other peptides in close proximity. The software has been developed in Python using Numpy and Pandas and made freely available with an open-source MIT license to facilitate experimentation and further improvement (DOI 10.5281/zenodo.6513126). Data are available via ProteomeXchange with identifier PXD030706. The primary goal of mass spectrometry data processing pipelines in the proteomic analysis of complex biological samples is to identify peptides accurately and comprehensively with abundance across a broad dynamic range. It has been reported that detection of low-abundance peptides for early-disease biomarkers in complex fluids is limited by the sensitivity of biomarker discovery platforms [1], the dynamic range of plasma abundance, which can exceed ten orders of magnitude [2], and the fact that lower abundance proteins provide the most insight in disease processes [3]. As mass spectrometry hardware improves, the corresponding increase in amounts of data for analysis pushes legacy software analysis methods out of their designed specification. Additionally, experimentation with new algorithms to analyse raw data produced by instruments such as the Bruker timsTOF Pro has been hampered by the paucity of modular, open-source software pipelines written in languages accessible by the large community of data scientists. Here we present several algorithms for simplifying MS1 and MS2 spectra that are written in Python. We show that these algorithms are effective to help improve the quality and accuracy of peptide identifications.
Collapse
|
13
|
De La Toba EA, Bell SE, Romanova EV, Sweedler JV. Mass Spectrometry Measurements of Neuropeptides: From Identification to Quantitation. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY (PALO ALTO, CALIF.) 2022; 15:83-106. [PMID: 35324254 DOI: 10.1146/annurev-anchem-061020-022048] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Neuropeptides (NPs), a unique class of neuronal signaling molecules, participate in a variety of physiological processes and diseases. Quantitative measurements of NPs provide valuable information regarding how these molecules are differentially regulated in a multitude of neurological, metabolic, and mental disorders. Mass spectrometry (MS) has evolved to become a powerful technique for measuring trace levels of NPs in complex biological tissues and individual cells using both targeted and exploratory approaches. There are inherent challenges to measuring NPs, including their wide endogenous concentration range, transport and postmortem degradation, complex sample matrices, and statistical processing of MS data required for accurate NP quantitation. This review highlights techniques developed to address these challenges and presents an overview of quantitative MS-based measurement approaches for NPs, including the incorporation of separation methods for high-throughput analysis, MS imaging for spatial measurements, and methods for NP quantitation in single neurons.
Collapse
Affiliation(s)
- Eduardo A De La Toba
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, Illinois, USA;
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Sara E Bell
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, Illinois, USA;
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Elena V Romanova
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, Illinois, USA;
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Jonathan V Sweedler
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, Illinois, USA;
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
14
|
Schessner JP, Voytik E, Bludau I. A practical guide to interpreting and generating bottom-up proteomics data visualizations. Proteomics 2022; 22:e2100103. [PMID: 35107884 DOI: 10.1002/pmic.202100103] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 12/22/2021] [Accepted: 01/20/2022] [Indexed: 11/10/2022]
Abstract
Mass-spectrometry based bottom-up proteomics is the main method to analyze proteomes comprehensively and the rapid evolution of instrumentation and data analysis has made the technology widely available. Data visualization is an integral part of the analysis process and it is crucial for the communication of results. This is a major challenge due to the immense complexity of MS data. In this review, we provide an overview of commonly used visualizations, starting with raw data of traditional and novel MS technologies, then basic peptide and protein level analyses, and finally visualization of highly complex datasets and networks. We specifically provide guidance on how to critically interpret and discuss the multitude of different proteomics data visualizations. Furthermore, we highlight Python-based libraries and other open science tools that can be applied for independent and transparent generation of customized visualizations. To further encourage programmatic data visualization, we provide the Python code used to generate all data Figures in this review on GitHub (https://github.com/MannLabs/ProteomicsVisualization). This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Julia Patricia Schessner
- Max-Planck-Institute of Biochemistry, Department of Proteomics and Signal Transduction, Planegg, Germany
| | - Eugenia Voytik
- Max-Planck-Institute of Biochemistry, Department of Proteomics and Signal Transduction, Planegg, Germany
| | - Isabell Bludau
- Max-Planck-Institute of Biochemistry, Department of Proteomics and Signal Transduction, Planegg, Germany
| |
Collapse
|
15
|
Voytik E, Bludau I, Willems S, Hansen FM, Brunner AD, Strauss MT, Mann M. AlphaMap: an open-source Python package for the visual annotation of proteomics data with sequence-specific knowledge. Bioinformatics 2021; 38:849-852. [PMID: 34586352 PMCID: PMC8756201 DOI: 10.1093/bioinformatics/btab674] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 09/02/2021] [Accepted: 09/22/2021] [Indexed: 02/03/2023] Open
Abstract
SUMMARY Integrating experimental information across proteomic datasets with the wealth of publicly available sequence annotations is a crucial part in many proteomic studies that currently lacks an automated analysis platform. Here, we present AlphaMap, a Python package that facilitates the visual exploration of peptide-level proteomics data. Identified peptides and post-translational modifications in proteomic datasets are mapped to their corresponding protein sequence and visualized together with prior knowledge from UniProt and with expected proteolytic cleavage sites. The functionality of AlphaMap can be accessed via an intuitive graphical user interface or-more flexibly-as a Python package that allows its integration into common analysis workflows for data visualization. AlphaMap produces publication-quality illustrations and can easily be customized to address a given research question. AVAILABILITY AND IMPLEMENTATION AlphaMap is implemented in Python and released under an Apache license. The source code and one-click installers are freely available at https://github.com/MannLabs/alphamap. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - Sander Willems
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, 82152 Martinsried, Germany
| | - Fynn M Hansen
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, 82152 Martinsried, Germany
| | - Andreas-David Brunner
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, 82152 Martinsried, Germany
| | - Maximilian T Strauss
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, 82152 Martinsried, Germany
| | | |
Collapse
|