1
|
Wang CR, McFarlane LO, Pukala TL. Exploring snake venoms beyond the primary sequence: From proteoforms to protein-protein interactions. Toxicon 2024; 247:107841. [PMID: 38950738 DOI: 10.1016/j.toxicon.2024.107841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 06/26/2024] [Accepted: 06/28/2024] [Indexed: 07/03/2024]
Abstract
Snakebite envenomation has been a long-standing global issue that is difficult to treat, largely owing to the flawed nature of current immunoglobulin-based antivenom therapy and the complexity of snake venoms as sophisticated mixtures of bioactive proteins and peptides. Comprehensive characterisation of venom compositions is essential to better understanding snake venom toxicity and inform effective and rationally designed antivenoms. Additionally, a greater understanding of snake venom composition will likely unearth novel biologically active proteins and peptides that have promising therapeutic or biotechnological applications. While a bottom-up proteomic workflow has been the main approach for cataloguing snake venom compositions at the toxin family level, it is unable to capture snake venom heterogeneity in the form of protein isoforms and higher-order protein interactions that are important in driving venom toxicity but remain underexplored. This review aims to highlight the importance of understanding snake venom heterogeneity beyond the primary sequence, in the form of post-translational modifications that give rise to different proteoforms and the myriad of higher-order protein complexes in snake venoms. We focus on current top-down proteomic workflows to identify snake venom proteoforms and further discuss alternative or novel separation, instrumentation, and data processing strategies that may improve proteoform identification. The current higher-order structural characterisation techniques implemented for snake venom proteins are also discussed; we emphasise the need for complementary and higher resolution structural bioanalytical techniques such as mass spectrometry-based approaches, X-ray crystallography and cryogenic electron microscopy, to elucidate poorly characterised tertiary and quaternary protein structures. We envisage that the expansion of the snake venom characterisation "toolbox" with top-down proteomics and high-resolution protein structure determination techniques will be pivotal in advancing structural understanding of snake venoms towards the development of improved therapeutic and biotechnology applications.
Collapse
Affiliation(s)
- C Ruth Wang
- Discipline of Chemistry, School of Physics, Chemistry and Earth Sciences, The University of Adelaide, Adelaide, 5005, Australia
| | - Lewis O McFarlane
- Discipline of Chemistry, School of Physics, Chemistry and Earth Sciences, The University of Adelaide, Adelaide, 5005, Australia
| | - Tara L Pukala
- Discipline of Chemistry, School of Physics, Chemistry and Earth Sciences, The University of Adelaide, Adelaide, 5005, Australia.
| |
Collapse
|
2
|
Popova L, Carr RA, Carabetta VJ. Recent Contributions of Proteomics to Our Understanding of Reversible N ε-Lysine Acylation in Bacteria. J Proteome Res 2024; 23:2733-2749. [PMID: 38442041 PMCID: PMC11296938 DOI: 10.1021/acs.jproteome.3c00912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2024]
Abstract
Post-translational modifications (PTMs) have been extensively studied in both eukaryotes and prokaryotes. Lysine acetylation, originally thought to be a rare occurrence in bacteria, is now recognized as a prevalent and important PTM in more than 50 species. This expansion in interest in bacterial PTMs became possible with the advancement of mass spectrometry technology and improved reagents such as acyl-modification specific antibodies. In this Review, we discuss how mass spectrometry-based proteomic studies of lysine acetylation and other acyl modifications have contributed to our understanding of bacterial physiology, focusing on recently published studies from 2018 to 2023. We begin with a discussion of approaches used to study bacterial PTMs. Next, we discuss newly characterized acylomes, including acetylomes, succinylomes, and malonylomes, in different bacterial species. In addition, we examine proteomic contributions to our understanding of bacterial virulence and biofilm formation. Finally, we discuss the contributions of mass spectrometry to our understanding of the mechanisms of acetylation, both enzymatic and nonenzymatic. We end with a discussion of the current state of the field and possible future research avenues to explore.
Collapse
Affiliation(s)
- Liya Popova
- Department of Biomedical Sciences, Cooper Medical School of Rowan University, Camden, New Jersey 08103, United States
| | - Rachel A Carr
- Department of Biomedical Sciences, Cooper Medical School of Rowan University, Camden, New Jersey 08103, United States
| | - Valerie J Carabetta
- Department of Biomedical Sciences, Cooper Medical School of Rowan University, Camden, New Jersey 08103, United States
| |
Collapse
|
3
|
Xu T, Wang Q, Wang Q, Sun L. Mass spectrometry-intensive top-down proteomics: an update on technology advancements and biomedical applications. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2024; 16:4664-4682. [PMID: 38973469 PMCID: PMC11257149 DOI: 10.1039/d4ay00651h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 06/25/2024] [Indexed: 07/09/2024]
Abstract
Proteoforms are all forms of protein molecules from the same gene because of variations at the DNA, RNA, and protein levels, e.g., alternative splicing and post-translational modifications (PTMs). Delineation of proteins in a proteoform-specific manner is crucial for understanding their biological functions. Mass spectrometry (MS)-intensive top-down proteomics (TDP) is promising for comprehensively characterizing intact proteoforms in complex biological systems. It has achieved substantial progress in technological development, including sample preparation, proteoform separations, MS instrumentation, and bioinformatics tools. In a single TDP study, thousands of proteoforms can be identified and quantified from a cell lysate. It has also been applied to various biomedical research to better our understanding of protein function in regulating cellular processes and to discover novel proteoform biomarkers of diseases for early diagnosis and therapeutic development. This review covers the most recent technological development and biomedical applications of MS-intensive TDP.
Collapse
Affiliation(s)
- Tian Xu
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, MI 48824, USA.
| | - Qianjie Wang
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, MI 48824, USA.
| | - Qianyi Wang
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, MI 48824, USA.
| | - Liangliang Sun
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, MI 48824, USA.
| |
Collapse
|
4
|
Pavek JG, Frey BL, Frost DC, Gu TJ, Li L, Smith LM. Cysteine Counting via Isotopic Chemical Labeling for Intact Mass Proteoform Identifications in Tissue. Anal Chem 2023; 95:15245-15253. [PMID: 37791746 PMCID: PMC10637319 DOI: 10.1021/acs.analchem.3c02473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
Top-down proteomics, the tandem mass spectrometric analysis of intact proteoforms, is the dominant method for proteoform characterization in complex mixtures. While this strategy produces detailed molecular information, it also requires extensive instrument time per mass spectrum obtained and thus compromises the depth of proteoform coverage that is accessible on liquid chromatography time scales. Such a top-down analysis is necessary for making original proteoform identifications, but once a proteoform has been confidently identified, the extensive characterization it provides may no longer be required for a subsequent identification of the same proteoform. We present a strategy to identify proteoforms in tissue samples on the basis of the combination of an intact mass determination with a measured count of the number of cysteine residues present in each proteoform. We developed and characterized a cysteine tagging chemistry suitable for the efficient and specific labeling of cysteine residues within intact proteoforms and for providing a count of the cysteine amino acids present. On simple protein mixtures, the tagging chemistry yields greater than 98% labeling of all cysteine residues, with a labeling specificity of greater than 95%. Similar results are observed on more complex samples. In a proof-of-principle study, proteoforms present in a human prostate tumor biopsy were characterized. Observed proteoforms, each characterized by an intact mass and a cysteine count, were grouped into proteoform families (groups of proteoforms originating from the same gene). We observed 2190 unique experimental proteoforms, 703 of which were grouped into 275 proteoform families.
Collapse
Affiliation(s)
- John G. Pavek
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Ave. Madison, WI 53706
| | - Brian L. Frey
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Ave. Madison, WI 53706
| | - Dustin C. Frost
- School of Pharmacy, University of Wisconsin-Madison, 777 Highland Ave, Madison, WI 53705
| | - Ting-Jia Gu
- School of Pharmacy, University of Wisconsin-Madison, 777 Highland Ave, Madison, WI 53705
| | - Lingjun Li
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Ave. Madison, WI 53706
- School of Pharmacy, University of Wisconsin-Madison, 777 Highland Ave, Madison, WI 53705
| | - Lloyd M. Smith
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Ave. Madison, WI 53706
| |
Collapse
|
5
|
Robey MT, Utley D, Greer JB, Fellers RT, Kelleher NL, Durbin KR. Advancing Intact Protein Quantitation with Updated Deconvolution Routines. Anal Chem 2023; 95:14954-14962. [PMID: 37750863 PMCID: PMC10840078 DOI: 10.1021/acs.analchem.3c02345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
Analysis of intact proteins by mass spectrometry enables direct quantitation of the specific proteoforms present in a sample and is an increasingly important tool for biopharmaceutical and academic research. Interpreting and quantifying intact protein species from mass spectra typically involves many challenges including mass deconvolution and peak processing as well as determining optimal spectral averaging parameters and matching masses to theoretical proteoforms. Each of these steps can present informatic hurdles, as parameters often need to be tailored specifically to the data sets. To reduce intact mass deconvolution data analysis burdens, we built upon the widely used "sliding window" mass deconvolution technique with several additional concepts. First, we found that how spectra are averaged and the overlap in spectral windows can be tuned to favor either sensitivity or speed. A multiple window averaging approach was found to be the most effective way to increase mass detection and yielded a >2-fold increase in the number of masses detected. We also developed a targeted feature-finding routine that boosted sensitivity by >2-fold, decreased coefficient of variation across replicates by 50%, and increased the quality of mass elution profiles through 3-fold more detected time points. Lastly, we furthered existing approaches for annotating detected masses with potential proteoforms through spectral fitting for possible proteoform family modifications and network viewing. These proteoform annotation approaches ultimately produced a more accurate way of finding related, but previously unknown proteoforms from intact mass-only data. Together, these quantitation workflow improvements advance the information obtainable from intact protein mass spectrometry analyses.
Collapse
Affiliation(s)
- Matthew T Robey
- Proteinaceous, Inc., Evanston, Illinois 60201, United States
- Northwestern University, Evanston, Illinois 60208, United States
| | - Daisha Utley
- Proteinaceous, Inc., Evanston, Illinois 60201, United States
| | - Joseph B Greer
- Proteinaceous, Inc., Evanston, Illinois 60201, United States
- Northwestern University, Evanston, Illinois 60208, United States
| | - Ryan T Fellers
- Proteinaceous, Inc., Evanston, Illinois 60201, United States
- Northwestern University, Evanston, Illinois 60208, United States
| | - Neil L Kelleher
- Proteinaceous, Inc., Evanston, Illinois 60201, United States
- Northwestern University, Evanston, Illinois 60208, United States
| | - Kenneth R Durbin
- Proteinaceous, Inc., Evanston, Illinois 60201, United States
- Northwestern University, Evanston, Illinois 60208, United States
| |
Collapse
|
6
|
Lignieres L, Legros V, Khelil M, Senecaut N, Lauber MA, Camadro JM, Chevreux G. Capillary liquid chromatography coupled with mass spectrometry for analysis of nanogram protein quantities on a wide-pore superficially porous particle column in top-down proteomics. J Chromatogr B Analyt Technol Biomed Life Sci 2023; 1214:123566. [PMID: 36516651 DOI: 10.1016/j.jchromb.2022.123566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 12/01/2022] [Accepted: 12/02/2022] [Indexed: 12/13/2022]
Abstract
In top-down proteomics experiments, intact protein ions are subjected to gas-phase fragmentation for MS analysis without prior digestion. This approach is used to characterize post-translational modifications and clipped forms of proteins, avoids several "inference" problems associated with bottom-up proteomics, and is well suited to the study of proteoforms. In the past decade, top-down proteomics has progressed rapidly, taking advantage of MS instrumentation improvements and the efforts of pioneering groups working to improve sample handling and data processing. The potential of this technology has been established through its successful use in a number of important biological studies. However, many challenges remain to be addressed like improving protein separation capabilities such that it might become possible to expand the dynamic range of whole proteome analysis, address co-elution and convoluted mass spectral data, and aid final data processing from peak identification to quantification. In this study, we investigated the use of a wide-pore silica-based superficially porous media with a high coverage phenyl bonding, commercially packed into customized capillary columns for the purpose of top-down proteomics. Protein samples of increasing complexity were tested, namely subunit digests of a monoclonal antibody, components of purified histones and proteins extracted from eukaryotic ribosomes. High quality mass spectra were obtained from only 100 ng of protein sample while using difluoroacetic acid as an ion pairing agent to improve peak shape and chromatographic resolution. A peak width at half height of about 15 s for a 45 min gradient time was observed on a complex mixture giving an estimated peak capacity close to 100. Most importantly, efficient separations were obtained for highly diverse proteins and there was no need to make method specific adjustments, suggesting this is a highly versatile and easy-to-use setup for top-down proteomics.
Collapse
Affiliation(s)
- Laurent Lignieres
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Véronique Legros
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Manel Khelil
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Nicolas Senecaut
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Matthew A Lauber
- Waters Corporation, 34, Maple Street, Milford, MA 01757-3696, United States
| | | | - Guillaume Chevreux
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France.
| |
Collapse
|
7
|
Seeing the complete picture: proteins in top-down mass spectrometry. Essays Biochem 2022; 67:283-300. [PMID: 36468679 DOI: 10.1042/ebc20220098] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 11/11/2022] [Accepted: 11/14/2022] [Indexed: 12/12/2022]
Abstract
Abstract
Top-down protein mass spectrometry can provide unique insights into protein sequence and structure, including precise proteoform identification and study of protein–ligand and protein–protein interactions. In contrast with the commonly applied bottom-up approach, top-down approaches do not include digestion of the protein of interest into small peptides, but instead rely on the ionization and subsequent fragmentation of intact proteins. As such, it is fundamentally the only way to fully characterize the composition of a proteoform. Here, we provide an overview of how a top-down protein mass spectrometry experiment is performed and point out recent applications from the literature to the reader. While some parts of the top-down workflow are broadly applicable, different research questions are best addressed with specific experimental designs. The most important divide is between studies that prioritize sequence information (i.e., proteoform identification) versus structural information (e.g., conformational studies, or mapping protein–protein or protein–ligand interactions). Another important consideration is whether to work under native or denaturing solution conditions, and the overall complexity of the sample also needs to be taken into account, as it determines whether (chromatographic) separation is required prior to MS analysis. In this review, we aim to provide enough information to support both newcomers and more experienced readers in the decision process of how to answer a potential research question most efficiently and to provide an overview of the methods that exist to answer these questions.
Collapse
|
8
|
Schaffer LV, Shortreed MR, Smith LM. Proteoform Analysis and Construction of Proteoform Families in Proteoform Suite. Methods Mol Biol 2022; 2500:67-81. [PMID: 35657588 PMCID: PMC9694099 DOI: 10.1007/978-1-0716-2325-1_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Proteoform Suite is an interactive software program for the identification and quantification of intact proteoforms from mass spectrometry data. Proteoform Suite identifies proteoforms observed by intact-mass (MS1) analysis. In intact-mass analysis, unfragmented experimental proteoforms are compared to a database of known proteoform sequences and to one another, searching for mass differences corresponding to well-known post-translational modifications or amino acids. Intact-mass analysis enables proteoforms observed in the MS1 data without MS/MS (MS2) fragmentation to be identified. Proteoform Suite further facilitates the construction and visualization of proteoform families, which are the sets of proteoforms derived from individual genes. Bottom-up peptide identifications and top-down (MS2) proteoform identifications can be integrated into the Proteoform Suite analysis to increase the sensitivity and accuracy of the analysis. Proteoform Suite is open source and freely available at https://github.com/smith-chem-wisc/proteoform-suite .
Collapse
Affiliation(s)
- Leah V Schaffer
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | | | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
9
|
Proteoforms and Proteoform Families: Past, Present, and Future. Methods Mol Biol 2022; 2500:1-4. [PMID: 35657582 PMCID: PMC9676067 DOI: 10.1007/978-1-0716-2325-1_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The Human Proteoform Project is an ambitious international effort to accelerate the development of technologies for proteoform analysis and to establish comprehensive atlases of proteoforms for humans and model organisms. Proteoforms are the ultimate molecular effectors of function in biology and are thus central to understanding that function. Proteoform analysis as it is practiced today is almost exclusively accomplished by mass spectrometry (MS) and is rapidly advancing in its capabilities. This volume presents a beautiful snapshot of emerging technologies at the exciting frontier of MS-based proteoform analysis.
Collapse
|
10
|
Tiambeng TN, Wu Z, Melby JA, Ge Y. Size Exclusion Chromatography Strategies and MASH Explorer for Large Proteoform Characterization. Methods Mol Biol 2022; 2500:15-30. [PMID: 35657584 PMCID: PMC9703982 DOI: 10.1007/978-1-0716-2325-1_3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Top-down mass spectrometry (MS)-based analysis of larger proteoforms (>50 kDa) is typically challenging due to an exponential decay in the signal-to-noise ratio with increasing protein molecular weight (MW) and coelution with low-MW proteoforms. Size exclusion chromatography (SEC) fractionates proteins based on their size, separating larger proteoforms from those of smaller size in the proteome. In this protocol, we initially describe the use of SEC to fractionate high-MW proteoforms from low-MW proteoforms. Subsequently, the SEC fractions containing the proteoforms of interest are subjected to reverse-phase liquid chromatography (RPLC) coupled online with high-resolution MS. Finally, proteoforms are characterized using MASH Explorer, a user-friendly software environment for in-depth proteoform characterization.
Collapse
Affiliation(s)
- Timothy N. Tiambeng
- Department of Chemistry, University of Wisconsin – Madison, Madison, WI 53706
| | - Zhijie Wu
- Department of Chemistry, University of Wisconsin – Madison, Madison, WI 53706
| | - Jake A. Melby
- Department of Chemistry, University of Wisconsin – Madison, Madison, WI 53706
| | - Ying Ge
- Department of Chemistry, University of Wisconsin – Madison, Madison, WI 53706,Department of Cell and Regenerative Biology, University of Wisconsin – Madison, Madison, WI 53705,Human Proteomic Program, University of Wisconsin – Madison, Madison WI 53705,To whom correspondence may be addressed: Dr. Ying Ge, 8551 WIMR-II, 1111 Highland Ave., Madison, Wisconsin 53705, USA. ; Tel: 608-265-4744
| |
Collapse
|
11
|
Wilson JW, Zhou M. Discovery of Unknown Posttranslational Modifications by Top-Down Mass Spectrometry. Methods Mol Biol 2022; 2500:181-199. [PMID: 35657594 DOI: 10.1007/978-1-0716-2325-1_13] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Protein encoding genes can undergo modifications posttranscriptionally and posttranslationally, yielding many different "proteoforms." The chemical diversity of such modifications is known to be important biomarkers of function within biological systems but is not completely understood. Top-down mass spectrometry is a valuable tool for the characterization of proteoforms, especially for histones that have complex combinations of posttranslational modifications (PTMs). In this chapter, we present a top-down liquid chromatography-mass spectrometry experimental and data analysis workflow for the identification of novel, unexpected modifications on histones. Proteoforms of interest are first discovered using the "open" modification search in TopPIC. Then target proteoforms are manually confirmed using the data visualization tool-LcMsSpectator, part of the Informed-Proteomics package. The workflow can be very helpful in targeted PTM analysis and can be expanded to other types of proteins for discovery of unknown PTMs.
Collapse
Affiliation(s)
- Jesse W Wilson
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Mowei Zhou
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA.
| |
Collapse
|
12
|
Kaulich PT, Winkels K, Kaulich TB, Treitz C, Cassidy L, Tholey A. MSTopDiff: A Tool for the Visualization of Mass Shifts in Deconvoluted Top-Down Proteomics Data for the Database-Independent Detection of Protein Modifications. J Proteome Res 2021; 21:20-29. [PMID: 34818005 DOI: 10.1021/acs.jproteome.1c00766] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Top-down proteomics analyzes intact proteoforms with all of their post-translational modifications and genetic and RNA splice variants. In addition, modifications introduced either deliberately or inadvertently during sample preparation, that is, via oxidation, alkylation, or labeling reagents, or through the formation of noncovalent adducts (e.g., detergents) further increase the sample complexity. To facilitate the recognition of protein modifications introduced during top-down analysis, we developed MSTopDiff, a software tool with a graphical user interface written in Python, which allows one to detect protein modifications by calculating and visualizing mass differences in top-down data without the prerequisite of a database search. We demonstrate the successful application of MSTopDiff for the detection of artifacts originating from oxidation, formylation, overlabeling during isobaric labeling, and adduct formation with cations or sodium dodecyl sulfate. MSTopDiff offers several modes of data representation using deconvoluted MS1 or MS2 spectra. In addition to artificial modifications, the tool enables the visualization of biological modifications such as phosphorylation and acetylation. MSTopDiff provides an overview of the artificial and biological modifications in top-down proteomics samples, which makes it a valuable tool in quality control of standard workflows and for parameter evaluation during method development.
Collapse
Affiliation(s)
- Philipp T Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Konrad Winkels
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Tobias B Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Christian Treitz
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| |
Collapse
|
13
|
Carbonara K, Andonovski M, Coorssen JR. Proteomes Are of Proteoforms: Embracing the Complexity. Proteomes 2021; 9:38. [PMID: 34564541 PMCID: PMC8482110 DOI: 10.3390/proteomes9030038] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 08/24/2021] [Accepted: 08/29/2021] [Indexed: 12/17/2022] Open
Abstract
Proteomes are complex-much more so than genomes or transcriptomes. Thus, simplifying their analysis does not simplify the issue. Proteomes are of proteoforms, not canonical proteins. While having a catalogue of amino acid sequences provides invaluable information, this is the Proteome-lite. To dissect biological mechanisms and identify critical biomarkers/drug targets, we must assess the myriad of proteoforms that arise at any point before, after, and between translation and transcription (e.g., isoforms, splice variants, and post-translational modifications [PTM]), as well as newly defined species. There are numerous analytical methods currently used to address proteome depth and here we critically evaluate these in terms of the current 'state-of-the-field'. We thus discuss both pros and cons of available approaches and where improvements or refinements are needed to quantitatively characterize proteomes. To enable a next-generation approach, we suggest that advances lie in transdisciplinarity via integration of current proteomic methods to yield a unified discipline that capitalizes on the strongest qualities of each. Such a necessary (if not revolutionary) shift cannot be accomplished by a continued primary focus on proteo-genomics/-transcriptomics. We must embrace the complexity. Yes, these are the hard questions, and this will not be easy…but where is the fun in easy?
Collapse
Affiliation(s)
| | | | - Jens R. Coorssen
- Faculties of Applied Health Sciences and Mathematics & Science, Departments of Health Sciences and Biological Sciences, Brock University, 1812 Sir Isaac Brock Way, St. Catharines, ON L2S 3A1, Canada; (K.C.); (M.A.)
| |
Collapse
|
14
|
Winkels K, Koudelka T, Tholey A. Quantitative Top-Down Proteomics by Isobaric Labeling with Thiol-Directed Tandem Mass Tags. J Proteome Res 2021; 20:4495-4506. [PMID: 34338531 DOI: 10.1021/acs.jproteome.1c00460] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
While identification-centric (qualitative) top-down proteomics (TDP) has seen rapid progress in the recent past, the quantification of intact proteoforms within complex proteomes is still challenging. The by far mostly applied approach is label-free quantification, which, however, provides limited multiplexing capacity, and its use in combination with multidimensional separation is encountered with a number of problems. Isobaric labeling, which is a standard quantification approach in bottom-up proteomics, circumvents these limitations. Here, we introduce the application of thiol-directed isobaric labeling for quantitative TDP. For this purpose, we analyzed the labeling efficiency and optimized tandem mass spectrometry parameters for optimal backbone fragmentation for identification and reporter ion formation for quantification. Two different separation schemes, gel-eluted liquid fraction entrapment electrophoresis × liquid chromatography-mass spectrometry (LC-MS) and high/low-pH LC-MS, were employed for the analyses of either Escherichia coli (E. coli) proteomes or combined E. coli/yeast samples (two-proteome interference model) to study potential ratio compression. While the thiol-directed labeling introduces a bias in the quantifiable proteoforms, being restricted to Cys-containing proteoforms, our approach showed excellent accuracy in quantification, which is similar to that achievable in bottom-up proteomics. For example, 876 proteoforms could be quantified with high accuracy in an E. coli lysate. The LC-MS data were deposited to the ProteomeXchange with the dataset identifier PXD026310.
Collapse
Affiliation(s)
- Konrad Winkels
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel 24105, Germany
| | - Tomas Koudelka
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel 24105, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel 24105, Germany
| |
Collapse
|
15
|
Abstract
Proteoform identification is required to fully understand the biological diversity present in a sample. However, these identifications are often ambiguous because of the challenges in analyzing full length proteins by mass spectrometry. A five-level proteoform classification system was recently developed to delineate the ambiguity of proteoform identifications and to allow for comparisons across software platforms and acquisition methods. Widespread adoption of this system requires software tools to provide classification of the proteoform identifications. We describe here an implementation of the five-level classification system in the software program MetaMorpheus, which provides both bottom-up and top-down identifications. Additionally, we developed a stand-alone program called ProteoformClassifier that allows users to classify proteoform results from any search program, provided that the program writes output that includes the information necessary to evaluate proteoform ambiguity. This stand-alone program includes a small test file and database to evaluate if a given program provides sufficient information to evaluate ambiguity. If the program does not, then ProteoformClassifier provides meaningful feedback to assist developers with implementing the classification system. We tested currently available top-down software programs and found that none of them (other than MetaMorpheus) provided sufficient information regarding identification ambiguity to permit classification.
Collapse
Affiliation(s)
- Zach Rolfs
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
16
|
Weisbrod CR, Anderson LC, Hendrickson CL, Schaffer LV, Shortreed MR, Smith LM, Shabanowitz J, Hunt DF. Advanced Strategies for Proton-Transfer Reactions Coupled with Parallel Ion Parking on a 21 T FT-ICR MS for Intact Protein Analysis. Anal Chem 2021; 93:9119-9128. [PMID: 34165955 DOI: 10.1021/acs.analchem.1c00847] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Proton-transfer reactions (PTRs) have emerged as a powerful tool for the study of intact proteins. When coupled with m/z-selective kinetic excitation, such as parallel ion parking (PIP), one can exert exquisite control over rates of reaction with a high degree of specificity. This allows one to "concentrate", in the gas phase, nearly all the signals from an intact protein charge state envelope into a single charge state, improving the signal-to-noise ratio (S/N) by 10× or more. While this approach has been previously reported, here we show that implementing these technologies on a 21 T FT-ICR MS provides a tremendous advantage for intact protein analysis. Advanced strategies for performing PTR with PIP were developed to complement this unique instrument, including subjecting all analyte ions entering the mass spectrometer to PTR and PIP. This experiment, which we call "PTR-MS1-PIP", generates a pseudo-MS1 spectrum derived from ions that are exposed to the PTR reagent and PIP waveforms but have not undergone any prior true mass filtering or ion isolation. The result is an extremely rapid and significant improvement in the spectral S/N of intact proteins. This permits the observation of many more proteoforms and reduces ion injection periods for subsequent tandem mass spectrometry characterization. Additionally, the product ion parking waveform has been optimized to enhance the PTR rate without compromise to the parking efficiency. We demonstrate that this process, called "rapid park", can improve reaction rates by 5-10× and explore critical factors discovered to influence this process. Finally, we demonstrate how coupling PTR-MS1 and rapid park provides a 10-fold reduction in ion injection time, improving the rate of tandem MS sequencing.
Collapse
Affiliation(s)
- Chad R Weisbrod
- Ion Cyclotron Resonance Program, National High Magnetic Field Laboratory, 1800 E. Paul Dirac Dr., Tallahassee, Florida 32310, United States
| | - Lissa C Anderson
- Ion Cyclotron Resonance Program, National High Magnetic Field Laboratory, 1800 E. Paul Dirac Dr., Tallahassee, Florida 32310, United States
| | - Christopher L Hendrickson
- Ion Cyclotron Resonance Program, National High Magnetic Field Laboratory, 1800 E. Paul Dirac Dr., Tallahassee, Florida 32310, United States
| | - Leah V Schaffer
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Michael R Shortreed
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Jeffrey Shabanowitz
- Department of Chemistry, University of Virginia, Charlottesville, Virginia 22904, United States
| | - Donald F Hunt
- Department of Chemistry, University of Virginia, Charlottesville, Virginia 22904, United States
| |
Collapse
|
17
|
Melby JA, Roberts DS, Larson EJ, Brown KA, Bayne EF, Jin S, Ge Y. Novel Strategies to Address the Challenges in Top-Down Proteomics. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021; 32:1278-1294. [PMID: 33983025 PMCID: PMC8310706 DOI: 10.1021/jasms.1c00099] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Top-down mass spectrometry (MS)-based proteomics is a powerful technology for comprehensively characterizing proteoforms to decipher post-translational modifications (PTMs) together with genetic variations and alternative splicing isoforms toward a proteome-wide understanding of protein functions. In the past decade, top-down proteomics has experienced rapid growth benefiting from groundbreaking technological advances, which have begun to reveal the potential of top-down proteomics for understanding basic biological functions, unraveling disease mechanisms, and discovering new biomarkers. However, many challenges remain to be comprehensively addressed. In this Account & Perspective, we discuss the major challenges currently facing the top-down proteomics field, particularly in protein solubility, proteome dynamic range, proteome complexity, data analysis, proteoform-function relationship, and analytical throughput for precision medicine. We specifically review the major technology developments addressing these challenges with an emphasis on our research group's efforts, including the development of top-down MS-compatible surfactants for protein solubilization, functionalized nanoparticles for the enrichment of low-abundance proteoforms, strategies for multidimensional chromatography separation of proteins, and a new comprehensive user-friendly software package for top-down proteomics. We have also made efforts to connect proteoforms with biological functions and provide our visions on what the future holds for top-down proteomics.
Collapse
Affiliation(s)
- Jake A Melby
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - David S Roberts
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Eli J Larson
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Kyle A Brown
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
- Department of Surgery, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Elizabeth F Bayne
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Song Jin
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Ying Ge
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
- Human Proteomics Program, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| |
Collapse
|
18
|
Schaffer LV, Anderson LC, Butcher DS, Shortreed MR, Miller RM, Pavelec C, Smith LM. Construction of Human Proteoform Families from 21 Tesla Fourier Transform Ion Cyclotron Resonance Mass Spectrometry Top-Down Proteomic Data. J Proteome Res 2020; 20:317-325. [PMID: 33074679 DOI: 10.1021/acs.jproteome.0c00403] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Identification of proteoforms, the different forms of a protein, is important to understand biological processes. A proteoform family is the set of different proteoforms from the same gene. We previously developed the software program Proteoform Suite, which constructs proteoform families and identifies proteoforms by intact-mass analysis. Here, we have applied this approach to top-down proteomic data acquired at the National High Magnetic Field Laboratory 21 tesla Fourier transform ion cyclotron resonance mass spectrometer (data available on the MassIVE platform with identifier MSV000085978). We explored the ability to construct proteoform families and identify proteoforms from the high mass accuracy data that this instrument provides for a complex cell lysate sample from the MCF-7 human breast cancer cell line. There were 2830 observed experimental proteforms, of which 932 were identified, 44 were ambiguous, and 1854 were unidentified. Of the 932 unique identified proteoforms, 766 were identified by top-down MS2 analysis at 1% false discovery rate (FDR) using TDPortal, and 166 were additional intact-mass identifications (∼4.7% calculated global FDR) made using Proteoform Suite. We recently published a proteoform level schema to represent ambiguity in proteoform identifications. We implemented this proteoform level classification in Proteoform Suite for intact-mass identifications, which enables users to determine the ambiguity levels and sources of ambiguity for each intact-mass proteoform identification.
Collapse
Affiliation(s)
- Leah V Schaffer
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Lissa C Anderson
- Ion Cyclotron Resonance Program, National High Magnetic Field Laboratory, Tallahassee, Florida 32310, United States
| | - David S Butcher
- Ion Cyclotron Resonance Program, National High Magnetic Field Laboratory, Tallahassee, Florida 32310, United States
| | - Michael R Shortreed
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Rachel M Miller
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Caitlin Pavelec
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
19
|
Brown KA, Melby JA, Roberts DS, Ge Y. Top-down proteomics: challenges, innovations, and applications in basic and clinical research. Expert Rev Proteomics 2020; 17:719-733. [PMID: 33232185 PMCID: PMC7864889 DOI: 10.1080/14789450.2020.1855982] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 11/23/2020] [Indexed: 12/14/2022]
Abstract
Introduction- A better understanding of the underlying molecular mechanism of diseases is critical for developing more effective diagnostic tools and therapeutics toward precision medicine. However, many challenges remain to unravel the complex nature of diseases. Areas covered- Changes in protein isoform expression and post-translation modifications (PTMs) have gained recognition for their role in underlying disease mechanisms. Top-down mass spectrometry (MS)-based proteomics is increasingly recognized as an important method for the comprehensive characterization of proteoforms that arise from alternative splicing events and/or PTMs for basic and clinical research. Here, we review the challenges, technological innovations, and recent studies that utilize top-down proteomics to elucidate changes in the proteome with an emphasis on its use to study heart diseases. Expert opinion- Proteoform-resolved information can substantially contribute to the understanding of the molecular mechanisms underlying various diseases and for the identification of novel proteoform targets for better therapeutic development . Despite the challenges of sequencing intact proteins, top-down proteomics has enabled a wealth of information regarding protein isoform switching and changes in PTMs. Continuous developments in sample preparation, intact protein separation, and instrumentation for top-down MS have broadened its capabilities to characterize proteoforms from a range of samples on an increasingly global scale.
Collapse
Affiliation(s)
- Kyle A. Brown
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States
| | - Jake A. Melby
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States
| | - David S. Roberts
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States
| | - Ying Ge
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin, United States
- Human Proteomics Program, University of Wisconsin-Madison, Madison, Wisconsin, United States
| |
Collapse
|
20
|
Zhou M, Uwugiaren N, Williams SM, Moore RJ, Zhao R, Goodlett D, Dapic I, Paša-Tolić L, Zhu Y. Sensitive Top-Down Proteomics Analysis of a Low Number of Mammalian Cells Using a Nanodroplet Sample Processing Platform. Anal Chem 2020; 92:7087-7095. [PMID: 32374172 DOI: 10.1021/acs.analchem.0c00467] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Top-down proteomics is a powerful tool for characterizing genetic variations and post-translational modifications at intact protein level. However, one significant technical gap of top-down proteomics is the inability to analyze a low amount of biological samples, which limits its access to isolated rare cells, fine needle aspiration biopsies, and tissue substructures. Herein, we developed an ultrasensitive top-down platform by incorporating a microfluidic sample preparation system, termed nanoPOTS (nanodroplet processing in one pot for trace samples), into a top-down proteomic workflow. A unique combination of a nonionic detergent dodecyl-β-d-maltopyranoside (DDM) with urea as protein extraction buffer significantly improved both protein extraction efficiency and sample recovery. We hypothesize that the DDM detergent improves protein recovery by efficiently reducing nonspecific adsorption of intact proteins on container surfaces, while urea serves as a strong denaturant to disrupt noncovalent complexes and release intact proteins for downstream analysis. The nanoPOTS-based top-down platform reproducibly and quantitatively identified ∼170 to ∼620 proteoforms from ∼70 to ∼770 HeLa cells containing ∼10 to ∼115 ng of total protein. A variety of post-translational modifications including acetylation, myristoylation, and iron binding were identified using only less than 800 cells. We anticipate the nanoPOTS top-down proteomics platform will be broadly applicable in biomedical research, particularly where clinical specimens are not available in amounts amenable to standard workflows.
Collapse
Affiliation(s)
- Mowei Zhou
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Naomi Uwugiaren
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
| | - Sarah M Williams
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Ronald J Moore
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Rui Zhao
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - David Goodlett
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland.,Department of Microbial Pathogenesis, School of Dentistry, University of Maryland, Baltimore, Maryland 21201, United States
| | - Irena Dapic
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
| | - Ljiljana Paša-Tolić
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Ying Zhu
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| |
Collapse
|
21
|
Jeong K, Kim J, Gaikwad M, Hidayah SN, Heikaus L, Schlüter H, Kohlbacher O. FLASHDeconv: Ultrafast, High-Quality Feature Deconvolution for Top-Down Proteomics. Cell Syst 2020; 10:213-218.e6. [DOI: 10.1016/j.cels.2020.01.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 12/19/2019] [Accepted: 01/27/2020] [Indexed: 02/06/2023]
|
22
|
Abstract
Top-down mass spectrometry (MS) analyzes intact proteins at the proteoform level, which allows researchers to better understand the functions of protein modifications. Recently, top-down proteomics has increased in popularity due to advancements in high-resolution mass spectrometers, increased efficiency in liquid chromatography (LC) separation, and advances in data analysis software. Some unique protein proteoforms, which have been distinguished using top-down MS, have even been shown to exhibit marked variation in biological function compared to similar proteoforms. However, the qualitative identification of a particular proteoform may not be enough to determine the biological relevance of that proteoform. Quantitative top-down MS methods have been notably applied to the study of the differing biological functions of protein proteoforms and have allowed researchers to explore proteomes at the proteoform, rather than the peptide, level. Here, we review the top-down MS methods that have been used to quantitatively identify intact proteins, discuss current applications of quantitative top-down MS analysis, and present new areas where quantitative top-down MS analysis may be implemented.
Collapse
Affiliation(s)
- Kellye A Cupp-Sutton
- Department of Chemistry and Biochemistry, University of Oklahoma, 101 Stephenson Parkway, Room 2210, Norman, OK 73019-5251, USA.
| | | |
Collapse
|
23
|
Shen X, Yang Z, McCool EN, Lubeckyj RA, Chen D, Sun L. Capillary zone electrophoresis-mass spectrometry for top-down proteomics. Trends Analyt Chem 2019; 120:115644. [PMID: 31537953 PMCID: PMC6752746 DOI: 10.1016/j.trac.2019.115644] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Mass spectrometry (MS)-based top-down proteomics characterizes complex proteomes at the intact proteoform level and provides an accurate picture of protein isoforms and protein post-translational modifications in the cell. The progress of top-down proteomics requires novel analytical tools with high peak capacity for proteoform separation and high sensitivity for proteoform detection. The requirements have made capillary zone electrophoresis (CZE)-MS an attractive approach for advancing large-scale top-down proteomics. CZE has achieved a peak capacity of 300 for separation of complex proteoform mixtures. CZE-MS has shown drastically better sensitivity than commonly used reversed-phase liquid chromatography (RPLC)-MS for proteoform detection. The advanced CZE-MS identified 6,000 proteoforms of nearly 1,000 proteoform families from a complex proteome sample, which represents one of the largest top-down proteomic datasets so far. In this review, we focus on the recent progress in CZE-MS-based top-down proteomics and provide our perspectives about its future directions.
Collapse
Affiliation(s)
- Xiaojing Shen
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, Michigan 48824, United States
| | - Zhichang Yang
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, Michigan 48824, United States
| | - Elijah N. McCool
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, Michigan 48824, United States
| | - Rachele A. Lubeckyj
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, Michigan 48824, United States
| | - Daoyang Chen
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, Michigan 48824, United States
| | - Liangliang Sun
- Department of Chemistry, Michigan State University, 578 S Shaw Lane, East Lansing, Michigan 48824, United States
| |
Collapse
|
24
|
Dai Y, Buxton KE, Schaffer LV, Miller RM, Millikin RJ, Scalf M, Frey BL, Shortreed MR, Smith LM. Constructing Human Proteoform Families Using Intact-Mass and Top-Down Proteomics with a Multi-Protease Global Post-Translational Modification Discovery Database. J Proteome Res 2019; 18:3671-3680. [PMID: 31479276 DOI: 10.1021/acs.jproteome.9b00339] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Complex human biomolecular processes are made possible by the diversity of human proteoforms. Constructing proteoform families, groups of proteoforms derived from the same gene, is one way to represent this diversity. Comprehensive, high-confidence identification of human proteoforms remains a central challenge in mass spectrometry-based proteomics. We have previously reported a strategy for proteoform identification using intact-mass measurements, and we have since improved that strategy by mass calibration based on search results, the use of a global post-translational modification discovery database, and the integration of top-down proteomics results with intact-mass analysis. In the present study, we combine these strategies for enhanced proteoform identification in total cell lysate from the Jurkat human T lymphocyte cell line. We collected, processed, and integrated three types of proteomics data (NeuCode-labeled intact-mass, label-free top-down, and multi-protease bottom-up) to maximize the number of confident proteoform identifications. The integrated analysis revealed 5950 unique experimentally observed proteoforms, which were assembled into 848 proteoform families. Twenty percent of the observed proteoforms were confidently identified at a 3.9% false discovery rate, representing 1207 unique proteoforms derived from 484 genes.
Collapse
Affiliation(s)
- Yunxiang Dai
- Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States.,Biophysics Graduate Program , University of Wisconsin , 413 Bock Laboratories, 1525 Linden Drive , Madison , Wisconsin 53706 , United States
| | - Katherine E Buxton
- Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States
| | - Leah V Schaffer
- Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States
| | - Rachel M Miller
- Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States
| | - Robert J Millikin
- Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States
| | - Mark Scalf
- Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States
| | - Brian L Frey
- Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States
| | - Michael R Shortreed
- Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States
| | - Lloyd M Smith
- Department of Chemistry , University of Wisconsin , 1101 University Avenue , Madison , Wisconsin 53706 , United States
| |
Collapse
|
25
|
Schaffer LV, Tucholski T, Shortreed MR, Ge Y, Smith LM. Intact-Mass Analysis Facilitating the Identification of Large Human Heart Proteoforms. Anal Chem 2019; 91:10937-10942. [PMID: 31393705 DOI: 10.1021/acs.analchem.9b02343] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Proteoforms, the primary effectors of biological processes, are the different forms of proteins that arise from molecular processing events such as alternative splicing and post-translational modifications. Heart diseases exhibit changes in proteoform levels, motivating the development of a deeper understanding of the heart proteoform landscape. Our recently developed two-dimensional top-down proteomics platform coupling serial size exclusion chromatography (sSEC) to reversed-phase chromatography (RPC) expanded coverage of the human heart proteome and allowed observation of high-molecular weight proteoforms. However, most of these observed proteoforms were not identified due to the difficulty in obtaining quality tandem mass spectrometry (MS2) fragmentation data for large proteoforms from complex biological mixtures on a chromatographic time scale. Herein, we sought to identify human heart proteoforms in this data set using an enhanced version of Proteoform Suite, which identifies proteoforms by intact mass alone. Specifically, we added a new feature to Proteoform Suite to determine candidate identifications for isotopically unresolved proteoforms larger than 50 kDa, enabling subsequent MS2 identification of important high-molecular weight human heart proteoforms such as lamin A (72 kDa) and trifunctional enzyme subunit α (79 kDa). With this new workflow for large proteoform identification, endogenous human cardiac myosin binding protein C (140 kDa) was identified for the first time. This study demonstrates the integration of our sSEC-RPC-MS proteomics platform with intact-mass analysis through Proteoform Suite to create a catalog of human heart proteoforms and facilitate the identification of large proteoforms in complex systems.
Collapse
Affiliation(s)
- Leah V Schaffer
- Department of Chemistry , University of Wisconsin-Madison , Madison , Wisconsin 53706 , United States
| | - Trisha Tucholski
- Department of Chemistry , University of Wisconsin-Madison , Madison , Wisconsin 53706 , United States
| | - Michael R Shortreed
- Department of Chemistry , University of Wisconsin-Madison , Madison , Wisconsin 53706 , United States
| | - Ying Ge
- Department of Chemistry , University of Wisconsin-Madison , Madison , Wisconsin 53706 , United States.,Department of Cell and Regenerative Biology , University of Wisconsin-Madison , Madison , Wisconsin 53705 , United States.,Human Proteomics Program , University of Wisconsin-Madison , Madison , Wisconsin 53705 , United States
| | - Lloyd M Smith
- Department of Chemistry , University of Wisconsin-Madison , Madison , Wisconsin 53706 , United States
| |
Collapse
|
26
|
Schaffer LV, Millikin RJ, Miller RM, Anderson LC, Fellers RT, Ge Y, Kelleher NL, LeDuc RD, Liu X, Payne SH, Sun L, Thomas PM, Tucholski T, Wang Z, Wu S, Wu Z, Yu D, Shortreed MR, Smith LM. Identification and Quantification of Proteoforms by Mass Spectrometry. Proteomics 2019; 19:e1800361. [PMID: 31050378 PMCID: PMC6602557 DOI: 10.1002/pmic.201800361] [Citation(s) in RCA: 128] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2019] [Revised: 04/07/2019] [Indexed: 12/29/2022]
Abstract
A proteoform is a defined form of a protein derived from a given gene with a specific amino acid sequence and localized post-translational modifications. In top-down proteomic analyses, proteoforms are identified and quantified through mass spectrometric analysis of intact proteins. Recent technological developments have enabled comprehensive proteoform analyses in complex samples, and an increasing number of laboratories are adopting top-down proteomic workflows. In this review, some recent advances are outlined and current challenges and future directions for the field are discussed.
Collapse
Affiliation(s)
- Leah V Schaffer
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Robert J Millikin
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Rachel M Miller
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Lissa C Anderson
- Ion Cyclotron Resonance Program, National High Magnetic Field Laboratory, Tallahassee, FL, 32310, USA
| | - Ryan T Fellers
- Proteomics Center of Excellence, Northwestern University, Evanston, IL, 60208, USA
| | - Ying Ge
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Department of Cell and Regenerative Biology and Human Proteomics Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Neil L Kelleher
- Proteomics Center of Excellence, Northwestern University, Evanston, IL, 60208, USA
- Department of Chemistry and Molecular Biosciences and the Division of Hematology and Oncology, Northwestern University, Evanston, IL, 60208, USA
| | - Richard D LeDuc
- Proteomics Center of Excellence, Northwestern University, Evanston, IL, 60208, USA
| | - Xiaowen Liu
- Department of BioHealth Informatics, Indiana University-Purdue University, Indianapolis, IN, 46202, USA
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Samuel H Payne
- Department of Biology, Brigham Young University, Provo, UT, 84602
| | - Liangliang Sun
- Department of Chemistry, Michigan State University, East Lansing, MI, 48824, USA
| | - Paul M Thomas
- Proteomics Center of Excellence, Northwestern University, Evanston, IL, 60208, USA
| | - Trisha Tucholski
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Zhe Wang
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK, 73019, USA
| | - Si Wu
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK, 73019, USA
| | - Zhijie Wu
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Dahang Yu
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK, 73019, USA
| | - Michael R Shortreed
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| |
Collapse
|
27
|
Liu Z, Wang R, Liu J, Sun R, Wang F. Global Quantification of Intact Proteins via Chemical Isotope Labeling and Mass Spectrometry. J Proteome Res 2019; 18:2185-2194. [PMID: 30990045 DOI: 10.1021/acs.jproteome.9b00071] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Although thousands of intact proteins have been feasibly identified in recent years, global quantification of intact proteins is still challenging. Herein, we develop a high-throughput strategy for global intact protein quantification based on chemical isotope labeling. The isotope incorporation efficiency is as high as 99.2% for complex intact protein samples extracted from HeLa cells. Further, the pTop 2.0 software is developed for automated quantification of intact proteoforms in a high-throughput manner. The high quantification accuracy and reproducibility of this strategy have been demonstrated for both standard and complex cellular protein samples. A total of 2283 intact proteoforms originated from 660 protein accessions are successfully quantified under anaerobic and aerobic conditions and the differentially expressed proteins are observed to be involved in the important biological processes such as stress response.
Collapse
Affiliation(s)
- Zheyi Liu
- CAS Key Laboratory of Separation Sciences for Analytical Chemistry, Dalian Institute of Chemical Physics , Chinese Academy of Sciences , Dalian , 116023 , China
| | - Ruimin Wang
- Institute of Computing Technology , Chinese Academy of Sciences , Beijing , 100190 , China
| | - Jing Liu
- College of Pharmacy , Dalian Medical University , Dalian , 116044 , China
| | - Ruixiang Sun
- Institute of Computing Technology , Chinese Academy of Sciences , Beijing , 100190 , China
| | - Fangjun Wang
- CAS Key Laboratory of Separation Sciences for Analytical Chemistry, Dalian Institute of Chemical Physics , Chinese Academy of Sciences , Dalian , 116023 , China
| |
Collapse
|
28
|
Schaffer LV, Rensvold JW, Shortreed MR, Cesnik AJ, Jochem A, Scalf M, Frey BL, Pagliarini DJ, Smith LM. Identification and Quantification of Murine Mitochondrial Proteoforms Using an Integrated Top-Down and Intact-Mass Strategy. J Proteome Res 2018; 17:3526-3536. [PMID: 30180576 PMCID: PMC6201694 DOI: 10.1021/acs.jproteome.8b00469] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The development of effective strategies for the comprehensive identification and quantification of proteoforms in complex systems is a critical challenge in proteomics. Proteoforms, the specific molecular forms in which proteins are present in biological systems, are the key effectors of biological function. Thus, knowledge of proteoform identities and abundances is essential to unraveling the mechanisms that underlie protein function. We recently reported a strategy that integrates conventional top-down mass spectrometry with intact-mass determinations for enhanced proteoform identifications and the elucidation of proteoform families and applied it to the analysis of yeast cell lysate. In the present work, we extend this strategy to enable quantification of proteoforms, and we examine changes in the abundance of murine mitochondrial proteoforms upon differentiation of mouse myoblasts to myotubes. The integrated top-down and intact-mass strategy provided an increase of ∼37% in the number of identified proteoforms compared to top-down alone, which is in agreement with our previous work in yeast; 1779 unique proteoforms were identified using the integrated strategy compared to 1301 using top-down analysis alone. Quantitative comparison of proteoform differences between the myoblast and myotube cell types showed 129 observed proteoforms exhibiting statistically significant abundance changes (fold change >2 and false discovery rate <5%).
Collapse
Affiliation(s)
- Leah V. Schaffer
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | | | - Michael R. Shortreed
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Anthony J. Cesnik
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Adam Jochem
- Morgridge Institute for Research, Madison, WI 53715, USA
| | - Mark Scalf
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Brian L. Frey
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - David J. Pagliarini
- Morgridge Institute for Research, Madison, WI 53715, USA
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Lloyd M. Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
- Genome Center of Wisconsin, University of Wisconsin-Madison, Madison, WI 53706, USA
| |
Collapse
|
29
|
Schaffer LV, Shortreed MR, Cesnik AJ, Frey BL, Solntsev SK, Scalf M, Smith LM. Expanding Proteoform Identifications in Top-Down Proteomic Analyses by Constructing Proteoform Families. Anal Chem 2017; 90:1325-1333. [PMID: 29227670 DOI: 10.1021/acs.analchem.7b04221] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
In top-down proteomics, intact proteins are analyzed by tandem mass spectrometry and proteoforms, which are defined forms of a protein with specific sequences of amino acids and localized post-translational modifications, are identified using precursor mass and fragmentation data. Many proteoforms that are detected in the precursor scan (MS1) are not selected for fragmentation by the instrument and therefore remain unidentified in typical top-down proteomic workflows. Our laboratory has developed the open source software program Proteoform Suite to analyze MS1-only intact proteoform data. Here, we have adapted it to provide identifications of proteoform masses in precursor MS1 spectra of top-down data, supplementing the top-down identifications obtained using the MS2 fragmentation data. Proteoform Suite performs mass calibration using high-scoring top-down identifications and identifies additional proteoforms using calibrated, accurate intact masses. Proteoform families, the set of proteoforms from a given gene, are constructed and visualized from proteoforms identified by both top-down and intact-mass analyses. Using this strategy, we constructed proteoform families and identified 1861 proteoforms in yeast lysate, yielding an approximately 40% increase over the original 1291 proteoform identifications observed using traditional top-down analysis alone.
Collapse
Affiliation(s)
- Leah V Schaffer
- Department of Chemistry, University of Wisconsin , 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Michael R Shortreed
- Department of Chemistry, University of Wisconsin , 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Anthony J Cesnik
- Department of Chemistry, University of Wisconsin , 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Brian L Frey
- Department of Chemistry, University of Wisconsin , 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Stefan K Solntsev
- Department of Chemistry, University of Wisconsin , 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Mark Scalf
- Department of Chemistry, University of Wisconsin , 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin , 1101 University Avenue, Madison, Wisconsin 53706, United States.,Genome Center of Wisconsin, University of Wisconsin , 425G Henry Mall, Room 3420, Madison, Wisconsin 53706, United States
| |
Collapse
|