1
|
Nimer RM, Abdel Rahman AM. Recent advances in proteomic-based diagnostics of cystic fibrosis. Expert Rev Proteomics 2023; 20:151-169. [PMID: 37766616 DOI: 10.1080/14789450.2023.2258282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 07/06/2023] [Indexed: 09/29/2023]
Abstract
INTRODUCTION Cystic fibrosis (CF) is a genetic disease characterized by thick and sticky mucus accumulation, which may harm numerous internal organs. Various variables such as gene modifiers, environmental factors, age of diagnosis, and CF transmembrane conductance regulator (CFTR) gene mutations influence phenotypic disease diversity. Biomarkers that are based on genomic information may not accurately represent the underlying mechanism of the disease as well as its lethal complications. Therefore, recent advancements in mass spectrometry (MS)-based proteomics may provide deep insights into CF mechanisms and cellular functions by examining alterations in the protein expression patterns from various samples of individuals with CF. AREAS COVERED We present current developments in MS-based proteomics, its application, and findings in CF. In addition, the future roles of proteomics in finding diagnostic and prognostic novel biomarkers. EXPERT OPINION Despite significant advances in MS-based proteomics, extensive research in a large cohort for identifying and validating diagnostic, prognostic, predictive, and therapeutic biomarkers for CF disease is highly needed.
Collapse
Affiliation(s)
- Refat M Nimer
- Department of Medical Laboratory Sciences, Jordan University of Science and Technology, Irbid, Jordan
| | - Anas M Abdel Rahman
- Metabolomics Section, Department of Clinical Genomics, Center for Genome Medicine, King Faisal Specialist Hospital and Research Centre (KFSHRC), Riyadh, Saudi Arabia
- Department of Biochemistry and Molecular Medicine, College of Medicine, Alfaisal University, Riyadh, Saudi Arabia
| |
Collapse
|
2
|
Son A, Pankow S, Bamberger TC, Yates JR. Quantitative structural proteomics in living cells by covalent protein painting. Methods Enzymol 2023; 679:33-63. [PMID: 36682868 PMCID: PMC10262296 DOI: 10.1016/bs.mie.2022.08.046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
The fold and conformation of proteins are key to successful cellular function, but all techniques for protein structure determination are performed in an artificial environment with highly purified proteins. While protein conformations have been solved to atomic resolution and modern protein structure prediction tools rapidly generate near accurate models of proteins, there is an unmet need to uncover the conformations of proteins in living cells. Here, we describe Covalent Protein Painting (CPP), a simple and fast method to infer structural information on protein conformation in cells with a quantitative protein footprinting technology. CPP monitors the conformational landscape of the 3D proteome in cells with high sensitivity and throughput. A key advantage of CPP is its' ability to quantitatively compare the 3D proteomes between different experimental conditions and to discover significant changes in the protein conformations. We detail how to perform a successful CPP experiment, the factors to consider before performing the experiment, and how to interpret the results.
Collapse
Affiliation(s)
- Ahrum Son
- Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, United States
| | - Sandra Pankow
- Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, United States
| | - Tom Casimir Bamberger
- Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, United States
| | - John R Yates
- Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, United States.
| |
Collapse
|
3
|
Cunsolo V, Di Francesco A, Pittalà MGG, Saletti R, Foti S. The TriMet_DB: A Manually Curated Database of the Metabolic Proteins of Triticum aestivum. Nutrients 2022; 14:nu14245377. [PMID: 36558536 PMCID: PMC9781733 DOI: 10.3390/nu14245377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 12/07/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022] Open
Abstract
Mass-spectrometry-based wheat proteomics is challenging because the current interpretation of mass spectrometry data relies on public databases that are not exhaustive (UniProtKB/Swiss-Prot) or contain many redundant and poor or un-annotated entries (UniProtKB/TrEMBL). Here, we report the development of a manually curated database of the metabolic proteins of Triticum aestivum (hexaploid wheat), named TriMet_DB (Triticum aestivum Metabolic Proteins DataBase). The manually curated TriMet_DB was generated in FASTA format so that it can be read directly by programs used to interpret the mass spectrometry data. Furthermore, the complete list of entries included in the TriMet_DB is reported in a freely available resource, which includes for each protein the description, the gene code, the protein family, and the allergen name (if any). To evaluate its performance, the TriMet_DB was used to interpret the MS data acquired on the metabolic protein fraction extracted from the cultivar MEC of Triticum aestivum. Data are available via ProteomeXchange with identifier PXD037709.
Collapse
|
4
|
Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs. PLoS One 2022; 17:e0276401. [PMID: 36269744 PMCID: PMC9586388 DOI: 10.1371/journal.pone.0276401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 10/05/2022] [Indexed: 11/06/2022] Open
Abstract
In bottom-up proteomics, proteins are enzymatically digested into peptides before measurement with mass spectrometry. The relationship between proteins and their corresponding peptides can be represented by bipartite graphs. We conduct a comprehensive analysis of bipartite graphs using quantified peptides from measured data sets as well as theoretical peptides from an in silico digestion of the corresponding complete taxonomic protein sequence databases. The aim of this study is to characterize and structure the different types of graphs that occur and to compare them between data sets. We observed a large influence of the accepted minimum peptide length during in silico digestion. When changing from theoretical peptides to measured ones, the graph structures are subject to two opposite effects. On the one hand, the graphs based on measured peptides are on average smaller and less complex compared to graphs using theoretical peptides. On the other hand, the proportion of protein nodes without unique peptides, which are a complicated case for protein inference and quantification, is considerably larger for measured data. Additionally, the proportion of graphs containing at least one protein node without unique peptides rises when going from database to quantitative level. The fraction of shared peptides and proteins without unique peptides as well as the complexity and size of the graphs highly depends on the data set and organism. Large differences between the structures of bipartite peptide-protein graphs have been observed between database and quantitative level as well as between analyzed species. In the analyzed measured data sets, the proportion of protein nodes without unique peptides ranged from 6.4% to 55.0%. This highlights the need for novel methods that can quantify proteins without unique peptides. The knowledge about the structure of the bipartite peptide-protein graphs gained in this study will be useful for the development of such algorithms.
Collapse
|
5
|
Fancello L, Burger T. An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics. Genome Biol 2022; 23:132. [PMID: 35725496 PMCID: PMC9208142 DOI: 10.1186/s13059-022-02701-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 06/09/2022] [Indexed: 12/03/2022] Open
Abstract
Background Proteogenomics aims to identify variant or unknown proteins in bottom-up proteomics, by searching transcriptome- or genome-derived custom protein databases. However, empirical observations reveal that these large proteogenomic databases produce lower-sensitivity peptide identifications. Various strategies have been proposed to avoid this, including the generation of reduced transcriptome-informed protein databases, which only contain proteins whose transcripts are detected in the sample-matched transcriptome. These were found to increase peptide identification sensitivity. Here, we present a detailed evaluation of this approach. Results We establish that the increased sensitivity in peptide identification is in fact a statistical artifact, directly resulting from the limited capability of target-decoy competition to accurately model incorrect target matches when using excessively small databases. As anti-conservative false discovery rates (FDRs) are likely to hamper the robustness of the resulting biological conclusions, we advocate for alternative FDR control methods that are less sensitive to database size. Nevertheless, reduced transcriptome-informed databases are useful, as they reduce the ambiguity of protein identifications, yielding fewer shared peptides. Furthermore, searching the reference database and subsequently filtering proteins whose transcripts are not expressed reduces protein identification ambiguity to a similar extent, but is more transparent and reproducible. Conclusions In summary, using transcriptome information is an interesting strategy that has not been promoted for the right reasons. While the increase in peptide identifications from searching reduced transcriptome-informed databases is an artifact caused by the use of an FDR control method unsuitable to excessively small databases, transcriptome information can reduce the ambiguity of protein identifications. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02701-2.
Collapse
Affiliation(s)
- Laura Fancello
- CNRS, CEA, Inserm, BioSanté U1292, Profi FR2048, Université Grenoble Alpes, Grenoble, France
| | - Thomas Burger
- CNRS, CEA, Inserm, BioSanté U1292, Profi FR2048, Université Grenoble Alpes, Grenoble, France.
| |
Collapse
|
6
|
Plubell DL, Käll L, Webb-Robertson BJM, Bramer LM, Ives A, Kelleher NL, Smith LM, Montine TJ, Wu CC, MacCoss MJ. Putting Humpty Dumpty Back Together Again: What Does Protein Quantification Mean in Bottom-Up Proteomics? J Proteome Res 2022; 21:891-898. [PMID: 35220718 PMCID: PMC8976764 DOI: 10.1021/acs.jproteome.1c00894] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Bottom-up proteomics provides peptide measurements and has been invaluable for moving proteomics into large-scale analyses. Commonly, a single quantitative value is reported for each protein-coding gene by aggregating peptide quantities into protein groups following protein inference or parsimony. However, given the complexity of both RNA splicing and post-translational protein modification, it is overly simplistic to assume that all peptides that map to a singular protein-coding gene will demonstrate the same quantitative response. By assuming that all peptides from a protein-coding sequence are representative of the same protein, we may miss the discovery of important biological differences. To capture the contributions of existing proteoforms, we need to reconsider the practice of aggregating protein values to a single quantity per protein-coding gene.
Collapse
Affiliation(s)
- Deanna L. Plubell
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195 USA
| | - Lukas Käll
- Science for Life Laboratory, KTH - Royal Institute of Technology, Box 1031, 17121, Solna, Sweden
| | | | - Lisa M. Bramer
- Pacific Northwest National Laboratory, Richland, WA 99352
| | - Ashley Ives
- Proteomics Center of Excellence & Departments of Chemistry and Molecular Biosciences, Northwestern University, Evanston, IL 60208
| | - Neil L. Kelleher
- Proteomics Center of Excellence & Departments of Chemistry and Molecular Biosciences, Northwestern University, Evanston, IL 60208
| | - Lloyd M. Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706
| | | | - Christine C. Wu
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195 USA
| | - Michael J. MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195 USA
| |
Collapse
|
7
|
Bamberger C, Diedrich J, Martìnez-Bartholomé S, Yates JR. Cancer Conformational Landscape Shapes Tumorigenesis. J Proteome Res 2022; 21:1017-1028. [PMID: 35271278 PMCID: PMC9653087 DOI: 10.1021/acs.jproteome.1c00906] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
During tumorigenesis, DNA mutations in protein coding sequences can alter amino acid sequences which can change the structures of proteins. While the 3D structure of mutated proteins has been studied with atomic resolution, the precise impact of somatic mutations on the 3D proteome during malignant transformation remains unknown because methods to reveal in vivo protein structures in high throughput are limited. Here, we measured the accessibility of the lysine ε-amine for chemical modification across proteomes using covalent protein painting (CPP) to indirectly determine alterations in the 3D proteome. CPP is a novel, high-throughput quantitative mass spectrometric method that surveyed a total of 8052 lysine sites across the 60 cell lines of the well-studied anticancer cell line panel (NCI60). Overall, 5.2 structural alterations differentiated any cancer cell line from the other 59. Structural aberrations in 98 effector proteins correlated with the selected presence of 90 commonly mutated proteins in the NCI60 cell line panel, suggesting that different tumor genotypes reshape a limited set of effector proteins. We searched our dataset for druggable conformational aberrations and identified 49 changes in the cancer conformational landscape that correlated with the growth inhibition profiles of 300 drug candidates out of 50,000 small molecules. We found that alterations in heat shock proteins are key predictors of anticancer drug efficacy, which implies that the proteostasis network may have a general but hitherto unrecognized role in maintaining malignancy. Individual lysine sites may serve as biomarkers to guide drug selection or may be directly targeted for anticancer drug development.
Collapse
Affiliation(s)
- Casimir Bamberger
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Jolene Diedrich
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Salvador Martìnez-Bartholomé
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - John R. Yates
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| |
Collapse
|
8
|
Bludau I, Frank M, Dörig C, Cai Y, Heusel M, Rosenberger G, Picotti P, Collins BC, Röst H, Aebersold R. Systematic detection of functional proteoform groups from bottom-up proteomic datasets. Nat Commun 2021; 12:3810. [PMID: 34155216 PMCID: PMC8217233 DOI: 10.1038/s41467-021-24030-x] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 05/25/2021] [Indexed: 02/05/2023] Open
Abstract
To a large extent functional diversity in cells is achieved by the expansion of molecular complexity beyond that of the coding genome. Various processes create multiple distinct but related proteins per coding gene - so-called proteoforms - that expand the functional capacity of a cell. Evaluating proteoforms from classical bottom-up proteomics datasets, where peptides instead of intact proteoforms are measured, has remained difficult. Here we present COPF, a tool for COrrelation-based functional ProteoForm assessment in bottom-up proteomics data. It leverages the concept of peptide correlation analysis to systematically assign peptides to co-varying proteoform groups. We show applications of COPF to protein complex co-fractionation data as well as to more typical protein abundance vs. sample data matrices, demonstrating the systematic detection of assembly- and tissue-specific proteoform groups, respectively, in either dataset. We envision that the presented approach lays the foundation for a systematic assessment of proteoforms and their functional implications directly from bottom-up proteomic datasets.
Collapse
Affiliation(s)
- Isabell Bludau
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Max Frank
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
- Department of Molecular Genetics, University of Toronto, Toronto, Canada
- The Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Christian Dörig
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Yujia Cai
- Department of Molecular Genetics, University of Toronto, Toronto, Canada
- The Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
| | - Moritz Heusel
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
- Division of Infection Medicine (BMC), Department of Clinical Sciences, Lund University, Lund, Sweden
| | - George Rosenberger
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
- Columbia University, New York, NY, USA
| | - Paola Picotti
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Ben C Collins
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
- School of Biological Sciences, Queen's University Belfast, Belfast, UK
| | - Hannes Röst
- Department of Molecular Genetics, University of Toronto, Toronto, Canada.
- The Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada.
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.
- Faculty of Science, University of Zurich, Zurich, Switzerland.
| |
Collapse
|
9
|
Bamberger C, Pankow S, Martínez-Bartolomé S, Ma M, Diedrich J, Rissman RA, Yates JR. Protein Footprinting via Covalent Protein Painting Reveals Structural Changes of the Proteome in Alzheimer's Disease. J Proteome Res 2021; 20:2762-2771. [PMID: 33872013 PMCID: PMC8477671 DOI: 10.1021/acs.jproteome.0c00912] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Misfolding and aggregation of amyloid-β peptide and hyperphosphorylated tau are molecular markers of Alzheimer's disease (AD), and although the 3D structures of these aberrantly folded proteins have been visualized in exquisite detail, no method has been able to survey protein folding across the proteome in AD. Here, we present covalent protein painting (CPP), a mass spectrometry-based protein footprinting approach to quantify the accessibility of lysine ε-amines for covalent modification at the surface of natively folded proteins. We used CPP to survey the reactivity of 2645 lysine residues and therewith the structural proteome of HEK293T cells and found that reactivity increased upon mild heat shock. CPP revealed that the accessibility of lysine residues for covalent modification in tubulin-β (TUBB), in succinate dehydrogenase (SHDB), and in amyloid-β peptide (Aβ) is altered in human postmortem brain samples of patients with neurodegenerative diseases. The structural alterations of TUBB and SHDB in patients with AD, dementia with Lewy bodies (DLB), or both point to broader perturbations of the 3D proteome beyond Aβ and hyperphosphorylated tau.
Collapse
Affiliation(s)
- Casimir Bamberger
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Sandra Pankow
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Salvador Martínez-Bartolomé
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Michelle Ma
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Jolene Diedrich
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Robert A. Rissman
- Department of Neurosciences, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
- Veterans Affairs San Diego Healthcare System, San Diego, CA, 92161, USA
| | - John R. Yates
- Department of Molecular Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| |
Collapse
|
10
|
Tsang O, Wong JWH. Proteogenomic interrogation of cancer cell lines: an overview of the field. Expert Rev Proteomics 2021; 18:221-232. [PMID: 33877947 DOI: 10.1080/14789450.2021.1914594] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Introduction: Cancer cell lines (CCLs) have been a major resource for cancer research. Over the past couple of decades, they have been instrumental in omic profiling method development and as model systems to generate new knowledge in cell and cancer biology. More recently, with the increasing amount of genomic, transcriptomic and proteomic data being generated in hundreds of CCLs, there is growing potential for integrative proteogenomic data analyses to be performed.Areas covered: In this review, we first describe the most commonly used proteome profiling methods in CCLs. We then discuss how these proteomics data can be integrated with genomics data for proteogenomics analyses. Finally, we highlight some of the recent biological discoveries that have arisen from proteogenomics analyses of CCLs.Expert opinion: Protegeonomics analyses of CCLs have so far enabled the discovery of novel proteins and proteoforms. It has also improved our understanding of biological processes including post-transcriptional regulation of protein abundance and the presentation of antigens by major histocompatibility complex alleles. With proteomics data to be generated in hundreds to thousands of CCLs in coming years, there will be further potential for large-scale proteogenomics analyses and data integration with the phenotypically well-characterized CCLs.
Collapse
Affiliation(s)
- Olson Tsang
- Centre for PanorOmic Sciences, The University of Hong Kong, Pokfulam, Hong Kong SAR
| | - Jason W H Wong
- Centre for PanorOmic Sciences, The University of Hong Kong, Pokfulam, Hong Kong SAR.,School of Biomedical Sciences, The University of Hong Kong, Pokfulam, Hong Kong SAR
| |
Collapse
|
11
|
Schaffer LV, Millikin RJ, Shortreed MR, Scalf M, Smith LM. Improving Proteoform Identifications in Complex Systems Through Integration of Bottom-Up and Top-Down Data. J Proteome Res 2020; 19:3510-3517. [PMID: 32584579 DOI: 10.1021/acs.jproteome.0c00332] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Cellular functions are performed by a vast and diverse set of proteoforms. Proteoforms are the specific forms of proteins produced as a result of genetic variations, RNA splicing, and post-translational modifications (PTMs). Top-down mass spectrometric analysis of intact proteins enables proteoform identification, including proteoforms derived from sequence cleavage events or harboring multiple PTMs. In contrast, bottom-up proteomics identifies peptides, which necessitates protein inference and does not yield proteoform identifications. We seek here to exploit the synergies between these two data types to improve the quality and depth of the overall proteomic analysis. To this end, we automated the large-scale integration of results from multiprotease bottom-up and top-down analyses in the software program Proteoform Suite and applied it to the analysis of proteoforms from the human Jurkat T lymphocyte cell line. We implemented the recently developed proteoform-level classification scheme for top-down tandem mass spectrometry (MS/MS) identifications in Proteoform Suite, which enables users to observe the level and type of ambiguity for each proteoform identification, including which of the ambiguous proteoform identifications are supported by bottom-up-level evidence. We used Proteoform Suite to find instances where top-down identifications aid in protein inference from bottom-up analysis and conversely where bottom-up peptide identifications aid in proteoform PTM localization. We also show the use of bottom-up data to infer proteoform candidates potentially present in the sample, allowing confirmation of such proteoform candidates by intact-mass analysis of MS1 spectra. The implementation of these capabilities in the freely available software program Proteoform Suite enables users to integrate large-scale top-down and bottom-up data sets and to utilize the synergies between them to improve and extend the proteomic analysis.
Collapse
Affiliation(s)
- Leah V Schaffer
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Robert J Millikin
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Michael R Shortreed
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Mark Scalf
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
12
|
Budayeva HG, Kirkpatrick DS. Monitoring protein communities and their responses to therapeutics. Nat Rev Drug Discov 2020; 19:414-426. [PMID: 32139903 DOI: 10.1038/s41573-020-0063-y] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/24/2020] [Indexed: 12/19/2022]
Abstract
Most therapeutics are designed to alter the activities of proteins. From metabolic enzymes to cell surface receptors, connecting the function of a protein to a cellular phenotype, to the activity of a drug and to a clinical outcome represents key mechanistic milestones during drug development. Yet, even for therapeutics with exquisite specificity, the sequence of events following target engagement can be complex. Interconnected communities of structural, metabolic and signalling proteins modulate diverse downstream effects that manifest as interindividual differences in efficacy, adverse effects and resistance to therapy. Recent advances in mass spectrometry proteomics have made it possible to decipher these complex relationships and to understand how factors such as genotype, cell type, local environment and external perturbations influence them. In this Review, we explore how proteomic technologies are expanding our understanding of protein communities and their responses to large- and small-molecule therapeutics.
Collapse
Affiliation(s)
- Hanna G Budayeva
- Department of Microchemistry, Proteomics and Lipidomics, Genentech, South San Francisco, CA, USA
| | - Donald S Kirkpatrick
- Department of Microchemistry, Proteomics and Lipidomics, Genentech, South San Francisco, CA, USA.
| |
Collapse
|
13
|
Lorentzian A, Uzozie A, Lange PF. Origins and clinical relevance of proteoforms in pediatric malignancies. Expert Rev Proteomics 2019; 16:185-200. [PMID: 30700156 DOI: 10.1080/14789450.2019.1575206] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
INTRODUCTION Cancer changes the proteome in complex ways that reach well beyond simple changes in protein abundance. Genomic and transcriptional variations and post-translational protein modification create functional variants of a protein, known as proteoforms. Childhood cancers have fewer genomic alterations but show equally dramatic phenotypic changes as malignant cells in adults. Therefore, unraveling the complexities of the proteome is even more important in pediatric malignancies. Areas covered: In this review, the biological origins of proteoforms and technological advancements in the study of proteoforms are discussed. Particular emphasis is given to their implication in childhood malignancies and the critical role of cancer-specific proteoforms for the next generation of cancer therapies and diagnostics. Expert opinion: Recent advancements in technology have led to a better understanding of the underlying mechanisms of tumorigenesis. This has been critical for the development of more effective and less harmful treatments that are based on direct targeting of altered proteins and deregulated pathways. As proteome coverage and the ability to detect complex proteoforms increase, the most need for change is in data compilation and database availability to mediate high-level data analysis and allow for better functional annotation of proteoforms.
Collapse
Affiliation(s)
- Amanda Lorentzian
- a Department of Cell and Developmental Biology , University of British Columbia , Vancouver , BC , Canada.,b Michael Cuccione Childhood Cancer Research Program , BC Children's Hospital Research Institute , Vancouver , BC , Canada
| | - Anuli Uzozie
- b Michael Cuccione Childhood Cancer Research Program , BC Children's Hospital Research Institute , Vancouver , BC , Canada.,c Department of Pathology and Laboratory Medicine , University of British Columbia , Vancouver , BC , Canada
| | - Philipp F Lange
- a Department of Cell and Developmental Biology , University of British Columbia , Vancouver , BC , Canada.,b Michael Cuccione Childhood Cancer Research Program , BC Children's Hospital Research Institute , Vancouver , BC , Canada.,c Department of Pathology and Laboratory Medicine , University of British Columbia , Vancouver , BC , Canada
| |
Collapse
|
14
|
Pankow S, Martínez-Bartolomé S, Bamberger C, Yates JR. Understanding molecular mechanisms of disease through spatial proteomics. Curr Opin Chem Biol 2018; 48:19-25. [PMID: 30308467 DOI: 10.1016/j.cbpa.2018.09.016] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2018] [Revised: 09/17/2018] [Accepted: 09/19/2018] [Indexed: 02/07/2023]
Abstract
Mammalian cells are organized into different compartments that separate and facilitate physiological processes by providing specialized local environments and allowing different, otherwise incompatible biological processes to be carried out simultaneously. Proteins are targeted to these subcellular locations where they fulfill specialized, compartment-specific functions. Spatial proteomics aims to localize and quantify proteins within subcellular structures.
Collapse
Affiliation(s)
- Sandra Pankow
- Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, 92037, United States
| | | | - Casimir Bamberger
- Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, 92037, United States
| | - John R Yates
- Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA, 92037, United States.
| |
Collapse
|