1
|
Santos LGC, Parreira VDSC, da Silva EMG, Santos MDM, Fernandes ADF, Neves-Ferreira AGDC, Carvalho PC, Freitas FCDP, Passetti F. SpliceProt 2.0: A Sequence Repository of Human, Mouse, and Rat Proteoforms. Int J Mol Sci 2024; 25:1183. [PMID: 38256255 PMCID: PMC10816255 DOI: 10.3390/ijms25021183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/15/2023] [Accepted: 01/03/2024] [Indexed: 01/24/2024] Open
Abstract
SpliceProt 2.0 is a public proteogenomics database that aims to list the sequence of known proteins and potential new proteoforms in human, mouse, and rat proteomes. This updated repository provides an even broader range of computationally translated proteins and serves, for example, to aid with proteomic validation of splice variants absent from the reference UniProtKB/SwissProt database. We demonstrate the value of SpliceProt 2.0 to predict orthologous proteins between humans and murines based on transcript reconstruction, sequence annotation and detection at the transcriptome and proteome levels. In this release, the annotation data used in the reconstruction of transcripts based on the methodology of ternary matrices were acquired from new databases such as Ensembl, UniProt, and APPRIS. Another innovation implemented in the pipeline is the exclusion of transcripts predicted to be susceptible to degradation through the NMD pathway. Taken together, our repository and its applications represent a valuable resource for the proteogenomics community.
Collapse
Affiliation(s)
- Letícia Graziela Costa Santos
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Vinícius da Silva Coutinho Parreira
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Esdras Matheus Gomes da Silva
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fundação Oswaldo Cruz (FIOCRUZ), Av. Brazil 4036, Campus Maré, Rio de Janeiro 21040-361, RJ, Brazil
| | - Marlon Dias Mariano Santos
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Alexander da Franca Fernandes
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Ana Gisele da Costa Neves-Ferreira
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fundação Oswaldo Cruz (FIOCRUZ), Av. Brazil 4036, Campus Maré, Rio de Janeiro 21040-361, RJ, Brazil
| | - Paulo Costa Carvalho
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Flávia Cristina de Paula Freitas
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
- Departamento de Genética e Evolução, Universidade Federal de São Carlos (UFSCar), Rodovia Washington Luis, Km 235, São Carlos 13565-905, SP, Brazil
| | - Fabio Passetti
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| |
Collapse
|
2
|
Leduc A, Harens H, Slavov N. Modeling and interpretation of single-cell proteogenomic data. ARXIV 2023:arXiv:2308.07465v2. [PMID: 37645043 PMCID: PMC10462161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Biological functions stem from coordinated interactions among proteins, nucleic acids and small molecules. Mass spectrometry technologies for reliable, high throughput single-cell proteomics will add a new modality to genomics and enable data-driven modeling of the molecular mechanisms coordinating proteins and nucleic acids at single-cell resolution. This promising potential requires estimating the reliability of measurements and computational analysis so that models can distinguish biological regulation from technical artifacts. We highlight different measurement modes that can support single-cell proteogenomic analysis and how to estimate their reliability. We then discuss approaches for developing both abstract and mechanistic models that aim to biologically interpret the measured differences across modalities, including specific applications to directed stem cell differentiation and to inferring protein interactions in cancer cells from the buffing of DNA copy-number variations. Single-cell proteogenomic data will support mechanistic models of direct molecular interactions that will provide generalizable and predictive representations of biological systems.
Collapse
Affiliation(s)
- Andrew Leduc
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single Cell Proteomics Center, and Barnett Institute, Northeastern University, Boston, MA 02115, USA
| | - Hannah Harens
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single Cell Proteomics Center, and Barnett Institute, Northeastern University, Boston, MA 02115, USA
| | - Nikolai Slavov
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single Cell Proteomics Center, and Barnett Institute, Northeastern University, Boston, MA 02115, USA
- Parallel Squared Technology Institute, Watertown, MA 02472, USA
| |
Collapse
|
3
|
Gatto L, Aebersold R, Cox J, Demichev V, Derks J, Emmott E, Franks AM, Ivanov AR, Kelly RT, Khoury L, Leduc A, MacCoss MJ, Nemes P, Perlman DH, Petelski AA, Rose CM, Schoof EM, Van Eyk J, Vanderaa C, Yates JR, Slavov N. Initial recommendations for performing, benchmarking and reporting single-cell proteomics experiments. Nat Methods 2023; 20:375-386. [PMID: 36864200 PMCID: PMC10130941 DOI: 10.1038/s41592-023-01785-3] [Citation(s) in RCA: 74] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 01/24/2023] [Indexed: 03/04/2023]
Abstract
Analyzing proteins from single cells by tandem mass spectrometry (MS) has recently become technically feasible. While such analysis has the potential to accurately quantify thousands of proteins across thousands of single cells, the accuracy and reproducibility of the results may be undermined by numerous factors affecting experimental design, sample preparation, data acquisition and data analysis. We expect that broadly accepted community guidelines and standardized metrics will enhance rigor, data quality and alignment between laboratories. Here we propose best practices, quality controls and data-reporting recommendations to assist in the broad adoption of reliable quantitative workflows for single-cell proteomics. Resources and discussion forums are available at https://single-cell.net/guidelines .
Collapse
Affiliation(s)
- Laurent Gatto
- Computational Biology and Bioinformatics Unit, de Duve Institute, Université Catholique de Louvain, Brussels, Belgium
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Juergen Cox
- Max Planck Institute of Biochemistry, Martinsried, Germany
| | | | - Jason Derks
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single-Cell Proteomics Center and Barnett Institute, Northeastern University, Boston, MA, USA
| | - Edward Emmott
- Centre for Proteome Research, Department of Biochemistry and Systems Biology, University of Liverpool, Liverpool, UK
| | - Alexander M Franks
- Department of Statistics and Applied Probability, University of California Santa Barbara, Santa Barbara, CA, USA
| | - Alexander R Ivanov
- Department of Chemistry and Chemical Biology, Barnett Institute of Chemical and Biological Analysis, Northeastern University, Boston, MA, USA
| | - Ryan T Kelly
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | - Luke Khoury
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single-Cell Proteomics Center and Barnett Institute, Northeastern University, Boston, MA, USA
| | - Andrew Leduc
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single-Cell Proteomics Center and Barnett Institute, Northeastern University, Boston, MA, USA
| | | | - Peter Nemes
- Department of Chemistry and Biochemistry, University of Maryland, College Park, MD, USA
| | - David H Perlman
- Merck Exploratory Science Center, Merck Sharp & Dohme Corp., Cambridge, MA, USA
| | - Aleksandra A Petelski
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single-Cell Proteomics Center and Barnett Institute, Northeastern University, Boston, MA, USA
- Parallel Squared Technology Institute, Watertown, MA, USA
| | - Christopher M Rose
- Department of Microchemistry, Proteomics and Lipidomics, Genentech Inc., South San Francisco, CA, USA
| | - Erwin M Schoof
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Lyngby, Denmark
| | | | - Christophe Vanderaa
- Computational Biology and Bioinformatics Unit, de Duve Institute, Université Catholique de Louvain, Brussels, Belgium
| | - John R Yates
- Departments of Molecular Medicine and Neurobiology, the Scripps Research Institute, La Jolla, CA, USA
| | - Nikolai Slavov
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single-Cell Proteomics Center and Barnett Institute, Northeastern University, Boston, MA, USA.
- Parallel Squared Technology Institute, Watertown, MA, USA.
| |
Collapse
|
4
|
Plubell DL, Käll L, Webb-Robertson BJM, Bramer LM, Ives A, Kelleher NL, Smith LM, Montine TJ, Wu CC, MacCoss MJ. Putting Humpty Dumpty Back Together Again: What Does Protein Quantification Mean in Bottom-Up Proteomics? J Proteome Res 2022; 21:891-898. [PMID: 35220718 PMCID: PMC8976764 DOI: 10.1021/acs.jproteome.1c00894] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Bottom-up proteomics provides peptide measurements and has been invaluable for moving proteomics into large-scale analyses. Commonly, a single quantitative value is reported for each protein-coding gene by aggregating peptide quantities into protein groups following protein inference or parsimony. However, given the complexity of both RNA splicing and post-translational protein modification, it is overly simplistic to assume that all peptides that map to a singular protein-coding gene will demonstrate the same quantitative response. By assuming that all peptides from a protein-coding sequence are representative of the same protein, we may miss the discovery of important biological differences. To capture the contributions of existing proteoforms, we need to reconsider the practice of aggregating protein values to a single quantity per protein-coding gene.
Collapse
Affiliation(s)
- Deanna L. Plubell
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195 USA
| | - Lukas Käll
- Science for Life Laboratory, KTH - Royal Institute of Technology, Box 1031, 17121, Solna, Sweden
| | | | - Lisa M. Bramer
- Pacific Northwest National Laboratory, Richland, WA 99352
| | - Ashley Ives
- Proteomics Center of Excellence & Departments of Chemistry and Molecular Biosciences, Northwestern University, Evanston, IL 60208
| | - Neil L. Kelleher
- Proteomics Center of Excellence & Departments of Chemistry and Molecular Biosciences, Northwestern University, Evanston, IL 60208
| | - Lloyd M. Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706
| | | | - Christine C. Wu
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195 USA
| | - Michael J. MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195 USA
| |
Collapse
|
5
|
Specht H, Slavov N. Beyond Protein Sequence: Protein Isomerization in Alzheimer's Disease. J Proteome Res 2022; 21:299-300. [PMID: 35114789 DOI: 10.1021/acs.jproteome.2c00016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Harrison Specht
- Department of Bioengineering, Northeastern University, Boston, Massachusetts 02115, United States.,Barnett Institute, Northeastern University, Boston, Massachusetts 02115, United States
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, Massachusetts 02115, United States.,Barnett Institute, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
6
|
Meyer B, Chiaravalli J, Gellenoncourt S, Brownridge P, Bryne DP, Daly LA, Grauslys A, Walter M, Agou F, Chakrabarti LA, Craik CS, Eyers CE, Eyers PA, Gambin Y, Jones AR, Sierecki E, Verdin E, Vignuzzi M, Emmott E. Characterising proteolysis during SARS-CoV-2 infection identifies viral cleavage sites and cellular targets with therapeutic potential. Nat Commun 2021; 12:5553. [PMID: 34548480 PMCID: PMC8455558 DOI: 10.1038/s41467-021-25796-w] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 08/24/2021] [Indexed: 02/08/2023] Open
Abstract
SARS-CoV-2 is the causative agent behind the COVID-19 pandemic, responsible for over 170 million infections, and over 3.7 million deaths worldwide. Efforts to test, treat and vaccinate against this pathogen all benefit from an improved understanding of the basic biology of SARS-CoV-2. Both viral and cellular proteases play a crucial role in SARS-CoV-2 replication. Here, we study proteolytic cleavage of viral and cellular proteins in two cell line models of SARS-CoV-2 replication using mass spectrometry to identify protein neo-N-termini generated through protease activity. We identify previously unknown cleavage sites in multiple viral proteins, including major antigens S and N: the main targets for vaccine and antibody testing efforts. We discover significant increases in cellular cleavage events consistent with cleavage by SARS-CoV-2 main protease, and identify 14 potential high-confidence substrates of the main and papain-like proteases. We show that siRNA depletion of these cellular proteins inhibits SARS-CoV-2 replication, and that drugs targeting two of these proteins: the tyrosine kinase SRC and Ser/Thr kinase MYLK, show a dose-dependent reduction in SARS-CoV-2 titres. Overall, our study provides a powerful resource to understand proteolysis in the context of viral infection, and to inform the development of targeted strategies to inhibit SARS-CoV-2 and treat COVID-19.
Collapse
Affiliation(s)
- Bjoern Meyer
- Viral Populations and Pathogenesis Unit, CNRS, UMR 3569, Institut Pasteur, CEDEX 15, Paris, France
| | - Jeanne Chiaravalli
- Chemogenomic and Biological Screening Core Facility, C2RT, Departments of Cell Biology & Infection and of Structural Biology & Chemistry, Institut Pasteur, CEDEX 15, Paris, France
| | - Stacy Gellenoncourt
- CIVIC Group, Virus & Immunity Unit, Institut Pasteur and CNRS, UMR 3569, Paris, France
| | - Philip Brownridge
- Centre for Proteome Research, Department of Biochemistry & Systems Biology, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Dominic P Bryne
- Department of Biochemistry & Systems Biology, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Leonard A Daly
- Centre for Proteome Research, Department of Biochemistry & Systems Biology, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Arturas Grauslys
- Computational Biology Facility, LIV-SRF, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Marius Walter
- Buck Institute for Research on Aging, Novato, CA, 94945, USA
| | - Fabrice Agou
- Chemogenomic and Biological Screening Core Facility, C2RT, Departments of Cell Biology & Infection and of Structural Biology & Chemistry, Institut Pasteur, CEDEX 15, Paris, France
| | - Lisa A Chakrabarti
- CIVIC Group, Virus & Immunity Unit, Institut Pasteur and CNRS, UMR 3569, Paris, France
| | - Charles S Craik
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
| | - Claire E Eyers
- Centre for Proteome Research, Department of Biochemistry & Systems Biology, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Patrick A Eyers
- Department of Biochemistry & Systems Biology, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Yann Gambin
- EMBL Australia Node for Single Molecule Sciences, and School of Medical Sciences, Botany Road, The University of New South Wales, Sydney, NSW, 2052, Australia
| | - Andrew R Jones
- Department of Biochemistry & Systems Biology, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Emma Sierecki
- EMBL Australia Node for Single Molecule Sciences, and School of Medical Sciences, Botany Road, The University of New South Wales, Sydney, NSW, 2052, Australia
| | - Eric Verdin
- Buck Institute for Research on Aging, Novato, CA, 94945, USA
| | - Marco Vignuzzi
- Viral Populations and Pathogenesis Unit, CNRS, UMR 3569, Institut Pasteur, CEDEX 15, Paris, France
| | - Edward Emmott
- Centre for Proteome Research, Department of Biochemistry & Systems Biology, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, University of Liverpool, Liverpool, L69 7ZB, UK.
| |
Collapse
|
7
|
Petelski AA, Slavov N. Analyzing Ribosome Remodeling in Health and Disease. Proteomics 2020; 20:e2000039. [PMID: 32820594 PMCID: PMC7501214 DOI: 10.1002/pmic.202000039] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 08/01/2020] [Indexed: 12/24/2022]
Abstract
Increasing evidence suggests that ribosomes actively regulate protein synthesis. However, much of this evidence is indirect, leaving this layer of gene regulation largely unexplored, in part due to methodological limitations. Indeed, evidence is reviewed demonstrating that commonly used methods, such as transcriptomics, are inadequate because the variability in mRNAs coding for ribosomal proteins (RP) does not necessarily correspond to RP variability. Thus protein remodeling of ribosomes should be investigated by methods that allow direct quantification of RPs, ideally of isolated ribosomes. Such methods are reviewed, focusing on mass spectrometry and emphasizing method-specific biases and approaches to control these biases. It is argued that using multiple complementary methods can help reduce the danger of interpreting reproducible systematic biases as evidence for ribosome remodeling.
Collapse
Affiliation(s)
- Aleksandra A Petelski
- Department of Bioengineering, Northeastern University, Boston, MA, 02115, USA
- Barnett Institute, Northeastern University, Boston, MA, 02115, USA
- Department of Biology, Northeastern University, Boston, MA, 02115, USA
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, MA, 02115, USA
- Barnett Institute, Northeastern University, Boston, MA, 02115, USA
- Department of Biology, Northeastern University, Boston, MA, 02115, USA
| |
Collapse
|
8
|
Waas M, Kislinger T. Addressing Cellular Heterogeneity in Cancer through Precision Proteomics. J Proteome Res 2020; 19:3607-3619. [PMID: 32697918 DOI: 10.1021/acs.jproteome.0c00338] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Cells exhibit a broad spectrum of functions driven by differences in molecular phenotype. Understanding the heterogeneity between and within cell types has led to advances in our ability to diagnose and manipulate biological systems. Heterogeneity within and between tumors still poses a challenge to the development and efficacy of therapeutics. In this Perspective we review the toolkit of protein-level experimental approaches for investigating cellular heterogeneity. We describe how innovative approaches and technical developments have supported the advent of bottom-up single-cell proteomic analysis and present opportunities and challenges within cancer research. Finally, we introduce the concept of "precision proteomics" and discuss how the advantages and limitations of various experimental approaches render them suitable for different biological systems and questions.
Collapse
|
9
|
Schaffer LV, Millikin RJ, Shortreed MR, Scalf M, Smith LM. Improving Proteoform Identifications in Complex Systems Through Integration of Bottom-Up and Top-Down Data. J Proteome Res 2020; 19:3510-3517. [PMID: 32584579 DOI: 10.1021/acs.jproteome.0c00332] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Cellular functions are performed by a vast and diverse set of proteoforms. Proteoforms are the specific forms of proteins produced as a result of genetic variations, RNA splicing, and post-translational modifications (PTMs). Top-down mass spectrometric analysis of intact proteins enables proteoform identification, including proteoforms derived from sequence cleavage events or harboring multiple PTMs. In contrast, bottom-up proteomics identifies peptides, which necessitates protein inference and does not yield proteoform identifications. We seek here to exploit the synergies between these two data types to improve the quality and depth of the overall proteomic analysis. To this end, we automated the large-scale integration of results from multiprotease bottom-up and top-down analyses in the software program Proteoform Suite and applied it to the analysis of proteoforms from the human Jurkat T lymphocyte cell line. We implemented the recently developed proteoform-level classification scheme for top-down tandem mass spectrometry (MS/MS) identifications in Proteoform Suite, which enables users to observe the level and type of ambiguity for each proteoform identification, including which of the ambiguous proteoform identifications are supported by bottom-up-level evidence. We used Proteoform Suite to find instances where top-down identifications aid in protein inference from bottom-up analysis and conversely where bottom-up peptide identifications aid in proteoform PTM localization. We also show the use of bottom-up data to infer proteoform candidates potentially present in the sample, allowing confirmation of such proteoform candidates by intact-mass analysis of MS1 spectra. The implementation of these capabilities in the freely available software program Proteoform Suite enables users to integrate large-scale top-down and bottom-up data sets and to utilize the synergies between them to improve and extend the proteomic analysis.
Collapse
Affiliation(s)
- Leah V Schaffer
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Robert J Millikin
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Michael R Shortreed
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Mark Scalf
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
10
|
Yang L, George J, Wang J. Deep Profiling of Cellular Heterogeneity by Emerging Single-Cell Proteomic Technologies. Proteomics 2020; 20:e1900226. [PMID: 31729152 PMCID: PMC7225074 DOI: 10.1002/pmic.201900226] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 10/14/2019] [Indexed: 12/20/2022]
Abstract
The ability to comprehensively profile cellular heterogeneity in functional proteome is crucial in advancing the understanding of cell behavior, organism development, and disease mechanisms. Conventional bulk measurement by averaging the biological responses across a population often loses the information of cellular variations. Single-cell proteomic technologies are becoming increasingly important to understand and discern cellular heterogeneity. The well-established methods for single-cell protein analysis based on flow cytometry and fluorescence microscopy are limited by the low multiplexing ability owing to the spectra overlap of fluorophores for labeling antibodies. Recent advances in mass spectrometry (MS), microchip, and reiterative staining-based techniques for single-cell proteomics have enabled the evaluation of cellular heterogeneity with high throughput, increased multiplexity, and improved sensitivity. In this review, the principles, developments, advantages, and limitations of these advanced technologies in analysis of single-cell proteins, along with their biological applications to study cellular heterogeneity, are described. At last, the remaining challenges, possible strategies, and future opportunities that will facilitate the improvement and broad applications of single-cell proteomic technologies in cell biology and medical research are discussed.
Collapse
Affiliation(s)
- Liwei Yang
- Multiplex Biotechnology Laboratory, Department of Biomedical Engineering, State University of New York at Stony Brook, Stony Brook, NY 11794
| | - Justin George
- Department of Chemistry, State University of New York, University at Albany, Albany, NY 12222
| | - Jun Wang
- Multiplex Biotechnology Laboratory, Department of Biomedical Engineering, State University of New York at Stony Brook, Stony Brook, NY 11794
| |
Collapse
|
11
|
Slavov N. Single-cell protein analysis by mass spectrometry. Curr Opin Chem Biol 2020; 60:1-9. [PMID: 32599342 DOI: 10.1016/j.cbpa.2020.04.018] [Citation(s) in RCA: 105] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 04/23/2020] [Accepted: 04/27/2020] [Indexed: 10/24/2022]
Abstract
Human physiology and pathology arise from the coordinated interactions of diverse single cells. However, analyzing single cells has been limited by the low sensitivity and throughput of analytical methods. DNA sequencing has recently made such analysis feasible for nucleic acids but single-cell protein analysis remains limited. Mass spectrometry is the most powerful method for protein analysis, but its application to single cells faces three major challenges: efficiently delivering proteins/peptides to mass spectrometry detectors, identifying their sequences, and scaling the analysis to many thousands of single cells. These challenges have motivated corresponding solutions, including SCoPE design multiplexing and clean, automated, and miniaturized sample preparation. Synergistically applied, these solutions enable quantifying thousands of proteins across many single cells and establish a solid foundation for further advances. Building upon this foundation, the SCoPE concept will enable analyzing subcellular organelles and posttranslational modifications, while increases in multiplexing capabilities will increase the throughput and decrease cost.
Collapse
Affiliation(s)
- Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, MA, 02115, USA; Barnett Institute, Northeastern University, Boston, MA, 02115, USA; Department of Biology, Northeastern University, Boston, MA, 02115, USA.
| |
Collapse
|
12
|
Godfrey AK, Naqvi S, Chmátal L, Chick JM, Mitchell RN, Gygi SP, Skaletsky H, Page DC. Quantitative analysis of Y-Chromosome gene expression across 36 human tissues. Genome Res 2020; 30:860-873. [PMID: 32461223 PMCID: PMC7370882 DOI: 10.1101/gr.261248.120] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Accepted: 05/18/2020] [Indexed: 02/07/2023]
Abstract
Little is known about how human Y-Chromosome gene expression directly contributes to differences between XX (female) and XY (male) individuals in nonreproductive tissues. Here, we analyzed quantitative profiles of Y-Chromosome gene expression across 36 human tissues from hundreds of individuals. Although it is often said that Y-Chromosome genes are lowly expressed outside the testis, we report many instances of elevated Y-Chromosome gene expression in a nonreproductive tissue. A notable example is EIF1AY, which encodes eukaryotic translation initiation factor 1A Y-linked, together with its X-linked homolog EIF1AX. Evolutionary loss of a Y-linked microRNA target site enabled up-regulation of EIF1AY, but not of EIF1AX, in the heart. Consequently, this essential translation initiation factor is nearly twice as abundant in male as in female heart tissue at the protein level. Divergence between the X and Y Chromosomes in regulatory sequence can therefore lead to tissue-specific Y-Chromosome-driven sex biases in expression of critical, dosage-sensitive regulatory genes.
Collapse
Affiliation(s)
- Alexander K Godfrey
- Whitehead Institute, Cambridge, Massachusetts 02142, USA.,Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Sahin Naqvi
- Whitehead Institute, Cambridge, Massachusetts 02142, USA.,Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Lukáš Chmátal
- Whitehead Institute, Cambridge, Massachusetts 02142, USA
| | - Joel M Chick
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Richard N Mitchell
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Steven P Gygi
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Helen Skaletsky
- Whitehead Institute, Cambridge, Massachusetts 02142, USA.,Howard Hughes Medical Institute, Whitehead Institute, Cambridge, Massachusetts 02142, USA
| | - David C Page
- Whitehead Institute, Cambridge, Massachusetts 02142, USA.,Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.,Howard Hughes Medical Institute, Whitehead Institute, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
13
|
Yugandhar K, Gupta S, Yu H. Inferring Protein-Protein Interaction Networks From Mass Spectrometry-Based Proteomic Approaches: A Mini-Review. Comput Struct Biotechnol J 2019; 17:805-811. [PMID: 31316724 PMCID: PMC6611912 DOI: 10.1016/j.csbj.2019.05.007] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2019] [Revised: 05/20/2019] [Accepted: 05/26/2019] [Indexed: 01/06/2023] Open
Abstract
Studying protein-protein interaction networks provide key evidence for the underlying molecular mechanisms. Mass spectrometry-based proteomic approaches have been playing a pivotal role in deciphering these interaction networks, along with precise quantification for individual interactions. In this mini-review we discuss the available techniques and methods for qualitative and quantitative elucidation of protein-protein interaction networks. We then summarize the down-stream computational strategies for identification and quantification of interactions from those techniques. Finally, we highlight the challenges and limitations of current computational pipelines in eliminating false positive interactors, followed by a summary of the innovative algorithms to address these issues, along with the scope for future improvements.
Collapse
Affiliation(s)
- Kumar Yugandhar
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| | - Shagun Gupta
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
14
|
Huffman RG, Chen A, Specht H, Slavov N. DO-MS: Data-Driven Optimization of Mass Spectrometry Methods. J Proteome Res 2019; 18:2493-2500. [PMID: 31081635 DOI: 10.1021/acs.jproteome.9b00039] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The performance of ultrasensitive liquid chromatography and tandem mass spectrometry (LC-MS/MS) methods, such as single-cell proteomics by mass spectrometry (SCoPE-MS), depends on multiple interdependent parameters. This interdependence makes it challenging to specifically pinpoint the sources of problems in the LC-MS/MS methods and approaches for resolving them. For example, a low signal at the MS2 level can be due to poor LC separation, ionization, apex targeting, ion transfer, or ion detection. We sought to specifically diagnose such problems by interactively visualizing data from all levels of bottom-up LC-MS/MS analysis. Many software packages, such as MaxQuant, already provide such data, and we developed an open source platform for their interactive visualization and analysis: Data-driven Optimization of MS (DO-MS). We found that in many cases DO-MS not only specifically diagnosed LC-MS/MS problems but also enabled us to rationally optimize them. For example, by using DO-MS to optimize the sampling of the elution peak apexes, we increased ion accumulation times and apex sampling, which resulted in a 370% more efficient delivery of ions for MS2 analysis. DO-MS is easy to install and use, and its GUI allows for interactive data subsetting and high-quality figure generation. The modular design of DO-MS facilitates customization and expansion. DO-MS v1.0.8 is available for download from GitHub: https://github.com/SlavovLab/DO-MS . Additional documentation is available at https://do-ms.slavovlab.net .
Collapse
Affiliation(s)
- R Gray Huffman
- Department of Bioengineering , Northeastern University , Boston , Massachusetts 02115 , United States.,Barnett Institute , Northeastern University , Boston , Massachusetts 02115 , United States
| | - Albert Chen
- Department of Bioengineering , Northeastern University , Boston , Massachusetts 02115 , United States.,Barnett Institute , Northeastern University , Boston , Massachusetts 02115 , United States
| | - Harrison Specht
- Department of Bioengineering , Northeastern University , Boston , Massachusetts 02115 , United States.,Barnett Institute , Northeastern University , Boston , Massachusetts 02115 , United States
| | - Nikolai Slavov
- Department of Bioengineering , Northeastern University , Boston , Massachusetts 02115 , United States.,Barnett Institute , Northeastern University , Boston , Massachusetts 02115 , United States.,Department of Biology , Northeastern University , Boston , Massachusetts 02115 , United States
| |
Collapse
|
15
|
Emmott E, Jovanovic M, Slavov N. Approaches for Studying Ribosome Specialization. Trends Biochem Sci 2019; 44:478-479. [PMID: 30792028 DOI: 10.1016/j.tibs.2019.01.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Accepted: 01/23/2019] [Indexed: 12/14/2022]
Abstract
Contrary to the textbook model, recent measurements demonstrated unexpected diversity in ribosomal composition that likely enables specialized translational functions. Methods based on liquid chromatography coupled to tandem mass-spectrometry (LC-MS/MS) enable direct quantification of ribosomal proteins with high specificity, accuracy, and throughput. LC-MS/MS can be 'top-down', analyzing intact proteins, or more commonly 'bottom-up', where proteins are digested to peptides prior to analysis. Changes to rRNA can be examined using either LC-MS/MS or sequencing-based approaches. The regulation of protein synthesis by specialized ribosomes can be examined by multiple methods. These include the popular 'Ribo-Seq' method for analyzing ribosome density on a given mRNA, as well as LC-MS/MS approaches incorporating pulse-labelling with stable isotopes (SILAC) to monitor protein synthesis and degradation.
Collapse
Affiliation(s)
- Edward Emmott
- Department of Bioengineering, Northeastern University, Boston, MA, USA; Barnett Institute for Chemical and Biological Analysis, Northeastern University, Boston, MA, USA
| | - Marko Jovanovic
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, MA, USA; Barnett Institute for Chemical and Biological Analysis, Northeastern University, Boston, MA, USA; Department of Biology, Northeastern University, Boston, MA, USA.
| |
Collapse
|
16
|
Emmott E, Jovanovic M, Slavov N. Ribosome Stoichiometry: From Form to Function. Trends Biochem Sci 2019; 44:95-109. [PMID: 30473427 PMCID: PMC6340777 DOI: 10.1016/j.tibs.2018.10.009] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Revised: 08/27/2018] [Accepted: 10/20/2018] [Indexed: 12/11/2022]
Abstract
The existence of eukaryotic ribosomes with distinct ribosomal protein (RP) stoichiometry and regulatory roles in protein synthesis has been speculated for over 60 years. Recent advances in mass spectrometry (MS) and high-throughput analysis have begun to identify and characterize distinct ribosome stoichiometry in yeast and mammalian systems. In addition to RP stoichiometry, ribosomes host a vast array of protein modifications, effectively expanding the number of human RPs from 80 to many thousands of distinct proteoforms. Is it possible that these proteoforms combine to function as a 'ribosome code' to tune protein synthesis? We outline the specific benefits that translational regulation by specialized ribosomes can offer and discuss the means and methodologies available to correlate and characterize RP stoichiometry with function. We highlight previous research with a focus on formulating hypotheses that can guide future experiments and crack the ribosome code.
Collapse
Affiliation(s)
- Edward Emmott
- Department of Bioengineering, Northeastern University, Boston, MA, USA
| | - Marko Jovanovic
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, MA, USA; Department of Biology, Northeastern University, Boston, MA, USA.
| |
Collapse
|