51
|
Poudel S, Vanderwall D, Yuan ZF, Wu Z, Peng J, Li Y. JUMPptm: Integrated software for sensitive identification of post-translational modifications and its application in Alzheimer's disease study. Proteomics 2023; 23:e2100369. [PMID: 36094355 PMCID: PMC9957936 DOI: 10.1002/pmic.202100369] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Revised: 07/29/2022] [Accepted: 08/23/2022] [Indexed: 01/10/2023]
Abstract
BACKGROUND Mass spectrometry (MS)-based proteomic analysis of posttranslational modifications (PTMs) usually requires the pre-enrichment of modified proteins or peptides. However, recent ultra-deep whole proteome profiling generates millions of spectra in a single experiment, leaving many unassigned spectra, some of which may be derived from PTM peptides. METHODS Here we present JUMPptm, an integrative computational pipeline, to extract PTMs from unenriched whole proteome. JUMPptm combines the advantages of JUMP, MSFragger and Comet search engines, and includes de novo tags, customized database search and peptide filtering, which iteratively analyzes each PTM by a multi-stage strategy to improve sensitivity and specificity. RESULTS We applied JUMPptm to the deep brain proteome of Alzheimer's disease (AD), and identified 34,954 unique peptides with phosphorylation, methylation, acetylation, ubiquitination, and others. The phosphorylated peptides were validated by enriched phosphoproteome from the same sample. TMT-based quantification revealed 482 PTM peptides dysregulated at different stages during AD progression. For example, the acetylation of numerous mitochondrial proteins is significantly decreased in AD. A total of 60 PTM sites are found in the pan-PTM map of the Tau protein. CONCLUSION The JUMPptm program is an effective tool for pan-PTM analysis and the resulting AD pan-PTM profile serves as a valuable resource for AD research.
Collapse
Affiliation(s)
- Suresh Poudel
- Center for Proteomics and Metabolomics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - David Vanderwall
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - Zuo-Fei Yuan
- Center for Proteomics and Metabolomics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - Zhiping Wu
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - Junmin Peng
- Center for Proteomics and Metabolomics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA,Departments of Structural Biology and Developmental Neurobiology, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA,Correspondence: and
| | - Yuxin Li
- Center for Proteomics and Metabolomics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA,Correspondence: and
| |
Collapse
|
52
|
DeRosa C, Weaver SD, Wang CW, Schuster-Little N, Whelan RJ. Simultaneous N-Deglycosylation and Digestion of Complex Samples on S-Traps Enables Efficient Glycosite Hypothesis Generation. ACS OMEGA 2023; 8:4410-4418. [PMID: 36743002 PMCID: PMC9893465 DOI: 10.1021/acsomega.2c08071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 01/10/2023] [Indexed: 06/18/2023]
Abstract
N-linked glycosylation is an important post-translational modification that is difficult to identify and quantify in traditional bottom-up proteomics experiments. Enzymatic deglycosylation of proteins by peptide:N-glycosidase F (PNGase F) prior to digestion and subsequent mass spectrometry analysis has been shown to improve coverage of various N-linked glycopeptides, but the inclusion of this step may add up to a day to an already lengthy sample preparation process. An efficient way to integrate deglycosylation with bottom-up proteomics would be a valuable contribution to the glycoproteomics field. Here, we demonstrate a proteomics workflow in which deglycosylation and proteolytic digestion of samples occur simultaneously using suspension trapping (S-Trap). This approach adds no time to standard digestion protocols. Applying this sample preparation strategy to a human serum sample, we demonstrate improved identification of potential N-glycosylated peptides in deglycosylated samples compared with non-deglycosylated samples, identifying 156 unique peptides that contain the N-glycosylation motif (asparagine-X-serine/threonine), the deamidation modification characteristic of PNGase F, and an increase in peptide intensity over a control sample. We expect that this rapid sample preparation strategy will assist in the identification and quantification of both known and potential glycoproteins. Data are available via ProteomeXchange with the identifier PXD037921.
Collapse
Affiliation(s)
- Christine
M. DeRosa
- Department
of Chemistry and Biochemistry, University
of Notre Dame, Notre
Dame, Indiana 46556, United States
| | - Simon D. Weaver
- Department
of Chemistry and Biochemistry, University
of Notre Dame, Notre
Dame, Indiana 46556, United States
- Integrated
Biomedical Sciences Graduate Program, University
of Notre Dame, Notre Dame, Indiana 46656, United States
| | - Chien-Wei Wang
- Department
of Chemistry, University of Kansas, Lawrence, Kansas 66047, United States
| | | | - Rebecca J. Whelan
- Department
of Chemistry, University of Kansas, Lawrence, Kansas 66047, United States
| |
Collapse
|
53
|
Abstract
Proteins are the key biological actors within cells, driving many biological processes integral to both healthy and diseased states. Understanding the depth of complexity represented within the proteome is crucial to our scientific understanding of cellular biology and to provide disease specific insights for clinical applications. Mass spectrometry-based proteomics is the premier method for proteome analysis, with the ability to both identify and quantify proteins. Although proteomics continues to grow as a robust field of bioanalytical chemistry, advances are still necessary to enable a more comprehensive view of the proteome. In this review, we provide a broad overview of mass spectrometry-based proteomics in general, and highlight four developing areas of bottom-up proteomics: (1) protein inference, (2) alternative proteases, (3) sample-specific databases and (4) post-translational modification discovery.
Collapse
Affiliation(s)
- Rachel M Miller
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA.
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA.
| |
Collapse
|
54
|
Weaver S, DeRosa CM, Schultz SR, Champion MM. PrIntMap-R: An Online Application for Intraprotein Intensity and Peptide Visualization from Bottom-Up Proteomics. J Proteome Res 2023; 22:432-441. [PMID: 36652611 PMCID: PMC9904286 DOI: 10.1021/acs.jproteome.2c00606] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Bottom-up proteomics (BUP) produces rich data, but visualization and analysis are time-consuming and often require programming skills. Many tools analyze these data at the proteome-level, but fewer options exist for individual proteins. Sequence coverage maps are common, but do not proportion peptide intensity. Abundance-based visualization of sequence coverage facilitates detection of protein isoforms, domains, potential truncation sites, peptide "hot-spots", and localization of post-translational modifications (PTMs). Redundant stacked-sequence coverage is an important tool in designing hydrogen-deuterium exchange (HDX) experiments. Visualization tools often lack graphical and tabular-export of processed data which complicates publication of results. Quantitative peptide abundance across amino acid sequences is an essential and missing tool in proteomics toolkits. Here we created PrIntMap-R, an online application that only requires peptide files from a database search and FASTA protein sequences. PrIntMap-R produces a variety of plots for quantitative visualization of coverage; annotation of specific sequences, PTM's, and comparisons of one or many samples overlaid with calculated fold-change or several intensity metrics. We show use-cases including protein phosphorylation, identification of glycosylation, and the optimization of digestion conditions for HDX experiments. PrIntMap-R is freely available, open source, and can run online with no installation, or locally by downloading source code from GitHub.
Collapse
Affiliation(s)
- Simon
D. Weaver
- Department
of Chemistry and Biochemistry, University
of Notre Dame, Notre
Dame, Indiana 46556, United States,Integrated
Biomedical Sciences Graduate Program, University
of Notre Dame, Notre
Dame, Indiana 46556, United States
| | - Christine M. DeRosa
- Department
of Chemistry and Biochemistry, University
of Notre Dame, Notre
Dame, Indiana 46556, United States
| | - Sadie R. Schultz
- Department
of Chemistry and Biochemistry, University
of Notre Dame, Notre
Dame, Indiana 46556, United States
| | - Matthew M. Champion
- Department
of Chemistry and Biochemistry, University
of Notre Dame, Notre
Dame, Indiana 46556, United States,Integrated
Biomedical Sciences Graduate Program, University
of Notre Dame, Notre
Dame, Indiana 46556, United States,
| |
Collapse
|
55
|
Kwok N, Aretz Z, Takao S, Ser Z, Cifani P, Kentsis A. Integrative proteogenomics using ProteomeGenerator2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.04.522774. [PMID: 36711693 PMCID: PMC9882001 DOI: 10.1101/2023.01.04.522774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Recent advances in nucleic acid sequencing now permit rapid and genome-scale analysis of genetic variation and transcription, enabling population-scale studies of human biology, disease, and diverse organisms. Likewise, advances in mass spectrometry proteomics now permit highly sensitive and accurate studies of protein expression at the whole proteome-scale. However, most proteomic studies rely on consensus databases to match spectra to peptide and proteins sequences, and thus remain limited to the analysis of canonical protein sequences. Here, we develop ProteomeGenerator2 (PG2), based on the scalable and modular ProteomeGenerator framework. PG2 integrates genome and transcriptome sequencing to incorporate protein variants containing amino acid substitutions, insertions, and deletions, as well as non-canonical reading frames, exons, and other variants caused by genomic and transcriptomic variation. We benchmarked PG2 using synthetic data and genomic, transcriptomic, and proteomic analysis of human leukemia cells. PG2 can be integrated with current and emerging sequencing technologies, assemblers, variant callers, and mass spectral analysis algorithms, and is available open-source from https://github.com/kentsisresearchgroup/ProteomeGenerator2 .
Collapse
|
56
|
Miller RM, Millikin RJ, Rolfs Z, Shortreed MR, Smith LM. Enhanced Proteomic Data Analysis with MetaMorpheus. Methods Mol Biol 2023; 2426:35-66. [PMID: 36308684 PMCID: PMC9623450 DOI: 10.1007/978-1-0716-1967-4_3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
MetaMorpheus is a free and open-source software program dedicated to the comprehensive analysis of proteomic data. In bottom-up proteomics, protein samples are digested into peptides prior to chromatographic separation and tandem mass spectrometric analysis. The resulting fragmentation spectra are subsequently analyzed with search software programs to obtain peptide identifications and infer the presence of proteins in the samples. MetaMorpheus seeks to maximize the information gleaned from proteomic data through the use of (a) mass calibration, (b) post-translational modification discovery, (c) multiple search algorithms, which aid in the analysis of data from traditional, crosslinking, and glycoproteomic experiments, (d) isotope-based or label-free quantification, (e) multi-protease protein inference, and (f) spectral annotation and data visualization capabilities. This protocol provides detailed descriptions of how use MetaMorpheus and how to customize data analysis workflows using MetaMorpheus tasks to meet the specific needs of the user.
Collapse
Affiliation(s)
- Rachel M Miller
- University of Wisconsin-Madison, Department of Chemistry, Madison, WI, USA
| | - Robert J Millikin
- University of Wisconsin-Madison, Department of Chemistry, Madison, WI, USA
| | - Zach Rolfs
- University of Wisconsin-Madison, Department of Chemistry, Madison, WI, USA
| | | | - Lloyd M Smith
- University of Wisconsin-Madison, Department of Chemistry, Madison, WI, USA.
| |
Collapse
|
57
|
Eirich J, Sindlinger J, Schön S, Schwarzer D, Finkemeier I. Peptide CoA conjugates for in situ proteomics profiling of acetyltransferase activities. Methods Enzymol 2023; 684:209-252. [DOI: 10.1016/bs.mie.2022.09.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2023]
|
58
|
Ryu SY, Yun MP, Kim S. Integrating Multiple Quantitative Proteomic Analyses Using MetaMSD. Methods Mol Biol 2023; 2426:361-374. [PMID: 36308697 DOI: 10.1007/978-1-0716-1967-4_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
MetaMSD is a proteomic software that integrates multiple quantitative mass spectrometry data analysis results using statistical summary combination approaches. By utilizing this software, scientists can combine results from their pilot and main studies to maximize their biomarker discovery while effectively controlling false discovery rates. It also works for combining proteomic datasets generated by different labeling techniques and/or different types of mass spectrometry instruments. With these advantages, MetaMSD enables biological researchers to explore various proteomic datasets in public repositories to discover new biomarkers and generate interesting hypotheses for future studies. In this protocol, we provide a step-by-step procedure on how to install and perform a meta-analysis for quantitative proteomics using MetaMSD.
Collapse
Affiliation(s)
- So Young Ryu
- School of Public Health, University of Nevada Reno, Reno, NV, USA.
| | - Miriam P Yun
- Department of Psychology Institute for Neuroscience, University of Nevada Reno, Reno, NV, USA
| | - Sujung Kim
- School of Public Health, University of Nevada Reno, Reno, NV, USA
| |
Collapse
|
59
|
Koopmans F, Li KW, Klaassen RV, Smit AB. MS-DAP Platform for Downstream Data Analysis of Label-Free Proteomics Uncovers Optimal Workflows in Benchmark Data Sets and Increased Sensitivity in Analysis of Alzheimer's Biomarker Data. J Proteome Res 2022; 22:374-386. [PMID: 36541440 PMCID: PMC9903323 DOI: 10.1021/acs.jproteome.2c00513] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
In the rapidly moving proteomics field, a diverse patchwork of data analysis pipelines and algorithms for data normalization and differential expression analysis is used by the community. We generated a mass spectrometry downstream analysis pipeline (MS-DAP) that integrates both popular and recently developed algorithms for normalization and statistical analyses. Additional algorithms can be easily added in the future as plugins. MS-DAP is open-source and facilitates transparent and reproducible proteome science by generating extensive data visualizations and quality reporting, provided as standardized PDF reports. Second, we performed a systematic evaluation of methods for normalization and statistical analysis on a large variety of data sets, including additional data generated in this study, which revealed key differences. Commonly used approaches for differential testing based on moderated t-statistics were consistently outperformed by more recent statistical models, all integrated in MS-DAP. Third, we introduced a novel normalization algorithm that rescues deficiencies observed in commonly used normalization methods. Finally, we used the MS-DAP platform to reanalyze a recently published large-scale proteomics data set of CSF from AD patients. This revealed increased sensitivity, resulting in additional significant target proteins which improved overlap with results reported in related studies and includes a large set of new potential AD biomarkers in addition to previously reported.
Collapse
Affiliation(s)
- Frank Koopmans
- Department
of Molecular and Cellular Neurobiology, Center for Neurogenomics and
Cognitive Research, Amsterdam Neuroscience, VU University, 1081 HV Amsterdam, The Netherlands,
| | - Ka Wan Li
- Department
of Molecular and Cellular Neurobiology, Center for Neurogenomics and
Cognitive Research, Amsterdam Neuroscience, VU University, 1081 HV Amsterdam, The Netherlands
| | - Remco V. Klaassen
- Department
of Molecular and Cellular Neurobiology, Center for Neurogenomics and
Cognitive Research, Amsterdam Neuroscience, VU University, 1081 HV Amsterdam, The Netherlands
| | - August B. Smit
- Department
of Molecular and Cellular Neurobiology, Center for Neurogenomics and
Cognitive Research, Amsterdam Neuroscience, VU University, 1081 HV Amsterdam, The Netherlands
| |
Collapse
|
60
|
Banerjee P, Tan X, Russell WK, Kurie JM. Analysis of Golgi Secretory Functions in Cancer. Methods Mol Biol 2022; 2557:785-810. [PMID: 36512251 DOI: 10.1007/978-1-0716-2639-9_47] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Cancer cells utilize secretory pathways for paracrine signaling and extracellular matrix remodeling to facilitate directional cell migration, invasion, and metastasis. The Golgi apparatus is a central secretory signaling hub that is often deregulated in cancer. Here we described technologies that utilize microscopic, biochemical, and proteomic approaches to analyze Golgi secretory functions in genetically heterogeneous cancer cell lines.
Collapse
Affiliation(s)
- Priyam Banerjee
- Frits and Rita Markus Bio-Imaging Resource Center, The Rockefeller University, New York, NY, USA
| | - Xiaochao Tan
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - William K Russell
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, TX, USA
| | - Jonathan M Kurie
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
| |
Collapse
|
61
|
Slade D, Hartl M. Analysis of Golgi Protein Acetylation Using In Vitro Assays and Parallel Reaction Monitoring Mass Spectrometry. Methods Mol Biol 2022; 2557:721-741. [PMID: 36512247 DOI: 10.1007/978-1-0716-2639-9_43] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Acetylation is one of the most abundant post-translational protein modifications that regulates all cellular compartments ranging from chromatin to cytoskeleton and Golgi. The dynamic acetylation of the Golgi stacking protein GRASP55 was shown to regulate Golgi reassembly after mitosis. Here we provide a detailed protocol for the analysis of Golgi acetylation including in vitro assays to detect protein acetylation and mass spectrometry analysis to identify specific acetylation sites and their relative abundance.
Collapse
Affiliation(s)
- Dea Slade
- Department of Medical Biochemistry, Max Perutz Labs, Vienna Biocenter, Vienna, Austria.
- Department of Radiation Oncology, Medical University of Vienna, Vienna, Austria.
- Comprehensive Cancer Center, Medical University of Vienna, Vienna, Austria.
| | - Markus Hartl
- Mass Spectrometry Facility, Max Perutz Labs, University of Vienna, Vienna Biocenter, Vienna, Austria.
- Department of Biochemistry and Cell Biology, Max Perutz Labs, University of Vienna, Vienna Biocenter, Vienna, Austria.
| |
Collapse
|
62
|
Weiss CA, Myers TM, Wu CH, Jenkins C, Sondermann H, Lee V, Winkler WC. NrnA is a 5'-3' exonuclease that processes short RNA substrates in vivo and in vitro. Nucleic Acids Res 2022; 50:12369-12388. [PMID: 36478094 PMCID: PMC9757072 DOI: 10.1093/nar/gkac1091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 10/25/2022] [Accepted: 11/30/2022] [Indexed: 12/12/2022] Open
Abstract
Bacterial RNases process RNAs until only short oligomers (2-5 nucleotides) remain, which are then processed by one or more specialized enzymes until only nucleoside monophosphates remain. Oligoribonuclease (Orn) is an essential enzyme that acts in this capacity. However, many bacteria do not encode for Orn and instead encode for NanoRNase A (NrnA). Yet, the catalytic mechanism, cellular roles and physiologically relevant substrates have not been fully resolved for NrnA proteins. We herein utilized a common set of reaction assays to directly compare substrate preferences exhibited by NrnA-like proteins from Bacillus subtilis, Enterococcus faecalis, Streptococcus pyogenes and Mycobacterium tuberculosis. While the M. tuberculosis protein specifically cleaved cyclic di-adenosine monophosphate, the B. subtilis, E. faecalis and S. pyogenes NrnA-like proteins uniformly exhibited striking preference for short RNAs between 2-4 nucleotides in length, all of which were processed from their 5' terminus. Correspondingly, deletion of B. subtilis nrnA led to accumulation of RNAs between 2 and 4 nucleotides in length in cellular extracts. Together, these data suggest that many Firmicutes NrnA-like proteins are likely to resemble B. subtilis NrnA to act as a housekeeping enzyme for processing of RNAs between 2 and 4 nucleotides in length.
Collapse
Affiliation(s)
| | | | - Chih Hao Wu
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
| | - Conor Jenkins
- Department of Chemistry and Biochemistry, University of Maryland, College Park, MD 20742, USA
| | - Holger Sondermann
- CSSB – Centre for Structural Systems Biology, Deutsches Elektronen-Synchrotron (DESY), 22607 Hamburg, Germany,Christian-Albrechts-Universität, 24118 Kiel, Germany
| | - Vincent T Lee
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
| | - Wade C Winkler
- To whom correspondence should be addressed. Tel: +1 301 405 7780;
| |
Collapse
|
63
|
Riley NM, Bertozzi CR. Deciphering O-glycoprotease substrate preferences with O-Pair Search. Mol Omics 2022; 18:908-922. [PMID: 36373229 PMCID: PMC10010678 DOI: 10.1039/d2mo00244b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
O-Glycoproteases are an emerging class of enzymes that selectively digest glycoproteins at positions decorated with specific O-linked glycans. O-Glycoprotease substrates range from any O-glycoprotein (albeit with specific O-glycan modifications) to only glycoproteins harboring specific O-glycosylated sequence motifs, such as those found in mucin domains. Their utility for multiple glycoproteomic applications is driving the search to both discover new O-glycoproteases and to understand how structural features of characterized O-glycoproteases influence their substrate specificities. One challenge of defining O-glycoprotease specificity restraints is the need to characterize O-glycopeptides with site-specific analysis of O-glycosites. Here, we demonstrate how O-Pair Search, a recently developed O-glycopeptide-centric identification platform that enables rapid searches and confident O-glycosite localization, can be used to determine substrate specificities of various O-glycoproteases de novo from LC-MS/MS data of O-glycopeptides. Using secreted protease of C1 esterase inhibitor (StcE) from enterohemorrhagic Escherichia coli and O-endoprotease OgpA from Akkermansia mucinophila, we explore numerous settings that effect O-glycopeptide identification and show how non-specific and semi-tryptic searches of O-glycopeptide data can produce candidate cleavage motifs. These putative motifs can be further used to define new protease cleavage settings that lower search times and improve O-glycopeptide identifications. We use this platform to generate a consensus motif for the recently characterized immunomodulating metalloprotease (IMPa) from Pseudomonas aeruginosa and show that IMPa is a favorable O-glycoprotease for characterizing densely O-glycosylated mucin-domain glycoproteins.
Collapse
Affiliation(s)
- Nicholas M Riley
- Department of Chemistry, Sarafan ChEM-H, Stanford University, Stanford, California, USA.
| | - Carolyn R Bertozzi
- Department of Chemistry, Sarafan ChEM-H, Stanford University, Stanford, California, USA.
- Howard Hughes Medical Institute, Stanford, California, USA
| |
Collapse
|
64
|
Seeing the complete picture: proteins in top-down mass spectrometry. Essays Biochem 2022; 67:283-300. [PMID: 36468679 DOI: 10.1042/ebc20220098] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 11/11/2022] [Accepted: 11/14/2022] [Indexed: 12/12/2022]
Abstract
Abstract
Top-down protein mass spectrometry can provide unique insights into protein sequence and structure, including precise proteoform identification and study of protein–ligand and protein–protein interactions. In contrast with the commonly applied bottom-up approach, top-down approaches do not include digestion of the protein of interest into small peptides, but instead rely on the ionization and subsequent fragmentation of intact proteins. As such, it is fundamentally the only way to fully characterize the composition of a proteoform. Here, we provide an overview of how a top-down protein mass spectrometry experiment is performed and point out recent applications from the literature to the reader. While some parts of the top-down workflow are broadly applicable, different research questions are best addressed with specific experimental designs. The most important divide is between studies that prioritize sequence information (i.e., proteoform identification) versus structural information (e.g., conformational studies, or mapping protein–protein or protein–ligand interactions). Another important consideration is whether to work under native or denaturing solution conditions, and the overall complexity of the sample also needs to be taken into account, as it determines whether (chromatographic) separation is required prior to MS analysis. In this review, we aim to provide enough information to support both newcomers and more experienced readers in the decision process of how to answer a potential research question most efficiently and to provide an overview of the methods that exist to answer these questions.
Collapse
|
65
|
Lenčo J, Jadeja S, Naplekov DK, Krokhin OV, Khalikova MA, Chocholouš P, Urban J, Broeckhoven K, Nováková L, Švec F. Reversed-Phase Liquid Chromatography of Peptides for Bottom-Up Proteomics: A Tutorial. J Proteome Res 2022; 21:2846-2892. [PMID: 36355445 DOI: 10.1021/acs.jproteome.2c00407] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The performance of the current bottom-up liquid chromatography hyphenated with mass spectrometry (LC-MS) analyses has undoubtedly been fueled by spectacular progress in mass spectrometry. It is thus not surprising that the MS instrument attracts the most attention during LC-MS method development, whereas optimizing conditions for peptide separation using reversed-phase liquid chromatography (RPLC) remains somewhat in its shadow. Consequently, the wisdom of the fundaments of chromatography is slowly vanishing from some laboratories. However, the full potential of advanced MS instruments cannot be achieved without highly efficient RPLC. This is impossible to attain without understanding fundamental processes in the chromatographic system and the properties of peptides important for their chromatographic behavior. We wrote this tutorial intending to give practitioners an overview of critical aspects of peptide separation using RPLC to facilitate setting the LC parameters so that they can leverage the full capabilities of their MS instruments. After briefly introducing the gradient separation of peptides, we discuss their properties that affect the quality of LC-MS chromatograms the most. Next, we address the in-column and extra-column broadening. The last section is devoted to key parameters of LC-MS methods. We also extracted trends in practice from recent bottom-up proteomics studies and correlated them with the current knowledge on peptide RPLC separation.
Collapse
Affiliation(s)
- Juraj Lenčo
- Department of Analytical Chemistry, Faculty of Pharmacy in Hradec Králové, Charles University, Heyrovského 1203/8, 500 05Hradec Králové, Czech Republic
| | - Siddharth Jadeja
- Department of Analytical Chemistry, Faculty of Pharmacy in Hradec Králové, Charles University, Heyrovského 1203/8, 500 05Hradec Králové, Czech Republic
| | - Denis K Naplekov
- Department of Analytical Chemistry, Faculty of Pharmacy in Hradec Králové, Charles University, Heyrovského 1203/8, 500 05Hradec Králové, Czech Republic
| | - Oleg V Krokhin
- Department of Internal Medicine, Manitoba Centre for Proteomics and Systems Biology, University of Manitoba, 799 JBRC, 715 McDermot Avenue, WinnipegR3E 3P4, Manitoba, Canada
| | - Maria A Khalikova
- Department of Analytical Chemistry, Faculty of Pharmacy in Hradec Králové, Charles University, Heyrovského 1203/8, 500 05Hradec Králové, Czech Republic
| | - Petr Chocholouš
- Department of Analytical Chemistry, Faculty of Pharmacy in Hradec Králové, Charles University, Heyrovského 1203/8, 500 05Hradec Králové, Czech Republic
| | - Jiří Urban
- Department of Chemistry, Faculty of Science, Masaryk University, Kamenice 5, 625 00Brno, Czech Republic
| | - Ken Broeckhoven
- Department of Chemical Engineering (CHIS), Faculty of Engineering, Vrije Universiteit Brussel, Pleinlaan 2, 1050Brussel, Belgium
| | - Lucie Nováková
- Department of Analytical Chemistry, Faculty of Pharmacy in Hradec Králové, Charles University, Heyrovského 1203/8, 500 05Hradec Králové, Czech Republic
| | - František Švec
- Department of Analytical Chemistry, Faculty of Pharmacy in Hradec Králové, Charles University, Heyrovského 1203/8, 500 05Hradec Králové, Czech Republic
| |
Collapse
|
66
|
An extracellular receptor tyrosine kinase motif orchestrating intracellular STAT activation. Nat Commun 2022; 13:6953. [PMID: 36376313 PMCID: PMC9663514 DOI: 10.1038/s41467-022-34539-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 10/27/2022] [Indexed: 11/16/2022] Open
Abstract
The ErbB4 receptor isoforms JM-a and JM-b differ within their extracellular juxtamembrane (eJM) domains. Here, ErbB4 isoforms are used as a model to address the effect of structural variation in the eJM domain of receptor tyrosine kinases (RTK) on downstream signaling. A specific JM-a-like sequence motif is discovered, and its presence or absence (in JM-b-like RTKs) in the eJM domains of several RTKs is demonstrated to dictate selective STAT activation. STAT5a activation by RTKs including the JM-a like motif is shown to involve interaction with oligosaccharides of N-glycosylated cell surface proteins such as β1 integrin, whereas STAT5b activation by JM-b is dependent on TYK2. ErbB4 JM-a- and JM-b-like RTKs are shown to associate with specific signaling complexes at different cell surface compartments using analyses of RTK interactomes and super-resolution imaging. These findings provide evidence for a conserved mechanism linking a ubiquitous extracellular motif in RTKs with selective intracellular STAT signaling.
Collapse
|
67
|
Dai Y, Millikin R, Rolfs Z, Shortreed MR, Smith LM. A Hybrid Spectral Library and Protein Sequence Database Search Strategy for Bottom-Up and Top-Down Proteomic Data Analysis. J Proteome Res 2022; 21:2609-2618. [PMID: 36206157 PMCID: PMC9869658 DOI: 10.1021/acs.jproteome.2c00305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Tandem mass spectrometry (MS/MS) is widely employed for the analysis of complex proteomic samples. While protein sequence database searching and spectral library searching are both well-established peptide identification methods, each has shortcomings. Protein sequence databases lack fragment peak intensity information, which can result in poor discrimination between correct and incorrect spectrum assignments. Spectral libraries usually contain fewer peptides than protein sequence databases, which limits the number of peptides that can be identified. Notably, few post-translationally modified peptides are represented in spectral libraries. This is because few search engines can both identify a broad spectrum of PTMs and create corresponding spectral libraries. Also, programs that generate spectral libraries using deep learning approaches are not yet able to accurately predict spectra for the vast majority of PTMs. Here, we address these limitations through use of a hybrid search strategy that combines protein sequence database and spectral library searches to improve identification success rates and sensitivity. This software uses Global PTM Discovery (G-PTM-D) to produce spectral libraries for a wide variety of different PTMs. These features, along with a new spectrum annotation and visualization tool, have been integrated into the freely available and open-source search engine MetaMorpheus.
Collapse
Affiliation(s)
- Yuling Dai
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Robert Millikin
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Zach Rolfs
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Michael R. Shortreed
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Lloyd M. Smith
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| |
Collapse
|
68
|
Schoffman H, Levin Y, Itzhaki-Alfia A, Tselekovits L, Gonen L, Vainer GW, Hout-Siloni G, Barshack I, Cohen ZR, Margalit N, Shahar T. Comparison of matched formalin-fixed paraffin embedded and fresh frozen meningioma tissue reveals bias in proteomic profiles. Proteomics 2022; 22:e2200085. [PMID: 36098096 DOI: 10.1002/pmic.202200085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 08/26/2022] [Accepted: 08/26/2022] [Indexed: 12/29/2022]
Abstract
Tissue biopsies are most commonly archived in a paraffin block following tissue fixation with formaldehyde (FFPE) or as fresh frozen tissue (FFT). While both methods preserve biological samples, little is known about how they affect the quantifiable proteome. We performed a 'bottom-up' proteomic analysis (N = 20) of short and long-term archived FFPE surgical samples of human meningiomas and compared them to matched FFT specimens. FFT facilitated a similar number of proteins assigned by MetaMorpheus compared with matched FFPE specimens (5378 vs. 5338 proteins, respectively (p = 0.053), regardless of archival time. However, marked differences in the proteome composition were apparent between FFPE and FFT specimens. Twenty-three percent of FFPE-derived peptides and 8% of FFT-derived peptides contained at least one chemical modification. Methylation and formylation were most prominent in FFPE-derived peptides (36% and 17% of modified FFPE peptides, respectively) while, most of phosphorylation and iron modifications appeared in FFT-derived peptides (p < 0.001). A mean 14% (± 2.9) of peptides identified in FFPE contained at least one modified Lysine residue. Importantly, larger proteins were significantly overrepresented in FFT specimens, while FFPE specimens were enriched with smaller proteins.
Collapse
Affiliation(s)
- Hanan Schoffman
- Laboratory of Molecular Neuro Oncology, Neurosurgery Department, Shaare Zedek Medical Center, Jerusalem, Israel
| | - Yishai Levin
- de Botton Institute for Protein Profiling, The Nancy and Stephen Grand Israel National Center for Personalized Medicine, Weizmann Institute of Science, Rehovot, Israel
| | | | - Lea Tselekovits
- Laboratory of Molecular Neuro Oncology, Neurosurgery Department, Shaare Zedek Medical Center, Jerusalem, Israel
| | - Lior Gonen
- Neurosurgery Department, Shaare Zedek Medical Center, Hebrew University Medical School, Jerusalem, Israel
| | - Gilad Wolf Vainer
- Department of Pathology, Hadassah Hebrew University Medical School, Jerusalem, Israel
| | - Goni Hout-Siloni
- Department of Pathology, Sheba Medical Center, Ramat Gan, Israel
| | - Iris Barshack
- Department of Pathology, Sheba Medical Center, Ramat Gan, Israel.,Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Zvi R Cohen
- Department of Neurosurgery, Sheba Medical Center, Ramat Gan, Israel.,Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Nevo Margalit
- Neurosurgery Department, Shaare Zedek Medical Center, Hebrew University Medical School, Jerusalem, Israel
| | - Tal Shahar
- Laboratory of Molecular Neuro Oncology, Neurosurgery Department, Shaare Zedek Medical Center, Jerusalem, Israel.,Neurosurgery Department, Shaare Zedek Medical Center, Hebrew University Medical School, Jerusalem, Israel
| |
Collapse
|
69
|
A solution to the long-standing problem of actin expression and purification. Proc Natl Acad Sci U S A 2022; 119:e2209150119. [PMID: 36197995 PMCID: PMC9565351 DOI: 10.1073/pnas.2209150119] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Actin is the most abundant protein in the cytoplasm of eukaryotic cells and interacts with hundreds of proteins to perform essential functions, including cell motility and cytokinesis. Numerous diseases are caused by mutations in actin, but studying the biochemistry of actin mutants is difficult without a reliable method to obtain recombinant actin. Moreover, biochemical studies have typically used tissue-purified α-actin, whereas humans express six isoforms that are nearly identical but perform specialized functions and are difficult to obtain in isolation from natural sources. Here, we describe a solution to the problem of actin expression and purification. We obtain high yields of actin isoforms in human Expi293F cells. Experiments along the multistep purification protocol demonstrate the removal of endogenous actin and the functional integrity of recombinant actin isoforms. Proteomics analysis of endogenous vs. recombinant actin isoforms confirms the presence of native posttranslational modifications, including N-terminal acetylation achieved after affinity-tag removal using the actin-specific enzyme Naa80. The method described facilitates studies of actin under fully native conditions to determine differences among isoforms and the effects of disease-causing mutations that occur in all six isoforms.
Collapse
|
70
|
Suddhapas K, Choi MH, Shortreed MR, Timperman A. Evaluation of Variant-Specific Peptides for Detection of SARS-CoV-2 Variants of Concern. J Proteome Res 2022; 21:2443-2452. [PMID: 36108102 PMCID: PMC10318299 DOI: 10.1021/acs.jproteome.2c00325] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The SARS-CoV-2 omicron variant presented significant challenges to the global effort to counter the pandemic. SARS-CoV-2 is predicted to remain prevalent for the foreseeable future, making the ability to identify SARS-CoV-2 variants imperative in understanding and controlling the pandemic. The predominant variant discovery method, genome sequencing, is time-consuming, insensitive, and expensive. Ultraperformance liquid chromatography-mass spectrometry (UPLC-MS) offers an exciting alternative detection modality provided that variant-containing peptide markers are sufficiently detectable from their tandem mass spectra (MS/MS). We have synthesized model tryptic peptides of SARS-CoV-2 variants alpha, beta, gamma, delta, and omicron and evaluated their signal intensity, HCD spectra, and reverse phase retention time. Detection limits of 781, 781, 65, and 65 amol are obtained for the molecular ions of the proteotypic peptides, beta (QIAPGQTGNIADYNYK), gamma (TQLPSAYTNSFTR), delta (VGGNYNYR), and omicron (TLVKQLSSK), from neat solutions. These detection limits are on par with the detection limits of a previously reported proteotypic peptide from the SARS-CoV-2 spike protein, HTPINLVR. This study demonstrates the potential to differentiate SARS-CoV-2 variants through their proteotypic peptides with an approach that is broadly applicable across a wide range of pathogens.
Collapse
Affiliation(s)
- Kantaphon Suddhapas
- Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - M Hannah Choi
- Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Michael R Shortreed
- Department of Chemistry, University of Wisconsin, Madison, Wisconsin 53706, United States
| | - AaronT Timperman
- Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
71
|
Frankenfield AM, Ni J, Ahmed M, Hao L. Protein Contaminants Matter: Building Universal Protein Contaminant Libraries for DDA and DIA Proteomics. J Proteome Res 2022; 21:2104-2113. [PMID: 35793413 PMCID: PMC10040255 DOI: 10.1021/acs.jproteome.2c00145] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Mass spectrometry-based proteomics is constantly challenged by the presence of contaminant background signals. In particular, protein contaminants from reagents and sample handling are almost impossible to avoid. For data-dependent acquisition (DDA) proteomics, an exclusion list can be used to reduce the influence of protein contaminants. However, protein contamination has not been evaluated and is rarely addressed in data-independent acquisition (DIA). How protein contaminants influence proteomic data is also unclear. In this study, we established new protein contaminant FASTA and spectral libraries that are applicable to all proteomic workflows and evaluated the impact of protein contaminants on both DDA and DIA proteomics. We demonstrated that including our contaminant libraries can reduce false discoveries and increase protein identifications, without influencing the quantification accuracy in various proteomic software platforms. With the pressing need to standardize proteomic workflow in the research community, we highly recommend including our contaminant FASTA and spectral libraries in all bottom-up proteomic data analysis. Our contaminant libraries and a step-by-step tutorial to incorporate these libraries in various DDA and DIA data analysis platforms can be valuable resources for proteomic researchers, freely accessible at https://github.com/HaoGroup-ProtContLib.
Collapse
Affiliation(s)
- Ashley M Frankenfield
- Department of Chemistry, The George Washington University, Science and Engineering Hall 4000, 800, 22nd St., Northwest, Washington, DC 20052, United States
| | - Jiawei Ni
- Department of Chemistry, The George Washington University, Science and Engineering Hall 4000, 800, 22nd St., Northwest, Washington, DC 20052, United States
| | - Mustafa Ahmed
- Department of Chemistry, The George Washington University, Science and Engineering Hall 4000, 800, 22nd St., Northwest, Washington, DC 20052, United States
| | - Ling Hao
- Department of Chemistry, The George Washington University, Science and Engineering Hall 4000, 800, 22nd St., Northwest, Washington, DC 20052, United States
| |
Collapse
|
72
|
Brewitz L, Onisko BC, Schofield CJ. Combined proteomic and biochemical analyses redefine the consensus sequence requirement for epidermal growth factor-like domain hydroxylation. J Biol Chem 2022; 298:102129. [PMID: 35700824 PMCID: PMC9293771 DOI: 10.1016/j.jbc.2022.102129] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 06/07/2022] [Accepted: 06/09/2022] [Indexed: 11/16/2022] Open
Abstract
Epidermal growth factor-like domains (EGFDs) have important functions in cell-cell signaling. Both secreted and cell surface human EGFDs are subject to extensive modifications, including aspartate and asparagine residue C3-hydroxylations catalyzed by the 2-oxoglutarate oxygenase aspartate/asparagine-β-hydroxylase (AspH). Although genetic studies show AspH is important in human biology, studies on its physiological roles have been limited by incomplete knowledge of its substrates. Here, we redefine the consensus sequence requirements for AspH-catalyzed EGFD hydroxylation based on combined analysis of proteomic mass spectrometric data and mass spectrometry-based assays with isolated AspH and peptide substrates. We provide cellular and biochemical evidence that the preferred site of EGFD hydroxylation is embedded within a disulfide-bridged macrocycle formed of 10 amino acid residues. This definition enabled the identification of previously unassigned hydroxylation sites in three EGFDs of human fibulins as AspH substrates. A non-EGFD containing protein, lymphocyte antigen-6/plasminogen activator urokinase receptor domain containing protein 6B (LYPD6B) was shown to be a substrate for isolated AspH, but we did not observe evidence for LYPD6B hydroxylation in cells. AspH-catalyzed hydroxylation of fibulins is of particular interest given their important roles in extracellular matrix dynamics. In conclusion, these results lead to a revision of the consensus substrate requirements for AspH and expand the range of observed and potential AspH-catalyzed hydroxylation in cells, which will enable future study of the biological roles of AspH.
Collapse
Affiliation(s)
- Lennart Brewitz
- Chemistry Research Laboratory, Department of Chemistry and the Ineos Oxford Institute for Antimicrobial Research, University of Oxford, Oxford, United Kingdom.
| | | | - Christopher J Schofield
- Chemistry Research Laboratory, Department of Chemistry and the Ineos Oxford Institute for Antimicrobial Research, University of Oxford, Oxford, United Kingdom.
| |
Collapse
|
73
|
Hamid Z, Zimmerman KD, Guillen-Ahlers H, Li C, Nathanielsz P, Cox LA, Olivier M. Assessment of label-free quantification and missing value imputation for proteomics in non-human primates. BMC Genomics 2022; 23:496. [PMID: 35804317 PMCID: PMC9264528 DOI: 10.1186/s12864-022-08723-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Accepted: 06/23/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Reliable and effective label-free quantification (LFQ) analyses are dependent not only on the method of data acquisition in the mass spectrometer, but also on the downstream data processing, including software tools, query database, data normalization and imputation. In non-human primates (NHP), LFQ is challenging because the query databases for NHP are limited since the genomes of these species are not comprehensively annotated. This invariably results in limited discovery of proteins and associated Post Translational Modifications (PTMs) and a higher fraction of missing data points. While identification of fewer proteins and PTMs due to database limitations can negatively impact uncovering important and meaningful biological information, missing data also limits downstream analyses (e.g., multivariate analyses), decreases statistical power, biases statistical inference, and makes biological interpretation of the data more challenging. In this study we attempted to address both issues: first, we used the MetaMorphues proteomics search engine to counter the limits of NHP query databases and maximize the discovery of proteins and associated PTMs, and second, we evaluated different imputation methods for accurate data inference. We used a generic approach for missing data imputation analysis without distinguising the potential source of missing data (either non-assigned m/z or missing values across runs). RESULTS Using the MetaMorpheus proteomics search engine we obtained quantitative data for 1622 proteins and 10,634 peptides including 58 different PTMs (biological, metal and artifacts) across a diverse age range of NHP brain frontal cortex. However, among the 1622 proteins identified, only 293 proteins were quantified across all samples with no missing values, emphasizing the importance of implementing an accurate and statiscaly valid imputation method to fill in missing data. In our imputation analysis we demonstrate that Single Imputation methods that borrow information from correlated proteins such as Generalized Ridge Regression (GRR), Random Forest (RF), local least squares (LLS), and a Bayesian Principal Component Analysis methods (BPCA), are able to estimate missing protein abundance values with great accuracy. CONCLUSIONS Overall, this study offers a detailed comparative analysis of LFQ data generated in NHP and proposes strategies for improved LFQ in NHP proteomics data.
Collapse
Affiliation(s)
- Zeeshan Hamid
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Kip D Zimmerman
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Hector Guillen-Ahlers
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Cun Li
- Southwest National Primate Research Center, San Antonio, TX, USA
- Department of Animal Science, University of Wyoming, Laramie, WY, USA
| | - Peter Nathanielsz
- Southwest National Primate Research Center, San Antonio, TX, USA
- Department of Animal Science, University of Wyoming, Laramie, WY, USA
| | - Laura A Cox
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
- Southwest National Primate Research Center, San Antonio, TX, USA
| | - Michael Olivier
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA.
| |
Collapse
|
74
|
Woerner AE, Crysup B, Hewitt FC, Gardner MW, Freitas MA, Budowle B. Techniques for estimating genetically variable peptides and semi-continuous likelihoods from massively parallel sequencing data. Forensic Sci Int Genet 2022; 59:102719. [DOI: 10.1016/j.fsigen.2022.102719] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 04/25/2022] [Accepted: 05/01/2022] [Indexed: 11/25/2022]
|
75
|
Perez-Riverol Y. Proteomic repository data submission, dissemination, and reuse: key messages. Expert Rev Proteomics 2022; 19:297-310. [PMID: 36529941 PMCID: PMC7614296 DOI: 10.1080/14789450.2022.2160324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 12/07/2022] [Indexed: 12/23/2022]
Abstract
INTRODUCTION The creation of ProteomeXchange data workflows in 2012 transformed the field of proteomics, consisting of the standardization of data submission and dissemination and enabling the widespread reanalysis of public MS proteomics data worldwide. ProteomeXchange has triggered a growing trend toward public dissemination of proteomics data, facilitating the assessment, reuse, comparative analyses, and extraction of new findings from public datasets. By 2022, the consortium is integrated by PRIDE, PeptideAtlas, MassIVE, jPOST, iProX, and Panorama Public. AREAS COVERED Here, we review and discuss the current ecosystem of resources, guidelines, and file formats for proteomics data dissemination and reanalysis. Special attention is drawn to new exciting quantitative and post-translational modification-oriented resources. The challenges and future directions on data depositions including the lack of metadata and cloud-based and high-performance software solutions for fast and reproducible reanalysis of the available data are discussed. EXPERT OPINION The success of ProteomeXchange and the amount of proteomics data available in the public domain have triggered the creation and/or growth of other protein knowledgebase resources. Data reuse is a leading, active, and evolving field; supporting the creation of new formats, tools, and workflows to rediscover and reshape the public proteomics data.
Collapse
Affiliation(s)
- Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
76
|
Abstract
Mucin domains are densely O-glycosylated modular protein domains found in various extracellular and transmembrane proteins. Mucin-domain glycoproteins play important roles in many human diseases, such as cancer and cystic fibrosis, but the scope of the mucinome remains poorly defined. Recently, we characterized a bacterial O-glycoprotease, StcE, and demonstrated that an inactive point mutant retains binding selectivity for mucin-domain glycoproteins. In this work, we leverage inactive StcE to selectively enrich and identify mucin-domain glycoproteins from complex samples like cell lysate and crude ovarian cancer patient ascites fluid. Our enrichment strategy is further aided by an algorithm to assign confidence to mucin-domain glycoprotein identifications. This mucinomics platform facilitates detection of hundreds of glycopeptides from mucin domains and highly overlapping populations of mucin-domain glycoproteins from ovarian cancer patients. Ultimately, we demonstrate our mucinomics approach can reveal key molecular signatures of cancer from in vitro and ex vivo sources. Mucin-domain glycoproteins are densely O-glycosylated proteins with unique secondary structure that imparts a large influence on cellular environments. Here, the authors develop a technique to selectively enrich and characterize mucin-domain glycoproteins from cell lysate and patient biofluids.
Collapse
|
77
|
Vasconcellos AF, Melo RM, Mandacaru SC, de Oliveira LS, de Oliveira AS, Moraes ECDS, Trugilho MRDO, Ricart CAO, Báo SN, Resende RO, Charneau S. Aedes aegypti Aag-2 Cell Proteome Modulation in Response to Chikungunya Virus Infection. Front Cell Infect Microbiol 2022; 12:920425. [PMID: 35782121 PMCID: PMC9240781 DOI: 10.3389/fcimb.2022.920425] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 05/18/2022] [Indexed: 01/16/2023] Open
Abstract
Chikungunya virus (CHIKV) is a single-stranded positive RNA virus that belongs to the genus Alphavirus and is transmitted to humans by infected Aedes aegypti and Aedes albopictus bites. In humans, CHIKV usually causes painful symptoms during acute and chronic stages of infection. Conversely, virus–vector interaction does not disturb the mosquito’s fitness, allowing a persistent infection. Herein, we studied CHIKV infection of Ae. aegypti Aag-2 cells (multiplicity of infection (MOI) of 0.1) for 48 h through label-free quantitative proteomic analysis and transmission electron microscopy (TEM). TEM images showed a high load of intracellular viral cargo at 48 h postinfection (hpi), as well as an unusual elongated mitochondria morphology that might indicate a mitochondrial imbalance. Proteome analysis revealed 196 regulated protein groups upon infection, which are related to protein synthesis, energy metabolism, signaling pathways, and apoptosis. These Aag-2 proteins regulated during CHIKV infection might have roles in antiviral and/or proviral mechanisms and the balance between viral propagation and the survival of host cells, possibly leading to the persistent infection.
Collapse
Affiliation(s)
- Anna Fernanda Vasconcellos
- Laboratory of Biochemistry and Protein Chemistry, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
- Laboratory of Virology, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
| | - Reynaldo Magalhães Melo
- Laboratory of Biochemistry and Protein Chemistry, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
| | - Samuel Coelho Mandacaru
- Laboratory of Toxinology and Center for Technological Development in Health, Oswaldo Cruz Foundation, Rio de Janeiro, Brazil
| | - Lucas Silva de Oliveira
- Laboratory of Biochemistry and Protein Chemistry, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
| | - Athos Silva de Oliveira
- Laboratory of Virology, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
| | | | | | - Carlos André Ornelas Ricart
- Laboratory of Biochemistry and Protein Chemistry, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
| | - Sônia Nair Báo
- Laboratory of Microscopy and Microanalysis, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
| | - Renato Oliveira Resende
- Laboratory of Virology, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
- *Correspondence: Sébastien Charneau, ; Renato Oliveira Resende,
| | - Sébastien Charneau
- Laboratory of Biochemistry and Protein Chemistry, Department of Cell Biology, Institute of Biology, University of Brasilia, Brasilia, Brazil
- *Correspondence: Sébastien Charneau, ; Renato Oliveira Resende,
| |
Collapse
|
78
|
Black MH, Gradowski M, Pawłowski K, Tagliabracci VS. Methods for discovering catalytic activities for pseudokinases. Methods Enzymol 2022; 667:575-610. [PMID: 35525554 DOI: 10.1016/bs.mie.2022.03.047] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Pseudoenzymes resemble active enzymes, but lack key catalytic residues believed to be required for activity. Many pseudoenzymes appear to be inactive in conventional enzyme assays. However, an alternative explanation for their apparent lack of activity is that pseudoenzymes are being assayed for the wrong reaction. We have discovered several new protein kinase-like families which have revealed how different binding orientations of adenosine triphosphate (ATP) and active site residue migration can generate a novel reaction from a common kinase scaffold. These results have exposed the catalytic versatility of the protein kinase fold and suggest that atypical kinases and pseudokinases should be analyzed for alternative transferase activities. In this chapter, we discuss a general approach for bioinformatically identifying divergent or atypical members of an enzyme superfamily, then present an experimental approach to characterize their catalytic activity.
Collapse
Affiliation(s)
- Miles H Black
- Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Marcin Gradowski
- Department of Biochemistry and Microbiology, Institute of Biology, Warsaw University of Life Sciences, Warsaw, Poland
| | - Krzysztof Pawłowski
- Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, TX, United States; Department of Biochemistry and Microbiology, Institute of Biology, Warsaw University of Life Sciences, Warsaw, Poland.
| | - Vincent S Tagliabracci
- Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, TX, United States; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, United States; Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, TX, United States; Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX, United States.
| |
Collapse
|
79
|
Berger MT, Hemmler D, Diederich P, Rychlik M, Marshall JW, Schmitt-Kopplin P. Open Search of Peptide Glycation Products from Tandem Mass Spectra. Anal Chem 2022; 94:5953-5961. [PMID: 35389626 DOI: 10.1021/acs.analchem.2c00388] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Identification of chemically modified peptides in mass spectrometry (MS)-based glycation studies is a crucial yet challenging task. There is a need to establish a mode for matching tandem mass spectrometry (MS/MS) data, allowing for both known and unknown peptide glycation modifications. We present an open search approach that uses classic and modified peptide fragment ions. The latter are shifted by the mass delta of the modification. Both provide key structural information that can be used to assess the peptide core structure of the glycation product. We also leverage redundant neutral losses from the modification side chain, introducing a third ion class for matching referred to as characteristic fragment ions. We demonstrate that peptide glycation product MS/MS spectra contain multidimensional information and that most often, more than half of the spectral information is ignored if no attempt is made to use a multi-step matching algorithm. Compared to regular and/or modified peptide ion matching, our triple-ion strategy significantly increased the median interpretable fraction of the glycation product MS/MS spectra. For reference, we apply our approach for Amadori product characterization and identify all established diagnostic ions automatically. We further show how this method effectively applies the open search concept and allows for optimized elucidation of unknown structures by presenting two hitherto undescribed peptide glycation modifications with a delta mass of 102.0311 and 268.1768 Da. We characterize their fragmentation signature by integration with isotopically labeled glycation products, which provides high validity for non-targeted structure identification.
Collapse
Affiliation(s)
- Michelle T Berger
- Chair of Analytical Food Chemistry, Technical University Munich, Maximus-von-Imhof-Forum 2, 85354 Freising, Germany.,Research Unit Analytical BioGeoChemistry (BGC), Helmholtz Zentrum München, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
| | - Daniel Hemmler
- Chair of Analytical Food Chemistry, Technical University Munich, Maximus-von-Imhof-Forum 2, 85354 Freising, Germany.,Research Unit Analytical BioGeoChemistry (BGC), Helmholtz Zentrum München, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
| | - Philippe Diederich
- Research Unit Analytical BioGeoChemistry (BGC), Helmholtz Zentrum München, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
| | - Michael Rychlik
- Chair of Analytical Food Chemistry, Technical University Munich, Maximus-von-Imhof-Forum 2, 85354 Freising, Germany
| | - James W Marshall
- The Waltham Petcare Science Institute, Mars Petcare UK, Waltham-on-the-Wolds, Leicestershire LE14 4RT, United Kingdom
| | - Philippe Schmitt-Kopplin
- Chair of Analytical Food Chemistry, Technical University Munich, Maximus-von-Imhof-Forum 2, 85354 Freising, Germany.,Research Unit Analytical BioGeoChemistry (BGC), Helmholtz Zentrum München, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
| |
Collapse
|
80
|
Fang Z, Qin H, Mao J, Wang Z, Zhang N, Wang Y, Liu L, Nie Y, Dong M, Ye M. Glyco-Decipher enables glycan database-independent peptide matching and in-depth characterization of site-specific N-glycosylation. Nat Commun 2022; 13:1900. [PMID: 35393418 PMCID: PMC8990002 DOI: 10.1038/s41467-022-29530-y] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 03/16/2022] [Indexed: 12/20/2022] Open
Abstract
Glycopeptides with unusual glycans or poor peptide backbone fragmentation in tandem mass spectrometry are unaccounted for in typical site-specific glycoproteomics analysis and thus remain unidentified. Here, we develop a glycoproteomics tool, Glyco-Decipher, to address these issues. Glyco-Decipher conducts glycan database-independent peptide matching and exploits the fragmentation pattern of shared peptide backbones in glycopeptides to improve the spectrum interpretation. We benchmark Glyco-Decipher on several large-scale datasets, demonstrating that it identifies more peptide-spectrum matches than Byonic, MSFragger-Glyco, StrucGP and pGlyco 3.0, with a 33.5%-178.5% increase in the number of identified glycopeptide spectra. The database-independent and unbiased profiling of attached glycans enables the discovery of 164 modified glycans in mouse tissues, including glycans with chemical or biological modifications. By enabling in-depth characterization of site-specific protein glycosylation, Glyco-Decipher is a promising tool for advancing glycoproteomics analysis in biological research.
Collapse
Affiliation(s)
- Zheng Fang
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 116023, Dalian, China
- University of Chinese Academy of Sciences, 100049, Beijing, China
| | - Hongqiang Qin
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 116023, Dalian, China
- University of Chinese Academy of Sciences, 100049, Beijing, China
| | - Jiawei Mao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 116023, Dalian, China
| | - Zhongyu Wang
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 116023, Dalian, China
- University of Chinese Academy of Sciences, 100049, Beijing, China
| | - Na Zhang
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 116023, Dalian, China
| | - Yan Wang
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 116023, Dalian, China
| | - Luyao Liu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 116023, Dalian, China
- University of Chinese Academy of Sciences, 100049, Beijing, China
| | - Yongzhan Nie
- State Key Laboratory of Cancer Biology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, 710032, Xi'an, China
| | - Mingming Dong
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 116023, Dalian, China.
- School of Bioengineering, Dalian University of Technology, 116024, Dalian, China.
| | - Mingliang Ye
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 116023, Dalian, China.
- University of Chinese Academy of Sciences, 100049, Beijing, China.
| |
Collapse
|
81
|
Greer JB, Early BP, Durbin KR, Patrie SM, Thomas PM, Kelleher NL, LeDuc RD, Fellers RT. ProSight annotator: Complete control and customization of protein entries in UniProt xml files. Proteomics 2022; 22:e2100209. [PMID: 35286768 DOI: 10.1002/pmic.202100209] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 12/07/2021] [Accepted: 01/25/2022] [Indexed: 11/07/2022]
Abstract
The effectiveness of any proteomics database search depends on the theoretical candidate information contained in the protein database. Unfortunately, candidate entries from protein databases such as UniProt rarely contain all the post-translational modifications (PTMs), disulfide bonds, or endogenous cleavages of interest to researchers. These ommissions can limit discovery of novel and biologically important proteoforms. Conversely, searching for a specific proteoform becomes a computationally difficult task for heavily modified proteins. Both situations require updates to the database through user-annotated entries. Unfortunately, manually creating properly formatted UniProt XML files is tedious and prone to errors. ProSight Annotator solves these issues by providing a graphical interface for adding user-defined features to UniProt-formatted XML files for better informed proteoform searches. It can be downloaded from http://prosightannotator.northwestern.edu. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Joseph B Greer
- Departments of Chemistry, Molecular Biosciences and the Proteomics Center of Excellence Northwestern University, 2145 N. Sheridan Road, Evanston, IL, 60208
| | - Bryan P Early
- Departments of Chemistry, Molecular Biosciences and the Proteomics Center of Excellence Northwestern University, 2145 N. Sheridan Road, Evanston, IL, 60208
| | - Kenneth R Durbin
- Departments of Chemistry, Molecular Biosciences and the Proteomics Center of Excellence Northwestern University, 2145 N. Sheridan Road, Evanston, IL, 60208
| | - Steven M Patrie
- Departments of Chemistry, Molecular Biosciences and the Proteomics Center of Excellence Northwestern University, 2145 N. Sheridan Road, Evanston, IL, 60208
| | - Paul M Thomas
- Present address: AbbVie, Inc., 1401 Sheridan Rd, North Chicago, IL, 60064.,Departments of Chemistry, Molecular Biosciences and the Proteomics Center of Excellence Northwestern University, 2145 N. Sheridan Road, Evanston, IL, 60208
| | - Neil L Kelleher
- Departments of Chemistry, Molecular Biosciences and the Proteomics Center of Excellence Northwestern University, 2145 N. Sheridan Road, Evanston, IL, 60208
| | - Richard D LeDuc
- Departments of Chemistry, Molecular Biosciences and the Proteomics Center of Excellence Northwestern University, 2145 N. Sheridan Road, Evanston, IL, 60208
| | - Ryan T Fellers
- Departments of Chemistry, Molecular Biosciences and the Proteomics Center of Excellence Northwestern University, 2145 N. Sheridan Road, Evanston, IL, 60208
| |
Collapse
|
82
|
Yan T, Palmer AB, Geiszler DJ, Polasky DA, Boatner LM, Burton NR, Armenta E, Nesvizhskii AI, Backus KM. Enhancing Cysteine Chemoproteomic Coverage through Systematic Assessment of Click Chemistry Product Fragmentation. Anal Chem 2022; 94:3800-3810. [PMID: 35195394 DOI: 10.1021/acs.analchem.1c04402] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Mass spectrometry-based chemoproteomics has enabled functional analysis and small molecule screening at thousands of cysteine residues in parallel. Widely adopted chemoproteomic sample preparation workflows rely on the use of pan cysteine-reactive probes such as iodoacetamide alkyne combined with biotinylation via copper-catalyzed azide-alkyne cycloaddition (CuAAC) or "click chemistry" for cysteine capture. Despite considerable advances in both sample preparation and analytical platforms, current techniques only sample a small fraction of all cysteines encoded in the human proteome. Extending the recently introduced labile mode of the MSFragger search engine, here we report an in-depth analysis of cysteine biotinylation via click chemistry (CBCC) reagent gas-phase fragmentation during MS/MS analysis. We find that CBCC conjugates produce both known and novel diagnostic fragments and peptide remainder ions. Among these species, we identified a candidate signature ion for CBCC peptides, the cyclic oxonium-biotin fragment ion that is generated upon fragmentation of the N(triazole)-C(alkyl) bond. Guided by our empirical comparison of fragmentation patterns of six CBCC reagent combinations, we achieved enhanced coverage of cysteine-labeled peptides. Implementation of labile searches afforded unique PSMs and provides a roadmap for the utility of such searches in enhancing chemoproteomic peptide coverage.
Collapse
Affiliation(s)
- Tianyang Yan
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Andrew B Palmer
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Daniel J Geiszler
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Lisa M Boatner
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Nikolas R Burton
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Ernest Armenta
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Alexey I Nesvizhskii
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Keriann M Backus
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| |
Collapse
|
83
|
Miller RM, Jordan BT, Mehlferber MM, Jeffery ED, Chatzipantsiou C, Kaur S, Millikin RJ, Dai Y, Tiberi S, Castaldi PJ, Shortreed MR, Luckey CJ, Conesa A, Smith LM, Deslattes Mays A, Sheynkman GM. Enhanced protein isoform characterization through long-read proteogenomics. Genome Biol 2022; 23:69. [PMID: 35241129 PMCID: PMC8892804 DOI: 10.1186/s13059-022-02624-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 02/02/2022] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND The detection of physiologically relevant protein isoforms encoded by the human genome is critical to biomedicine. Mass spectrometry (MS)-based proteomics is the preeminent method for protein detection, but isoform-resolved proteomic analysis relies on accurate reference databases that match the sample; neither a subset nor a superset database is ideal. Long-read RNA sequencing (e.g., PacBio or Oxford Nanopore) provides full-length transcripts which can be used to predict full-length protein isoforms. RESULTS We describe here a long-read proteogenomics approach for integrating sample-matched long-read RNA-seq and MS-based proteomics data to enhance isoform characterization. We introduce a classification scheme for protein isoforms, discover novel protein isoforms, and present the first protein inference algorithm for the direct incorporation of long-read transcriptome data to enable detection of protein isoforms previously intractable to MS-based detection. We have released an open-source Nextflow pipeline that integrates long-read sequencing in a proteomic workflow for isoform-resolved analysis. CONCLUSIONS Our work suggests that the incorporation of long-read sequencing and proteomic data can facilitate improved characterization of human protein isoform diversity. Our first-generation pipeline provides a strong foundation for future development of long-read proteogenomics and its adoption for both basic and translational research.
Collapse
Affiliation(s)
- Rachel M Miller
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Ben T Jordan
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - Madison M Mehlferber
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA
| | - Erin D Jeffery
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | | | - Simi Kaur
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Robert J Millikin
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Yunxiang Dai
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Simone Tiberi
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Peter J Castaldi
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of General Medicine and Primary Care, Brigham and Women's Hospital, Boston, MA, USA
| | | | - Chance John Luckey
- Department of Pathology, University of Virginia, Charlottesville, VA, USA
| | - Ana Conesa
- Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain
- Microbiology and Cell Science Department, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Anne Deslattes Mays
- Office of Data Science and Sharing, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Rockville, MD, USA
| | - Gloria M Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA.
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA.
- UVA Cancer Center, University of Virginia, Charlottesville, VA, USA.
| |
Collapse
|
84
|
Krakovka S, Ribacke U, Miyamoto Y, Eckmann L, Svärd S. Characterization of Metronidazole-Resistant Giardia intestinalis Lines by Comparative Transcriptomics and Proteomics. Front Microbiol 2022; 13:834008. [PMID: 35222342 PMCID: PMC8866875 DOI: 10.3389/fmicb.2022.834008] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Accepted: 01/13/2022] [Indexed: 12/13/2022] Open
Abstract
Metronidazole (MTZ) is a clinically important antimicrobial agent that is active against both bacterial and protozoan organisms. MTZ has been used extensively for more than 60 years and until now resistance has been rare. However, a recent and dramatic increase in the number of MTZ resistant bacteria and protozoa is of great concern since there are few alternative drugs with a similarly broad activity spectrum. To identify key factors and mechanisms underlying MTZ resistance, we utilized the protozoan parasite Giardia intestinalis, which is commonly treated with MTZ. We characterized two in vitro selected, metronidazole resistant parasite lines, as well as one revertant, by analyzing fitness aspects associated with increased drug resistance and transcriptomes and proteomes. We also conducted a meta-analysis using already existing data from additional resistant G. intestinalis isolates. The combined data suggest that in vitro generated MTZ resistance has a substantial fitness cost to the parasite, which may partly explain why resistance is not widespread despite decades of heavy use. Mechanistically, MTZ resistance in Giardia is multifactorial and associated with complex changes, yet a core set of pathways involving oxidoreductases, oxidative stress responses and DNA repair proteins, is central to MTZ resistance in both bacteria and protozoa.
Collapse
Affiliation(s)
- Sascha Krakovka
- Department of Cell and Molecular Biology, Biomedical Center (BMC), Uppsala University, Uppsala, Sweden
| | - Ulf Ribacke
- Department of Microbiology, Tumor and Cell Biology (MTC), Karolinska Institutet, Stockholm, Sweden
| | - Yukiko Miyamoto
- Department of Medicine, University of California, San Diego, La Jolla, CA, United States
| | - Lars Eckmann
- Department of Medicine, University of California, San Diego, La Jolla, CA, United States
| | - Staffan Svärd
- Department of Cell and Molecular Biology, Biomedical Center (BMC), Uppsala University, Uppsala, Sweden.,SciLifeLab, Uppsala University, Uppsala, Sweden
| |
Collapse
|
85
|
Miller RM, Knoener RA, Benner BE, Frey BL, Scalf M, Shortreed MR, Sherer NM, Smith LM. Discovery of Dehydroamino Acid Residues in the Capsid and Matrix Structural Proteins of HIV-1. J Proteome Res 2022; 21:993-1001. [PMID: 35192358 PMCID: PMC8976760 DOI: 10.1021/acs.jproteome.1c00867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Human immunodeficiency virus type 1 (HIV-1) remains a deadly infectious disease despite existing antiretroviral therapies. A comprehensive understanding of the specific mechanisms of viral infectivity remains elusive and currently limits the development of new and effective therapies. Through in-depth proteomic analysis of HIV-1 virions, we discovered the novel post-translational modification of highly conserved residues within the viral matrix and capsid proteins to the dehydroamino acids, dehydroalanine and dehydrobutyrine. We further confirmed their presence by labeling the reactive alkene, characteristic of dehydroamino acids, with glutathione via Michael addition. Dehydroamino acids are rare, understudied, and have been observed mainly in select bacterial and fungal species. Until now, they have not been observed in HIV proteins. We hypothesize that these residues are important in viral particle maturation and could provide valuable insight into HIV infectivity mechanisms.
Collapse
Affiliation(s)
- Rachel M Miller
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Rachel A Knoener
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States.,McArdle Laboratory for Cancer Research and Institute for Molecular Virology, University of Wisconsin, Madison, Wisconsin 53705, United States
| | - Bayleigh E Benner
- McArdle Laboratory for Cancer Research and Institute for Molecular Virology, University of Wisconsin, Madison, Wisconsin 53705, United States
| | - Brian L Frey
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Mark Scalf
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Michael R Shortreed
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Nathan M Sherer
- McArdle Laboratory for Cancer Research and Institute for Molecular Virology, University of Wisconsin, Madison, Wisconsin 53705, United States
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin, 1101 University Avenue, Madison, Wisconsin 53706, United States
| |
Collapse
|
86
|
Riffle M, Hoopmann MR, Jaschob D, Zhong G, Moritz RL, MacCoss MJ, Davis TN, Isoherranen N, Zelter A. Discovery and Visualization of Uncharacterized Drug-Protein Adducts Using Mass Spectrometry. Anal Chem 2022; 94:3501-3509. [PMID: 35184559 PMCID: PMC8892443 DOI: 10.1021/acs.analchem.1c04101] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
![]()
Drugs are often metabolized
to reactive intermediates that form
protein adducts. Adducts can inhibit protein activity, elicit immune
responses, and cause life-threatening adverse drug reactions. The
masses of reactive metabolites are frequently unknown, rendering traditional
mass spectrometry-based proteomics approaches incapable of adduct
identification. Here, we present Magnum, an open-mass search algorithm
optimized for adduct identification, and Limelight, a web-based data
processing package for analysis and visualization of data from all
existing algorithms. Limelight incorporates tools for sample comparisons
and xenobiotic-adduct discovery. We validate our tools with three
drug/protein combinations and apply our label-free workflow to identify
novel xenobiotic-protein adducts in CYP3A4. Our new methods and software
enable accurate identification of xenobiotic-protein adducts with
no prior knowledge of adduct masses or protein targets. Magnum outperforms
existing label-free tools in xenobiotic-protein adduct discovery,
while Limelight fulfills a major need in the rapidly developing field
of open-mass searching, which until now lacked comprehensive data
visualization tools.
Collapse
Affiliation(s)
- Michael Riffle
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| | | | - Daniel Jaschob
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| | - Guo Zhong
- Department of Pharmaceutics, University of Washington, Seattle, Washington 98195, United States
| | - Robert L Moritz
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
| | - Trisha N Davis
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| | - Nina Isoherranen
- Department of Pharmaceutics, University of Washington, Seattle, Washington 98195, United States
| | - Alex Zelter
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| |
Collapse
|
87
|
Ahrens CH, Wade JT, Champion MM, Langer JD. A Practical Guide to Small Protein Discovery and Characterization Using Mass Spectrometry. J Bacteriol 2022; 204:e0035321. [PMID: 34748388 PMCID: PMC8765459 DOI: 10.1128/jb.00353-21] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Small proteins of up to ∼50 amino acids are an abundant class of biomolecules across all domains of life. Yet due to the challenges inherent in their size, they are often missed in genome annotations, and are difficult to identify and characterize using standard experimental approaches. Consequently, we still know few small proteins even in well-studied prokaryotic model organisms. Mass spectrometry (MS) has great potential for the discovery, validation, and functional characterization of small proteins. However, standard MS approaches are poorly suited to the identification of both known and novel small proteins due to limitations at each step of a typical proteomics workflow, i.e., sample preparation, protease digestion, liquid chromatography, MS data acquisition, and data analysis. Here, we outline the major MS-based workflows and bioinformatic pipelines used for small protein discovery and validation. Special emphasis is placed on highlighting the adjustments required to improve detection and data quality for small proteins. We discuss both the unbiased detection of small proteins and the targeted analysis of small proteins of interest. Finally, we provide guidelines to prioritize novel small proteins, and an outlook on methods with particular potential to further improve comprehensive discovery and characterization of small proteins.
Collapse
Affiliation(s)
- Christian H. Ahrens
- Agroscope, Method Development and Analytics & SIB Swiss Institute of Bioinformatics, Wädenswil, Switzerland
| | - Joseph T. Wade
- Wadsworth Center, New York State Department of Health, Albany, New York, USA
- Department of Biomedical Sciences, School of Public Health, University at Albany, Albany, New York, USA
| | - Matthew M. Champion
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA
| | - Julian D. Langer
- Mass Spectrometry and Proteomics, Max Planck Institute of Biophysics, Frankfurt am Main, Germany
- Proteomics, Max Planck Institute for Brain Research, Frankfurt am Main, Germany
| |
Collapse
|
88
|
Henke KB, Miller RM, Knoener RA, Scalf M, Spiniello M, Smith LM. Identifying Protein Interactomes of Target RNAs Using HyPR-MS. Methods Mol Biol 2022; 2404:219-244. [PMID: 34694612 PMCID: PMC8754189 DOI: 10.1007/978-1-0716-1851-6_12] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
RNA-protein interactions are integral to maintaining proper cellular function and homeostasis, and the disruption of key RNA-protein interactions is central to many disease states. HyPR-MS (hybridization purification of RNA-protein complexes followed by mass spectrometry) is a highly versatile and efficient technology which enables multiplexed discovery of specific RNA-protein interactomes. This chapter provides extensive guidance for successful application of HyPR-MS to the system and target RNA(s) of interest, as well as a detailed description of the fundamental HyPR-MS procedure, including: (1) experimental design of controls, capture oligonucleotides, and qPCR assays; (2) formaldehyde cross-linking of cell culture; (3) cell lysis and RNA solubilization; (4) isolation of target RNA(s); (5) RNA purification and RT-qPCR analysis; (6) protein preparation and mass spectrometric analysis; and (7) mass spectrometric data analysis.
Collapse
Affiliation(s)
- Katherine B Henke
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Rachel M Miller
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Rachel A Knoener
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Mark Scalf
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Michele Spiniello
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
- Immuno-Hematology and Transfusion Medicine, Cardarelli Hospital, Naples, Italy
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA.
| |
Collapse
|
89
|
Schaffer LV, Shortreed MR, Smith LM. Proteoform Analysis and Construction of Proteoform Families in Proteoform Suite. Methods Mol Biol 2022; 2500:67-81. [PMID: 35657588 PMCID: PMC9694099 DOI: 10.1007/978-1-0716-2325-1_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Proteoform Suite is an interactive software program for the identification and quantification of intact proteoforms from mass spectrometry data. Proteoform Suite identifies proteoforms observed by intact-mass (MS1) analysis. In intact-mass analysis, unfragmented experimental proteoforms are compared to a database of known proteoform sequences and to one another, searching for mass differences corresponding to well-known post-translational modifications or amino acids. Intact-mass analysis enables proteoforms observed in the MS1 data without MS/MS (MS2) fragmentation to be identified. Proteoform Suite further facilitates the construction and visualization of proteoform families, which are the sets of proteoforms derived from individual genes. Bottom-up peptide identifications and top-down (MS2) proteoform identifications can be integrated into the Proteoform Suite analysis to increase the sensitivity and accuracy of the analysis. Proteoform Suite is open source and freely available at https://github.com/smith-chem-wisc/proteoform-suite .
Collapse
Affiliation(s)
- Leah V Schaffer
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | | | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
90
|
Iannetta AA, Hicks LM. Maximizing Depth of PTM Coverage: Generating Robust MS Datasets for Computational Prediction Modeling. Methods Mol Biol 2022; 2499:1-41. [PMID: 35696073 DOI: 10.1007/978-1-0716-2317-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Post-translational modifications (PTMs) regulate complex biological processes through the modulation of protein activity, stability, and localization. Insights into the specific modification type and localization within a protein sequence can help ascertain functional significance. Computational models are increasingly demonstrated to offer a low-cost, high-throughput method for comprehensive PTM predictions. Algorithms are optimized using existing experimental PTM data, thus accurate prediction performance relies on the creation of robust datasets. Herein, advancements in mass spectrometry-based proteomics technologies to maximize PTM coverage are reviewed. Further, requisite experimental validation approaches for PTM predictions are explored to ensure that follow-up mechanistic studies are focused on accurate modification sites.
Collapse
Affiliation(s)
- Anthony A Iannetta
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Leslie M Hicks
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| |
Collapse
|
91
|
Mehlferber MM, Jeffery ED, Saquing J, Jordan BT, Sheynkman L, Murali M, Genet G, Acharya BR, Hirschi KK, Sheynkman GM. Characterization of protein isoform diversity in human umbilical vein endothelial cells via long-read proteogenomics. RNA Biol 2022; 19:1228-1243. [PMID: 36457147 PMCID: PMC9721438 DOI: 10.1080/15476286.2022.2141938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Endothelial cells (ECs) comprise the lumenal lining of all blood vessels and are critical for the functioning of the cardiovascular system. Their phenotypes can be modulated by alternative splicing of RNA to produce distinct protein isoforms. To characterize the RNA and protein isoform landscape within ECs, we applied a long read proteogenomics approach to analyse human umbilical vein endothelial cells (HUVECs). Transcripts delineated from PacBio sequencing serve as the basis for a sample-specific protein database used for downstream mass-spectrometry (MS) analysis to infer protein isoform expression. We detected 53,863 transcript isoforms from 10,426 genes, with 22,195 of those transcripts being novel. Furthermore, the predominant isoform in HUVECs does not correspond with the accepted "reference isoform" 25% of the time, with vascular pathway-related genes among this group. We found 2,597 protein isoforms supported through unique peptides, with an additional 2,280 isoforms nominated upon incorporation of long-read transcript evidence. We characterized a novel alternative acceptor for endothelial-related gene CDH5, suggesting potential changes in its associated signalling pathways. Finally, we identified novel protein isoforms arising from a diversity of RNA splicing mechanisms supported by uniquely mapped novel peptides. Our results represent a high-resolution atlas of known and novel isoforms of potential relevance to endothelial phenotypes and function.[Figure: see text].
Collapse
Affiliation(s)
- Madison M. Mehlferber
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA,Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Erin D. Jeffery
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Jamie Saquing
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Ben T. Jordan
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Leon Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Mayank Murali
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Gael Genet
- Department of Cell Biology, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Bipul R. Acharya
- Department of Cell Biology, University of Virginia School of Medicine, Charlottesville, VA, USA,Cardiovascular Research Center, University of Virginia, Charlottesville, VA, USA,Wellcome Centre for Cell-Matrix Research, Faculty of Biology, Medicine and Health, the University of Manchester, UK
| | - Karen K. Hirschi
- Department of Cell Biology, University of Virginia School of Medicine, Charlottesville, VA, USA,Cardiovascular Research Center, University of Virginia, Charlottesville, VA, USA
| | - Gloria M. Sheynkman
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA,Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA,Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA,UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, Virginia, USA,CONTACT Gloria M. Sheynkman The Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
92
|
Abstract
The goal of paleoproteomics is to characterize proteins from specimens that have been subjected to the degrading and obscuring effects of time, thus obtaining biological information about tissues or organisms both unobservable in the present and unobtainable through morphological study. Although the description of sequences from Tyrannosaurus rex and Brachylophosaurus canadensis suggested that proteins may persist over tens of millions of years, the majority of paleoproteomic analyses have focused on historical, archeological, or relatively young paleontological samples that rarely exceed 1 million years in age. However, recent advances in methodology and analyses of diverse tissues types (e.g., fossil eggshell, dental enamel) have begun closing the large window of time that remains unexplored in the fossil history of the Cenozoic. In this perspective, we discuss the history and current state of deep time paleoproteomics (DTPp), here defined as paleoproteomic study of samples ∼1 million years (1 Ma) or more in age. We then discuss the future of DTPp research, including what we see as critical ways the field can expand, advancements in technology that can be utilized, and the types of questions DTPp can address if such a future is realized.
Collapse
Affiliation(s)
- Elena R Schroeter
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, United States
| | - Timothy P Cleland
- Museum Conservation Institute, Smithsonian Institution, Suitland, Maryland 20746, United States
| | - Mary H Schweitzer
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, United States.,North Carolina Museum of Natural Sciences, Raleigh, North Carolina 27605, United States.,Department of Geology, Lund University, Lund SE-221 00, Sweden
| |
Collapse
|
93
|
Boekweg H, Van Der Watt D, Truong T, Johnston SM, Guise AJ, Plowey ED, Kelly RT, Payne SH. Features of Peptide Fragmentation Spectra in Single-Cell Proteomics. J Proteome Res 2021; 21:182-188. [PMID: 34920664 DOI: 10.1021/acs.jproteome.1c00670] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The goal of proteomics is to identify and quantify the complete set of proteins in a biological sample. Single-cell proteomics specializes in the identification and quantitation of proteins for individual cells, often used to elucidate cellular heterogeneity. The significant reduction in ions introduced into the mass spectrometer for single-cell samples could impact the features of MS2 fragmentation spectra. As all peptide identification software tools have been developed on spectra from bulk samples and the associated ion-rich spectra, the potential for spectral features to change is of great interest. We characterize the differences between single-cell spectra and bulk spectra by examining three fundamental spectral features that are likely to affect peptide identification performance. All features show significant changes in single-cell spectra, including the loss of annotated fragment ions, blurring signal and background peaks due to diminishing ion intensity, and distinct fragmentation pattern, compared to bulk spectra. As each of these features is a foundational part of peptide identification algorithms, it is critical to adjust algorithms to compensate for these losses.
Collapse
Affiliation(s)
- Hannah Boekweg
- Department of Biology, Brigham Young University, Provo Utah 84602, United States
| | - Daisha Van Der Watt
- Department of Biology, Brigham Young University, Provo Utah 84602, United States
| | - Thy Truong
- Department of Chemistry and Biochemistry, Brigham Young University, Provo Utah 84602, United States
| | - S Madisyn Johnston
- Department of Chemistry and Biochemistry, Brigham Young University, Provo Utah 84602, United States
| | - Amanda J Guise
- Translational Neuropathology, Biomarkers, Research and Development, Biogen Inc, Cambridge, Massachusetts 02142, United States
| | - Edward D Plowey
- Translational Neuropathology, Biomarkers, Research and Development, Biogen Inc, Cambridge, Massachusetts 02142, United States
| | - Ryan T Kelly
- Department of Chemistry and Biochemistry, Brigham Young University, Provo Utah 84602, United States
| | - Samuel H Payne
- Department of Biology, Brigham Young University, Provo Utah 84602, United States
| |
Collapse
|
94
|
Rolfs Z, Smith LM. Internal Fragment Ions Disambiguate and Increase Identifications in Top-Down Proteomics. J Proteome Res 2021; 20:5412-5418. [PMID: 34738820 DOI: 10.1021/acs.jproteome.1c00599] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A large fraction of observed fragment ion intensity remains unidentified in top-down proteomics. The elucidation of these unknown fragment ions could enable researchers to identify additional proteoforms and reduce proteoform ambiguity in their analyses. Internal fragment ions have received considerable attention as a major source of these unidentified fragment ions. Internal fragments are product ions that contain neither protein terminus, in contrast with terminal ions that contain a single terminus. There are many more possible internal fragments than terminal fragments, and the resulting computational complexity has historically limited the application of internal fragment ions to low-complexity samples containing only one or a few proteins of interest. We implemented internal fragment ion functionality in MetaMorpheus to allow the proteome-wide annotation of internal fragment ions. MetaMorpheus first uses terminal fragment ions to identify putative proteoforms and then employs internal fragment ions to disambiguate similar proteoforms. In the analysis of mammalian cell lysates, we found that MetaMorpheus could disambiguate over half of its previously ambiguous proteoforms while also providing up to a 7% increase in proteoform-spectrum matches identified at a 1% false discovery rate.
Collapse
Affiliation(s)
- Zach Rolfs
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
95
|
Farag YM, Horro C, Vaudel M, Barsnes H. PeptideShaker Online: A User-Friendly Web-Based Framework for the Identification of Mass Spectrometry-Based Proteomics Data. J Proteome Res 2021; 20:5419-5423. [PMID: 34709836 PMCID: PMC8650087 DOI: 10.1021/acs.jproteome.1c00678] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Mass spectrometry-based proteomics is a high-throughput technology generating ever-larger amounts of data per project. However, storing, processing, and interpreting these data can be a challenge. A key element in simplifying this process is the development of interactive frameworks focusing on visualization that can greatly simplify both the interpretation of data and the generation of new knowledge. Here we present PeptideShaker Online, a user-friendly web-based framework for the identification of mass spectrometry-based proteomics data, from raw file conversion to interactive visualization of the resulting data. Storage and processing of the data are performed via the versatile Galaxy platform (through SearchGUI, PeptideShaker, and moFF), while the interaction with the results happens via a locally installed web server, thus enabling researchers to process and interpret their own data without requiring advanced bioinformatics skills or direct access to compute-intensive infrastructures. The source code, additional documentation, and a fully functional demo is available at https://github.com/barsnes-group/peptide-shaker-online.
Collapse
Affiliation(s)
- Yehia Mokhtar Farag
- Proteomics Unit, Department of Biomedicine, University of Bergen, 5020 Bergen, Norway.,Computational Biology Unit, Department of Informatics, University of Bergen, 5008 Bergen, Norway
| | - Carlos Horro
- Proteomics Unit, Department of Biomedicine, University of Bergen, 5020 Bergen, Norway.,Computational Biology Unit, Department of Informatics, University of Bergen, 5008 Bergen, Norway
| | - Marc Vaudel
- Department of Clinical Sciences, University of Bergen, 5020 Bergen, Norway
| | - Harald Barsnes
- Proteomics Unit, Department of Biomedicine, University of Bergen, 5020 Bergen, Norway.,Computational Biology Unit, Department of Informatics, University of Bergen, 5008 Bergen, Norway
| |
Collapse
|
96
|
Rolfs Z, Frey BL, Shi X, Kawai Y, Smith LM, Welham NV. An atlas of protein turnover rates in mouse tissues. Nat Commun 2021; 12:6778. [PMID: 34836951 PMCID: PMC8626426 DOI: 10.1038/s41467-021-26842-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 10/25/2021] [Indexed: 01/25/2023] Open
Abstract
Protein turnover is critical to cellular physiology as well as to the growth and maintenance of tissues. The unique synthesis and degradation rates of each protein help to define tissue phenotype, and knowledge of tissue- and protein-specific half-lives is directly relevant to protein-related drug development as well as the administration of medical therapies. Using stable isotope labeling and mass spectrometry, we determine the in vivo turnover rates of thousands of proteins-including those of the extracellular matrix-in a set of biologically important mouse tissues. We additionally develop a data visualization platform, named ApplE Turnover, that enables facile searching for any protein of interest in a tissue of interest and then displays its half-life, confidence interval, and supporting measurements. This extensive dataset and the corresponding visualization software provide a reference to guide future studies of mammalian protein turnover in response to physiologic perturbation, disease, or therapeutic intervention.
Collapse
Affiliation(s)
- Zach Rolfs
- grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706 USA
| | - Brian L. Frey
- grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706 USA
| | - Xudong Shi
- grid.14003.360000 0001 2167 3675Division of Otolaryngology–Head and Neck Surgery, Department of Surgery, University of Wisconsin School of Medicine and Public Health, Madison, WI 53792 USA
| | - Yoshitaka Kawai
- grid.14003.360000 0001 2167 3675Division of Otolaryngology–Head and Neck Surgery, Department of Surgery, University of Wisconsin School of Medicine and Public Health, Madison, WI 53792 USA ,grid.258799.80000 0004 0372 2033Present Address: Department of Otolaryngology–Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, 606-8507 Japan
| | - Lloyd M. Smith
- grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706 USA
| | - Nathan V. Welham
- grid.14003.360000 0001 2167 3675Division of Otolaryngology–Head and Neck Surgery, Department of Surgery, University of Wisconsin School of Medicine and Public Health, Madison, WI 53792 USA
| |
Collapse
|
97
|
Doron S, Lampl N, Savidor A, Katina C, Gabashvili A, Levin Y, Rosenwasser S. SPEAR: A proteomics approach for simultaneous protein expression and redox analysis. Free Radic Biol Med 2021; 176:366-377. [PMID: 34619326 DOI: 10.1016/j.freeradbiomed.2021.10.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 09/30/2021] [Accepted: 10/01/2021] [Indexed: 01/02/2023]
Abstract
Oxidation and reduction of protein cysteinyl thiols serve as molecular switches, which is considered the most central mechanism for redox regulation of biological processes, altering protein structure, biochemical activity, subcellular localization, and binding affinity. Redox proteomics allows global identification of redox-modified cysteine (Cys) sites and quantification of their reversible oxidation/reduction responses, serving as a hypothesis-generating platform to stimulate redox biology mechanistic research. Here, we developed Simultaneous Protein Expression and Redox (SPEAR) analysis, a new redox-proteomics approach based on differential labeling of reversibly oxidized and reduced cysteines with light and heavy isotopic forms of commercially available isotopically-labeled N-ethylmaleimide (NEM). The presented method does not require enrichment for labeled peptides, thus enabling simultaneous quantification of Cys reversible oxidation state and protein abundance. Using SPEAR, we were able to quantify the in-vivo reversible oxidation state of thousands of cysteines across the Arabidopsis proteome under steady-state and oxidative stress conditions. Functional assignment of the identified redox-sensitive proteins demonstrated the widespread effect of oxidative conditions on various cellular functions and highlighted the enrichment of chloroplastic proteins. SPEAR provides a simple, straightforward, and cost-effective means of studying redox proteome dynamics. The presented data provide a global quantitative view of the reversible oxidation of well-known redox-regulated active sites and many novel redox-sensitive sites whose role in plant acclimation to stress conditions remains to be further explored.
Collapse
Affiliation(s)
- Shani Doron
- The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot 7610000, Israel
| | - Nardy Lampl
- The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot 7610000, Israel
| | - Alon Savidor
- de Botton Institute for Protein Profiling, The Nancy and Stephen Grand Israel National Center for Personalized Medicine, Weizmann Institute of Science, Rehovot, Israel
| | - Corine Katina
- de Botton Institute for Protein Profiling, The Nancy and Stephen Grand Israel National Center for Personalized Medicine, Weizmann Institute of Science, Rehovot, Israel
| | - Alexandra Gabashvili
- de Botton Institute for Protein Profiling, The Nancy and Stephen Grand Israel National Center for Personalized Medicine, Weizmann Institute of Science, Rehovot, Israel
| | - Yishai Levin
- de Botton Institute for Protein Profiling, The Nancy and Stephen Grand Israel National Center for Personalized Medicine, Weizmann Institute of Science, Rehovot, Israel.
| | - Shilo Rosenwasser
- The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot 7610000, Israel.
| |
Collapse
|
98
|
Leung SK, Jeffries AR, Castanho I, Jordan BT, Moore K, Davies JP, Dempster EL, Bray NJ, O'Neill P, Tseng E, Ahmed Z, Collier DA, Jeffery ED, Prabhakar S, Schalkwyk L, Jops C, Gandal MJ, Sheynkman GM, Hannon E, Mill J. Full-length transcript sequencing of human and mouse cerebral cortex identifies widespread isoform diversity and alternative splicing. Cell Rep 2021; 37:110022. [PMID: 34788620 PMCID: PMC8609283 DOI: 10.1016/j.celrep.2021.110022] [Citation(s) in RCA: 67] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 07/30/2021] [Accepted: 10/28/2021] [Indexed: 12/05/2022] Open
Abstract
Alternative splicing is a post-transcriptional regulatory mechanism producing distinct mRNA molecules from a single pre-mRNA with a prominent role in the development and function of the central nervous system. We used long-read isoform sequencing to generate full-length transcript sequences in the human and mouse cortex. We identify novel transcripts not present in existing genome annotations, including transcripts mapping to putative novel (unannotated) genes and fusion transcripts incorporating exons from multiple genes. Global patterns of transcript diversity are similar between human and mouse cortex, although certain genes are characterized by striking differences between species. We also identify developmental changes in alternative splicing, with differential transcript usage between human fetal and adult cortex. Our data confirm the importance of alternative splicing in the cortex, dramatically increasing transcriptional diversity and representing an important mechanism underpinning gene regulation in the brain. We provide transcript-level data for human and mouse cortex as a resource to the scientific community. There is widespread transcript diversity in the cortex and many novel transcripts Some genes display big differences in isoform number between human and mouse cortex There is evidence of differential transcript usage between human fetal and adult cortex There are many novel isoforms of genes associated with human brain disease
Collapse
Key Words
- isoform, transcript, expression, brain, cortex, mouse, human, adult, fetal, long-read sequencing, alternative splicing
Collapse
Affiliation(s)
| | | | - Isabel Castanho
- University of Exeter, Exeter, UK; Department of Pathology, Beth Israel Deaconess Medical Center, Boston, MA, USA; Harvard Medical School, Boston, MA, USA
| | - Ben T Jordan
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | | | | | | | | | | | | | | | | | - Erin D Jeffery
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - Shyam Prabhakar
- Genome Institute of Singapore, Agency for Science, Technology and Research (A(∗)STAR), Singapore, Singapore
| | | | - Connor Jops
- Department of Psychiatry and Biobehavioral Sciences, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
| | - Michael J Gandal
- Department of Psychiatry and Biobehavioral Sciences, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
| | - Gloria M Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA; Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA; UVA Cancer Center, University of Virginia, Charlottesville, VA, USA
| | | | | |
Collapse
|
99
|
Liu X, Fields R, Schweppe DK, Paulo JA. Strategies for mass spectrometry-based phosphoproteomics using isobaric tagging. Expert Rev Proteomics 2021; 18:795-807. [PMID: 34652972 DOI: 10.1080/14789450.2021.1994390] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
INTRODUCTION Protein phosphorylation is a primary mechanism of signal transduction in cellular systems. Isobaric tagging can be used to investigate alterations in phosphorylation events in sample multiplexing experiments where quantification extends across all conditions. As such, innovations in tandem mass tag methods can facilitate the expansion of the depth and breadth of phosphoproteomic analyses. AREAS COVERED This review discusses the current state of tandem mass tag-centric phosphoproteomics and highlights advances in reagent chemistry, instrumentation, data acquisition, and data analysis. We stress that approaches for phosphoproteomic investigations require high-specificity enrichment, sensitive detection, and accurate phosphorylation site localization. EXPERT OPINION Tandem mass tag-centric phosphoproteomics will continue to be an important conduit for our understanding of signal transduction in living organisms. We anticipate that progress in phosphopeptide enrichment methodologies, enhancements in instrumentation and data acquisition technologies, and further refinements in analytical strategies will be key to the discovery of biologically relevant findings from phosphoproteomics studies.
Collapse
Affiliation(s)
- Xinyue Liu
- Department of Cell Biology, Harvard Medical School, Boston, USA
| | - Rose Fields
- Department of Genome Sciences, University of Washington, Seattle, USA
| | - Devin K Schweppe
- Department of Genome Sciences, University of Washington, Seattle, USA
| | - Joao A Paulo
- Department of Cell Biology, Harvard Medical School, Boston, USA
| |
Collapse
|
100
|
Carman PJ, Barrie KR, Dominguez R. Novel human cell expression method reveals the role and prevalence of posttranslational modification in nonmuscle tropomyosins. J Biol Chem 2021; 297:101154. [PMID: 34478714 PMCID: PMC8463859 DOI: 10.1016/j.jbc.2021.101154] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 08/25/2021] [Accepted: 08/30/2021] [Indexed: 11/29/2022] Open
Abstract
Biochemical studies require large quantities of proteins, which are typically obtained using bacterial overexpression. However, the folding machinery in bacteria is inadequate for expressing many mammalian proteins, which additionally undergo posttranslational modifications (PTMs) that bacteria, yeast, or insect cells cannot perform. Many proteins also require native N- and C-termini and cannot tolerate extra tag amino acids for proper function. Tropomyosin (Tpm), a coiled coil protein that decorates most actin filaments in cells, requires both native N- and C-termini and PTMs, specifically N-terminal acetylation (Nt-acetylation), to polymerize along actin filaments. Here, we describe a new method that combines native protein expression in human cells with an intein-based purification tag that can be precisely removed after purification. Using this method, we expressed several nonmuscle Tpm isoforms (Tpm1.6, Tpm1.7, Tpm2.1, Tpm3.1, Tpm3.2, and Tpm4.2) and the muscle isoform Tpm1.1. Proteomics analysis revealed that human-cell-expressed Tpms present various PTMs, including Nt-acetylation, Ser/Thr phosphorylation, Tyr phosphorylation, and Lys acetylation. Depending on the Tpm isoform (humans express up to 40 Tpm isoforms), Nt-acetylation occurs on either the initiator methionine or on the second residue after removal of the initiator methionine. Human-cell-expressed Tpms bind F-actin differently than their Escherichia coli-expressed counterparts, with or without N-terminal extensions intended to mimic Nt-acetylation, and they can form heterodimers in cells and in vitro. The expression method described here reveals previously unknown features of nonmuscle Tpms and can be used in future structural and biochemical studies with Tpms and other proteins, as shown here for α-synuclein.
Collapse
Affiliation(s)
- Peter J Carman
- Department of Physiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA; Biochemistry and Molecular Biophysics Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Kyle R Barrie
- Department of Physiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA; Biochemistry and Molecular Biophysics Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Roberto Dominguez
- Department of Physiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA.
| |
Collapse
|