51
|
Impact of DJ-1 and Helix 8 on the Proteome and Degradome of Neuron-Like Cells. Cells 2021; 10:cells10020404. [PMID: 33669258 PMCID: PMC7920061 DOI: 10.3390/cells10020404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 02/08/2021] [Accepted: 02/10/2021] [Indexed: 12/04/2022] Open
Abstract
DJ-1 is an abundant and ubiquitous component of cellular proteomes. DJ-1 supposedly exerts a wide variety of molecular functions, ranging from enzymatic activities as a deglycase, protease, and esterase to chaperone functions. However, a consensus perspective on its molecular function in the cellular context has not yet been reached. Structurally, the C-terminal helix 8 of DJ-1 has been proposed to constitute a propeptide whose proteolytic removal transforms a DJ-1 zymogen to an active hydrolase with potential proteolytic activity. To better understand the cell-contextual functionality of DJ-1 and the role of helix 8, we employed post-mitotically differentiated, neuron-like SH-SY5Y neuroblastoma cells with stable over-expression of full length DJ-1 or DJ-1 lacking helix 8 (ΔH8), either with a native catalytically active site (C106) or an inactive site (C106A active site mutation). Global proteome comparison of cells over-expressing DJ-1 ΔH8 with native or mutated active site cysteine indicated a strong impact on mitochondrial biology. N-terminomic profiling however did not highlight direct protease substrate candidates for DJ-1 ΔH8, but linked DJ-1 to elevated levels of activated lysosomal proteases, albeit presumably in an indirect manner. Finally, we show that DJ-1 ΔH8 loses the deglycation activity of full length DJ-1. Our study further establishes DJ-1 as deglycation enzyme. Helix 8 is essential for the deglycation activity but dispensable for the impact on lysosomal and mitochondrial biology; further illustrating the pleiotropic nature of DJ-1.
Collapse
|
52
|
Hotta T, Haynes SE, Blasius TL, Gebbie M, Eberhardt EL, Sept D, Cianfrocco M, Verhey KJ, Nesvizhskii AI, Ohi R. Parthenolide Destabilizes Microtubules by Covalently Modifying Tubulin. Curr Biol 2021; 31:900-907.e6. [PMID: 33482110 DOI: 10.1016/j.cub.2020.11.055] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 11/02/2020] [Accepted: 11/19/2020] [Indexed: 12/28/2022]
Abstract
Detyrosination of the α-tubulin C-terminal tail is a post-translational modification (PTM) of microtubules that is key for many biological processes.1 Although detyrosination is the oldest known microtubule PTM,2-7 the carboxypeptidase responsible for this modification, VASH1/2-SVBP, was identified only 3 years ago,8,9 precluding genetic approaches to prevent detyrosination. Studies examining the cellular functions of detyrosination have therefore relied on a natural product, parthenolide, which is widely believed to block detyrosination of α-tubulin in cells, presumably by inhibiting the activity of the relevant carboxypeptidase(s).10 Parthenolide is a sesquiterpene lactone that forms covalent linkages predominantly with exposed thiol groups; e.g., on cysteine residues.11-13 Using mass spectrometry, we show that parthenolide forms adducts on both cysteine and histidine residues on tubulin itself, in vitro and in cells. Parthenolide causes tubulin protein aggregation and prevents the formation of microtubules. In contrast to epoY, an epoxide inhibitor of VASH1/2-SVBP,9 parthenolide does not block VASH1-SVBP activity in vitro. Lastly, we show that epoY is an efficacious inhibitor of microtubule detyrosination in cells, providing an alternative chemical means to block detyrosination. Collectively, our work supports the notion that parthenolide is a promiscuous inhibitor of many cellular processes and suggests that its ability to block detyrosination may be an indirect consequence of reducing the polymerization-competent pool of tubulin in cells.
Collapse
Affiliation(s)
- Takashi Hotta
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA
| | - Sarah E Haynes
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Teresa L Blasius
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA
| | - Margo Gebbie
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA
| | - Emily L Eberhardt
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA; Department of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - David Sept
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI, USA
| | - Michael Cianfrocco
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA; Department of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Kristen J Verhey
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Ryoma Ohi
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
53
|
|
54
|
Polasky DA, Yu F, Teo GC, Nesvizhskii AI. Fast and comprehensive N- and O-glycoproteomics analysis with MSFragger-Glyco. Nat Methods 2020; 17:1125-1132. [PMID: 33020657 PMCID: PMC7606558 DOI: 10.1038/s41592-020-0967-9] [Citation(s) in RCA: 137] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Accepted: 08/31/2020] [Indexed: 12/15/2022]
Abstract
Recent advances in methods for enrichment and mass spectrometric analysis of intact glycopeptides have produced large-scale glycoproteomics datasets, but interpreting these data remains challenging. We present MSFragger-Glyco, a glycoproteomics mode of the MSFragger search engine, for fast and sensitive identification of N- and O-linked glycopeptides and open glycan searches. Reanalysis of recent N-glycoproteomics data resulted in annotation of 80% more glycopeptide spectrum matches (glycoPSMs) than previously reported. In published O-glycoproteomics data, our method more than doubled the number of glycoPSMs annotated when searching the same glycans as the original search, and yielded 4- to 6-fold increases when expanding searches to include additional glycan compositions and other modifications. Expanded searches also revealed many sulfated and complex glycans that remained hidden to the original search. With greatly improved spectral annotation, coupled with the speed of index-based scoring, MSFragger-Glyco makes it possible to comprehensively interrogate glycoproteomics data and illuminate the many roles of glycosylation.
Collapse
Affiliation(s)
- Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
55
|
Polasky DA, Yu F, Teo GC, Nesvizhskii AI. Fast and comprehensive N- and O-glycoproteomics analysis with MSFragger-Glyco. Nat Methods 2020. [PMID: 33020657 DOI: 10.1101/2020.05.18.102665] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Abstract
Recent advances in methods for enrichment and mass spectrometric analysis of intact glycopeptides have produced large-scale glycoproteomics datasets, but interpreting these data remains challenging. We present MSFragger-Glyco, a glycoproteomics mode of the MSFragger search engine, for fast and sensitive identification of N- and O-linked glycopeptides and open glycan searches. Reanalysis of recent N-glycoproteomics data resulted in annotation of 80% more glycopeptide spectrum matches (glycoPSMs) than previously reported. In published O-glycoproteomics data, our method more than doubled the number of glycoPSMs annotated when searching the same glycans as the original search, and yielded 4- to 6-fold increases when expanding searches to include additional glycan compositions and other modifications. Expanded searches also revealed many sulfated and complex glycans that remained hidden to the original search. With greatly improved spectral annotation, coupled with the speed of index-based scoring, MSFragger-Glyco makes it possible to comprehensively interrogate glycoproteomics data and illuminate the many roles of glycosylation.
Collapse
Affiliation(s)
- Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
56
|
Li K, Jain A, Malovannaya A, Wen B, Zhang B. DeepRescore: Leveraging Deep Learning to Improve Peptide Identification in Immunopeptidomics. Proteomics 2020; 20:e1900334. [PMID: 32864883 PMCID: PMC7718998 DOI: 10.1002/pmic.201900334] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 08/27/2020] [Indexed: 12/23/2022]
Abstract
The identification of major histocompatibility complex (MHC)-binding peptides in mass spectrometry (MS)-based immunopeptideomics relies largely on database search engines developed for proteomics data analysis. However, because immunopeptidomics experiments do not involve enzymatic digestion at specific residues, an inflated search space leads to a high false positive rate and low sensitivity in peptide identification. In order to improve the sensitivity and reliability of peptide identification, a post-processing tool named DeepRescore is developed. DeepRescore combines peptide features derived from deep learning predictions, namely accurate retention timeand MS/MS spectra predictions, with previously used features to rescore peptide-spectrum matches. Using two public immunopeptidomics datasets, it is shown that rescoring by DeepRescore increases both the sensitivity and reliability of MHC-binding peptide and neoantigen identifications compared to existing methods. It is also shown that the performance improvement is, to a large extent, driven by the deep learning-derived features. DeepRescore is developed using NextFlow and Docker and is available at https://github.com/bzhanglab/DeepRescore.
Collapse
Affiliation(s)
- Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Antrix Jain
- Mass Spectrometry Proteomics Core, Baylor College of Medicine, Houston, TX 77030, USA
| | - Anna Malovannaya
- Mass Spectrometry Proteomics Core, Baylor College of Medicine, Houston, TX 77030, USA
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|
57
|
Zecha J, Lee CY, Bayer FP, Meng C, Grass V, Zerweck J, Schnatbaum K, Michler T, Pichlmair A, Ludwig C, Kuster B. Data, Reagents, Assays and Merits of Proteomics for SARS-CoV-2 Research and Testing. Mol Cell Proteomics 2020; 19:1503-1522. [PMID: 32591346 PMCID: PMC7780043 DOI: 10.1074/mcp.ra120.002164] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 06/26/2020] [Indexed: 12/14/2022] Open
Abstract
As the COVID-19 pandemic continues to spread, thousands of scientists around the globe have changed research direction to understand better how the virus works and to find out how it may be tackled. The number of manuscripts on preprint servers is soaring and peer-reviewed publications using MS-based proteomics are beginning to emerge. To facilitate proteomic research on SARS-CoV-2, the virus that causes COVID-19, this report presents deep-scale proteomes (10,000 proteins; >130,000 peptides) of common cell line models, notably Vero E6, Calu-3, Caco-2, and ACE2-A549 that characterize their protein expression profiles including viral entry factors such as ACE2 or TMPRSS2. Using the 9 kDa protein SRP9 and the breast cancer oncogene BRCA1 as examples, we show how the proteome expression data can be used to refine the annotation of protein-coding regions of the African green monkey and the Vero cell line genomes. Monitoring changes of the proteome on viral infection revealed widespread expression changes including transcriptional regulators, protease inhibitors, and proteins involved in innate immunity. Based on a library of 98 stable-isotope labeled synthetic peptides representing 11 SARS-CoV-2 proteins, we developed PRM (parallel reaction monitoring) assays for nano-flow and micro-flow LC-MS/MS. We assessed the merits of these PRM assays using supernatants of virus-infected Vero E6 cells and challenged the assays by analyzing two diagnostic cohorts of 24 (+30) SARS-CoV-2 positive and 28 (+9) negative cases. In light of the results obtained and including recent publications or manuscripts on preprint servers, we critically discuss the merits of MS-based proteomics for SARS-CoV-2 research and testing.
Collapse
Affiliation(s)
- Jana Zecha
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany
| | - Chien-Yun Lee
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany
| | - Florian P Bayer
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany
| | - Chen Meng
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), Technical University of Munich, Freising, Germany
| | - Vincent Grass
- Institute of Virology, School of Medicine, Technical University of Munich, Munich, Germany; German Center for Infection Research (DZIF), Munich partner site, Germany
| | | | | | - Thomas Michler
- Institute of Virology, School of Medicine, Technical University of Munich, Munich, Germany
| | - Andreas Pichlmair
- Institute of Virology, School of Medicine, Technical University of Munich, Munich, Germany; German Center for Infection Research (DZIF), Munich partner site, Germany
| | - Christina Ludwig
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), Technical University of Munich, Freising, Germany.
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany; Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), Technical University of Munich, Freising, Germany.
| |
Collapse
|
58
|
Yu F, Haynes SE, Teo GC, Avtonomov DM, Polasky DA, Nesvizhskii AI. Fast Quantitative Analysis of timsTOF PASEF Data with MSFragger and IonQuant. Mol Cell Proteomics 2020; 19:1575-1585. [PMID: 32616513 PMCID: PMC7996969 DOI: 10.1074/mcp.tir120.002048] [Citation(s) in RCA: 126] [Impact Index Per Article: 31.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2020] [Revised: 05/21/2020] [Indexed: 01/01/2023] Open
Abstract
Ion mobility brings an additional dimension of separation to LC-MS, improving identification of peptides and proteins in complex mixtures. A recently introduced timsTOF mass spectrometer (Bruker) couples trapped ion mobility separation to TOF mass analysis. With the parallel accumulation serial fragmentation (PASEF) method, the timsTOF platform achieves promising results, yet analysis of the data generated on this platform represents a major bottleneck. Currently, MaxQuant and PEAKS are most used to analyze these data. However, because of the high complexity of timsTOF PASEF data, both require substantial time to perform even standard tryptic searches. Advanced searches (e.g. with many variable modifications, semi- or non-enzymatic searches, or open searches for post-translational modification discovery) are practically impossible. We have extended our fast peptide identification tool MSFragger to support timsTOF PASEF data, and developed a label-free quantification tool, IonQuant, for fast and accurate 4-D feature extraction and quantification. Using a HeLa data set published by Meier et al. (2018), we demonstrate that MSFragger identifies significantly (∼30%) more unique peptides than MaxQuant (1.6.10.43), and performs comparably or better than PEAKS X+ (∼10% more peptides). IonQuant outperforms both in terms of number of quantified proteins while maintaining good quantification precision and accuracy. Runtime tests show that MSFragger and IonQuant can fully process a typical two-hour PASEF run in under 70 min on a typical desktop (6 CPU cores, 32 GB RAM), significantly faster than other tools. Finally, through semi-enzymatic searching, we significantly increase the number of identified peptides. Within these semi-tryptic identifications, we report evidence of gas-phase fragmentation before MS/MS analysis.
Collapse
Affiliation(s)
- Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Sarah E Haynes
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Dmitry M Avtonomov
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA.
| |
Collapse
|
59
|
Ming L, Zou Y, Zhao Y, Zhang L, He N, Chen Z, Li SSC, Li L. MMS2plot: An R Package for Visualizing Multiple MS/MS Spectra for Groups of Modified and Non-Modified Peptides. Proteomics 2020; 20:e2000061. [PMID: 32643287 DOI: 10.1002/pmic.202000061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 06/06/2020] [Indexed: 11/11/2022]
Abstract
A large number of post-translational modifications (PTMs) in proteins are buried in the unassigned mass spectrometric (MS) spectra in shot-gun proteomics datasets. Because the modified peptide fragments are low in abundance relative to the corresponding non-modified versions, it is critical to develop tools that allow facile evaluation of assignment of PTMs based on the MS/MS spectra. Such tools will preferably have the ability to allow comparison of fragment ion spectra and retention time between the modified and unmodified peptide pairs or group. Herein, MMS2plot, an R package for visualizing peptide-spectrum matches (PSMs) for multiple peptides, is described. MMS2plot features a batch mode and generates the output images in vector graphics file format that facilitate evaluation and publication of the PSM assignment. MMS2plot is expected to play an important role in PTM discovery from large-scale proteomics datasets generated by liquid chromatography-MS/MS. The MMS2plot package is freely available at https://github.com/lileir/MMS2plot under the GPL-3 license.
Collapse
Affiliation(s)
- Liya Ming
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
| | - Yang Zou
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
| | - Yiming Zhao
- Data Science and Software Engineering, Qingdao University, Qingdao, 266021, China
| | - Luna Zhang
- Data Science and Software Engineering, Qingdao University, Qingdao, 266021, China
| | - Ningning He
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
| | - Zhen Chen
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
| | - Shawn S-C Li
- Department of Biochemistry, Schulich School of Medicine and Dentistry, Western University, London, N6A 5C1, Canada
| | - Lei Li
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
- Data Science and Software Engineering, Qingdao University, Qingdao, 266021, China
| |
Collapse
|
60
|
Gillette MA, Satpathy S, Cao S, Dhanasekaran SM, Vasaikar SV, Krug K, Petralia F, Li Y, Liang WW, Reva B, Krek A, Ji J, Song X, Liu W, Hong R, Yao L, Blumenberg L, Savage SR, Wendl MC, Wen B, Li K, Tang LC, MacMullan MA, Avanessian SC, Kane MH, Newton CJ, Cornwell M, Kothadia RB, Ma W, Yoo S, Mannan R, Vats P, Kumar-Sinha C, Kawaler EA, Omelchenko T, Colaprico A, Geffen Y, Maruvka YE, da Veiga Leprevost F, Wiznerowicz M, Gümüş ZH, Veluswamy RR, Hostetter G, Heiman DI, Wyczalkowski MA, Hiltke T, Mesri M, Kinsinger CR, Boja ES, Omenn GS, Chinnaiyan AM, Rodriguez H, Li QK, Jewell SD, Thiagarajan M, Getz G, Zhang B, Fenyö D, Ruggles KV, Cieslik MP, Robles AI, Clauser KR, Govindan R, Wang P, Nesvizhskii AI, Ding L, Mani DR, Carr SA. Proteogenomic Characterization Reveals Therapeutic Vulnerabilities in Lung Adenocarcinoma. Cell 2020; 182:200-225.e35. [PMID: 32649874 PMCID: PMC7373300 DOI: 10.1016/j.cell.2020.06.013] [Citation(s) in RCA: 392] [Impact Index Per Article: 98.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 03/06/2020] [Accepted: 06/03/2020] [Indexed: 12/24/2022]
Abstract
To explore the biology of lung adenocarcinoma (LUAD) and identify new therapeutic opportunities, we performed comprehensive proteogenomic characterization of 110 tumors and 101 matched normal adjacent tissues (NATs) incorporating genomics, epigenomics, deep-scale proteomics, phosphoproteomics, and acetylproteomics. Multi-omics clustering revealed four subgroups defined by key driver mutations, country, and gender. Proteomic and phosphoproteomic data illuminated biology downstream of copy number aberrations, somatic mutations, and fusions and identified therapeutic vulnerabilities associated with driver events involving KRAS, EGFR, and ALK. Immune subtyping revealed a complex landscape, reinforced the association of STK11 with immune-cold behavior, and underscored a potential immunosuppressive role of neutrophil degranulation. Smoking-associated LUADs showed correlation with other environmental exposure signatures and a field effect in NATs. Matched NATs allowed identification of differentially expressed proteins with potential diagnostic and therapeutic utility. This proteogenomics dataset represents a unique public resource for researchers and clinicians seeking to better understand and treat lung adenocarcinomas.
Collapse
Affiliation(s)
- Michael A Gillette
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA; Division of Pulmonary and Critical Care Medicine, Massachusetts General Hospital, Boston, MA, 02115, USA.
| | - Shankha Satpathy
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA.
| | - Song Cao
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | | | - Suhas V Vasaikar
- Department of Translational Molecular Pathology, MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Karsten Krug
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | - Francesca Petralia
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Yize Li
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Wen-Wei Liang
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Boris Reva
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Azra Krek
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jiayi Ji
- Department of Population Health Science and Policy; Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Xiaoyu Song
- Department of Population Health Science and Policy; Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Wenke Liu
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Runyu Hong
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Lijun Yao
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Lili Blumenberg
- Institute for Systems Genetics and Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Sara R Savage
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Michael C Wendl
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Lauren C Tang
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA; Department of Biological Sciences, Columbia University, New York, NY, 10027, USA
| | - Melanie A MacMullan
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA; Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, CA, 90089, USA
| | - Shayan C Avanessian
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | - M Harry Kane
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | | | - MacIntosh Cornwell
- Institute for Systems Genetics and Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Ramani B Kothadia
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | - Weiping Ma
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Seungyeul Yoo
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Rahul Mannan
- Department of Pathology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Pankaj Vats
- Department of Pathology, University of Michigan, Ann Arbor, MI, 48109, USA
| | | | - Emily A Kawaler
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Tatiana Omelchenko
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Antonio Colaprico
- Department of Public Health Sciences, University of Miami, Miller School of Medicine, Miami, FL, 33136, USA
| | - Yifat Geffen
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | - Yosef E Maruvka
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | | | - Maciej Wiznerowicz
- Poznan University of Medical Sciences, Poznań, 61-701, Poland; International Institute for Molecular Oncology, Poznań, 60-203, Poland
| | - Zeynep H Gümüş
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Rajwanth R Veluswamy
- Division of Hematology and Medical Oncology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | | | - David I Heiman
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | - Matthew A Wyczalkowski
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Tara Hiltke
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD, 20892, USA
| | - Mehdi Mesri
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD, 20892, USA
| | - Christopher R Kinsinger
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD, 20892, USA
| | - Emily S Boja
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD, 20892, USA
| | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Arul M Chinnaiyan
- Department of Pathology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD, 20892, USA
| | - Qing Kay Li
- Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins Medical Institutions, Baltimore, MD, 21224, USA
| | - Scott D Jewell
- Van Andel Research Institute, Grand Rapids, MI, 49503, USA
| | - Mathangi Thiagarajan
- Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, 21702, USA
| | - Gad Getz
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA
| | - David Fenyö
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Kelly V Ruggles
- Institute for Systems Genetics and Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Marcin P Cieslik
- Department of Pathology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Ana I Robles
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD, 20892, USA
| | - Karl R Clauser
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | - Ramaswamy Govindan
- Division of Oncology and Siteman Cancer Center, Washington University School of Medicine in St. Louis, St. Louis, MO, 63110, USA
| | - Pei Wang
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, 48109, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Li Ding
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - D R Mani
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
| | - Steven A Carr
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA.
| |
Collapse
|
61
|
Li H, Zhou R, Xu S, Chen X, Hong Y, Lu Q, Liu H, Zhou B, Liang X. Improving Gene Annotation of the Peanut Genome by Integrated Proteogenomics Workflow. J Proteome Res 2020; 19:2226-2235. [DOI: 10.1021/acs.jproteome.9b00723] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Affiliation(s)
- Haifen Li
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Ruo Zhou
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Shaohang Xu
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Xiaoping Chen
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Yanbin Hong
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Qing Lu
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Hao Liu
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Baojin Zhou
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Xuanqiang Liang
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| |
Collapse
|
62
|
Wen B, Li K, Zhang Y, Zhang B. Cancer neoantigen prioritization through sensitive and reliable proteogenomics analysis. Nat Commun 2020; 11:1759. [PMID: 32273506 PMCID: PMC7145864 DOI: 10.1038/s41467-020-15456-w] [Citation(s) in RCA: 78] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Accepted: 03/10/2020] [Indexed: 01/01/2023] Open
Abstract
Genomics-based neoantigen discovery can be enhanced by proteomic evidence, but there remains a lack of consensus on the performance of different quality control methods for variant peptide identification in proteogenomics. We propose to use the difference between accurately predicted and observed retention times for each peptide as a metric to evaluate different quality control methods. To this end, we develop AutoRT, a deep learning algorithm with high accuracy in retention time prediction. Analysis of three cancer data sets with a total of 287 tumor samples using different quality control strategies results in substantially different numbers of identified variant peptides and putative neoantigens. Our systematic evaluation, using the proposed retention time metric, provides insights and practical guidance on the selection of quality control strategies. We implement the recommended strategy in a computational workflow named NeoFlow to support proteogenomics-based neoantigen prioritization, enabling more sensitive discovery of putative neoantigens. Identifying mutation-derived neoantigens by proteogenomics requires robust strategies for quality control. Here, the authors propose peptide retention time as an evaluation metric for proteogenomics quality control methods, and develop a deep learning algorithm for accurate retention time prediction.
Collapse
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Yun Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA. .,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.
| |
Collapse
|
63
|
Quantitative proteomic landscape of metaplastic breast carcinoma pathological subtypes and their relationship to triple-negative tumors. Nat Commun 2020; 11:1723. [PMID: 32265444 PMCID: PMC7138853 DOI: 10.1038/s41467-020-15283-z] [Citation(s) in RCA: 67] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 02/28/2020] [Indexed: 12/15/2022] Open
Abstract
Metaplastic breast carcinoma (MBC) is a highly aggressive form of triple-negative cancer (TNBC), defined by the presence of metaplastic components of spindle, squamous, or sarcomatoid histology. The protein profiles underpinning the pathological subtypes and metastatic behavior of MBC are unknown. Using multiplex quantitative tandem mass tag-based proteomics we quantify 5798 proteins in MBC, TNBC, and normal breast from 27 patients. Comparing MBC and TNBC protein profiles we show MBC-specific increases related to epithelial-to-mesenchymal transition and extracellular matrix, and reduced metabolic pathways. MBC subtypes exhibit distinct upregulated profiles, including translation and ribosomal events in spindle, inflammation- and apical junction-related proteins in squamous, and extracellular matrix proteins in sarcomatoid subtypes. Comparison of the proteomes of human spindle MBC with mouse spindle (CCN6 knockout) MBC tumors reveals a shared spindle-specific signature of 17 upregulated proteins involved in translation and 19 downregulated proteins with roles in cell metabolism. These data identify potential subtype specific MBC biomarkers and therapeutic targets. Metaplastic breast carcinoma (MBC) is among the most aggressive subtypes of triple-negative breast cancer (TNBC) but the underlying proteome profiles are unknown. Here, the authors characterize the protein signatures of human MBC tissue samples and their relationship to TNBC and normal breast tissue.
Collapse
|
64
|
McGowan T, Johnson JE, Kumar P, Sajulga R, Mehta S, Jagtap PD, Griffin TJ. Multi-omics Visualization Platform: An extensible Galaxy plug-in for multi-omics data visualization and exploration. Gigascience 2020; 9:giaa025. [PMID: 32236523 PMCID: PMC7102281 DOI: 10.1093/gigascience/giaa025] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 02/13/2020] [Accepted: 02/24/2020] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Proteogenomics integrates genomics, transcriptomics, and mass spectrometry (MS)-based proteomics data to identify novel protein sequences arising from gene and transcript sequence variants. Proteogenomic data analysis requires integration of disparate 'omic software tools, as well as customized tools to view and interpret results. The flexible Galaxy platform has proven valuable for proteogenomic data analysis. Here, we describe a novel Multi-omics Visualization Platform (MVP) for organizing, visualizing, and exploring proteogenomic results, adding a critically needed tool for data exploration and interpretation. FINDINGS MVP is built as an HTML Galaxy plug-in, primarily based on JavaScript. Via the Galaxy API, MVP uses SQLite databases as input-a custom data type (mzSQLite) containing MS-based peptide identification information, a variant annotation table, and a coding sequence table. Users can interactively filter identified peptides based on sequence and data quality metrics, view annotated peptide MS data, and visualize protein-level information, along with genomic coordinates. Peptides that pass the user-defined thresholds can be sent back to Galaxy via the API for further analysis; processed data and visualizations can also be saved and shared. MVP leverages the Integrated Genomics Viewer JavaScript framework, enabling interactive visualization of peptides and corresponding transcript and genomic coding information within the MVP interface. CONCLUSIONS MVP provides a powerful, extensible platform for automated, interactive visualization of proteogenomic results within the Galaxy environment, adding a unique and critically needed tool for empowering exploration and interpretation of results. The platform is extensible, providing a basis for further development of new functionalities for proteogenomic data visualization.
Collapse
Affiliation(s)
- Thomas McGowan
- Minnesota Supercomputing Institute, University of Minnesota, 599 Walter Library, 117 Pleasant Street SE, Minneapolis, MN 55455, USA
| | - James E Johnson
- Minnesota Supercomputing Institute, University of Minnesota, 599 Walter Library, 117 Pleasant Street SE, Minneapolis, MN 55455, USA
| | - Praveen Kumar
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 6–155 Jackson Hall, 321 Church Street SE, Minneapolis, MN 55455, USA
- Bioinformatics and Computational Biology program, University of Minnesota-Rochester, 111 South Broadway, Suite 300, Rochester, MN 55904, USA
| | - Ray Sajulga
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 6–155 Jackson Hall, 321 Church Street SE, Minneapolis, MN 55455, USA
| | - Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 6–155 Jackson Hall, 321 Church Street SE, Minneapolis, MN 55455, USA
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 6–155 Jackson Hall, 321 Church Street SE, Minneapolis, MN 55455, USA
| | - Timothy J Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 6–155 Jackson Hall, 321 Church Street SE, Minneapolis, MN 55455, USA
| |
Collapse
|
65
|
Dou Y, Kawaler EA, Cui Zhou D, Gritsenko MA, Huang C, Blumenberg L, Karpova A, Petyuk VA, Savage SR, Satpathy S, Liu W, Wu Y, Tsai CF, Wen B, Li Z, Cao S, Moon J, Shi Z, Cornwell M, Wyczalkowski MA, Chu RK, Vasaikar S, Zhou H, Gao Q, Moore RJ, Li K, Sethuraman S, Monroe ME, Zhao R, Heiman D, Krug K, Clauser K, Kothadia R, Maruvka Y, Pico AR, Oliphant AE, Hoskins EL, Pugh SL, Beecroft SJI, Adams DW, Jarman JC, Kong A, Chang HY, Reva B, Liao Y, Rykunov D, Colaprico A, Chen XS, Czekański A, Jędryka M, Matkowski R, Wiznerowicz M, Hiltke T, Boja E, Kinsinger CR, Mesri M, Robles AI, Rodriguez H, Mutch D, Fuh K, Ellis MJ, DeLair D, Thiagarajan M, Mani DR, Getz G, Noble M, Nesvizhskii AI, Wang P, Anderson ML, Levine DA, Smith RD, Payne SH, Ruggles KV, Rodland KD, Ding L, Zhang B, Liu T, Fenyö D. Proteogenomic Characterization of Endometrial Carcinoma. Cell 2020; 180:729-748.e26. [PMID: 32059776 PMCID: PMC7233456 DOI: 10.1016/j.cell.2020.01.026] [Citation(s) in RCA: 273] [Impact Index Per Article: 68.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 11/11/2019] [Accepted: 01/16/2020] [Indexed: 02/07/2023]
Abstract
We undertook a comprehensive proteogenomic characterization of 95 prospectively collected endometrial carcinomas, comprising 83 endometrioid and 12 serous tumors. This analysis revealed possible new consequences of perturbations to the p53 and Wnt/β-catenin pathways, identified a potential role for circRNAs in the epithelial-mesenchymal transition, and provided new information about proteomic markers of clinical and genomic tumor subgroups, including relationships to known druggable pathways. An extensive genome-wide acetylation survey yielded insights into regulatory mechanisms linking Wnt signaling and histone acetylation. We also characterized aspects of the tumor immune landscape, including immunogenic alterations, neoantigens, common cancer/testis antigens, and the immune microenvironment, all of which can inform immunotherapy decisions. Collectively, our multi-omic analyses provide a valuable resource for researchers and clinicians, identify new molecular associations of potential mechanistic significance in the development of endometrial cancers, and suggest novel approaches for identifying potential therapeutic targets.
Collapse
Affiliation(s)
- Yongchao Dou
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Emily A Kawaler
- Institute for Systems Genetics, NYU School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY 10016, USA
| | - Daniel Cui Zhou
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Marina A Gritsenko
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Chen Huang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Lili Blumenberg
- Department of Medicine, NYU School of Medicine, New York, NY 10016, USA
| | - Alla Karpova
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Vladislav A Petyuk
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Sara R Savage
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Shankha Satpathy
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Wenke Liu
- Institute for Systems Genetics, NYU School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY 10016, USA
| | - Yige Wu
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Chia-Feng Tsai
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Zhi Li
- Institute for Systems Genetics, NYU School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY 10016, USA
| | - Song Cao
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Jamie Moon
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Zhiao Shi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - MacIntosh Cornwell
- Institute for Systems Genetics, NYU School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY 10016, USA
| | - Matthew A Wyczalkowski
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Rosalie K Chu
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Suhas Vasaikar
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Hua Zhou
- Institute for Systems Genetics, NYU School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY 10016, USA
| | - Qingsong Gao
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Ronald J Moore
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sunantha Sethuraman
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Matthew E Monroe
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Rui Zhao
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - David Heiman
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Karsten Krug
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Karl Clauser
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Ramani Kothadia
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Yosef Maruvka
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Alexander R Pico
- Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA 94158, USA
| | - Amanda E Oliphant
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Emily L Hoskins
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Samuel L Pugh
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Sean J I Beecroft
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - David W Adams
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Jonathan C Jarman
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Andy Kong
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Hui-Yin Chang
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Boris Reva
- Department of Genetics and Genomic Sciences, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Yuxing Liao
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Dmitry Rykunov
- Department of Genetics and Genomic Sciences, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Antonio Colaprico
- Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Division of Biostatistics, Department of Public Health Science, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Xi Steven Chen
- Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Division of Biostatistics, Department of Public Health Science, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Andrzej Czekański
- Department of Oncology, Wroclaw Medical University, 50-367 Wrocław, Poland; Wroclaw Comprehensive Cancer Center, 53-413 Wrocław, Poland
| | - Marcin Jędryka
- Department of Oncology, Wroclaw Medical University, 50-367 Wrocław, Poland; Wroclaw Comprehensive Cancer Center, 53-413 Wrocław, Poland
| | - Rafał Matkowski
- Department of Oncology, Wroclaw Medical University, 50-367 Wrocław, Poland; Wroclaw Comprehensive Cancer Center, 53-413 Wrocław, Poland
| | - Maciej Wiznerowicz
- Poznan University of Medical Sciences, 61-701 Poznań, Poland; University Hospital of Lord's Transfiguration, 60-569 Poznań, Poland; International Institute for Molecular Oncology, 60-203 Poznań, Poland
| | - Tara Hiltke
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Emily Boja
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Christopher R Kinsinger
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Mehdi Mesri
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Ana I Robles
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - David Mutch
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Katherine Fuh
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Matthew J Ellis
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Deborah DeLair
- Department of Pathology, NYU Langone Health, New York, NY 10016, USA
| | - Mathangi Thiagarajan
- Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - D R Mani
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Gad Getz
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Michael Noble
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Pei Wang
- Department of Genetics and Genomic Sciences, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Matthew L Anderson
- College of Medicine Obstetrics & Gynecology, University of South Florida Health, Tampa, FL 33620, USA
| | - Douglas A Levine
- Gynecologic Oncology, Laura and Isaac Perlmutter Cancer Center, NYU Langone Health, New York, NY 10016, USA
| | - Richard D Smith
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Samuel H Payne
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Kelly V Ruggles
- Department of Medicine, NYU School of Medicine, New York, NY 10016, USA
| | - Karin D Rodland
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA; Department of Cell, Developmental, and Cancer Biology, Oregon Health & Science University, Portland, OR 97221, USA.
| | - Li Ding
- Department of Medicine and Genetics, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA.
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA.
| | - Tao Liu
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA.
| | - David Fenyö
- Institute for Systems Genetics, NYU School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY 10016, USA.
| |
Collapse
|
66
|
Karunratanakul K, Tang HY, Speicher DW, Chuangsuwanich E, Sriswasdi S. Uncovering Thousands of New Peptides with Sequence-Mask-Search Hybrid De Novo Peptide Sequencing Framework. Mol Cell Proteomics 2019; 18:2478-2491. [PMID: 31591261 PMCID: PMC6885704 DOI: 10.1074/mcp.tir119.001656] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Revised: 09/09/2019] [Indexed: 01/03/2023] Open
Abstract
Typical analyses of mass spectrometry data only identify amino acid sequences that exist in reference databases. This restricts the possibility of discovering new peptides such as those that contain uncharacterized mutations or originate from unexpected processing of RNAs and proteins. De novo peptide sequencing approaches address this limitation but often suffer from low accuracy and require extensive validation by experts. Here, we develop SMSNet, a deep learning-based de novo peptide sequencing framework that achieves >95% amino acid accuracy while retaining good identification coverage. Applications of SMSNet on landmark proteomics and peptidomics studies reveal over 10,000 previously uncharacterized HLA antigens and phosphopeptides, and in conjunction with database-search methods, expand the coverage of peptide identification by almost 30%. The power to accurately identify new peptides of SMSNet would make it an invaluable tool for any future proteomics and peptidomics studies, including tumor neoantigen discovery, antibody sequencing, and proteome characterization of non-model organisms.
Collapse
Affiliation(s)
- Korrawe Karunratanakul
- Department of Computer Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok 10330, Thailand
| | - Hsin-Yao Tang
- Proteomics and Metabolomics Facility, The Wistar Institute, Philadelphia, PA 19104
| | - David W Speicher
- Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA 19104
| | - Ekapol Chuangsuwanich
- Department of Computer Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok 10330, Thailand; Computational Molecular Biology Group, Faculty of Medicine, Chulalongkorn University, Bangkok 10330, Thailand. mailto:
| | - Sira Sriswasdi
- Computational Molecular Biology Group, Faculty of Medicine, Chulalongkorn University, Bangkok 10330, Thailand; Research Affairs, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Bangkok 10330, Thailand. mailto:
| |
Collapse
|
67
|
Clark DJ, Dhanasekaran SM, Petralia F, Pan J, Song X, Hu Y, da Veiga Leprevost F, Reva B, Lih TSM, Chang HY, Ma W, Huang C, Ricketts CJ, Chen L, Krek A, Li Y, Rykunov D, Li QK, Chen LS, Ozbek U, Vasaikar S, Wu Y, Yoo S, Chowdhury S, Wyczalkowski MA, Ji J, Schnaubelt M, Kong A, Sethuraman S, Avtonomov DM, Ao M, Colaprico A, Cao S, Cho KC, Kalayci S, Ma S, Liu W, Ruggles K, Calinawan A, Gümüş ZH, Geiszler D, Kawaler E, Teo GC, Wen B, Zhang Y, Keegan S, Li K, Chen F, Edwards N, Pierorazio PM, Chen XS, Pavlovich CP, Hakimi AA, Brominski G, Hsieh JJ, Antczak A, Omelchenko T, Lubinski J, Wiznerowicz M, Linehan WM, Kinsinger CR, Thiagarajan M, Boja ES, Mesri M, Hiltke T, Robles AI, Rodriguez H, Qian J, Fenyö D, Zhang B, Ding L, Schadt E, Chinnaiyan AM, Zhang Z, Omenn GS, Cieslik M, Chan DW, Nesvizhskii AI, Wang P, Zhang H. Integrated Proteogenomic Characterization of Clear Cell Renal Cell Carcinoma. Cell 2019; 179:964-983.e31. [PMID: 31675502 PMCID: PMC7331093 DOI: 10.1016/j.cell.2019.10.007] [Citation(s) in RCA: 394] [Impact Index Per Article: 78.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Revised: 07/15/2019] [Accepted: 10/07/2019] [Indexed: 02/07/2023]
Abstract
To elucidate the deregulated functional modules that drive clear cell renal cell carcinoma (ccRCC), we performed comprehensive genomic, epigenomic, transcriptomic, proteomic, and phosphoproteomic characterization of treatment-naive ccRCC and paired normal adjacent tissue samples. Genomic analyses identified a distinct molecular subgroup associated with genomic instability. Integration of proteogenomic measurements uniquely identified protein dysregulation of cellular mechanisms impacted by genomic alterations, including oxidative phosphorylation-related metabolism, protein translation processes, and phospho-signaling modules. To assess the degree of immune infiltration in individual tumors, we identified microenvironment cell signatures that delineated four immune-based ccRCC subtypes characterized by distinct cellular pathways. This study reports a large-scale proteogenomic analysis of ccRCC to discern the functional impact of genomic alterations and provides evidence for rational treatment selection stemming from ccRCC pathobiology.
Collapse
Affiliation(s)
- David J Clark
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | | | - Francesca Petralia
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jianbo Pan
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Xiaoyu Song
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Yingwei Hu
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | | | - Boris Reva
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Tung-Shing M Lih
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Hui-Yin Chang
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Weiping Ma
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Chen Huang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Christopher J Ricketts
- Urologic Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Lijun Chen
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Azra Krek
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Yize Li
- Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Dmitry Rykunov
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Qing Kay Li
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Lin S Chen
- Department of Public Health Sciences, University of Chicago, Chicago, IL 60637, USA
| | - Umut Ozbek
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Suhas Vasaikar
- Department of Translational Molecular Pathology, MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Yige Wu
- Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Seungyeul Yoo
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Shrabanti Chowdhury
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | | | - Jiayi Ji
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Michael Schnaubelt
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Andy Kong
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | | | - Dmitry M Avtonomov
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Minghui Ao
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Antonio Colaprico
- Department of Public Health Sciences, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Song Cao
- Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Kyung-Cho Cho
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Selim Kalayci
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Shiyong Ma
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Wenke Liu
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, NY 10016, USA
| | - Kelly Ruggles
- Department of Medicine, New York University School of Medicine, New York, NY 10016, USA
| | - Anna Calinawan
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Zeynep H Gümüş
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Daniel Geiszler
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Emily Kawaler
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, NY 10016, USA
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yuping Zhang
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Sarah Keegan
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, NY 10016, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Feng Chen
- Departments of Medicine and Cell Biology and Physiology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Nathan Edwards
- Department of Biochemistry and Cellular Biology, Georgetown University, Washington, DC 20007, USA
| | - Phillip M Pierorazio
- Brady Urological Institute and Department of Urology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Xi Steven Chen
- Department of Public Health Sciences, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Christian P Pavlovich
- Brady Urological Institute and Department of Urology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - A Ari Hakimi
- Department of Surgery, Urology Service, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
| | - Gabriel Brominski
- Department of Urology, Poznań University of Medical Sciences, Szwajcarska 3, Poznań 61-285, Poland
| | - James J Hsieh
- Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Andrzej Antczak
- Department of Urology, Poznań University of Medical Sciences, Szwajcarska 3, Poznań 61-285, Poland
| | - Tatiana Omelchenko
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
| | - Jan Lubinski
- Department of Genetics and Pathology, Pomeranian Medical University, Szczecin 71-252, Poland
| | - Maciej Wiznerowicz
- International Institute for Molecular Oncology, Poznań 60-203, Poland; Poznań University of Medical Sciences, Poznan 60-701, Poland
| | - W Marston Linehan
- Urologic Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Christopher R Kinsinger
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | | | - Emily S Boja
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Mehdi Mesri
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Tara Hiltke
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Ana I Robles
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Jiang Qian
- Department of Ophthalmology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - David Fenyö
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, NY 10016, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Li Ding
- Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Eric Schadt
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Sema4, Stamford, CT 06902, USA
| | - Arul M Chinnaiyan
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Zhen Zhang
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Department of Internal Medicine, Human Genetics, and School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA
| | - Marcin Cieslik
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA.
| | - Daniel W Chan
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA.
| | | | - Pei Wang
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| | - Hui Zhang
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA.
| |
Collapse
|
68
|
Wen B, Wang X, Zhang B. PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations. Genome Res 2019; 29:485-493. [PMID: 30610011 PMCID: PMC6396417 DOI: 10.1101/gr.235028.118] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Accepted: 12/28/2018] [Indexed: 12/20/2022]
Abstract
Massively parallel or second-generation sequencing-based genomic studies continuously identify new genomic alterations that may lead to novel protein sequences, which are attractive candidates for disease biomarkers and therapeutic targets after proteomic validation. Integrative proteogenomic methods have been developed to use mass spectrometry (MS)-based proteomics data for such validation. These methods replace the reference sequence database in proteomic database searching with a customized protein database that incorporates sample- or disease-specific sequences derived from DNA or RNA sequencing, thus enabling the identification of novel protein sequences. Although useful, this spectrum-centric approach requires a full evaluation of all possible spectrum-peptide pairs, which is time-consuming, error-prone, and difficult to apply. Here, we present PepQuery, a peptide-centric approach that focuses on only novel DNA or protein sequences of interest. PepQuery allows quick and easy proteomic validation of genomic alterations without customized database construction. We demonstrated the sensitivity and specificity of the approach in validating completely novel proteins, novel splice junctions, and single amino acid variants using simulations and experimental data. Notably, enabling unrestricted modification searching in PepQuery reduced false positives by up to 95%. We implemented PepQuery as both web-based and stand-alone applications. The web version provides direct access to more than half a billion MS/MS spectra from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) and other cancer proteomic studies. The stand-alone version supports batch analysis and user-provided MS/MS data. PepQuery will increase the usage of proteogenomics beyond the proteomics community and will broaden the application of proteogenomics in personalized medicine.
Collapse
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Xiaojing Wang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
69
|
Rong M, Zhou B, Zhou R, Liao Q, Zeng Y, Xu S, Liu Z. PPIP: Automated Software for Identification of Bioactive Endogenous Peptides. J Proteome Res 2019; 18:721-727. [PMID: 30540478 DOI: 10.1021/acs.jproteome.8b00718] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Endogenous peptides play an important role in multiple biological processes in many species. Liquid chromatography coupled to tandem mass spectrometry (LC-MS/MS) is an important technique for detecting these peptides on a large scale. We present PPIP, which is a dedicated peptidogenomics software for identifying endogenous peptides based on peptidomics and RNA-Seq data. This software automates the de novo transcript assembly based on RNA-Seq data, construction of a protein reference database based on the de novo assembled transcripts, peptide identification, function analysis, and HTML-based report generation. Different function components are integrated using Docker technology. The Docker image of PPIP is available at https://hub.docker.com/r/shawndp/ppip , and the source code under GPL-3 license is available at https://github.com/Shawn-Xu/PPIP . A user manual of PPIP is available at https://shawn-xu.github.io/PPIP .
Collapse
Affiliation(s)
- Mingqiang Rong
- The National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences , Hunan Normal University , Changsha 410081 , Hunan , China
| | - Baojin Zhou
- Deepxomics Co., Ltd. , Shenzhen 518000 , China
| | - Ruo Zhou
- Deepxomics Co., Ltd. , Shenzhen 518000 , China
| | - Qiong Liao
- The National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences , Hunan Normal University , Changsha 410081 , Hunan , China
| | - Yong Zeng
- The National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences , Hunan Normal University , Changsha 410081 , Hunan , China
| | - Shaohang Xu
- Deepxomics Co., Ltd. , Shenzhen 518000 , China
| | - Zhonghua Liu
- The National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences , Hunan Normal University , Changsha 410081 , Hunan , China
| |
Collapse
|
70
|
Ren Z, Qi D, Pugh N, Li K, Wen B, Zhou R, Xu S, Liu S, Jones AR. Improvements to the Rice Genome Annotation Through Large-Scale Analysis of RNA-Seq and Proteomics Data Sets. Mol Cell Proteomics 2019; 18:86-98. [PMID: 30293062 PMCID: PMC6317475 DOI: 10.1074/mcp.ra118.000832] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Revised: 08/31/2018] [Indexed: 01/22/2023] Open
Abstract
Rice (Oryza sativa) is one of the most important worldwide crops. The genome has been available for over 10 years and has undergone several rounds of annotation. We created a comprehensive database of transcripts from 29 public RNA sequencing data sets, officially predicted genes from Ensembl plants, and common contaminants in which to search for protein-level evidence. We re-analyzed nine publicly accessible rice proteomics data sets. In total, we identified 420K peptide spectrum matches from 47K peptides and 8,187 protein groups. 4168 peptides were initially classed as putative novel peptides (not matching official genes). Following a strict filtration scheme to rule out other possible explanations, we discovered 1,584 high confidence novel peptides. The novel peptides were clustered into 692 genomic loci where our results suggest annotation improvements. 80% of the novel peptides had an ortholog match in the curated protein sequence set from at least one other plant species. For the peptides clustering in intergenic regions (and thus potentially new genes), 101 loci were identified, for which 43 had a high-confidence hit for a protein domain. Our results can be displayed as tracks on the Ensembl genome or other browsers supporting Track Hubs, to support re-annotation of the rice genome.
Collapse
Affiliation(s)
- Zhe Ren
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Da Qi
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Nina Pugh
- §Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Kai Li
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Bo Wen
- ‖Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030;; ¶Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030
| | - Ruo Zhou
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Shaohang Xu
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Siqi Liu
- From the ‡BGI-Shenzhen, Shenzhen 518083, China;.
| | - Andrew R Jones
- §Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK;.
| |
Collapse
|