1
|
Illuminating the Dark Cancer Phosphoproteome Through a Machine-Learned Co-Regulation Map of 26,280 Phosphosites. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.19.585786. [PMID: 38562798 PMCID: PMC10983930 DOI: 10.1101/2024.03.19.585786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Mass spectrometry-based phosphoproteomics offers a comprehensive view of protein phosphorylation, but limited knowledge about the regulation and function of most phosphosites restricts our ability to extract meaningful biological insights from phosphoproteomics data. To address this, we combine machine learning and phosphoproteomic data from 1,195 tumor specimens spanning 11 cancer types to construct CoPheeMap, a network mapping the co-regulation of 26,280 phosphosites. Integrating network features from CoPheeMap into a machine learning model, CoPheeKSA, we achieve superior performance in predicting kinase-substrate associations. CoPheeKSA reveals 24,015 associations between 9,399 phosphosites and 104 serine/threonine kinases, including many unannotated phosphosites and under-studied kinases. We validate the accuracy of these predictions using experimentally determined kinase-substrate specificities. By applying CoPheeMap and CoPheeKSA to phosphosites with high computationally predicted functional significance and cancer-associated phosphosites, we demonstrate the effectiveness of these tools in systematically illuminating phosphosites of interest, revealing dysregulated signaling processes in human cancer, and identifying under-studied kinases as putative therapeutic targets.
Collapse
|
2
|
Deep Learning Prediction Boosts Phosphoproteomics-Based Discoveries Through Improved Phosphopeptide Identification. Mol Cell Proteomics 2024; 23:100707. [PMID: 38154692 PMCID: PMC10831110 DOI: 10.1016/j.mcpro.2023.100707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 11/06/2023] [Accepted: 12/23/2023] [Indexed: 12/30/2023] Open
Abstract
Shotgun phosphoproteomics enables high-throughput analysis of phosphopeptides in biological samples. One of the primary challenges associated with this technology is the relatively low rate of phosphopeptide identification during data analysis. This limitation hampers the full realization of the potential offered by shotgun phosphoproteomics. Here we present DeepRescore2, a computational workflow that leverages deep learning-based retention time and fragment ion intensity predictions to improve phosphopeptide identification and phosphosite localization. Using a state-of-the-art computational workflow as a benchmark, DeepRescore2 increases the number of correctly identified peptide-spectrum matches by 17% in a synthetic dataset and identifies 19% to 46% more phosphopeptides in biological datasets. In a liver cancer dataset, 30% of the significantly altered phosphosites between tumor and normal tissues and 60% of the prognosis-associated phosphosites identified from DeepRescore2-processed data could not be identified based on the state-of-the-art workflow. Notably, DeepRescore2-processed data uniquely identifies EGFR hyperactivation as a new target in poor-prognosis liver cancer, which is validated experimentally. Integration of deep learning prediction in DeepRescore2 improves phosphopeptide identification and facilitates biological discoveries.
Collapse
|
3
|
Kinase inhibitor pulldown assay (KiP) for clinical proteomics. Clin Proteomics 2024; 21:3. [PMID: 38225548 PMCID: PMC10790396 DOI: 10.1186/s12014-023-09448-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 12/07/2023] [Indexed: 01/17/2024] Open
Abstract
Protein kinases are frequently dysregulated and/or mutated in cancer and represent essential targets for therapy. Accurate quantification is essential. For breast cancer treatment, the identification and quantification of the protein kinase ERBB2 is critical for therapeutic decisions. While immunohistochemistry (IHC) is the current clinical diagnostic approach, it is only semiquantitative. Mass spectrometry-based proteomics offers quantitative assays that, unlike IHC, can be used to accurately evaluate hundreds of kinases simultaneously. The enrichment of less abundant kinase targets for quantification, along with depletion of interfering proteins, improves sensitivity and thus promotes more effective downstream analyses. Multiple kinase inhibitors were therefore deployed as a capture matrix for kinase inhibitor pulldown (KiP) assays designed to profile the human protein kinome as broadly as possible. Optimized assays were initially evaluated in 16 patient derived xenograft models (PDX) where KiP identified multiple differentially expressed and biologically relevant kinases. From these analyses, an optimized single-shot parallel reaction monitoring (PRM) method was developed to improve quantitative fidelity. The PRM KiP approach was then reapplied to low quantities of proteins typical of yields from core needle biopsies of human cancers. The initial prototype targeting 100 kinases recapitulated intrinsic subtyping of PDX models obtained from comprehensive proteomic and transcriptomic profiling. Luminal and HER2 enriched OCT-frozen patient biopsies subsequently analyzed through KiP-PRM also clustered by subtype. Finally, stable isotope labeled peptide standards were developed to define a prototype clinical method. Data are available via ProteomeXchange with identifiers PXD044655 and PXD046169.
Collapse
|
4
|
IDPpub: Illuminating the Dark Phosphoproteome Through PubMed Mining. Mol Cell Proteomics 2024; 23:100682. [PMID: 37993103 PMCID: PMC10716774 DOI: 10.1016/j.mcpro.2023.100682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 10/25/2023] [Accepted: 11/14/2023] [Indexed: 11/24/2023] Open
Abstract
Global phosphoproteomics experiments quantify tens of thousands of phosphorylation sites. However, data interpretation is hampered by our limited knowledge on functions, biological contexts, or precipitating enzymes of the phosphosites. This study establishes a repository of phosphosites with associated evidence in biomedical abstracts, using deep learning-based natural language processing techniques. Our model for illuminating the dark phosphoproteome through PubMed mining (IDPpub) was generated by fine-tuning BioBERT, a deep learning tool for biomedical text mining. Trained using sentences containing protein substrates and phosphorylation site positions from 3000 abstracts, the IDPpub model was then used to extract phosphorylation sites from all MEDLINE abstracts. The extracted proteins were normalized to gene symbols using the National Center for Biotechnology Information gene query, and sites were mapped to human UniProt sequences using ProtMapper and mouse UniProt sequences by direct match. Precision and recall were calculated using 150 curated abstracts, and utility was assessed by analyzing the CPTAC (Clinical Proteomics Tumor Analysis Consortium) pan-cancer phosphoproteomics datasets and the PhosphoSitePlus database. Using 10-fold cross validation, pairs of correct substrates and phosphosite positions were extracted with an average precision of 0.93 and recall of 0.94. After entity normalization and site mapping to human reference sequences, an independent validation achieved a precision of 0.91 and recall of 0.77. The IDPpub repository contains 18,458 unique human phosphorylation sites with evidence sentences from 58,227 abstracts and 5918 mouse sites in 14,610 abstracts. This included evidence sentences for 1803 sites identified in CPTAC studies that are not covered by manually curated functional information in PhosphoSitePlus. Evaluation results demonstrate the potential of IDPpub as an effective biomedical text mining tool for collecting phosphosites. Moreover, the repository (http://idppub.ptmax.org), which can be automatically updated, can serve as a powerful complement to existing resources.
Collapse
|
5
|
The Breast Cancer Proteome and Precision Oncology. Cold Spring Harb Perspect Med 2023; 13:a041323. [PMID: 37137501 PMCID: PMC10547392 DOI: 10.1101/cshperspect.a041323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
The goal of precision oncology is to translate the molecular features of cancer into predictive and prognostic tests that can be used to individualize treatment leading to improved outcomes and decreased toxicity. Success for this strategy in breast cancer is exemplified by efficacy of trastuzumab in tumors overexpressing ERBB2 and endocrine therapy for tumors that are estrogen receptor positive. However, other effective treatments, including chemotherapy, immune checkpoint inhibitors, and CDK4/6 inhibitors are not associated with strong predictive biomarkers. Proteomics promises another tier of information that, when added to genomic and transcriptomic features (proteogenomics), may create new opportunities to improve both treatment precision and therapeutic hypotheses. Here, we review both mass spectrometry-based and antibody-dependent proteomics as complementary approaches. We highlight how these methods have contributed toward a more complete understanding of breast cancer and describe the potential to guide diagnosis and treatment more accurately.
Collapse
|
6
|
Proteogenomic data and resources for pan-cancer analysis. Cancer Cell 2023; 41:1397-1406. [PMID: 37582339 PMCID: PMC10506762 DOI: 10.1016/j.ccell.2023.06.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 11/15/2022] [Accepted: 06/27/2023] [Indexed: 08/17/2023]
Abstract
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigates tumors from a proteogenomic perspective, creating rich multi-omics datasets connecting genomic aberrations to cancer phenotypes. To facilitate pan-cancer investigations, we have generated harmonized genomic, transcriptomic, proteomic, and clinical data for >1000 tumors in 10 cohorts to create a cohesive and powerful dataset for scientific discovery. We outline efforts by the CPTAC pan-cancer working group in data harmonization, data dissemination, and computational resources for aiding biological discoveries. We also discuss challenges for multi-omics data integration and analysis, specifically the unique challenges of working with both nucleotide sequencing and mass spectrometry proteomics data.
Collapse
|
7
|
Kinase Inhibitor Pulldown Assay Identifies a Chemotherapy Response Signature in Triple-negative Breast Cancer Based on Purine-binding Proteins. CANCER RESEARCH COMMUNICATIONS 2023; 3:1551-1563. [PMID: 37587913 PMCID: PMC10426551 DOI: 10.1158/2767-9764.crc-22-0501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 04/10/2023] [Accepted: 06/21/2023] [Indexed: 08/18/2023]
Abstract
Triple-negative breast cancer (TNBC) constitutes 10%-15% of all breast tumors. The current standard of care is multiagent chemotherapy, which is effective in only a subset of patients. The original objective of this study was to deploy a mass spectrometry (MS)-based kinase inhibitor pulldown assay (KIPA) to identify kinases elevated in non-pCR (pathologic complete response) cases for therapeutic targeting. Frozen optimal cutting temperature compound-embedded core needle biopsies were obtained from 43 patients with TNBC before docetaxel- and carboplatin-based neoadjuvant chemotherapy. KIPA was applied to the native tumor lysates that were extracted from samples with high tumor content. Seven percent of all identified proteins were kinases, and none were significantly associated with lack of pCR. However, among a large population of "off-target" purine-binding proteins (PBP) identified, seven were enriched in pCR-associated samples (P < 0.01). In orthogonal mRNA-based TNBC datasets, this seven-gene "PBP signature" was associated with chemotherapy sensitivity and favorable clinical outcomes. Functional annotation demonstrated IFN gamma response, nuclear import of DNA repair proteins, and cell death associations. Comparisons with standard tandem mass tagged-based discovery proteomics performed on the same samples demonstrated that KIPA-nominated pCR biomarkers were unique to the platform. KIPA is a novel biomarker discovery tool with unexpected utility for the identification of PBPs related to cytotoxic drug response. The PBP signature has the potential to contribute to clinical trials designed to either escalate or de-escalate therapy based on pCR probability. Significance The identification of pretreatment predictive biomarkers for pCR in response to neoadjuvant chemotherapy would advance precision treatment for TNBC. To complement standard proteogenomic discovery profiling, a KIPA was deployed and unexpectedly identified a seven-member non-kinase PBP pCR-associated signature. Individual members served diverse pathways including IFN gamma response, nuclear import of DNA repair proteins, and cell death.
Collapse
|
8
|
Abstract P5-02-36: Proteogenomic profiling of fresh frozen core biopsies from CALGB 40601. Cancer Res 2023. [DOI: 10.1158/1538-7445.sabcs22-p5-02-36] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Abstract
Abstract
Background: Targeted therapy for HER2+ breast cancer has significantly improved outcomes for this aggressive subtype. However, a subset of patients do not achieve pathological complete response (pCR). In CALGB 40601, a randomized Phase III Trial for neoadjuvant treatment of HER2+ primary breast cancer with Paclitaxel (T: taxane) combined with HER2 antibody therapy (H: Herceptin/Trastuzumab), the small molecule inhibitor Lapatinib (L), or the antibody-inhibitor combination, pCR frequency was 56% for the combination (THL arm), 46% for Trastuzumab (TH arm), and 32% for Lapatinib (TL arm, closed early because of lower efficacy) (PMID: 26527775). While a recent publication reports relapse-free survival (RFS), overall survival (OS), and RNA-based gene expression signatures that can predict pCR (PMID: 33095682), understanding the proteogenomic landscape of treatment response should facilitate identification of alternative and therapeutically tractable protein targets for treatment-resistant tumors. Methods: Microscaled proteogenomic profiling (PMID: 31988290) was performed on treatment-naïve, flash-frozen core needle biopsies from the CALGB 40601 trial obtained from the Alliance for Clinical Trials in Oncology tissue bank. Multi-omics profiling included whole-exome sequencing (WES), RNA-sequencing, and mass spectroscopy-based proteomics and phosphoproteomics from one or two cores from each patient. Results: Eighty baseline core biopsies from 54 patients, including 22 patients from the THL arm, 24 from the TH arm, and 8 from the TL arm, from the CALGB 40601 tissue archive were of sufficient quality to yield genomics, transcriptomics, and/or proteomics profiling data. The frequency of pCR for profiled samples was representative of the overall trial cohort. Linear models were employed to identify baseline determinants of pCR for each arm and to assess differences in genes associated with response between the TH and THL arms. Pathways associated with RNA processing, translation, and the proteasome were elevated in pCR tumors in TH and THL arms, while cell cycle, DNA replication and repair pathways were higher in pCR only in the THL arm. While enrichment of similar pathways was observed in pCR in the transcriptome, the proteome specifically showed enrichment of pathways associated with extracellular matrix and EMT in non-pCR in the THL but not the TH arm. In particular, “EMT”, “ECM-receptor interaction”, and “extracellular structure organization” constituted the most enriched pathways and GO terms that were higher in non-pCR than in pCR tumors from the combination arm (THL) in the proteomics data despite showing no enrichment in the transcriptomics data. Driving this pathway enrichment were several collagens and matrix metalloproteinases that were significantly elevated in non-pCR tumors at the protein but not the RNA level. Finally, kinase target enrichment of differential phosphorylation sites suggested that the activity of PAK1, a regulator of cytoskeletal remodeling, is elevated in non-pCR tumors from the THL arm (p=0.006), but not the TH arm (p=0.69). Conclusion: Proteogenomic analysis of archival HER2+ breast cancer core biopsies provides opportunities for identifying proteins and phosphorylation sites in treatment-naive tumors that are associated with pCR to neoadjuvant Paclitaxel/anti-HER2 therapy. Notably, proteomic but not transcriptomic data showed that ECM and EMT pathways were elevated in non-pCR tumors; thus, signatures encompassing these pathways may serve as biomarkers for aggressive HER2+ breast cancer that is more likely to evade treatment. Non-pCR tumors in the THL arm were also marked by elevated levels of PAK1 target phosphorylation sites, suggesting that this kinase may be a potential therapeutic target in HER2+ breast cancer that is refractory to combination anti-HER2 therapy.
Citation Format: Eric J. Jaehnig, Aranzazu Fernandez-Martinez, Tanmayi Vashist, Matthew V. Holt, LaTerrica Williams, Jonathan Lei, Beom-Jun Kim, Yongchao Dou, Viktoriya Korchina, Richard Gibbs, Donna Muzny, Harshavardhan Doddapaneni, Henry Rodriguez, Ana Robles, Tara Hiltke, DR Mani, Michael Gillette, Terry Hyslop, Yujia Wen, Linda McCart, George Miles, Steven Carr, Bing Zhang, Shankha Satpathy, Matthew Ellis, Meenakshi Anurag. Proteogenomic profiling of fresh frozen core biopsies from CALGB 40601 [abstract]. In: Proceedings of the 2022 San Antonio Breast Cancer Symposium; 2022 Dec 6-10; San Antonio, TX. Philadelphia (PA): AACR; Cancer Res 2023;83(5 Suppl):Abstract nr P5-02-36.
Collapse
|
9
|
Deep learning prediction boosts phosphoproteomics-based discoveries through improved phosphopeptide identification. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.11.523329. [PMID: 36711982 PMCID: PMC9882090 DOI: 10.1101/2023.01.11.523329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Shotgun phosphoproteomics enables high-throughput analysis of phosphopeptides in biological samples, but low phosphopeptide identification rate in data analysis limits the potential of this technology. Here we present DeepRescore2, a computational workflow that leverages deep learning-based retention time and fragment ion intensity predictions to improve phosphopeptide identification and phosphosite localization. Using a state-of-the-art computational workflow as a benchmark, DeepRescore2 increases the number of correctly identified peptide-spectrum matches by 17% in a synthetic dataset and identifies 19%-46% more phosphopeptides in biological datasets. In a liver cancer dataset, 30% of the significantly altered phosphosites between tumor and normal tissues and 60% of the prognosis-associated phosphosites identified from DeepRescore2-processed data could not be identified based on the state-of-the-art workflow. Notably, DeepRescore2-processed data uniquely identifies EGFR hyperactivation as a new target in poor-prognosis liver cancer, which is validated experimentally. Integration of deep learning prediction in DeepRescore2 improves phosphopeptide identification and facilitates biological discoveries.
Collapse
|
10
|
Proteogenomic Markers of Chemotherapy Resistance and Response in Triple-Negative Breast Cancer. Cancer Discov 2022; 12:2586-2605. [PMID: 36001024 PMCID: PMC9627136 DOI: 10.1158/2159-8290.cd-22-0200] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 06/08/2022] [Accepted: 08/18/2022] [Indexed: 01/12/2023]
Abstract
Microscaled proteogenomics was deployed to probe the molecular basis for differential response to neoadjuvant carboplatin and docetaxel combination chemotherapy for triple-negative breast cancer (TNBC). Proteomic analyses of pretreatment patient biopsies uniquely revealed metabolic pathways, including oxidative phosphorylation, adipogenesis, and fatty acid metabolism, that were associated with resistance. Both proteomics and transcriptomics revealed that sensitivity was marked by elevation of DNA repair, E2F targets, G2-M checkpoint, interferon-gamma signaling, and immune-checkpoint components. Proteogenomic analyses of somatic copy-number aberrations identified a resistance-associated 19q13.31-33 deletion where LIG1, POLD1, and XRCC1 are located. In orthogonal datasets, LIG1 (DNA ligase I) gene deletion and/or low mRNA expression levels were associated with lack of pathologic complete response, higher chromosomal instability index (CIN), and poor prognosis in TNBC, as well as carboplatin-selective resistance in TNBC preclinical models. Hemizygous loss of LIG1 was also associated with higher CIN and poor prognosis in other cancer types, demonstrating broader clinical implications. SIGNIFICANCE Proteogenomic analysis of triple-negative breast tumors revealed a complex landscape of chemotherapy response associations, including a 19q13.31-33 somatic deletion encoding genes serving lagging-strand DNA synthesis (LIG1, POLD1, and XRCC1), that correlate with lack of pathologic response, carboplatin-selective resistance, and, in pan-cancer studies, poor prognosis and CIN. This article is highlighted in the In This Issue feature, p. 2483.
Collapse
|
11
|
OmicsEV: a tool for comprehensive quality evaluation of omics data tables. Bioinformatics 2022; 38:5463-5465. [PMID: 36271853 PMCID: PMC9750102 DOI: 10.1093/bioinformatics/btac698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Accepted: 10/20/2022] [Indexed: 12/25/2022] Open
Abstract
SUMMARY RNA-Seq and mass spectrometry-based studies generate omics data tables with measurements for tens of thousands of genes across all samples in a study. The success of a study relies on the quality of these data tables, which is determined by both experimental data generation and computational methods used to process raw experimental data into quantitative data tables. We present OmicsEV, an R package for the quality evaluation of omics data tables. For each data table, OmicsEV uses a series of methods to evaluate data depth, data normalization, batch effect, biological signal, platform reproducibility and multi-omics concordance, producing comprehensive visual and quantitative evaluation results that help assess the data quality of individual data tables and facilitate the identification of the optimal data processing method and parameters for the omics study under investigation. AVAILABILITY AND IMPLEMENTATION The source code and the user manual of OmicsEV are available at https://github.com/bzhanglab/OmicsEV, and the source code is released under the GPL-3 license.
Collapse
|
12
|
A proteogenomic portrait of lung squamous cell carcinoma. Cell 2021; 184:4348-4371.e40. [PMID: 34358469 PMCID: PMC8475722 DOI: 10.1016/j.cell.2021.07.016] [Citation(s) in RCA: 138] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 04/26/2021] [Accepted: 07/12/2021] [Indexed: 02/07/2023]
Abstract
Lung squamous cell carcinoma (LSCC) remains a leading cause of cancer death with few therapeutic options. We characterized the proteogenomic landscape of LSCC, providing a deeper exposition of LSCC biology with potential therapeutic implications. We identify NSD3 as an alternative driver in FGFR1-amplified tumors and low-p63 tumors overexpressing the therapeutic target survivin. SOX2 is considered undruggable, but our analyses provide rationale for exploring chromatin modifiers such as LSD1 and EZH2 to target SOX2-overexpressing tumors. Our data support complex regulation of metabolic pathways by crosstalk between post-translational modifications including ubiquitylation. Numerous immune-related proteogenomic observations suggest directions for further investigation. Proteogenomic dissection of CDKN2A mutations argue for more nuanced assessment of RB1 protein expression and phosphorylation before declaring CDK4/6 inhibition unsuccessful. Finally, triangulation between LSCC, LUAD, and HNSCC identified both unique and common therapeutic vulnerabilities. These observations and proteogenomics data resources may guide research into the biology and treatment of LSCC.
Collapse
|
13
|
Proteogenomic insights into the biology and treatment of HPV-negative head and neck squamous cell carcinoma. Cancer Cell 2021; 39:361-379.e16. [PMID: 33417831 PMCID: PMC7946781 DOI: 10.1016/j.ccell.2020.12.007] [Citation(s) in RCA: 162] [Impact Index Per Article: 54.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Revised: 09/13/2020] [Accepted: 12/07/2020] [Indexed: 02/08/2023]
Abstract
We present a proteogenomic study of 108 human papilloma virus (HPV)-negative head and neck squamous cell carcinomas (HNSCCs). Proteomic analysis systematically catalogs HNSCC-associated proteins and phosphosites, prioritizes copy number drivers, and highlights an oncogenic role for RNA processing genes. Proteomic investigation of mutual exclusivity between FAT1 truncating mutations and 11q13.3 amplifications reveals dysregulated actin dynamics as a common functional consequence. Phosphoproteomics characterizes two modes of EGFR activation, suggesting a new strategy to stratify HNSCCs based on EGFR ligand abundance for effective treatment with inhibitory EGFR monoclonal antibodies. Widespread deletion of immune modulatory genes accounts for low immune infiltration in immune-cold tumors, whereas concordant upregulation of multiple immune checkpoint proteins may underlie resistance to anti-programmed cell death protein 1 monotherapy in immune-hot tumors. Multi-omic analysis identifies three molecular subtypes with high potential for treatment with CDK inhibitors, anti-EGFR antibody therapy, and immunotherapy, respectively. Altogether, proteogenomics provides a systematic framework to inform HNSCC biology and treatment.
Collapse
|
14
|
Proteogenomic Landscape of Breast Cancer Tumorigenesis and Targeted Therapy. Cell 2020; 183:1436-1456.e31. [PMID: 33212010 PMCID: PMC8077737 DOI: 10.1016/j.cell.2020.10.036] [Citation(s) in RCA: 223] [Impact Index Per Article: 55.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 07/14/2020] [Accepted: 10/21/2020] [Indexed: 02/08/2023]
Abstract
The integration of mass spectrometry-based proteomics with next-generation DNA and RNA sequencing profiles tumors more comprehensively. Here this "proteogenomics" approach was applied to 122 treatment-naive primary breast cancers accrued to preserve post-translational modifications, including protein phosphorylation and acetylation. Proteogenomics challenged standard breast cancer diagnoses, provided detailed analysis of the ERBB2 amplicon, defined tumor subsets that could benefit from immune checkpoint therapy, and allowed more accurate assessment of Rb status for prediction of CDK4/6 inhibitor responsiveness. Phosphoproteomics profiles uncovered novel associations between tumor suppressor loss and targetable kinases. Acetylproteome analysis highlighted acetylation on key nuclear proteins involved in the DNA damage response and revealed cross-talk between cytoplasmic and mitochondrial acetylation and metabolism. Our results underscore the potential of proteogenomics for clinical investigation of breast cancer through more accurate annotation of targetable pathways and biological features of this remarkably heterogeneous malignancy.
Collapse
|
15
|
WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res 2020; 47:W199-W205. [PMID: 31114916 PMCID: PMC6602449 DOI: 10.1093/nar/gkz401] [Citation(s) in RCA: 1759] [Impact Index Per Article: 439.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 04/23/2019] [Accepted: 05/01/2019] [Indexed: 02/07/2023] Open
Abstract
WebGestalt is a popular tool for the interpretation of gene lists derived from large scale -omics studies. In the 2019 update, WebGestalt supports 12 organisms, 342 gene identifiers and 155 175 functional categories, as well as user-uploaded functional databases. To address the growing and unique need for phosphoproteomics data interpretation, we have implemented phosphosite set analysis to identify important kinases from phosphoproteomics data. We have completely redesigned result visualizations and user interfaces to improve user-friendliness and to provide multiple types of interactive and publication-ready figures. To facilitate comprehension of the enrichment results, we have implemented two methods to reduce redundancy between enriched gene sets. We introduced a web API for other applications to get data programmatically from the WebGestalt server or pass data to WebGestalt for analysis. We also wrapped the core computation into an R package called WebGestaltR for users to perform analysis locally or in third party workflows. WebGestalt can be freely accessed at http://www.webgestalt.org.
Collapse
|
16
|
Identification and characterization of a supraclavicular brown adipose tissue in mice. JCI Insight 2017; 2:93166. [PMID: 28570265 DOI: 10.1172/jci.insight.93166] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Accepted: 04/20/2017] [Indexed: 12/29/2022] Open
Abstract
A fundamental challenge to our understanding of brown adipose tissue (BAT) is the lack of an animal model that faithfully represents human BAT. Such a model is essential for direct assessment of the function and therapeutic potential of BAT depots in humans. In human adults, most of the thermoactive BAT depots are located in the supraclavicular region of the neck, while mouse studies focus on depots located in the interscapular region of the torso. We recently discovered BAT depots that are located in a region analogous to that of human supraclavicular BAT (scBAT). Here, we report that the mouse scBAT depot has morphological characteristics of classical BAT, possesses the potential for high thermogenic activity, and expresses a gene signature that is similar to that of human scBAT. Taken together, our studies reveal a mouse BAT depot that represents human BAT and provides a unique tool for developing new translatable approaches for utilizing human scBAT.
Collapse
|
17
|
MEF2C regulates outflow tract alignment and transcriptional control of Tdgf1. Development 2016; 143:774-9. [PMID: 26811383 DOI: 10.1242/dev.126383] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2015] [Accepted: 01/19/2016] [Indexed: 01/24/2023]
Abstract
Congenital heart defects are the most common birth defects in humans, and those that affect the proper alignment of the outflow tracts and septation of the ventricles are a highly significant cause of morbidity and mortality in infants. A late differentiating population of cardiac progenitors, referred to as the anterior second heart field (AHF), gives rise to the outflow tract and the majority of the right ventricle and provides an embryological context for understanding cardiac outflow tract alignment and membranous ventricular septal defects. However, the transcriptional pathways controlling AHF development and their roles in congenital heart defects remain incompletely elucidated. Here, we inactivated the gene encoding the transcription factor MEF2C in the AHF in mice. Loss of Mef2c function in the AHF results in a spectrum of outflow tract alignment defects ranging from overriding aorta to double-outlet right ventricle and dextro-transposition of the great arteries. We identify Tdgf1, which encodes a Nodal co-receptor (also known as Cripto), as a direct transcriptional target of MEF2C in the outflow tract via an AHF-restricted Tdgf1 enhancer. Importantly, both the MEF2C and TDGF1 genes are associated with congenital heart defects in humans. Thus, these studies establish a direct transcriptional pathway between the core cardiac transcription factor MEF2C and the human congenital heart disease gene TDGF1. Moreover, we found a range of outflow tract alignment defects resulting from a single genetic lesion, supporting the idea that AHF-derived outflow tract alignment defects may constitute an embryological spectrum rather than distinct anomalies.
Collapse
|
18
|
Checkpoint kinases regulate a global network of transcription factors in response to DNA damage. Cell Rep 2013; 4:174-88. [PMID: 23810556 DOI: 10.1016/j.celrep.2013.05.041] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2012] [Revised: 04/04/2013] [Accepted: 05/24/2013] [Indexed: 01/01/2023] Open
Abstract
DNA damage activates checkpoint kinases that induce several downstream events, including widespread changes in transcription. However, the specific connections between the checkpoint kinases and downstream transcription factors (TFs) are not well understood. Here, we integrate kinase mutant expression profiles, transcriptional regulatory interactions, and phosphoproteomics to map kinases and downstream TFs to transcriptional regulatory networks. Specifically, we investigate the role of the Saccharomyces cerevisiae checkpoint kinases (Mec1, Tel1, Chk1, Rad53, and Dun1) in the transcriptional response to DNA damage caused by methyl methanesulfonate. The result is a global kinase-TF regulatory network in which Mec1 and Tel1 signal through Rad53 to synergistically regulate the expression of more than 600 genes. This network involves at least nine TFs, many of which have Rad53-dependent phosphorylation sites, as regulators of checkpoint-kinase-dependent genes. We also identify a major DNA damage-induced transcriptional network that regulates stress response genes independently of the checkpoint kinases.
Collapse
|
19
|
Abstract
Although cellular behaviors are dynamic, the networks that govern these behaviors have been mapped primarily as static snapshots. Using an approach called differential epistasis mapping, we have discovered widespread changes in genetic interaction among yeast kinases, phosphatases, and transcription factors as the cell responds to DNA damage. Differential interactions uncover many gene functions that go undetected in static conditions. They are very effective at identifying DNA repair pathways, highlighting new damage-dependent roles for the Slt2 kinase, Pph3 phosphatase, and histone variant Htz1. The data also reveal that protein complexes are generally stable in response to perturbation, but the functional relations between these complexes are substantially reorganized. Differential networks chart a new type of genetic landscape that is invaluable for mapping cellular responses to stimuli.
Collapse
|
20
|
Abstract
The DNA damage and replication checkpoints are believed to primarily slow the progression of the cell cycle to allow DNA repair to occur. Here we summarize known aspects of the Saccharomyces cerevisiae checkpoints including how these responses are integrated into downstream effects on the cell cycle, chromatin, DNA repair, and cytoplasmic targets. Analysis of the transcriptional response demonstrates that it is far more complex and less relevant to the repair of DNA damage than the bacterial SOS response. We also address more speculative questions regarding potential roles of the checkpoint during the normal S-phase and how current evidence hints at a checkpoint activation mechanism mediated by positive feedback that amplifies initial damage signals above a minimum threshold.
Collapse
|
21
|
Increased susceptibility to isoproterenol-induced cardiac hypertrophy and impaired weight gain in mice lacking the histidine-rich calcium-binding protein. Mol Cell Biol 2006; 26:9315-26. [PMID: 17030629 PMCID: PMC1698540 DOI: 10.1128/mcb.00482-06] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The sarcoplasmic reticulum (SR) plays a critical role in excitation-contraction coupling by regulating the cytoplasmic calcium concentration of striated muscle. The histidine-rich calcium-binding protein (HRCBP) is expressed in the junctional SR, the site of calcium release from the SR. HRCBP is expressed exclusively in muscle tissues and binds calcium with low affinity and high capacity. In addition, HRCBP interacts with triadin, a protein associated with the ryanodine receptor and thought to be involved in calcium release. Its calcium binding properties, localization to the SR, and interaction with triadin suggest that HRCBP is involved in calcium handling by the SR. To determine the function of HRCBP in vivo, we inactivated HRC, the gene encoding HRCBP, in mice. HRC knockout mice exhibited impaired weight gain beginning at 11 months of age, which was marked by reduced skeletal muscle and fat mass, and triadin protein expression was upregulated in the heart of HRC knockout mice. In addition, HRC null mice displayed a significantly exaggerated response to the induction of cardiac hypertrophy by isoproterenol compared to their wild-type littermates. The exaggerated response of HRC knockout mice to the induction of cardiac hypertrophy is consistent with a regulatory role for HRCBP in calcium handling in vivo and suggests that mutations in HRC, in combination with other genetic or environmental factors, might contribute to pathological hypertrophy and heart failure.
Collapse
|
22
|
HRC is a direct transcriptional target of MEF2 during cardiac, skeletal, and arterial smooth muscle development in vivo. Mol Cell Biol 2004; 24:3757-68. [PMID: 15082771 PMCID: PMC387749 DOI: 10.1128/mcb.24.9.3757-3768.2004] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The HRC gene encodes the histidine-rich calcium-binding protein, which is found in the lumen of the junctional sarcoplasmic reticulum (SR) of cardiac and skeletal muscle and within calciosomes of arterial smooth muscle. The expression of HRC in cardiac, skeletal, and smooth muscle raises the possibility of a common transcriptional mechanism governing its expression in all three muscle cell types. In this study, we identified a transcriptional enhancer from the HRC gene that is sufficient to direct the expression of lacZ in the expression pattern of endogenous HRC in transgenic mice. The HRC enhancer contains a small, highly conserved sequence that is required for expression in all three muscle lineages. Within this conserved region is a consensus site for myocyte enhancer factor 2 (MEF2) proteins that we show is bound efficiently by MEF2 and is required for transgene expression in all three muscle lineages in vivo. Furthermore, the entire HRC enhancer sequence lacks any discernible CArG motifs, the binding site for serum response factor (SRF), and we show that the enhancer is not activated by SRF. Thus, these studies identify the HRC enhancer as the first MEF2-dependent, CArG-independent transcriptional target in smooth muscle and represent the first analysis of the transcriptional regulation of an SR gene in vivo.
Collapse
MESH Headings
- Amino Acid Motifs
- Animals
- Base Sequence
- Calcium-Binding Proteins/genetics
- Calcium-Binding Proteins/metabolism
- DNA-Binding Proteins/genetics
- DNA-Binding Proteins/metabolism
- Embryo, Mammalian/anatomy & histology
- Embryo, Mammalian/physiology
- Enhancer Elements, Genetic
- Gene Expression Regulation, Developmental
- Genes, Reporter
- Heart/embryology
- Heart/physiology
- Humans
- MEF2 Transcription Factors
- Mice
- Mice, Transgenic
- Molecular Sequence Data
- Muscle Proteins/genetics
- Muscle Proteins/metabolism
- Muscle, Skeletal/cytology
- Muscle, Skeletal/embryology
- Muscle, Skeletal/physiology
- Muscle, Smooth, Vascular/embryology
- Muscle, Smooth, Vascular/physiology
- Myogenic Regulatory Factors
- Promoter Regions, Genetic
- Sequence Alignment
- Transcription Factors/genetics
- Transcription Factors/metabolism
- Transcription, Genetic
Collapse
|