1
|
Proteogenomic and metabolomic characterization of human glioblastoma. Cancer Cell 2021; 39:509-528.e20. [PMID: 33577785 PMCID: PMC8044053 DOI: 10.1016/j.ccell.2021.01.006] [Citation(s) in RCA: 275] [Impact Index Per Article: 91.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Revised: 06/02/2020] [Accepted: 01/11/2021] [Indexed: 02/07/2023]
Abstract
Glioblastoma (GBM) is the most aggressive nervous system cancer. Understanding its molecular pathogenesis is crucial to improving diagnosis and treatment. Integrated analysis of genomic, proteomic, post-translational modification and metabolomic data on 99 treatment-naive GBMs provides insights to GBM biology. We identify key phosphorylation events (e.g., phosphorylated PTPN11 and PLCG1) as potential switches mediating oncogenic pathway activation, as well as potential targets for EGFR-, TP53-, and RB1-altered tumors. Immune subtypes with distinct immune cell types are discovered using bulk omics methodologies, validated by snRNA-seq, and correlated with specific expression and histone acetylation patterns. Histone H2B acetylation in classical-like and immune-low GBM is driven largely by BRDs, CREBBP, and EP300. Integrated metabolomic and proteomic data identify specific lipid distributions across subtypes and distinct global metabolic changes in IDH-mutated tumors. This work highlights biological relationships that could contribute to stratification of GBM patients for more effective treatment.
Collapse
|
2
|
Proteogenomic insights into the biology and treatment of HPV-negative head and neck squamous cell carcinoma. Cancer Cell 2021; 39:361-379.e16. [PMID: 33417831 PMCID: PMC7946781 DOI: 10.1016/j.ccell.2020.12.007] [Citation(s) in RCA: 162] [Impact Index Per Article: 54.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Revised: 09/13/2020] [Accepted: 12/07/2020] [Indexed: 02/08/2023]
Abstract
We present a proteogenomic study of 108 human papilloma virus (HPV)-negative head and neck squamous cell carcinomas (HNSCCs). Proteomic analysis systematically catalogs HNSCC-associated proteins and phosphosites, prioritizes copy number drivers, and highlights an oncogenic role for RNA processing genes. Proteomic investigation of mutual exclusivity between FAT1 truncating mutations and 11q13.3 amplifications reveals dysregulated actin dynamics as a common functional consequence. Phosphoproteomics characterizes two modes of EGFR activation, suggesting a new strategy to stratify HNSCCs based on EGFR ligand abundance for effective treatment with inhibitory EGFR monoclonal antibodies. Widespread deletion of immune modulatory genes accounts for low immune infiltration in immune-cold tumors, whereas concordant upregulation of multiple immune checkpoint proteins may underlie resistance to anti-programmed cell death protein 1 monotherapy in immune-hot tumors. Multi-omic analysis identifies three molecular subtypes with high potential for treatment with CDK inhibitors, anti-EGFR antibody therapy, and immunotherapy, respectively. Altogether, proteogenomics provides a systematic framework to inform HNSCC biology and treatment.
Collapse
|
3
|
Proteogenomic Landscape of Breast Cancer Tumorigenesis and Targeted Therapy. Cell 2020; 183:1436-1456.e31. [PMID: 33212010 PMCID: PMC8077737 DOI: 10.1016/j.cell.2020.10.036] [Citation(s) in RCA: 223] [Impact Index Per Article: 55.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 07/14/2020] [Accepted: 10/21/2020] [Indexed: 02/08/2023]
Abstract
The integration of mass spectrometry-based proteomics with next-generation DNA and RNA sequencing profiles tumors more comprehensively. Here this "proteogenomics" approach was applied to 122 treatment-naive primary breast cancers accrued to preserve post-translational modifications, including protein phosphorylation and acetylation. Proteogenomics challenged standard breast cancer diagnoses, provided detailed analysis of the ERBB2 amplicon, defined tumor subsets that could benefit from immune checkpoint therapy, and allowed more accurate assessment of Rb status for prediction of CDK4/6 inhibitor responsiveness. Phosphoproteomics profiles uncovered novel associations between tumor suppressor loss and targetable kinases. Acetylproteome analysis highlighted acetylation on key nuclear proteins involved in the DNA damage response and revealed cross-talk between cytoplasmic and mitochondrial acetylation and metabolism. Our results underscore the potential of proteogenomics for clinical investigation of breast cancer through more accurate annotation of targetable pathways and biological features of this remarkably heterogeneous malignancy.
Collapse
|
4
|
Integrated Proteomic and Glycoproteomic Characterization of Human High-Grade Serous Ovarian Carcinoma. Cell Rep 2020; 33:108276. [PMID: 33086064 PMCID: PMC7970828 DOI: 10.1016/j.celrep.2020.108276] [Citation(s) in RCA: 69] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Revised: 06/18/2020] [Accepted: 09/23/2020] [Indexed: 12/12/2022] Open
Abstract
Many gene products exhibit great structural heterogeneity because of an array of modifications. These modifications are not directly encoded in the genomic template but often affect the functionality of proteins. Protein glycosylation plays a vital role in proper protein functions. However, the analysis of glycoproteins has been challenging compared with other protein modifications, such as phosphorylation. Here, we perform an integrated proteomic and glycoproteomic analysis of 83 prospectively collected high-grade serous ovarian carcinoma (HGSC) and 23 non-tumor tissues. Integration of the expression data from global proteomics and glycoproteomics reveals tumor-specific glycosylation, uncovers different glycosylation associated with three tumor clusters, and identifies glycosylation enzymes that were correlated with the altered glycosylation. In addition to providing a valuable resource, these results provide insights into the potential roles of glycosylation in the pathogenesis of HGSC, with the possibility of distinguishing pathological outcomes of ovarian tumors from non-tumors, as well as classifying tumor clusters.
Collapse
|
5
|
Correction: Proteogenomic Characterization of Ovarian HGSC Implicates Mitotic Kinases, Replication Stress in Observed Chromosomal Instability. CELL REPORTS MEDICINE 2020; 1. [PMID: 32954372 PMCID: PMC7500561 DOI: 10.1016/j.xcrm.2020.100075] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
6
|
Abstract LB-242: Proteomic Data Commons: A resource for proteogenomic analysis. Cancer Res 2020. [DOI: 10.1158/1538-7445.am2020-lb-242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
The objective of the National Cancer Institutes' Proteomic Data Commons (PDC) is to make cancer-related proteomic datasets accessible to the public. The PDC provides the cancer research community with a unified data repository that enables data sharing across cancer proteomic studies and also enables multi-omic integration in support of precision medicine. As a domain-specific repository within the Cancer Research Data Commons (CRDC), the vision for the PDC is to provide researchers the ability to find and analyze proteomic data across a wide variety of tumor types. Currently, the PDC houses data, supported by a large collection of metadata attributes, for nearly 40 datasets from over 12 cancer types produced by several large-scale cancer research programs, each with cohort sizes greater than 100 patients.
The PDC facilitates the analysis of proteomic, genomic, and imaging data derived from the same tumor. Most of the datasets in the PDC also have corresponding genomic and imaging data available in the Genomic Data Commons and The Cancer Imaging Archive respectively. Researchers can discover which genomic variants are detectable at the protein-level or better understand associations between gene expression, copy number variation, and protein abundance. The resource is currently available to the public in beta phase (https://pdc.esacinc.com) and will be officially launched on the cancer.gov domain in March 2020.
The PDC data portal is supported by a robust and extensible data model and provides user-friendly exploration, visualization and data analysis. This allows researchers to search for and visualize expression of proteins (through their mapped genes) across all studies, analyze protein abundance for all cases in a study through heatmaps, build and explore pan-cancer cohorts using highly curated, clinical metadata, and comprehensively view a study without needing to download the data.
The PDC provides quick access to mapping of peptide identities and quantities on the human genome as well as protein databases containing patient/tumor-specific variants and novel splicing events. It also enables fast, accurate, and convenient proteomic validation of novel genomic alterations through the PepQuery algorithm.
Through a highly versatile application programming interface (API), PDC allows users to interact with data programmatically and facilitates integration with data from other resources in their scripts for multi-omic analysis.
Big data interoperability is critical for progress in precision medicine. PDC is designed to interoperate with other resources including the CRDC nodes, allowing users to analyze PDC data with the tools and pipelines available on the NCI cloud resources. It further allows users to use their own tools to co-analyze genomic and proteomic data available from a common sample on Amazon Web Services (AWS) platform or on a local system.
The presentation will provide an overview of the PDC and it's available datasets, as well as a discussion of how it facilitates multi-omic data analyses.
Citation Format: Ratna Rajesh Thangudu, Paul A. Rudnick, Michael Holck, Deepak Singhal, Michael J. MacCoss, Nathan J. Edwards, Karen A. Ketchum, Christopher R. Kinsinger, Erika Kim, Anand Basu. Proteomic Data Commons: A resource for proteogenomic analysis [abstract]. In: Proceedings of the Annual Meeting of the American Association for Cancer Research 2020; 2020 Apr 27-28 and Jun 22-24. Philadelphia (PA): AACR; Cancer Res 2020;80(16 Suppl):Abstract nr LB-242.
Collapse
|
7
|
Abstract 5122: NCI's Clinical Proteomic Tumor Analysis Consortium: A proteogenomic cancer analysis program. Cancer Res 2020. [DOI: 10.1158/1538-7445.am2020-5122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (NCI-CPTAC) is an integrative proteogenomic program composed of a Proteogenomic Tumor Characterization Program, and a Proteogenomic Translational Research Program. The goal of CPTAC is to improve prevention, early detection, diagnosis, and treatment of cancer by enhancing the understanding of the molecular mechanisms of cancer, advancing proteogenome science and technology development through community resources (data and reagents), and accelerating the translation of molecular findings into the clinic.
Using state-of-the-art, high throughput standardized mass spectrometry-based methods, the Proteogenomic Tumor Characterization Program performs deep comprehensive proteogenomic analysis of cancer types with all data and assays to be released to the public. In addition to retrospectively collected TCGA and prospectively collected CPTAC samples from colon, breast and ovarian cancers, additional prospectively-collected, treatment-naïve tumors including clear cell renal cell carcinoma (CCRCC), uterine corpus endometrial carcinoma (UCEC), lung adenocarcinoma (LUAD), glioblastoma multiforme (GBM), and retrospectively collected pediatric brain cancers have been characterized. In the Proteogenomic Translational Research Program, CPTAC is partnering for the first time with NCI-sponsored clinical trials to support clinically-relevant research projects that would elucidate biological mechanisms of therapeutic response, resistance, and/or toxicity. The Proteogenomic Translational Research Program currently explores triple negative breast cancer (TNBC), high-grade serous ovarian cancer (HGSOC), and acute myeloid leukemia (AML).
Raw and processed mass spectrometry-based proteomic, and genomic data are publicly available at the CPTAC Data Portal (http://proteomics.cancer.gov), and GDC Data Portal (https://portal.gdc.cancer.gov/) respectively. CPTAC is also supporting development of new proteogenomic data analysis tools, such as network visualization tools (http://ccrcc.cptac-network-view.org). In addition, the CPTAC Assay Portal (http://assays.cancer.gov) is a public resource populated with mass spectrometry-based targeted proteomic assays developed by the consortium for quantitatively measuring proteins of interest, including those discovered through comprehensive tumor characterization. Lastly, well-characterized monoclonal antibodies targeting cancer-specific proteins and peptides are also made available at CPTAC's Antibody Portal (http://antibodies.cancer.gov).
Citation Format: Mehdi Mesri, Emily Boja, Tara Hiltke, Christopher R. Kinsinger, Annette Marrero-Oliveras, Ana Robles, Henry Rodriguez, CPTAC Investigators. NCI's Clinical Proteomic Tumor Analysis Consortium: A proteogenomic cancer analysis program [abstract]. In: Proceedings of the Annual Meeting of the American Association for Cancer Research 2020; 2020 Apr 27-28 and Jun 22-24. Philadelphia (PA): AACR; Cancer Res 2020;80(16 Suppl):Abstract nr 5122.
Collapse
|
8
|
Proteogenomic Characterization Reveals Therapeutic Vulnerabilities in Lung Adenocarcinoma. Cell 2020; 182:200-225.e35. [PMID: 32649874 PMCID: PMC7373300 DOI: 10.1016/j.cell.2020.06.013] [Citation(s) in RCA: 338] [Impact Index Per Article: 84.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 03/06/2020] [Accepted: 06/03/2020] [Indexed: 12/24/2022]
Abstract
To explore the biology of lung adenocarcinoma (LUAD) and identify new therapeutic opportunities, we performed comprehensive proteogenomic characterization of 110 tumors and 101 matched normal adjacent tissues (NATs) incorporating genomics, epigenomics, deep-scale proteomics, phosphoproteomics, and acetylproteomics. Multi-omics clustering revealed four subgroups defined by key driver mutations, country, and gender. Proteomic and phosphoproteomic data illuminated biology downstream of copy number aberrations, somatic mutations, and fusions and identified therapeutic vulnerabilities associated with driver events involving KRAS, EGFR, and ALK. Immune subtyping revealed a complex landscape, reinforced the association of STK11 with immune-cold behavior, and underscored a potential immunosuppressive role of neutrophil degranulation. Smoking-associated LUADs showed correlation with other environmental exposure signatures and a field effect in NATs. Matched NATs allowed identification of differentially expressed proteins with potential diagnostic and therapeutic utility. This proteogenomics dataset represents a unique public resource for researchers and clinicians seeking to better understand and treat lung adenocarcinomas.
Collapse
|
9
|
Proteogenomic Characterization of Ovarian HGSC Implicates Mitotic Kinases, Replication Stress in Observed Chromosomal Instability. CELL REPORTS MEDICINE 2020; 1. [PMID: 32529193 PMCID: PMC7289043 DOI: 10.1016/j.xcrm.2020.100004] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
In the absence of a dominant driving mutation other than uniformly present TP53 mutations, deeper understanding of the biology driving ovarian high-grade serous cancer (HGSC) requires analysis at a functional level, including post-translational modifications. Comprehensive proteogenomic and phosphoproteomic characterization of 83 prospectively collected ovarian HGSC and appropriate normal precursor tissue samples (fallopian tube) under strict control of ischemia time reveals pathways that significantly differentiate between HGSC and relevant normal tissues in the context of homologous repair deficiency (HRD) status. In addition to confirming key features of HGSC from previous studies, including a potential survival-associated signature and histone acetylation as a marker of HRD, deep phosphoproteomics provides insights regarding the potential role of proliferation-induced replication stress in promoting the characteristic chromosomal instability of HGSC and suggests potential therapeutic targets for use in precision medicine trials. Comparison of ovarian cancer and normal precursors identifies key signaling pathways Mitotic and cyclin-dependent kinases emerge as potential therapeutic targets Previously identified hallmarks of homologous repair status and survival are confirmed Replication stress appears to drive increased chromosomal instability
Collapse
|
10
|
Proteogenomic Characterization of Endometrial Carcinoma. Cell 2020; 180:729-748.e26. [PMID: 32059776 PMCID: PMC7233456 DOI: 10.1016/j.cell.2020.01.026] [Citation(s) in RCA: 247] [Impact Index Per Article: 61.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 11/11/2019] [Accepted: 01/16/2020] [Indexed: 02/07/2023]
Abstract
We undertook a comprehensive proteogenomic characterization of 95 prospectively collected endometrial carcinomas, comprising 83 endometrioid and 12 serous tumors. This analysis revealed possible new consequences of perturbations to the p53 and Wnt/β-catenin pathways, identified a potential role for circRNAs in the epithelial-mesenchymal transition, and provided new information about proteomic markers of clinical and genomic tumor subgroups, including relationships to known druggable pathways. An extensive genome-wide acetylation survey yielded insights into regulatory mechanisms linking Wnt signaling and histone acetylation. We also characterized aspects of the tumor immune landscape, including immunogenic alterations, neoantigens, common cancer/testis antigens, and the immune microenvironment, all of which can inform immunotherapy decisions. Collectively, our multi-omic analyses provide a valuable resource for researchers and clinicians, identify new molecular associations of potential mechanistic significance in the development of endometrial cancers, and suggest novel approaches for identifying potential therapeutic targets.
Collapse
|
11
|
|
12
|
Integrated Proteogenomic Characterization of Clear Cell Renal Cell Carcinoma. Cell 2019; 179:964-983.e31. [PMID: 31675502 PMCID: PMC7331093 DOI: 10.1016/j.cell.2019.10.007] [Citation(s) in RCA: 357] [Impact Index Per Article: 71.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Revised: 07/15/2019] [Accepted: 10/07/2019] [Indexed: 02/07/2023]
Abstract
To elucidate the deregulated functional modules that drive clear cell renal cell carcinoma (ccRCC), we performed comprehensive genomic, epigenomic, transcriptomic, proteomic, and phosphoproteomic characterization of treatment-naive ccRCC and paired normal adjacent tissue samples. Genomic analyses identified a distinct molecular subgroup associated with genomic instability. Integration of proteogenomic measurements uniquely identified protein dysregulation of cellular mechanisms impacted by genomic alterations, including oxidative phosphorylation-related metabolism, protein translation processes, and phospho-signaling modules. To assess the degree of immune infiltration in individual tumors, we identified microenvironment cell signatures that delineated four immune-based ccRCC subtypes characterized by distinct cellular pathways. This study reports a large-scale proteogenomic analysis of ccRCC to discern the functional impact of genomic alterations and provides evidence for rational treatment selection stemming from ccRCC pathobiology.
Collapse
|
13
|
Regulated Phosphosignaling Associated with Breast Cancer Subtypes and Druggability. Mol Cell Proteomics 2019; 18:1630-1650. [PMID: 31196969 PMCID: PMC6682998 DOI: 10.1074/mcp.ra118.001243] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2018] [Revised: 05/02/2019] [Indexed: 12/25/2022] Open
Abstract
Aberrant phospho-signaling is a hallmark of cancer. We investigated kinase-substrate regulation of 33,239 phosphorylation sites (phosphosites) in 77 breast tumors and 24 breast cancer xenografts. Our search discovered 2134 quantitatively correlated kinase-phosphosite pairs, enriching for and extending experimental or binding-motif predictions. Among the 91 kinases with auto-phosphorylation, elevated EGFR, ERBB2, PRKG1, and WNK1 phosphosignaling were enriched in basal, HER2-E, Luminal A, and Luminal B breast cancers, respectively, revealing subtype-specific regulation. CDKs, MAPKs, and ataxia-telangiectasia proteins were dominant, master regulators of substrate-phosphorylation, whose activities are not captured by genomic evidence. We unveiled phospho-signaling and druggable targets from 113 kinase-substrate pairs and cascades downstream of kinases, including AKT1, BRAF and EGFR. We further identified kinase-substrate-pairs associated with clinical or immune signatures and experimentally validated activated phosphosites of ERBB2, EIF4EBP1, and EGFR. Overall, kinase-substrate regulation revealed by the largest unbiased global phosphorylation data to date connects driver events to their signaling effects.
Collapse
|
14
|
Abstract LB-006: Proteogenomic characterization of human colon cancer reveals new therapeutic opportunities. Cancer Res 2019. [DOI: 10.1158/1538-7445.am2019-lb-006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
We prospectively collected matched tumor specimens, adjacent non-tumor tissues, and blood samples from 110 colon cancer patients and analyzed the samples using seven omics platforms, including whole-exome sequencing, copy number arrays, RNA-Seq, miRNA-Seq, label-free global proteomics, isobaric tandem mass tag (TMT) labeling-based global proteomics, and TMT-based phosphoproteomics. Comparative proteomic and phosphoproteomic analysis of paired tumor and adjacent normal samples produced the first comprehensive catalogue of colon cancer-associated proteins and phosphosites, including known and putative new biomarkers and drug targets. These cancer-associated proteins and phosphosites had very little overlap with known cancer genes in the Cancer Gene Census, providing a novel information layer to our knowledge about colon cancer. One notable finding in differential proteome analysis is the identification of several cancer/testis antigens that were recurrently over-expressed in tumors compared to adjacent normal tissue, including IGF2BP3 (51%), SPAG1 (14%), and ATAD2 (8%). Through integrative analysis of the whole-exome sequencing, RNA-Seq, and proteomics data, we further predicted personalized neoantigens for 38% of the patients. In total, we found proteomics-supported neoantigens or cancer/testis antigens for 78% of the tumors in this cohort, demonstrating the potential of proteogenomics in identifying tumor antigens for cancer vaccine development. Proteomics data complemented somatic copy number analysis results and showed that multiple somatic copy number deletion events converge to repress the endocytosis pathway, suggesting its tumor suppressor role in colon cancer. In addition to reinforcing or complementing genomic findings, proteogenomic integration may also contradict genomics data-based inferences and lead to unexpected discoveries and therapeutic opportunities. Proteomics data identified SOX9 as an oncogene in colon cancer, whereas it was predicted to be a tumor suppressor based on somatic mutation data in the TCGA study. Phosphoproteomics data revealed a dual role of Rb phosphorylation in promoting proliferation and repressing apoptosis in colon cancer, clarifying the long-standing puzzle of colon cancer-specific amplification of this tumor suppressor and highlighting a unique opportunity for targeting Rb phosphorylation in colon cancer. Microsatellite instability status has been approved by the FDA as a biomarker for selecting patients for checkpoint inhibitor therapy in colorectal and other solid tumors. However, many MSI-high tumors fail to respond to checkpoint inhibition. Our proteogenomic analysis identified a subtype-specific association between increased glycolysis and decreased CD8 T cell infiltration in MSI-high colon tumors, suggesting glycolysis as a target for overcoming immune evasion in this MSI-H tumors. We make the primary and processed datasets available in publicly accessible data repositories and portals to allow broad use of these datasets for new biological discoveries and therapeutic hypothesis generation.
Citation Format: Bing Zhang, Suhas Vasaikar, Chen Huang, Xiaojing Wang, Vladislav A. Petyuk, Sara R. Savage, Bo Wen, Yongchao Dou, Yun Zhang, Zhiao Shi, Osama A. Arshad, Marina A. Gritsenko, Lisa J. Zimmerman, Jason E. McDermott, Therese R. Clauss, Ronald J. Moore, Rui Zhao, Matthew E. Monroe, Yi-Ting Wang, Matthew C. Chambers, Robbert J. Slebos, Ken S. Lau, Qianxing Mo, Li Ding, Matthew Ellis, Mathangi Thiagarajan, Christopher R. Kinsinger, Henry Rodriguez, Richard D. Smith, Karin D. Rodland, Daniel C. Liebler, Tao Liu, CPTAC Investigators. Proteogenomic characterization of human colon cancer reveals new therapeutic opportunities [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2019; 2019 Mar 29-Apr 3; Atlanta, GA. Philadelphia (PA): AACR; Cancer Res 2019;79(13 Suppl):Abstract nr LB-006.
Collapse
|
15
|
Proteogenomic Analysis of Human Colon Cancer Reveals New Therapeutic Opportunities. Cell 2019; 177:1035-1049.e19. [PMID: 31031003 DOI: 10.1016/j.cell.2019.03.030] [Citation(s) in RCA: 407] [Impact Index Per Article: 81.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2018] [Revised: 11/22/2018] [Accepted: 03/12/2019] [Indexed: 12/12/2022]
Abstract
We performed the first proteogenomic study on a prospectively collected colon cancer cohort. Comparative proteomic and phosphoproteomic analysis of paired tumor and normal adjacent tissues produced a catalog of colon cancer-associated proteins and phosphosites, including known and putative new biomarkers, drug targets, and cancer/testis antigens. Proteogenomic integration not only prioritized genomically inferred targets, such as copy-number drivers and mutation-derived neoantigens, but also yielded novel findings. Phosphoproteomics data associated Rb phosphorylation with increased proliferation and decreased apoptosis in colon cancer, which explains why this classical tumor suppressor is amplified in colon tumors and suggests a rationale for targeting Rb phosphorylation in colon cancer. Proteomics identified an association between decreased CD8 T cell infiltration and increased glycolysis in microsatellite instability-high (MSI-H) tumors, suggesting glycolysis as a potential target to overcome the resistance of MSI-H tumors to immune checkpoint blockade. Proteogenomics presents new avenues for biological discoveries and therapeutic development.
Collapse
|
16
|
Challenges and opportunities in the proteomic characterization of clear cell renal cell carcinoma (ccRCC): A critical step towards the personalized care of renal cancers. Semin Cancer Biol 2018; 55:8-15. [PMID: 30055950 DOI: 10.1016/j.semcancer.2018.06.004] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Revised: 06/22/2018] [Accepted: 06/28/2018] [Indexed: 12/28/2022]
Abstract
Clear cell renal cell carcinoma (ccRCC) is the most common type of kidney cancer, comprising approximately 75% of all kidney tumors. Recent the Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) studies have significantly advanced the molecular characterization of RCC and facilitated the development of targeted therapies. Such advances have improved the median survival of patients with advanced disease from less than 10 months prior to 2004 to 30 months by 2011. However, approximately 30% of localized ccRCC patients will nevertheless develop recurrence or metastasis after surgical resection of their tumor. Therefore, it is critical to further analyze potential tumor-associated proteins and their profiles during disease progression. Over the past decade, tremendous effort has been focused on the study of molecular pathways, including genomics, transcriptomics, and proteomics in order to identify potential molecular biomarkers, as well as to facilitate early detection, monitor tumor progression and uncover potentially therapeutic targets. In this review, we focus on recent advances in the proteomic analysis of ccRCC, current strategies and challenges, and perspectives in the field. This insight will highlight the discovery of tumor-associated proteins, and their potential clinical impact on personalized precision-based care in ccRCC.
Collapse
|
17
|
The cancer proteomic landscape and the HUPO Cancer Proteome Project. Clin Proteomics 2018; 15:4. [PMID: 29416445 PMCID: PMC5785860 DOI: 10.1186/s12014-018-9180-6] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Accepted: 01/17/2018] [Indexed: 01/05/2023] Open
Abstract
The Human Cancer Proteome Project (Cancer-HPP) is an international initiative organized by HUPO whose key objective is to decipher the human cancer proteome through a coordinated effort by cancer proteome researchers around the world. The ultimate goal is to map the entire human cancer proteome to disclose tumor biology and drive improved diagnostics, treatment and management of cancer. Here we report the progress in the cancer proteomics field to date, and discuss future proteomic developments that will be needed to optimally delineate cancer phenotypes and advance the molecular characterization of this significant disease that is one of the leading causes of death worldwide.
Collapse
|
18
|
Quality Assessments of Long-Term Quantitative Proteomic Analysis of Breast Cancer Xenograft Tissues. J Proteome Res 2017; 16:4523-4530. [PMID: 29124938 DOI: 10.1021/acs.jproteome.7b00362] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Clinical proteomics requires large-scale analysis of human specimens to achieve statistical significance. We evaluated the long-term reproducibility of an iTRAQ (isobaric tags for relative and absolute quantification)-based quantitative proteomics strategy using one channel for reference across all samples in different iTRAQ sets. A total of 148 liquid chromatography tandem mass spectrometric (LC-MS/MS) analyses were completed, generating six 2D LC-MS/MS data sets for human-in-mouse breast cancer xenograft tissues representative of basal and luminal subtypes. Such large-scale studies require the implementation of robust metrics to assess the contributions of technical and biological variability in the qualitative and quantitative data. Accordingly, we derived a quantification confidence score based on the quality of each peptide-spectrum match to remove quantification outliers from each analysis. After combining confidence score filtering and statistical analysis, reproducible protein identification and quantitative results were achieved from LC-MS/MS data sets collected over a 7-month period. This study provides the first quality assessment on long-term stability and technical considerations for study design of a large-scale clinical proteomics project.
Collapse
|
19
|
Corrigendum: Proteogenomic integration reveals therapeutic targets in breast cancer xenografts. Nat Commun 2017; 8:15479. [PMID: 28440318 PMCID: PMC5414030 DOI: 10.1038/ncomms15479] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
|
20
|
Proteogenomic integration reveals therapeutic targets in breast cancer xenografts. Nat Commun 2017; 8:14864. [PMID: 28348404 PMCID: PMC5379071 DOI: 10.1038/ncomms14864] [Citation(s) in RCA: 95] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Accepted: 02/06/2017] [Indexed: 01/08/2023] Open
Abstract
Recent advances in mass spectrometry (MS) have enabled extensive analysis of cancer proteomes. Here, we employed quantitative proteomics to profile protein expression across 24 breast cancer patient-derived xenograft (PDX) models. Integrated proteogenomic analysis shows positive correlation between expression measurements from transcriptomic and proteomic analyses; further, gene expression-based intrinsic subtypes are largely re-capitulated using non-stromal protein markers. Proteogenomic analysis also validates a number of predicted genomic targets in multiple receptor tyrosine kinases. However, several protein/phosphoprotein events such as overexpression of AKT proteins and ARAF, BRAF, HSP90AB1 phosphosites are not readily explainable by genomic analysis, suggesting that druggable translational and/or post-translational regulatory events may be uniquely diagnosed by MS. Drug treatment experiments targeting HER2 and components of the PI3K pathway supported proteogenomic response predictions in seven xenograft models. Our study demonstrates that MS-based proteomics can identify therapeutic targets and highlights the potential of PDX drug response evaluation to annotate MS-based pathway activities. Patient-derived xenografts recapitulate major genomic signatures and transcriptome profiles of their original tumours. Here, the authors, performing proteomic and phosphoproteomic analyses of 24 breast cancer PDX models, demonstrate that druggable candidates can be identified based on a comprehensive proteogenomic profiling.
Collapse
|
21
|
Proteome Profiling Outperforms Transcriptome Profiling for Coexpression Based Gene Function Prediction. Mol Cell Proteomics 2016; 16:121-134. [PMID: 27836980 PMCID: PMC5217778 DOI: 10.1074/mcp.m116.060301] [Citation(s) in RCA: 91] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Revised: 11/07/2016] [Indexed: 01/05/2023] Open
Abstract
Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this "guilt-by-association" (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies.
Collapse
|
22
|
Recommendations for the Generation, Quantification, Storage, and Handling of Peptides Used for Mass Spectrometry-Based Assays. Clin Chem 2016; 62:48-69. [PMID: 26719571 DOI: 10.1373/clinchem.2015.250563] [Citation(s) in RCA: 151] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
BACKGROUND For many years, basic and clinical researchers have taken advantage of the analytical sensitivity and specificity afforded by mass spectrometry in the measurement of proteins. Clinical laboratories are now beginning to deploy these work flows as well. For assays that use proteolysis to generate peptides for protein quantification and characterization, synthetic stable isotope-labeled internal standard peptides are of central importance. No general recommendations are currently available surrounding the use of peptides in protein mass spectrometric assays. CONTENT The Clinical Proteomic Tumor Analysis Consortium of the National Cancer Institute has collaborated with clinical laboratorians, peptide manufacturers, metrologists, representatives of the pharmaceutical industry, and other professionals to develop a consensus set of recommendations for peptide procurement, characterization, storage, and handling, as well as approaches to the interpretation of the data generated by mass spectrometric protein assays. Additionally, the importance of carefully characterized reference materials-in particular, peptide standards for the improved concordance of amino acid analysis methods across the industry-is highlighted. The alignment of practices around the use of peptides and the transparency of sample preparation protocols should allow for the harmonization of peptide and protein quantification in research and clinical care.
Collapse
|
23
|
Proteogenomics connects somatic mutations to signalling in breast cancer. Nature 2016; 534:55-62. [PMID: 27251275 PMCID: PMC5102256 DOI: 10.1038/nature18003] [Citation(s) in RCA: 1104] [Impact Index Per Article: 138.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Accepted: 04/13/2016] [Indexed: 12/17/2022]
Abstract
Somatic mutations have been extensively characterized in breast cancer, but the effects of these genetic alterations on the proteomic landscape remain poorly understood. We describe quantitative mass spectrometry-based proteomic and phosphoproteomic analyses of 105 genomically annotated breast cancers of which 77 provided high-quality data. Integrated analyses allowed insights into the somatic cancer genome including the consequences of chromosomal loss, such as the 5q deletion characteristic of basal-like breast cancer. The 5q trans effects were interrogated against the Library of Integrated Network-based Cellular Signatures, thereby connecting CETN3 and SKP1 loss to elevated expression of EGFR, and SKP1 loss also to increased SRC. Global proteomic data confirmed a stromal-enriched group in addition to basal and luminal clusters and pathway analysis of the phosphoproteome identified a G Protein-coupled receptor cluster that was not readily identified at the mRNA level. Besides ERBB2, other amplicon-associated, highly phosphorylated kinases were identified, including CDK12, PAK1, PTK2, RIPK2 and TLK2. We demonstrate that proteogenomic analysis of breast cancer elucidates functional consequences of somatic mutations, narrows candidate nominations for driver genes within large deletions and amplified regions, and identifies therapeutic targets.
Collapse
|
24
|
A Description of the Clinical Proteomic Tumor Analysis Consortium (CPTAC) Common Data Analysis Pipeline. J Proteome Res 2016; 15:1023-32. [PMID: 26860878 DOI: 10.1021/acs.jproteome.5b01091] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) has produced large proteomics data sets from the mass spectrometric interrogation of tumor samples previously analyzed by The Cancer Genome Atlas (TCGA) program. The availability of the genomic and proteomic data is enabling proteogenomic study for both reference (i.e., contained in major sequence databases) and nonreference markers of cancer. The CPTAC laboratories have focused on colon, breast, and ovarian tissues in the first round of analyses; spectra from these data sets were produced from 2D liquid chromatography-tandem mass spectrometry analyses and represent deep coverage. To reduce the variability introduced by disparate data analysis platforms (e.g., software packages, versions, parameters, sequence databases, etc.), the CPTAC Common Data Analysis Platform (CDAP) was created. The CDAP produces both peptide-spectrum-match (PSM) reports and gene-level reports. The pipeline processes raw mass spectrometry data according to the following: (1) peak-picking and quantitative data extraction, (2) database searching, (3) gene-based protein parsimony, and (4) false-discovery rate-based filtering. The pipeline also produces localization scores for the phosphopeptide enrichment studies using the PhosphoRS program. Quantitative information for each of the data sets is specific to the sample processing, with PSM and protein reports containing the spectrum-level or gene-level ("rolled-up") precursor peak areas and spectral counts for label-free or reporter ion log-ratios for 4plex iTRAQ. The reports are available in simple tab-delimited formats and, for the PSM-reports, in mzIdentML. The goal of the CDAP is to provide standard, uniform reports for all of the CPTAC data to enable comparisons between different samples and cancer types as well as across the major omics fields.
Collapse
|
25
|
Abstract
The NCI Clinical Proteomic Tumor Analysis Consortium (CPTAC) employed a pair of reference xenograft proteomes for initial platform validation and ongoing quality control of its data collection for The Cancer Genome Atlas (TCGA) tumors. These two xenografts, representing basal and luminal-B human breast cancer, were fractionated and analyzed on six mass spectrometers in a total of 46 replicates divided between iTRAQ and label-free technologies, spanning a total of 1095 LC-MS/MS experiments. These data represent a unique opportunity to evaluate the stability of proteomic differentiation by mass spectrometry over many months of time for individual instruments or across instruments running dissimilar workflows. We evaluated iTRAQ reporter ions, label-free spectral counts, and label-free extracted ion chromatograms as strategies for data interpretation (source code is available from http://homepages.uc.edu/~wang2x7/Research.htm ). From these assessments, we found that differential genes from a single replicate were confirmed by other replicates on the same instrument from 61 to 93% of the time. When comparing across different instruments and quantitative technologies, using multiple replicates, differential genes were reproduced by other data sets from 67 to 99% of the time. Projecting gene differences to biological pathways and networks increased the degree of similarity. These overlaps send an encouraging message about the maturity of technologies for proteomic differentiation.
Collapse
|
26
|
An Analysis of the Sensitivity of Proteogenomic Mapping of Somatic Mutations and Novel Splicing Events in Cancer. Mol Cell Proteomics 2015; 15:1060-71. [PMID: 26631509 DOI: 10.1074/mcp.m115.056226] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Indexed: 11/06/2022] Open
Abstract
Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis.
Collapse
|
27
|
Large-Scale Interlaboratory Study to Develop, Analytically Validate and Apply Highly Multiplexed, Quantitative Peptide Assays to Measure Cancer-Relevant Proteins in Plasma. Mol Cell Proteomics 2015; 14:2357-74. [PMID: 25693799 DOI: 10.1074/mcp.m114.047050] [Citation(s) in RCA: 138] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Indexed: 11/06/2022] Open
Abstract
There is an increasing need in biology and clinical medicine to robustly and reliably measure tens to hundreds of peptides and proteins in clinical and biological samples with high sensitivity, specificity, reproducibility, and repeatability. Previously, we demonstrated that LC-MRM-MS with isotope dilution has suitable performance for quantitative measurements of small numbers of relatively abundant proteins in human plasma and that the resulting assays can be transferred across laboratories while maintaining high reproducibility and quantitative precision. Here, we significantly extend that earlier work, demonstrating that 11 laboratories using 14 LC-MS systems can develop, determine analytical figures of merit, and apply highly multiplexed MRM-MS assays targeting 125 peptides derived from 27 cancer-relevant proteins and seven control proteins to precisely and reproducibly measure the analytes in human plasma. To ensure consistent generation of high quality data, we incorporated a system suitability protocol (SSP) into our experimental design. The SSP enabled real-time monitoring of LC-MRM-MS performance during assay development and implementation, facilitating early detection and correction of chromatographic and instrumental problems. Low to subnanogram/ml sensitivity for proteins in plasma was achieved by one-step immunoaffinity depletion of 14 abundant plasma proteins prior to analysis. Median intra- and interlaboratory reproducibility was <20%, sufficient for most biological studies and candidate protein biomarker verification. Digestion recovery of peptides was assessed and quantitative accuracy improved using heavy-isotope-labeled versions of the proteins as internal standards. Using the highly multiplexed assay, participating laboratories were able to precisely and reproducibly determine the levels of a series of analytes in blinded samples used to simulate an interlaboratory clinical study of patient samples. Our study further establishes that LC-MRM-MS using stable isotope dilution, with appropriate attention to analytical validation and appropriate quality control measures, enables sensitive, specific, reproducible, and quantitative measurements of proteins and peptides in complex biological matrices such as plasma.
Collapse
|
28
|
Abstract
In the past two decades, our ability to study cellular and molecular systems has been transformed through the development of omics sciences. While unlimited potential lies within massive omics datasets, the success of omics sciences to further our understanding of human disease and/or translating these findings to clinical utility remains elusive due to a number of factors. A significant limiting factor is the integration of different omics datasets (i.e., integromics) for extraction of biological and clinical insights. To this end, the National Cancer Institute (NCI) and the National Heart, Lung and Blood Institute (NHLBI) organized a joint workshop in June 2012 with the focus on integration issues related to multi-omics technologies that needed to be resolved in order to realize the full utility of integrating omics datasets by providing a glimpse into the disease as an integrated “system”. The overarching goals were to (1) identify challenges and roadblocks in omics integration, and (2) facilitate the full maturation of ‘integromics’ in biology and medicine. Participants reached a consensus on the most significant barriers for integrating omics sciences and provided recommendations on viable approaches to overcome each of these barriers within the areas of technology, bioinformatics and clinical medicine.
Collapse
|
29
|
Statistical design for biospecimen cohort size in proteomics-based biomarker discovery and verification studies. J Proteome Res 2013; 12:5383-94. [PMID: 24063748 DOI: 10.1021/pr400132j] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Protein biomarkers are needed to deepen our understanding of cancer biology and to improve our ability to diagnose, monitor, and treat cancers. Important analytical and clinical hurdles must be overcome to allow the most promising protein biomarker candidates to advance into clinical validation studies. Although contemporary proteomics technologies support the measurement of large numbers of proteins in individual clinical specimens, sample throughput remains comparatively low. This problem is amplified in typical clinical proteomics research studies, which routinely suffer from a lack of proper experimental design, resulting in analysis of too few biospecimens to achieve adequate statistical power at each stage of a biomarker pipeline. To address this critical shortcoming, a joint workshop was held by the National Cancer Institute (NCI), National Heart, Lung, and Blood Institute (NHLBI), and American Association for Clinical Chemistry (AACC) with participation from the U.S. Food and Drug Administration (FDA). An important output from the workshop was a statistical framework for the design of biomarker discovery and verification studies. Herein, we describe the use of quantitative clinical judgments to set statistical criteria for clinical relevance and the development of an approach to calculate biospecimen sample size for proteomic studies in discovery and verification stages prior to clinical validation stage. This represents a first step toward building a consensus on quantitative criteria for statistical design of proteomics biomarker discovery and verification research.
Collapse
|
30
|
Design, implementation and multisite evaluation of a system suitability protocol for the quantitative assessment of instrument performance in liquid chromatography-multiple reaction monitoring-MS (LC-MRM-MS). Mol Cell Proteomics 2013; 12:2623-39. [PMID: 23689285 DOI: 10.1074/mcp.m112.027078] [Citation(s) in RCA: 90] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Multiple reaction monitoring (MRM) mass spectrometry coupled with stable isotope dilution (SID) and liquid chromatography (LC) is increasingly used in biological and clinical studies for precise and reproducible quantification of peptides and proteins in complex sample matrices. Robust LC-SID-MRM-MS-based assays that can be replicated across laboratories and ultimately in clinical laboratory settings require standardized protocols to demonstrate that the analysis platforms are performing adequately. We developed a system suitability protocol (SSP), which employs a predigested mixture of six proteins, to facilitate performance evaluation of LC-SID-MRM-MS instrument platforms, configured with nanoflow-LC systems interfaced to triple quadrupole mass spectrometers. The SSP was designed for use with low multiplex analyses as well as high multiplex approaches when software-driven scheduling of data acquisition is required. Performance was assessed by monitoring of a range of chromatographic and mass spectrometric metrics including peak width, chromatographic resolution, peak capacity, and the variability in peak area and analyte retention time (RT) stability. The SSP, which was evaluated in 11 laboratories on a total of 15 different instruments, enabled early diagnoses of LC and MS anomalies that indicated suboptimal LC-MRM-MS performance. The observed range in variation of each of the metrics scrutinized serves to define the criteria for optimized LC-SID-MRM-MS platforms for routine use, with pass/fail criteria for system suitability performance measures defined as peak area coefficient of variation <0.15, peak width coefficient of variation <0.15, standard deviation of RT <0.15 min (9 s), and the RT drift <0.5min (30 s). The deleterious effect of a marginally performing LC-SID-MRM-MS system on the limit of quantification (LOQ) in targeted quantitative assays illustrates the use and need for a SSP to establish robust and reliable system performance. Use of a SSP helps to ensure that analyte quantification measurements can be replicated with good precision within and across multiple laboratories and should facilitate more widespread use of MRM-MS technology by the basic biomedical and clinical laboratory research communities.
Collapse
|
31
|
Recommendations for mass spectrometry data quality metrics for open access data (corollary to the Amsterdam principles). Proteomics Clin Appl 2012; 5:580-9. [PMID: 22213554 DOI: 10.1002/prca.201100097] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Policies supporting the rapid and open sharing of proteomic data are being implemented by the leading journals in the field. The proteomics community is taking steps to ensure that data are made publicly accessible and are of high quality, a challenging task that requires the development and deployment of methods for measuring and documenting data quality metrics. On September 18, 2010, the U.S. National Cancer Institute (NCI) convened the "International Workshop on Proteomic Data Quality Metrics" in Sydney, Australia, to identify and address issues facing the development and use of such methods for open access proteomics data. The stakeholders at the workshop enumerated the key principles underlying a framework for data quality assessment in mass spectrometry data that will meet the needs of the research community, journals, funding agencies, and data repositories. Attendees discussed and agreed up on two primary needs for the wide use of quality metrics: (i) an evolving list of comprehensive quality metrics and (ii) standards accompanied by software analytics. Attendees stressed the importance of increased education and training programs to promote reliable protocols in proteomics. This workshop report explores the historic precedents, key discussions, and necessary next steps to enhance the quality of open access data. By agreement, this article is published simultaneously in Proteomics, Proteomics Clinical Applications, Journal of Proteome Research, and Molecular and Cellular Proteomics, as a public service to the research community. The peer review process was a coordinated effort conducted by a panel of referees selected by the journals.
Collapse
|
32
|
Recommendations for mass spectrometry data quality metrics for open access data (corollary to the Amsterdam Principles). J Proteome Res 2012; 11:1412-9. [PMID: 22053864 PMCID: PMC3272102 DOI: 10.1021/pr201071t] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Policies supporting the rapid and open sharing of proteomic data are being implemented by the leading journals in the field. The proteomics community is taking steps to ensure that data are made publicly accessible and are of high quality, a challenging task that requires the development and deployment of methods for measuring and documenting data quality metrics. On September 18, 2010, the U.S. National Cancer Institute (NCI) convened the "International Workshop on Proteomic Data Quality Metrics" in Sydney, Australia, to identify and address issues facing the development and use of such methods for open access proteomics data. The stakeholders at the workshop enumerated the key principles underlying a framework for data quality assessment in mass spectrometry data that will meet the needs of the research community, journals, funding agencies, and data repositories. Attendees discussed and agreed up on two primary needs for the wide use of quality metrics: (1) an evolving list of comprehensive quality metrics and (2) standards accompanied by software analytics. Attendees stressed the importance of increased education and training programs to promote reliable protocols in proteomics. This workshop report explores the historic precedents, key discussions, and necessary next steps to enhance the quality of open access data. By agreement, this article is published simultaneously in the Journal of Proteome Research, Molecular and Cellular Proteomics, Proteomics, and Proteomics Clinical Applications as a public service to the research community. The peer review process was a coordinated effort conducted by a panel of referees selected by the journals.
Collapse
|
33
|
Recommendations for mass spectrometry data quality metrics for open access data (corollary to the Amsterdam principles). Proteomics 2011; 12:11-20. [PMID: 22069307 DOI: 10.1002/pmic.201100562] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2011] [Accepted: 10/27/2011] [Indexed: 11/10/2022]
Abstract
Policies supporting the rapid and open sharing of proteomic data are being implemented by the leading journals in the field. The proteomics community is taking steps to ensure that data are made publicly accessible and are of high quality, a challenging task that requires the development and deployment of methods for measuring and documenting data quality metrics. On September 18, 2010, the U.S. National Cancer Institute (NCI) convened the "International Workshop on Proteomic Data Quality Metrics" in Sydney, Australia, to identify and address issues facing the development and use of such methods for open access proteomics data. The stakeholders at the workshop enumerated the key principles underlying a framework for data quality assessment in mass spectrometry data that will meet the needs of the research community, journals, funding agencies, and data repositories. Attendees discussed and agreed upon two primary needs for the wide use of quality metrics: (i) an evolving list of comprehensive quality metrics and (ii) standards accompanied by software analytics. Attendees stressed the importance of increased education and training programs to promote reliable protocols in proteomics. This workshop report explores the historic precedents, key discussions, and necessary next steps to enhance the quality of open access data. By agreement, this article is published simultaneously in Proteomics, Proteomics Clinical Applications, Journal of Proteome Research, and Molecular and Cellular Proteomics, as a public service to the research community. The peer review process was a coordinated effort conducted by a panel of referees selected by the journals.
Collapse
|
34
|
Recommendations for mass spectrometry data quality metrics for open access data (corollary to the Amsterdam Principles). Mol Cell Proteomics 2011; 10:O111.015446. [PMID: 22052993 DOI: 10.1074/mcp.o111.015446] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Policies supporting the rapid and open sharing of proteomic data are being implemented by the leading journals in the field. The proteomics community is taking steps to ensure that data are made publicly accessible and are of high quality, a challenging task that requires the development and deployment of methods for measuring and documenting data quality metrics. On September 18, 2010, the United States National Cancer Institute convened the "International Workshop on Proteomic Data Quality Metrics" in Sydney, Australia, to identify and address issues facing the development and use of such methods for open access proteomics data. The stakeholders at the workshop enumerated the key principles underlying a framework for data quality assessment in mass spectrometry data that will meet the needs of the research community, journals, funding agencies, and data repositories. Attendees discussed and agreed up on two primary needs for the wide use of quality metrics: 1) an evolving list of comprehensive quality metrics and 2) standards accompanied by software analytics. Attendees stressed the importance of increased education and training programs to promote reliable protocols in proteomics. This workshop report explores the historic precedents, key discussions, and necessary next steps to enhance the quality of open access data. By agreement, this article is published simultaneously in the Journal of Proteome Research, Molecular and Cellular Proteomics, Proteomics, and Proteomics Clinical Applications as a public service to the research community. The peer review process was a coordinated effort conducted by a panel of referees selected by the journals.
Collapse
|
35
|
Repeatability and reproducibility in proteomic identifications by liquid chromatography-tandem mass spectrometry. J Proteome Res 2010; 9:761-76. [PMID: 19921851 DOI: 10.1021/pr9006365] [Citation(s) in RCA: 409] [Impact Index Per Article: 29.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The complexity of proteomic instrumentation for LC-MS/MS introduces many possible sources of variability. Data-dependent sampling of peptides constitutes a stochastic element at the heart of discovery proteomics. Although this variation impacts the identification of peptides, proteomic identifications are far from completely random. In this study, we analyzed interlaboratory data sets from the NCI Clinical Proteomic Technology Assessment for Cancer to examine repeatability and reproducibility in peptide and protein identifications. Included data spanned 144 LC-MS/MS experiments on four Thermo LTQ and four Orbitrap instruments. Samples included yeast lysate, the NCI-20 defined dynamic range protein mix, and the Sigma UPS 1 defined equimolar protein mix. Some of our findings reinforced conventional wisdom, such as repeatability and reproducibility being higher for proteins than for peptides. Most lessons from the data, however, were more subtle. Orbitraps proved capable of higher repeatability and reproducibility, but aberrant performance occasionally erased these gains. Even the simplest protein digestions yielded more peptide ions than LC-MS/MS could identify during a single experiment. We observed that peptide lists from pairs of technical replicates overlapped by 35-60%, giving a range for peptide-level repeatability in these experiments. Sample complexity did not appear to affect peptide identification repeatability, even as numbers of identified spectra changed by an order of magnitude. Statistical analysis of protein spectral counts revealed greater stability across technical replicates for Orbitraps, making them superior to LTQ instruments for biomarker candidate discovery. The most repeatable peptides were those corresponding to conventional tryptic cleavage sites, those that produced intense MS signals, and those that resulted from proteins generating many distinct peptides. Reproducibility among different instruments of the same type lagged behind repeatability of technical replicates on a single instrument by several percent. These findings reinforce the importance of evaluating repeatability as a fundamental characteristic of analytical technologies.
Collapse
|
36
|
Abstract
As a part of ongoing efforts of the NCI-FDA Interagency Oncology Task Force subcommittee on molecular diagnostics, members of the Clinical Proteomic Technology Assessment for Cancer program of the National Cancer Institute have submitted 2 protein-based multiplex assay descriptions to the Office of In Vitro Diagnostic Device Evaluation and Safety, US Food and Drug Administration. The objective was to evaluate the analytical measurement criteria and studies needed to validate protein-based multiplex assays. Each submission described a different protein-based platform: a multiplex immunoaffinity mass spectrometry platform for protein quantification, and an immunological array platform quantifying glycoprotein isoforms. Submissions provided a mutually beneficial way for members of the proteomics and regulatory communities to identify the analytical issues that the field should address when developing protein-based multiplex clinical assays.
Collapse
|
37
|
Analytical validation of protein-based multiplex assays: a workshop report by the NCI-FDA interagency oncology task force on molecular diagnostics. Clin Chem 2009; 56:237-43. [PMID: 20007859 DOI: 10.1373/clinchem.2009.136416] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Clinical proteomics has the potential to enable the early detection of cancer through the development of multiplex assays that can inform clinical decisions. However, there has been some uncertainty among translational researchers and developers as to the specific analytical measurement criteria needed to validate protein-based multiplex assays. To begin to address the causes of this uncertainty, a day-long workshop titled "Interagency Oncology Task Force Molecular Diagnostics Workshop" was held in which members of the proteomics and regulatory communities discussed many of the analytical evaluation issues that the field should address in development of protein-based multiplex assays for clinical use. This meeting report explores the issues raised at the workshop and details the recommendations that came out of the day's discussions, such as a workshop summary discussing the analytical evaluation issues that specific proteomic technologies should address when seeking US Food and Drug Administration approval.
Collapse
|
38
|
Interlaboratory study characterizing a yeast performance standard for benchmarking LC-MS platform performance. Mol Cell Proteomics 2009; 9:242-54. [PMID: 19858499 DOI: 10.1074/mcp.m900222-mcp200] [Citation(s) in RCA: 140] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Optimal performance of LC-MS/MS platforms is critical to generating high quality proteomics data. Although individual laboratories have developed quality control samples, there is no widely available performance standard of biological complexity (and associated reference data sets) for benchmarking of platform performance for analysis of complex biological proteomes across different laboratories in the community. Individual preparations of the yeast Saccharomyces cerevisiae proteome have been used extensively by laboratories in the proteomics community to characterize LC-MS platform performance. The yeast proteome is uniquely attractive as a performance standard because it is the most extensively characterized complex biological proteome and the only one associated with several large scale studies estimating the abundance of all detectable proteins. In this study, we describe a standard operating protocol for large scale production of the yeast performance standard and offer aliquots to the community through the National Institute of Standards and Technology where the yeast proteome is under development as a certified reference material to meet the long term needs of the community. Using a series of metrics that characterize LC-MS performance, we provide a reference data set demonstrating typical performance of commonly used ion trap instrument platforms in expert laboratories; the results provide a basis for laboratories to benchmark their own performance, to improve upon current methods, and to evaluate new technologies. Additionally, we demonstrate how the yeast reference, spiked with human proteins, can be used to benchmark the power of proteomics platforms for detection of differentially expressed proteins at different levels of concentration in a complex matrix, thereby providing a metric to evaluate and minimize pre-analytical and analytical variation in comparative proteomics experiments.
Collapse
|
39
|
Performance metrics for liquid chromatography-tandem mass spectrometry systems in proteomics analyses. Mol Cell Proteomics 2009; 9:225-41. [PMID: 19837981 PMCID: PMC2830836 DOI: 10.1074/mcp.m900223-mcp200] [Citation(s) in RCA: 158] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
A major unmet need in LC-MS/MS-based proteomics analyses is a set of tools for quantitative assessment of system performance and evaluation of technical variability. Here we describe 46 system performance metrics for monitoring chromatographic performance, electrospray source stability, MS1 and MS2 signals, dynamic sampling of ions for MS/MS, and peptide identification. Applied to data sets from replicate LC-MS/MS analyses, these metrics displayed consistent, reasonable responses to controlled perturbations. The metrics typically displayed variations less than 10% and thus can reveal even subtle differences in performance of system components. Analyses of data from interlaboratory studies conducted under a common standard operating procedure identified outlier data and provided clues to specific causes. Moreover, interlaboratory variation reflected by the metrics indicates which system components vary the most between laboratories. Application of these metrics enables rational, quantitative quality assessment for proteomics and other LC-MS/MS analytical applications.
Collapse
|
40
|
Recommendations from the 2008 International Summit on Proteomics Data Release and Sharing Policy: the Amsterdam principles. J Proteome Res 2009; 8:3689-92. [PMID: 19344107 DOI: 10.1021/pr900023z] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Policies supporting the rapid and open sharing of genomic data have directly fueled the accelerated pace of discovery in large-scale genomics research. The proteomics community is starting to implement analogous policies and infrastructure for making large-scale proteomics data widely available on a precompetitive basis. On August 14, 2008, the National Cancer Institute (NCI) convened the "International Summit on Proteomics Data Release and Sharing Policy" in Amsterdam, The Netherlands, to identify and address potential roadblocks to rapid and open access to data. The six principles agreed upon by key stakeholders at the summit addressed issues surrounding (1) timing, (2) comprehensiveness, (3) format, (4) deposition to repositories, (5) quality metrics, and (6) responsibility for proteomics data release. This summit report explores various approaches to develop a framework of data release and sharing principles that will most effectively fulfill the needs of the funding agencies and the research community.
Collapse
|
41
|
Erratum: Corrigendum: Multi-site assessment of the precision and reproducibility of multiple reaction monitoring–based measurements of proteins in plasma. Nat Biotechnol 2009. [DOI: 10.1038/nbt0909-864b] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
42
|
Multi-site assessment of the precision and reproducibility of multiple reaction monitoring-based measurements of proteins in plasma. Nat Biotechnol 2009; 27:633-41. [PMID: 19561596 DOI: 10.1038/nbt.1546] [Citation(s) in RCA: 819] [Impact Index Per Article: 54.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2009] [Accepted: 05/31/2009] [Indexed: 01/13/2023]
Abstract
Verification of candidate biomarkers relies upon specific, quantitative assays optimized for selective detection of target proteins, and is increasingly viewed as a critical step in the discovery pipeline that bridges unbiased biomarker discovery to preclinical validation. Although individual laboratories have demonstrated that multiple reaction monitoring (MRM) coupled with isotope dilution mass spectrometry can quantify candidate protein biomarkers in plasma, reproducibility and transferability of these assays between laboratories have not been demonstrated. We describe a multilaboratory study to assess reproducibility, recovery, linear dynamic range and limits of detection and quantification of multiplexed, MRM-based assays, conducted by NCI-CPTAC. Using common materials and standardized protocols, we demonstrate that these assays can be highly reproducible within and across laboratories and instrument platforms, and are sensitive to low mug/ml protein concentrations in unfractionated plasma. We provide data and benchmarks against which individual laboratories can compare their performance and evaluate new technologies for biomarker verification in plasma.
Collapse
|
43
|
|
44
|
How useful are vibrational frequencies of isotopomeric O2 fragments for assessing local symmetry? Some simple systems and the vexing case of a galactose oxidase model. J Biol Inorg Chem 2005; 10:778-89. [PMID: 16187071 DOI: 10.1007/s00775-005-0026-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2005] [Accepted: 08/23/2005] [Indexed: 11/27/2022]
Abstract
The tendency for mixed-isotope O2 fragments to exhibit different stretching frequencies in asymmetric environments is examined with various levels of electronic structure theory for simple peroxides and peroxyl radicals, as well as for a variety of monocopper-O2 complexes. The study of the monocopper species is motivated by their relevance to the active site of galactose oxidase. Extensive theoretical work with an experimental model characterized by Jazdzewski et al. (J. Biol. Inorg. Chem. 8:381-393, 2003) suggests that the failure to observe a splitting between 16O18O and 18O16O isotopomers cannot be taken as evidence against end-on O2 coordination. Conformational analysis on an energetic basis, however, is complicated by biradical character inherent in all of the copper-O2 singlet structures.
Collapse
|
45
|
Abstract
Two highly reactive heterodinuclear bis(mu-oxo) complexes were prepared by combining mononuclear peroxo species with reduced metal precursors at -80 degrees C and were identified by UV-vis, EPR/NMR, and resonance Raman spectroscopy, with corroboration in the case of the CuPd system from density functional calculations.
Collapse
|
46
|
Mechanism of intramolecular C–H bond activation in [(LCu)2(μ-O)2]2+ (L=1,4,7-trialkyl-1,4,7-triazacyclononane): quantum mechanical/molecular mechanical modeling. ACTA ACUST UNITED AC 2003. [DOI: 10.1016/s0166-1280(03)00292-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
47
|
|
48
|
|
49
|
Electronic Structure and Bonding in Hexacoordinate Silyl–Palladium Complexes Support from the National Science Foundation (CHE-9876792) is gratefully acknowledged. Angew Chem Int Ed Engl 2002. [DOI: 10.1002/1521-3773(20020603)41:11<1953::aid-anie1953>3.0.co;2-h] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
50
|
Electronic Structure and Bonding in Hexacoordinate Silyl–Palladium Complexes Support from the National Science Foundation (CHE-9876792) is gratefully acknowledged. Angew Chem Int Ed Engl 2002. [DOI: 10.1002/1521-3757(20020603)114:11<2033::aid-ange2033>3.0.co;2-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|