1
|
Wang XY, Xu YM, Lau ATY. Proteogenomics in Cancer: Then and Now. J Proteome Res 2023; 22:3103-3122. [PMID: 37725793 DOI: 10.1021/acs.jproteome.3c00196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/21/2023]
Abstract
For years, the paths of sequencing technologies and mass spectrometry have occurred in isolation, with each developing its own unique culture and expertise. These two technologies are crucial for inspecting complementary aspects of the molecular phenotype across the central dogma. Integrative multiomics strives to bridge the analysis gap among different fields to complete more comprehensive mechanisms of life events and diseases. Proteogenomics is one integrated multiomics field. Here in this review, we mainly summarize and discuss three aspects: workflow of proteogenomics, proteogenomics applications in cancer research, and the SWOT (Strengths, Weaknesses, Opportunities, Threats) analysis of proteogenomics in cancer research. In conclusion, proteogenomics has a promising future as it clarifies the functional consequences of many unannotated genomic abnormalities or noncanonical variants and identifies driver genes and novel therapeutic targets across cancers, which would substantially accelerate the development of precision oncology.
Collapse
Affiliation(s)
- Xiu-Yun Wang
- Laboratory of Cancer Biology and Epigenetics, Department of Cell Biology and Genetics, Shantou University Medical College, Shantou, Guangdong 515041, People's Republic of China
| | - Yan-Ming Xu
- Laboratory of Cancer Biology and Epigenetics, Department of Cell Biology and Genetics, Shantou University Medical College, Shantou, Guangdong 515041, People's Republic of China
| | - Andy T Y Lau
- Laboratory of Cancer Biology and Epigenetics, Department of Cell Biology and Genetics, Shantou University Medical College, Shantou, Guangdong 515041, People's Republic of China
| |
Collapse
|
2
|
Li Y, Dou Y, Da Veiga Leprevost F, Geffen Y, Calinawan AP, Aguet F, Akiyama Y, Anand S, Birger C, Cao S, Chaudhary R, Chilappagari P, Cieslik M, Colaprico A, Zhou DC, Day C, Domagalski MJ, Esai Selvan M, Fenyö D, Foltz SM, Francis A, Gonzalez-Robles T, Gümüş ZH, Heiman D, Holck M, Hong R, Hu Y, Jaehnig EJ, Ji J, Jiang W, Katsnelson L, Ketchum KA, Klein RJ, Lei JT, Liang WW, Liao Y, Lindgren CM, Ma W, Ma L, MacCoss MJ, Martins Rodrigues F, McKerrow W, Nguyen N, Oldroyd R, Pilozzi A, Pugliese P, Reva B, Rudnick P, Ruggles KV, Rykunov D, Savage SR, Schnaubelt M, Schraink T, Shi Z, Singhal D, Song X, Storrs E, Terekhanova NV, Thangudu RR, Thiagarajan M, Wang LB, Wang JM, Wang Y, Wen B, Wu Y, Wyczalkowski MA, Xin Y, Yao L, Yi X, Zhang H, Zhang Q, Zuhl M, Getz G, Ding L, Nesvizhskii AI, Wang P, Robles AI, Zhang B, Payne SH. Proteogenomic data and resources for pan-cancer analysis. Cancer Cell 2023; 41:1397-1406. [PMID: 37582339 PMCID: PMC10506762 DOI: 10.1016/j.ccell.2023.06.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 11/15/2022] [Accepted: 06/27/2023] [Indexed: 08/17/2023]
Abstract
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigates tumors from a proteogenomic perspective, creating rich multi-omics datasets connecting genomic aberrations to cancer phenotypes. To facilitate pan-cancer investigations, we have generated harmonized genomic, transcriptomic, proteomic, and clinical data for >1000 tumors in 10 cohorts to create a cohesive and powerful dataset for scientific discovery. We outline efforts by the CPTAC pan-cancer working group in data harmonization, data dissemination, and computational resources for aiding biological discoveries. We also discuss challenges for multi-omics data integration and analysis, specifically the unique challenges of working with both nucleotide sequencing and mass spectrometry proteomics data.
Collapse
Affiliation(s)
- Yize Li
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Yongchao Dou
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Yifat Geffen
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Anna P Calinawan
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - François Aguet
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Yo Akiyama
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Shankara Anand
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Chet Birger
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Song Cao
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | | | - Marcin Cieslik
- Department of Computational Medicine & Bioinformatics, Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Antonio Colaprico
- Department of Public Health Sciences, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Daniel Cui Zhou
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Corbin Day
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | | | - Myvizhi Esai Selvan
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - David Fenyö
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Steven M Foltz
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | - Tania Gonzalez-Robles
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Zeynep H Gümüş
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - David Heiman
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | | | - Runyu Hong
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Yingwei Hu
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Eric J Jaehnig
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Jiayi Ji
- Tisch Cancer Institute and Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Wen Jiang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Lizabeth Katsnelson
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | | | - Robert J Klein
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jonathan T Lei
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Wen-Wei Liang
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Yuxing Liao
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Caleb M Lindgren
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Weiping Ma
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Lei Ma
- ICF, Rockville, MD 20850, USA
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Fernanda Martins Rodrigues
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Wilson McKerrow
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | | | - Robert Oldroyd
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | | | - Pietro Pugliese
- Department of Sciences and Technologies, University of Sannio, Benevento 82100, Italy
| | - Boris Reva
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Paul Rudnick
- Spectragen Informatics, Bainbridge Island, WA 98110, USA
| | - Kelly V Ruggles
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Dmitry Rykunov
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Sara R Savage
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Michael Schnaubelt
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Tobias Schraink
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Zhiao Shi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Xiaoyu Song
- Tisch Cancer Institute and Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Erik Storrs
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Nadezhda V Terekhanova
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | | | - Liang-Bo Wang
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Joshua M Wang
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Ying Wang
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yige Wu
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Matthew A Wyczalkowski
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Yi Xin
- ICF, Rockville, MD 20850, USA
| | - Lijun Yao
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Xinpei Yi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Hui Zhang
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Qing Zhang
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | | | - Gad Getz
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA; Cancer Center and Department of Pathology, Mass. General Hospital, Boston, MA 02114, USA; Harvard Medical School, Boston, MA 02115, USA
| | - Li Ding
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA; Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63130, USA; Department of Genetics, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | - Pei Wang
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Ana I Robles
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Rockville, MD 20850, USA.
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
| | - Samuel H Payne
- Department of Biology, Brigham Young University, Provo, UT 84602, USA.
| |
Collapse
|
3
|
Lee RY, Ng CW, Rajapakse MP, Ang N, Yeong JPS, Lau MC. The promise and challenge of spatial omics in dissecting tumour microenvironment and the role of AI. Front Oncol 2023; 13:1172314. [PMID: 37197415 PMCID: PMC10183599 DOI: 10.3389/fonc.2023.1172314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 04/18/2023] [Indexed: 05/19/2023] Open
Abstract
Growing evidence supports the critical role of tumour microenvironment (TME) in tumour progression, metastases, and treatment response. However, the in-situ interplay among various TME components, particularly between immune and tumour cells, are largely unknown, hindering our understanding of how tumour progresses and responds to treatment. While mainstream single-cell omics techniques allow deep, single-cell phenotyping, they lack crucial spatial information for in-situ cell-cell interaction analysis. On the other hand, tissue-based approaches such as hematoxylin and eosin and chromogenic immunohistochemistry staining can preserve the spatial information of TME components but are limited by their low-content staining. High-content spatial profiling technologies, termed spatial omics, have greatly advanced in the past decades to overcome these limitations. These technologies continue to emerge to include more molecular features (RNAs and/or proteins) and to enhance spatial resolution, opening new opportunities for discovering novel biological knowledge, biomarkers, and therapeutic targets. These advancements also spur the need for novel computational methods to mine useful TME insights from the increasing data complexity confounded by high molecular features and spatial resolution. In this review, we present state-of-the-art spatial omics technologies, their applications, major strengths, and limitations as well as the role of artificial intelligence (AI) in TME studies.
Collapse
Affiliation(s)
- Ren Yuan Lee
- Singapore Thong Chai Medical Institution, Singapore, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Chan Way Ng
- Singapore Immunology Network (SIgN), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | | | - Nicholas Ang
- Singapore Immunology Network (SIgN), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Joe Poh Sheng Yeong
- Department of Anatomical Pathology, Singapore General Hospital, Singapore, Singapore
- Cancer Science Institute of Singapore, National University of Singapore, Singapore, Singapore
- *Correspondence: Joe Poh Sheng Yeong, ; Mai Chan Lau,
| | - Mai Chan Lau
- Singapore Immunology Network (SIgN), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- *Correspondence: Joe Poh Sheng Yeong, ; Mai Chan Lau,
| |
Collapse
|
4
|
Rajczewski AT, Han Q, Mehta S, Kumar P, Jagtap PD, Knutson CG, Fox JG, Tretyakova NY, Griffin TJ. Quantitative Proteogenomic Characterization of Inflamed Murine Colon Tissue Using an Integrated Discovery, Verification, and Validation Proteogenomic Workflow. Proteomes 2022; 10:proteomes10020011. [PMID: 35466239 PMCID: PMC9036229 DOI: 10.3390/proteomes10020011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 03/27/2022] [Accepted: 04/07/2022] [Indexed: 11/24/2022] Open
Abstract
Chronic inflammation of the colon causes genomic and/or transcriptomic events, which can lead to expression of non-canonical protein sequences contributing to oncogenesis. To better understand these mechanisms, Rag2−/−Il10−/− mice were infected with Helicobacter hepaticus to induce chronic inflammation of the cecum and the colon. Transcriptomic data from harvested proximal colon samples were used to generate a customized FASTA database containing non-canonical protein sequences. Using a proteogenomic approach, mass spectrometry data for proximal colon proteins were searched against this custom FASTA database using the Galaxy for Proteomics (Galaxy-P) platform. In addition to the increased abundance in inflammatory response proteins, we also discovered several non-canonical peptide sequences derived from unique proteoforms. We confirmed the veracity of these novel sequences using an automated bioinformatics verification workflow with targeted MS-based assays for peptide validation. Our bioinformatics discovery workflow identified 235 putative non-canonical peptide sequences, of which 58 were verified with high confidence and 39 were validated in targeted proteomics assays. This study provides insights into challenges faced when identifying non-canonical peptides using a proteogenomics approach and demonstrates an integrated workflow addressing these challenges. Our bioinformatic discovery and verification workflow is publicly available and accessible via the Galaxy platform and should be valuable in non-canonical peptide identification using proteogenomics.
Collapse
Affiliation(s)
- Andrew T. Rajczewski
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Qiyuan Han
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Praveen Kumar
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Pratik D. Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Charles G. Knutson
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; (C.G.K.); (J.G.F.)
| | - James G. Fox
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; (C.G.K.); (J.G.F.)
| | - Natalia Y. Tretyakova
- Department of Medicinal Chemistry, the Masonic Cancer Center, University of Minnesota, Minneapolis, MN 55455, USA;
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
- Correspondence:
| |
Collapse
|
5
|
Cappello F, Fais S. Extracellular vesicles in cancer pros and cons: the importance of the evidence-based medicine. Semin Cancer Biol 2022; 86:4-12. [DOI: 10.1016/j.semcancer.2022.01.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 01/18/2022] [Accepted: 01/28/2022] [Indexed: 12/17/2022]
|
6
|
Qi F, Tan Y, Yao A, Yang X, He Y. Psoriasis to Psoriatic Arthritis: The Application of Proteomics Technologies. Front Med (Lausanne) 2021; 8:681172. [PMID: 34869404 PMCID: PMC8635007 DOI: 10.3389/fmed.2021.681172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 10/18/2021] [Indexed: 11/13/2022] Open
Abstract
Psoriatic disease (PsD) is a spectrum of diseases that affect both skin [cutaneous psoriasis (PsC)] and musculoskeletal features [psoriatic arthritis (PsA)]. A considerable number of patients with PsC have asymptomatic synovio-entheseal inflammations, and approximately one-third of those eventually progress to PsA with an enigmatic mechanism. Published studies have shown that early interventions to the very early-stage PsA would effectively prevent substantial bone destructions or deformities, suggesting an unmet goal for exploring early PsA biomarkers. The emergence of proteomics technologies brings a complete view of all involved proteins in PsA transitions, offers a unique chance to map all potential peptides, and allows a direct head-to-head comparison of interaction pathways in PsC and PsA. This review summarized the latest development of proteomics technologies, highlighted its application in PsA biomarker discovery, and discussed the possible clinical detectable PsA risk factors in patients with PsC.
Collapse
Affiliation(s)
- Fei Qi
- Department of Dermatology, Capital Medical University Affiliated Beijing Chaoyang Hospital, Beijing, China
| | - Yaqi Tan
- Department of Dermatology, Capital Medical University Affiliated Beijing Chaoyang Hospital, Beijing, China
| | - Amin Yao
- Department of Dermatology, Capital Medical University Affiliated Beijing Chaoyang Hospital, Beijing, China
| | - Xutong Yang
- Department of Dermatology, Capital Medical University Affiliated Beijing Chaoyang Hospital, Beijing, China
| | - Yanling He
- Department of Dermatology, Capital Medical University Affiliated Beijing Chaoyang Hospital, Beijing, China
| |
Collapse
|
7
|
Babu N, Bhat MY, John AE, Chatterjee A. The role of proteomics in the multiplexed analysis of gene alterations in human cancer. Expert Rev Proteomics 2021; 18:737-756. [PMID: 34602018 DOI: 10.1080/14789450.2021.1984884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
INTRODUCTION Proteomics has played a pivotal role in identifying proteins perturbed in disease conditions when compared with healthy samples. Study of dysregulated proteins aids in identifying diagnostic markers and potential therapeutic targets. Cancer is an outcome of interplay of several such disarrayed proteins and molecular pathways which perturb cellular homeostasis, resulting in transformation. In this review, we discuss various facets of proteomic approaches, including tools and technological advancements, aiding in understanding differentially expressed molecules and signaling mechanisms. AREAS COVERED In this review, we have taken the approach of documenting the different methods of proteomic studies, ranging from labeling techniques, data analysis methods, and the nature of molecule detected. We summarize each technique and provide a glimpse of cancer research carried out using them, highlighting the advantages and drawbacks in comparison with others. Literature search using online resources, such as PubMed and Google Scholar were carried out for this approach. EXPERT OPINION Technological advancements in proteomics studies have come a long way from the study of two-dimensional mapping of proteins separated on gels in the early 1970s. Higher precision in molecular identification and quantification (high throughput), and greater number of samples analyzed have been the focus of researchers.
Collapse
Affiliation(s)
- Niraj Babu
- Institute of Bioinformatics, International Technology Park, Bangalore, Bangalore, 560066, India.,Manipal Academy of Higher Education (MAHE), Manipal, India
| | - Mohd Younis Bhat
- Institute of Bioinformatics, International Technology Park, Bangalore, Bangalore, 560066, India
| | | | - Aditi Chatterjee
- Institute of Bioinformatics, International Technology Park, Bangalore, Bangalore, 560066, India.,Manipal Academy of Higher Education (MAHE), Manipal, India
| |
Collapse
|
8
|
Chen X, Sun Y, Zhang T, Shu L, Roepstorff P, Yang F. Quantitative Proteomics Using Isobaric Labeling: A Practical Guide. GENOMICS, PROTEOMICS & BIOINFORMATICS 2021; 19:689-706. [PMID: 35007772 PMCID: PMC9170757 DOI: 10.1016/j.gpb.2021.08.012] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Revised: 05/19/2021] [Accepted: 09/27/2021] [Indexed: 01/09/2023]
Abstract
In the past decade, relative proteomic quantification using isobaric labeling technology has developed into a key tool for comparing the expression of proteins in biological samples. Although its multiplexing capacity and flexibility make this a valuable technology for addressing various biological questions, its quantitative accuracy and precision still pose significant challenges to the reliability of its quantification results. Here, we give a detailed overview of the different kinds of isobaric mass tags and the advantages and disadvantages of the isobaric labeling method. We also discuss which precautions should be taken at each step of the isobaric labeling workflow, to obtain reliable quantification results in large-scale quantitative proteomics experiments. In the last section, we discuss the broad applications of the isobaric labeling technology in biological and clinical studies, with an emphasis on thermal proteome profiling and proteogenomics.
Collapse
Affiliation(s)
- Xiulan Chen
- Key Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100149, China.
| | - Yaping Sun
- Key Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100149, China
| | - Tingting Zhang
- Key Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100149, China
| | - Lian Shu
- Key Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100149, China
| | - Peter Roepstorff
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Denmark
| | - Fuquan Yang
- Key Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100149, China.
| |
Collapse
|
9
|
Scull KE, Pandey K, Ramarathinam SH, Purcell AW. Immunopeptidogenomics: Harnessing RNA-Seq to Illuminate the Dark Immunopeptidome. Mol Cell Proteomics 2021; 20:100143. [PMID: 34509645 PMCID: PMC8724885 DOI: 10.1016/j.mcpro.2021.100143] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 08/10/2021] [Accepted: 08/24/2021] [Indexed: 01/08/2023] Open
Abstract
Human leukocyte antigen (HLA) molecules are cell-surface glycoproteins that present peptide antigens on the cell surface for surveillance by T lymphocytes, which contemporaneously seek signs of disease. Mass spectrometric analysis allows us to identify large numbers of these peptides (the immunopeptidome) following affinity purification of solubilized HLA-peptide complexes. However, in recent years, there has been a growing awareness of the "dark side" of the immunopeptidome: unconventional peptide epitopes, including neoepitopes, which elude detection by conventional search methods because their sequences are not present in reference protein databases (DBs). Here, we establish a bioinformatics workflow to aid identification of peptides generated by noncanonical translation of mRNA or by genome variants. The workflow incorporates both standard transcriptomics software and novel computer programs to produce cell line-specific protein DBs based on three-frame translation of the transcriptome. The final protein DB also includes sequences resulting from variants determined by variant calling on the same RNA-Seq data. We then searched our experimental data against both transcriptome-based and standard DBs using PEAKS Studio (Bioinformatics Solutions, Inc). Finally, further novel software helps to compare the various result sets arising for each sample, pinpoint putative genomic origins for unconventional sequences, and highlight potential neoepitopes. We applied the workflow to study the immunopeptidome of the acute myeloid leukemia cell line THP-1, using RNA-Seq and immunopeptidome data. We confidently identified over 14,000 peptides from three replicates of purified HLA peptides derived from THP-1 cells using the conventional UniProt human proteome. Using the transcriptome-based DB generated using our workflow, we recapitulated >85% of these and also identified 1029 unconventional peptides not explained by UniProt, including 16 sequences caused by nonsynonymous variants. Our workflow, which we term "immunopeptidogenomics," can provide DBs, which include pertinent unconventional sequences and allow neoepitope discovery, without becoming too large to search. Immunopeptidogenomics is a step toward unbiased search approaches that are needed to illuminate the dark side of the immunopeptidome.
Collapse
Affiliation(s)
- Katherine E Scull
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia
| | - Kirti Pandey
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia
| | - Sri H Ramarathinam
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia.
| | - Anthony W Purcell
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia.
| |
Collapse
|
10
|
Halperin RF, Hegde A, Lang JD, Raupach EA, Legendre C, Liang WS, LoRusso PM, Sekulic A, Sosman JA, Trent JM, Rangasamy S, Pirrotte P, Schork NJ. Improved methods for RNAseq-based alternative splicing analysis. Sci Rep 2021; 11:10740. [PMID: 34031440 PMCID: PMC8144374 DOI: 10.1038/s41598-021-89938-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Accepted: 04/13/2021] [Indexed: 01/04/2023] Open
Abstract
The robust detection of disease-associated splice events from RNAseq data is challenging due to the potential confounding effect of gene expression levels and the often limited number of patients with relevant RNAseq data. Here we present a novel statistical approach to splicing outlier detection and differential splicing analysis. Our approach tests for differences in the percentages of sequence reads representing local splice events. We describe a software package called Bisbee which can predict the protein-level effect of splice alterations, a key feature lacking in many other splicing analysis resources. We leverage Bisbee's prediction of protein level effects as a benchmark of its capabilities using matched sets of RNAseq and mass spectrometry data from normal tissues. Bisbee exhibits improved sensitivity and specificity over existing approaches and can be used to identify tissue-specific splice variants whose protein-level expression can be confirmed by mass spectrometry. We also applied Bisbee to assess evidence for a pathogenic splicing variant contributing to a rare disease and to identify tumor-specific splice isoforms associated with an oncogenic mutation. Bisbee was able to rediscover previously validated results in both of these cases and also identify common tumor-associated splice isoforms replicated in two independent melanoma datasets.
Collapse
Affiliation(s)
- Rebecca F Halperin
- Quantitative Medicine and Systems Biology Division, Translational Genomics Research Institute, Phoenix, AZ, USA.
| | - Apurva Hegde
- Collaborative Center for Translational Mass Spectrometry, Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Jessica D Lang
- Integrated Cancer Genomics Division, Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Elizabeth A Raupach
- Integrated Cancer Genomics Division, Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Christophe Legendre
- Integrated Cancer Genomics Division, Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Winnie S Liang
- Integrated Cancer Genomics Division, Translational Genomics Research Institute, Phoenix, AZ, USA
- Neurogenomics Division, Translational Genomics Research Institute, Phoenix, AZ, USA
| | | | | | | | - Jeffrey M Trent
- Integrated Cancer Genomics Division, Translational Genomics Research Institute, Phoenix, AZ, USA
| | | | - Patrick Pirrotte
- Collaborative Center for Translational Mass Spectrometry, Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Nicholas J Schork
- Quantitative Medicine and Systems Biology Division, Translational Genomics Research Institute, Phoenix, AZ, USA
| |
Collapse
|
11
|
Ferreira JA, Relvas-Santos M, Peixoto A, M N Silva A, Lara Santos L. Glycoproteogenomics: Setting the Course for Next-generation Cancer Neoantigen Discovery for Cancer Vaccines. GENOMICS, PROTEOMICS & BIOINFORMATICS 2021; 19:25-43. [PMID: 34118464 PMCID: PMC8498922 DOI: 10.1016/j.gpb.2021.03.005] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 01/25/2021] [Accepted: 03/01/2021] [Indexed: 12/24/2022]
Abstract
Molecular-assisted precision oncology gained tremendous ground with high-throughput next-generation sequencing (NGS), supported by robust bioinformatics. The quest for genomics-based cancer medicine set the foundations for improved patient stratification, while unveiling a wide array of neoantigens for immunotherapy. Upfront pre-clinical and clinical studies have successfully used tumor-specific peptides in vaccines with minimal off-target effects. However, the low mutational burden presented by many lesions challenges the generalization of these solutions, requiring the diversification of neoantigen sources. Oncoproteogenomics utilizing customized databases for protein annotation by mass spectrometry (MS) is a powerful tool toward this end. Expanding the concept toward exploring proteoforms originated from post-translational modifications (PTMs) will be decisive to improve molecular subtyping and provide potentially targetable functional nodes with increased cancer specificity. Walking through the path of systems biology, we highlight that alterations in protein glycosylation at the cell surface not only have functional impact on cancer progression and dissemination but also originate unique molecular fingerprints for targeted therapeutics. Moreover, we discuss the outstanding challenges required to accommodate glycoproteomics in oncoproteogenomics platforms. We envisage that such rationale may flag a rather neglected research field, generating novel paradigms for precision oncology and immunotherapy.
Collapse
Affiliation(s)
- José Alexandre Ferreira
- Experimental Pathology and Therapeutics Group, Portuguese Institute of Oncology, Porto 4200-072, Portugal; Institute of Biomedical Sciences Abel Salazar, University of Porto, Porto 4050-313, Portugal; Porto Comprehensive Cancer Center (P.ccc), Porto 4200-072, Portugal.
| | - Marta Relvas-Santos
- Experimental Pathology and Therapeutics Group, Portuguese Institute of Oncology, Porto 4200-072, Portugal; Institute of Biomedical Sciences Abel Salazar, University of Porto, Porto 4050-313, Portugal; REQUIMTE-LAQV, Department of Chemistry and Biochemistry, Faculty of Sciences of the University of Porto, Porto 4169-007, Portugal
| | - Andreia Peixoto
- Experimental Pathology and Therapeutics Group, Portuguese Institute of Oncology, Porto 4200-072, Portugal; Institute of Biomedical Sciences Abel Salazar, University of Porto, Porto 4050-313, Portugal
| | - André M N Silva
- REQUIMTE-LAQV, Department of Chemistry and Biochemistry, Faculty of Sciences of the University of Porto, Porto 4169-007, Portugal
| | - Lúcio Lara Santos
- Experimental Pathology and Therapeutics Group, Portuguese Institute of Oncology, Porto 4200-072, Portugal; Institute of Biomedical Sciences Abel Salazar, University of Porto, Porto 4050-313, Portugal; Porto Comprehensive Cancer Center (P.ccc), Porto 4200-072, Portugal
| |
Collapse
|
12
|
Verma A, Halder A, Marathe S, Purwar R, Srivastava S. A proteogenomic approach to target neoantigens in solid tumors. Expert Rev Proteomics 2021; 17:797-812. [PMID: 33491499 DOI: 10.1080/14789450.2020.1881889] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
INTRODUCTION Proteogenomic techniques find applications in identifying novel cancer-specific peptides called neoantigens; they are non-self peptides derived from tumor-specific non-synonymous mutations. These peptides with MHCs are recognized by the T cells and induce an antitumor response. Due to their selective expression of tumor cells, neoantigens are considered attractive targets for cancer immunotherapy. AREAS COVERED In this review, we have discussed the proteogenomic strategies to identify neoantigens. We have also provided a neoantigen identification pipeline using data from whole-exome sequencing, RNA sequencing, and MHC peptidomics. Further, we have reviewed recent tools for neoantigen discovery. EXPERT COMMENTARY The limitations in instrument sensitivity and availability of bioinformatics tools have restricted the identification of neoantigens from tumor samples. Nonetheless, the recent improvement in genome sequencing, mass spectrometry technologies, and the development of reliable algorithms for epitope prediction provide hope for efficient identification of neoantigens. Translating this workflow on patient samples would represent a massive advancement in neoantigen identification methods, leading to the constitution of novel personalized neoantigen cancer vaccines.
Collapse
Affiliation(s)
- Ayushi Verma
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay , Mumbai, India
| | - Ankit Halder
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay , Mumbai, India
| | - Soumitra Marathe
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay , Mumbai, India
| | - Rahul Purwar
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay , Mumbai, India
| | - Sanjeeva Srivastava
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay , Mumbai, India
| |
Collapse
|
13
|
Chen MX, Lu CC, Sun PC, Nie YX, Tian Y, Hu QJ, Das D, Hou XX, Gao B, Chen X, Liu SX, Zheng CC, Zhao XY, Dai L, Zhang J, Liu YG. Comprehensive transcriptome and proteome analyses reveal a novel sodium chloride responsive gene network in maize seed tissues during germination. PLANT, CELL & ENVIRONMENT 2021; 44:88-101. [PMID: 32677712 DOI: 10.1111/pce.13849] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Revised: 04/25/2020] [Accepted: 05/12/2020] [Indexed: 05/20/2023]
Abstract
Germination is a plant developmental process by which radicle of mature seeds start to penetrate surrounding barriers for seedling establishment and multiple environmental factors have been shown to affect it. Little is known how high salinity affects seed germination of C4 plant, Zea mays. Preliminary germination assay suggested that isolated embryo alone was able to germinate under 200 mM NaCl treatment, whereas the intact seeds were highly repressed. We hypothesized that maize endosperm may function in perception and transduction of salt signal to surrounding tissues such as embryo, showing a completely different response to that in Arabidopsis. Since salt response involves ABA, we analysed in vivo ABA distribution and quantity and the result demonstrated that ABA level in isolated embryo under NaCl treatment failed to increase in comparison with the water control, suggesting that the elevation of ABA level is an endosperm dependent process. Subsequently, by using advanced profiling techniques such as RNA sequencing and SWATH-MS-based quantitative proteomics, we found substantial differences in post-transcriptional and translational changes between salt-treated embryo and endosperm. In summary, our results indicate that these regulatory mechanisms, such as alternative splicing, are likely to mediate early responses to salt stress during maize seed germination.
Collapse
Affiliation(s)
- Mo-Xian Chen
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
- Southern Regional Collaborative Innovation Centre for Grain and Oil Crops in China, Hunan Agricultural University, Changsha, China
- CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Chong-Chong Lu
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
| | - Peng-Cheng Sun
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
| | - Yong-Xin Nie
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
| | - Yuan Tian
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
| | - Qi-Juan Hu
- Department of Biology, Hong Kong Baptist University, Shatin, Hong Kong
- State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Debatosh Das
- Department of Biology, Hong Kong Baptist University, Shatin, Hong Kong
- State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Xuan-Xuan Hou
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
| | - Bei Gao
- Department of Biology, Hong Kong Baptist University, Shatin, Hong Kong
- State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Xi Chen
- Wuhan Institute of Biotechnology, Wuhan, China
| | - Shou-Xu Liu
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
| | - Cheng-Chao Zheng
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
| | - Xiang-Yu Zhao
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
| | - Lei Dai
- CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Jianhua Zhang
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
- Department of Biology, Hong Kong Baptist University, Shatin, Hong Kong
| | - Ying-Gao Liu
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, China
| |
Collapse
|
14
|
Tariq MU, Haseeb M, Aledhari M, Razzak R, Parizi RM, Saeed F. Methods for Proteogenomics Data Analysis, Challenges, and Scalability Bottlenecks: A Survey. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020; 9:5497-5516. [PMID: 33537181 PMCID: PMC7853650 DOI: 10.1109/access.2020.3047588] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Big Data Proteogenomics lies at the intersection of high-throughput Mass Spectrometry (MS) based proteomics and Next Generation Sequencing based genomics. The combined and integrated analysis of these two high-throughput technologies can help discover novel proteins using genomic, and transcriptomic data. Due to the biological significance of integrated analysis, the recent past has seen an influx of proteogenomic tools that perform various tasks, including mapping proteins to the genomic data, searching experimental MS spectra against a six-frame translation genome database, and automating the process of annotating genome sequences. To date, most of such tools have not focused on scalability issues that are inherent in proteogenomic data analysis where the size of the database is much larger than a typical protein database. These state-of-the-art tools can take more than half a month to process a small-scale dataset of one million spectra against a genome of 3 GB. In this article, we provide an up-to-date review of tools that can analyze proteogenomic datasets, providing a critical analysis of the techniques' relative merits and potential pitfalls. We also point out potential bottlenecks and recommendations that can be incorporated in the future design of these workflows to ensure scalability with the increasing size of proteogenomic data. Lastly, we make a case of how high-performance computing (HPC) solutions may be the best bet to ensure the scalability of future big data proteogenomic data analysis.
Collapse
Affiliation(s)
- Muhammad Usman Tariq
- School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
| | - Muhammad Haseeb
- School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
| | - Mohammed Aledhari
- College of Computing and Software Engineering, Kennesaw State University, Marietta, GA 30060, USA
| | - Rehma Razzak
- College of Computing and Software Engineering, Kennesaw State University, Marietta, GA 30060, USA
| | - Reza M Parizi
- College of Computing and Software Engineering, Kennesaw State University, Marietta, GA 30060, USA
| | - Fahad Saeed
- School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
| |
Collapse
|
15
|
Lau E, Han Y, Williams DR, Thomas CT, Shrestha R, Wu JC, Lam MPY. Splice-Junction-Based Mapping of Alternative Isoforms in the Human Proteome. Cell Rep 2020; 29:3751-3765.e5. [PMID: 31825849 PMCID: PMC6961840 DOI: 10.1016/j.celrep.2019.11.026] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Revised: 09/24/2019] [Accepted: 11/06/2019] [Indexed: 12/18/2022] Open
Abstract
The protein-level translational status and function of many alternative splicing events remain poorly understood. We use an RNA sequencing (RNA-seq)-guided proteomics method to identify protein alternative splicing isoforms in the human proteome by constructing tissue-specific protein databases that prioritize transcript splice junction pairs with high translational potential. Using the custom databases to reanalyze ~80 million mass spectra in public proteomics datasets, we identify more than 1,500 noncanonical protein isoforms across 12 human tissues, including ~400 sequences undocumented on TrEMBL and RefSeq databases. We apply the method to original quantitative mass spectrometry experiments and observe widespread isoform regulation during human induced pluripotent stem cell cardiomyocyte differentiation. On a proteome scale, alternative isoform regions overlap frequently with disordered sequences and post-translational modification sites, suggesting that alternative splicing may regulate protein function through modulating intrinsically disordered regions. The described approach may help elucidate functional consequences of alternative splicing and expand the scope of proteomics investigations in various systems. The translation and function of many alternative splicing events await confirmation at the protein level. Lau et al. use an integrated proteotranscriptomics approach to identify non-canonical and undocumented isoforms from 12 organs in the human proteome. Alternative isoforms interfere with functional sequence features and are differentially regulated during iPSC cardiomyocyte differentiation.
Collapse
Affiliation(s)
- Edward Lau
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA
| | - Yu Han
- Consortium for Fibrosis Research and Translation, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA; Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA
| | - Damon R Williams
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA
| | - Cody T Thomas
- Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA
| | - Rajani Shrestha
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA
| | - Joseph C Wu
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA; Department of Radiology, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Maggie P Y Lam
- Consortium for Fibrosis Research and Translation, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA; Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA.
| |
Collapse
|
16
|
Nerurkar SN, Goh D, Cheung CCL, Nga PQY, Lim JCT, Yeong JPS. Transcriptional Spatial Profiling of Cancer Tissues in the Era of Immunotherapy: The Potential and Promise. Cancers (Basel) 2020; 12:E2572. [PMID: 32917035 PMCID: PMC7563386 DOI: 10.3390/cancers12092572] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 09/05/2020] [Accepted: 09/06/2020] [Indexed: 12/18/2022] Open
Abstract
Intratumoral heterogeneity poses a major challenge to making an accurate diagnosis and establishing personalized treatment strategies for cancer patients. Moreover, this heterogeneity might underlie treatment resistance, disease progression, and cancer relapse. For example, while immunotherapies can confer a high success rate, selective pressures coupled with dynamic evolution within a tumour can drive the emergence of drug-resistant clones that allow tumours to persist in certain patients. To improve immunotherapy efficacy, researchers have used transcriptional spatial profiling techniques to identify and subsequently block the source of tumour heterogeneity. In this review, we describe and assess the different technologies available for such profiling within a cancer tissue. We first outline two well-known approaches, in situ hybridization and digital spatial profiling. Then, we highlight the features of an emerging technology known as Visium Spatial Gene Expression Solution. Visium generates quantitative gene expression data and maps them to the tissue architecture. By retaining spatial information, we are well positioned to identify novel biomarkers and perform computational analyses that might inform on novel combinatorial immunotherapies.
Collapse
Affiliation(s)
- Sanjna Nilesh Nerurkar
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore;
| | - Denise Goh
- Institute of Molecular and Cell Biology (IMCB), Agency of Science, Technology and Research (A*STAR), Singapore 169856, Singapore; (D.G.); (P.Q.Y.N.); (J.C.T.L.)
| | | | - Pei Qi Yvonne Nga
- Institute of Molecular and Cell Biology (IMCB), Agency of Science, Technology and Research (A*STAR), Singapore 169856, Singapore; (D.G.); (P.Q.Y.N.); (J.C.T.L.)
| | - Jeffrey Chun Tatt Lim
- Institute of Molecular and Cell Biology (IMCB), Agency of Science, Technology and Research (A*STAR), Singapore 169856, Singapore; (D.G.); (P.Q.Y.N.); (J.C.T.L.)
| | - Joe Poh Sheng Yeong
- Institute of Molecular and Cell Biology (IMCB), Agency of Science, Technology and Research (A*STAR), Singapore 169856, Singapore; (D.G.); (P.Q.Y.N.); (J.C.T.L.)
- Department of Anatomical Pathology, Singapore General Hospital, Singapore 169856, Singapore
- Singapore Immunology Network (SIgN), Agency of Science, Technology and Research (A*STAR), Singapore 138648, Singapore
| |
Collapse
|
17
|
Experimental limitations of extracellular vesicle-based therapies for the treatment of myocardial infarction. Trends Cardiovasc Med 2020; 31:405-415. [PMID: 32822840 PMCID: PMC8501308 DOI: 10.1016/j.tcm.2020.08.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 08/13/2020] [Accepted: 08/14/2020] [Indexed: 12/20/2022]
Abstract
Extracellular vesicles (EVs) are particles secreted by a vast variety of cells and are often recognised to mimic the properties of their parent cell, as such those derived from developmental sources hold promise for the treatment of various diseases including myocardial infarction (MI). Here we review the experimental approaches taken for assessing the therapeutic efficacy of EVs for MI and find overt shortcomings regarding purity of isolated EVs, quantitation, dosing, EV labelling/uptake, route of administration and use of appropriate controls that renders much of the data uninterpretable. Overall, the EV/MI field has suffered from experimental approaches that are not fully standardised or validated. Fundamental improvements in EV study design are required to improve interpretation of efficacy and to ensure reproducibility and comparability across preclinical MI studies.
Collapse
|
18
|
Waas M, Kislinger T. Addressing Cellular Heterogeneity in Cancer through Precision Proteomics. J Proteome Res 2020; 19:3607-3619. [PMID: 32697918 DOI: 10.1021/acs.jproteome.0c00338] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Cells exhibit a broad spectrum of functions driven by differences in molecular phenotype. Understanding the heterogeneity between and within cell types has led to advances in our ability to diagnose and manipulate biological systems. Heterogeneity within and between tumors still poses a challenge to the development and efficacy of therapeutics. In this Perspective we review the toolkit of protein-level experimental approaches for investigating cellular heterogeneity. We describe how innovative approaches and technical developments have supported the advent of bottom-up single-cell proteomic analysis and present opportunities and challenges within cancer research. Finally, we introduce the concept of "precision proteomics" and discuss how the advantages and limitations of various experimental approaches render them suitable for different biological systems and questions.
Collapse
|
19
|
Yang M, Petralia F, Li Z, Li H, Ma W, Song X, Kim S, Lee H, Yu H, Lee B, Bae S, Heo E, Kaczmarczyk J, Stępniak P, Warchoł M, Yu T, Calinawan AP, Boutros PC, Payne SH, Reva B, Boja E, Rodriguez H, Stolovitzky G, Guan Y, Kang J, Wang P, Fenyö D, Saez-Rodriguez J, Aderinwale T, Afyounian E, Agrawal P, Ali M, Amadoz A, Azuaje F, Bachman J, Bae S, Bhalla S, Carbonell-Caballero J, Chakraborty P, Chaudhary K, Choi Y, Choi Y, Çubuk C, Dhanda SK, Dopazo J, Elo LL, Fóthi Á, Gevaert O, Granberg K, Greiner R, Heo E, Hidalgo MR, Jayaswal V, Jeon H, Jeon M, Kalmady SV, Kambara Y, Kang J, Kang K, Kaoma T, Kaur H, Kazan H, Kesar D, Kesseli J, Kim D, Kim K, Kim SY, Kim S, Kumar S, Lee B, Lee H, Liu Y, Luethy R, Mahajan S, Mahmoudian M, Muller A, Nazarov PV, Nguyen H, Nykter M, Okuda S, Park S, Pal Singh Raghava G, Rajapakse JC, Rantapero T, Ryu H, Salavert F, Saraei S, Sharma R, Siitonen A, Sokolov A, Subramanian K, Suni V, Suomi T, Tranchevent LC, Usmani SS, Välikangas T, Vega R, Zhong H. Community Assessment of the Predictability of Cancer Protein and Phosphoprotein Levels from Genomics and Transcriptomics. Cell Syst 2020; 11:186-195.e9. [DOI: 10.1016/j.cels.2020.06.013] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 03/12/2020] [Accepted: 06/29/2020] [Indexed: 10/23/2022]
|
20
|
Macklin A, Khan S, Kislinger T. Recent advances in mass spectrometry based clinical proteomics: applications to cancer research. Clin Proteomics 2020; 17:17. [PMID: 32489335 PMCID: PMC7247207 DOI: 10.1186/s12014-020-09283-w] [Citation(s) in RCA: 150] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2019] [Accepted: 05/15/2020] [Indexed: 02/07/2023] Open
Abstract
Cancer biomarkers have transformed current practices in the oncology clinic. Continued discovery and validation are crucial for improving early diagnosis, risk stratification, and monitoring patient response to treatment. Profiling of the tumour genome and transcriptome are now established tools for the discovery of novel biomarkers, but alterations in proteome expression are more likely to reflect changes in tumour pathophysiology. In the past, clinical diagnostics have strongly relied on antibody-based detection strategies, but these methods carry certain limitations. Mass spectrometry (MS) is a powerful method that enables increasingly comprehensive insights into changes of the proteome to advance personalized medicine. In this review, recent improvements in MS-based clinical proteomics are highlighted with a focus on oncology. We will provide a detailed overview of clinically relevant samples types, as well as, consideration for sample preparation methods, protein quantitation strategies, MS configurations, and data analysis pipelines currently available to researchers. Critical consideration of each step is necessary to address the pressing clinical questions that advance cancer patient diagnosis and prognosis. While the majority of studies focus on the discovery of clinically-relevant biomarkers, there is a growing demand for rigorous biomarker validation. These studies focus on high-throughput targeted MS assays and multi-centre studies with standardized protocols. Additionally, improvements in MS sensitivity are opening the door to new classes of tumour-specific proteoforms including post-translational modifications and variants originating from genomic aberrations. Overlaying proteomic data to complement genomic and transcriptomic datasets forges the growing field of proteogenomics, which shows great potential to improve our understanding of cancer biology. Overall, these advancements not only solidify MS-based clinical proteomics' integral position in cancer research, but also accelerate the shift towards becoming a regular component of routine analysis and clinical practice.
Collapse
Affiliation(s)
- Andrew Macklin
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | - Shahbaz Khan
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | - Thomas Kislinger
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
| |
Collapse
|
21
|
Wang Z, Yu K, Tan H, Wu Z, Cho JH, Han X, Sun H, Beach TG, Peng J. 27-Plex Tandem Mass Tag Mass Spectrometry for Profiling Brain Proteome in Alzheimer's Disease. Anal Chem 2020; 92:7162-7170. [PMID: 32343560 PMCID: PMC8176402 DOI: 10.1021/acs.analchem.0c00655] [Citation(s) in RCA: 69] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Multiplexed isobaric labeling methods, such as tandem mass tags (TMT), remarkably improve the throughput of quantitative mass spectrometry. Here, we present a 27-plex TMT method coupled with two-dimensional liquid chromatography (LC/LC) for extensive peptide fractionation and high-resolution tandem mass spectrometry (MS/MS) for peptide quantification and then apply the method to profile the complex human brain proteome of Alzheimer's disease (AD). The 27-plex method combines multiplexed capacities of the 11-plex and the 16-plex TMT, as the peptides labeled by the two TMT sets display different mass and hydrophobicity, which can be well separated in LC-MS/MS. We first systematically optimized the protocol for the newly developed 16-plex TMT, including labeling reaction, desalting, and MS conditions, and then directly compared the 11-plex and 16-plex methods by analyzing the same human AD samples. Both methods yielded similar proteome coverage, analyzing >100 000 peptides in >10 000 human proteins. Furthermore, the 11-plex and 16-plex samples were mixed for a 27-plex assay, resulting in more than 8000 protein measurements within the same MS time. The 27-plex results are highly consistent with those of the individual 11-plex and 16-plex TMT analyses. We also used these proteomics data sets to compare the AD brain with the nondementia controls, discovering major AD-related proteins and revealing numerous novel protein alterations enriched in the pathways of amyloidosis, immunity, mitochondrial, and synaptic functions. Overall, our data strongly demonstrate that this new 27-plex strategy is highly feasible for routine large-scale proteomic analysis.
Collapse
Affiliation(s)
- Zhen Wang
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
- Contribute equally
| | - Kaiwen Yu
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
- Contribute equally
| | - Haiyan Tan
- Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
- Contribute equally
| | - Zhiping Wu
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Ji-Hoon Cho
- Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Xian Han
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
- University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Huan Sun
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Thomas G. Beach
- Banner Sun Health Research Institute, Sun City, AZ 85351, USA
| | - Junmin Peng
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
- Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| |
Collapse
|
22
|
Gallant J, Mouton J, Ummels R, Ten Hagen-Jongman C, Kriel N, Pain A, Warren RM, Bitter W, Heunis T, Sampson SL. Identification of gene fusion events in Mycobacterium tuberculosis that encode chimeric proteins. NAR Genom Bioinform 2020; 2:lqaa033. [PMID: 33575588 PMCID: PMC7671302 DOI: 10.1093/nargab/lqaa033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 04/16/2020] [Accepted: 05/05/2020] [Indexed: 02/07/2023] Open
Abstract
Mycobacterium tuberculosis is a facultative intracellular pathogen responsible for causing tuberculosis. The harsh environment in which M. tuberculosis survives requires this pathogen to continuously adapt in order to maintain an evolutionary advantage. However, the apparent absence of horizontal gene transfer in M. tuberculosis imposes restrictions in the ways by which evolution can occur. Large-scale changes in the genome can be introduced through genome reduction, recombination events and structural variation. Here, we identify a functional chimeric protein in the ppe38-71 locus, the absence of which is known to have an impact on protein secretion and virulence. To examine whether this approach was used more often by this pathogen, we further develop software that detects potential gene fusion events from multigene deletions using whole genome sequencing data. With this software we could identify a number of other putative gene fusion events within the genomes of M. tuberculosis isolates. We were able to demonstrate the expression of one of these gene fusions at the protein level using mass spectrometry. Therefore, gene fusions may provide an additional means of evolution for M. tuberculosis in its natural environment whereby novel chimeric proteins and functions can arise.
Collapse
Affiliation(s)
- James Gallant
- DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa.,Section of Molecular Microbiology, Amsterdam Institute for Molecules, Medicines and Systems, Vrije Universiteit Amsterdam, 1081 HZ Amsterdam, The Netherlands
| | - Jomien Mouton
- DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa
| | - Roy Ummels
- Medical Microbiology and Infection Control, Vrije Universiteit Amsterdam, Amsterdam UMC, 1081 HZ Amsterdam, The Netherlands
| | - Corinne Ten Hagen-Jongman
- Section of Molecular Microbiology, Amsterdam Institute for Molecules, Medicines and Systems, Vrije Universiteit Amsterdam, 1081 HZ Amsterdam, The Netherlands
| | - Nastassja Kriel
- DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa
| | - Arnab Pain
- Biological and Environmental Sciences and Engineering (BESE) Division, King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia.,Global Station for Zoonosis Control, GI-CoRE, Hokkaido University, 001-0020, N20 W10 Kita-ku, Sapporo, Japan
| | - Robin M Warren
- DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa
| | - Wilbert Bitter
- Section of Molecular Microbiology, Amsterdam Institute for Molecules, Medicines and Systems, Vrije Universiteit Amsterdam, 1081 HZ Amsterdam, The Netherlands.,Medical Microbiology and Infection Control, Vrije Universiteit Amsterdam, Amsterdam UMC, 1081 HZ Amsterdam, The Netherlands
| | - Tiaan Heunis
- DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa.,Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
| | - Samantha L Sampson
- DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa
| |
Collapse
|
23
|
Li Y, Wang G, Tan X, Ouyang J, Zhang M, Song X, Liu Q, Leng Q, Chen L, Xie L. ProGeo-neo: a customized proteogenomic workflow for neoantigen prediction and selection. BMC Med Genomics 2020; 13:52. [PMID: 32241270 PMCID: PMC7118832 DOI: 10.1186/s12920-020-0683-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Neoantigens can be differentially recognized by T cell receptor (TCR) as these sequences are derived from mutant proteins and are unique to the tumor. The discovery of neoantigens is the first key step for tumor-specific antigen (TSA) based immunotherapy. Based on high-throughput tumor genomic analysis, each missense mutation can potentially give rise to multiple neopeptides, resulting in a vast total number, but only a small percentage of these peptides may achieve immune-dominant status with a given major histocompatibility complex (MHC) class I allele. Specific identification of immunogenic candidate neoantigens is consequently a major challenge. Currently almost all neoantigen prediction tools are based on genomics data. RESULTS Here we report the construction of proteogenomics prediction of neoantigen (ProGeo-neo) pipeline, which incorporates the following modules: mining tumor specific antigens from next-generation sequencing genomic and mRNA expression data, predicting the binding mutant peptides to class I MHC molecules by latest netMHCpan (v.4.0), verifying MHC-peptides by MaxQuant with mass spectrometry proteomics data searched against customized protein database, and checking potential immunogenicity of T-cell-recognization by additional screening methods. ProGeo-neo pipeline achieves proteogenomics strategy and the neopeptides identified were of much higher quality as compared to those identified using genomic data only. CONCLUSIONS The pipeline was constructed based on the genomics and proteomics data of Jurkat leukemia cell line but is generally applicable to other solid cancer research. With massively parallel sequencing and proteomics profiling increasing, this proteogenomics workflow should be useful for neoantigen oriented research and immunotherapy.
Collapse
Affiliation(s)
- Yuyu Li
- Key Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage and Preservation (Shanghai), China Ministry of Agriculture; College of Food Science and Technology, Shanghai Ocean University, 999 Hu Cheng Huan Road, Shanghai, 201306, China.,Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, 1278 Keyuan Road, Shanghai, 201203, China
| | - Guangzhi Wang
- Key Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage and Preservation (Shanghai), China Ministry of Agriculture; College of Food Science and Technology, Shanghai Ocean University, 999 Hu Cheng Huan Road, Shanghai, 201306, China.,Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, 1278 Keyuan Road, Shanghai, 201203, China
| | - Xiaoxiu Tan
- Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, 1278 Keyuan Road, Shanghai, 201203, China
| | - Jian Ouyang
- Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, 1278 Keyuan Road, Shanghai, 201203, China
| | - Menghuan Zhang
- Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, 1278 Keyuan Road, Shanghai, 201203, China
| | - Xiaofeng Song
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China
| | - Qi Liu
- Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 20009, China
| | - Qibin Leng
- Affiliated Cancer Hospital & Institute of Guangzhou Medical University, 78 Heng Zhi Gang, Lu Hu Road, Guangzhou, 510095, China
| | - Lanming Chen
- Key Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage and Preservation (Shanghai), China Ministry of Agriculture; College of Food Science and Technology, Shanghai Ocean University, 999 Hu Cheng Huan Road, Shanghai, 201306, China.
| | - Lu Xie
- Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, 1278 Keyuan Road, Shanghai, 201203, China.
| |
Collapse
|
24
|
Cantor DI, Cheruku HR, Westacott J, Shin JS, Mohamedali A, Ahn SB. Proteomic investigations into resistance in colorectal cancer. Expert Rev Proteomics 2020; 17:49-65. [PMID: 31914823 DOI: 10.1080/14789450.2020.1713103] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Introduction: Despite advances in screening and treatment options, colorectal cancer (CRC) remains one of the most prevalent and lethal cancer subtypes. Resistance to cytotoxic or targeted therapy has remained a constant challenge to the treatment and long-term management of patients, attracting intense worldwide investigation since the 1950s. Through extensive investigations into the proteomic mechanisms and functions that convey resistance to therapy/s, researchers have become able to implicate alterations in several signaling pathways that provide and sustain resistance to treatment.Areas covered: In this review, we summarize how protein alterations are associated with resistance to therapy, with particular emphasis on CRC. An overview of the mechanisms of therapeutic resistance is described, highlighting recent studies which endeavor to elucidate the proteomic changes that are associated with the acquisition and promulgation of therapeutic resistance.Expert opinion: While cancers such as CRC have been intensively studied for decades, unresponsiveness and the resistance to therapy remain critical obstacles in the treatment of patients. Due to the inherent biological and clinical heterogeneity of individual CRCs, proteomic methods stand to become powerful tools to provide biological insights that may guide therapeutic strategies with the ultimate goal of refining emergent immunotherapeutic treatments.
Collapse
Affiliation(s)
- David I Cantor
- Australian Proteome Analysis Facility, Macquarie University, Sydney, Australia
| | | | - Jack Westacott
- Faculty of Science and Engineering, Macquarie University, Sydney, Australia
| | - Joo-Shik Shin
- Tissue Pathology and Diagnostic Oncology, Royal Prince Alfred Hospital and Faculty of Medicine, University of Sydney, Sydney, Australia
| | - Abidali Mohamedali
- Faculty of Science and Engineering, Macquarie University, Sydney, Australia
| | - Seong Boem Ahn
- Faculty of Health and Medical Sciences, Macquarie University, Sydney, Australia
| |
Collapse
|
25
|
Ang MY, Low TY, Lee PY, Wan Mohamad Nazarie WF, Guryev V, Jamal R. Proteogenomics: From next-generation sequencing (NGS) and mass spectrometry-based proteomics to precision medicine. Clin Chim Acta 2019; 498:38-46. [DOI: 10.1016/j.cca.2019.08.010] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2019] [Revised: 08/13/2019] [Accepted: 08/13/2019] [Indexed: 12/14/2022]
|
26
|
Paunovska K, Loughrey D, Sago CD, Langer R, Dahlman JE. Using Large Datasets to Understand Nanotechnology. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2019; 31:e1902798. [PMID: 31429126 PMCID: PMC6810779 DOI: 10.1002/adma.201902798] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Revised: 06/24/2019] [Indexed: 05/02/2023]
Abstract
Advances in sequencing technologies have made studying biological processes with genomics, transcriptomics, and proteomics commonplace. As a result, this suite of increasingly integrated techniques is well positioned to study drug delivery, a process that is influenced by many biomolecules working in concert. Omics-based approaches can be used to study the vast nanomaterial chemical space as well as the biological factors that affect the safety, toxicity, and efficacy of nanotechnologies. The generation and analysis of large datasets, methods to interpret them, and dataset applications to nanomaterials to date, are demonstrated here. Finally, new approaches for how sequencing-generated datasets can answer fundamental questions in nanotechnology based drug delivery are proposed.
Collapse
Affiliation(s)
- Kalina Paunovska
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory School of Medicine, Atlanta, GA, 30332, USA
| | - David Loughrey
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory School of Medicine, Atlanta, GA, 30332, USA
| | - Cory D Sago
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory School of Medicine, Atlanta, GA, 30332, USA
| | - Robert Langer
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - James E Dahlman
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory School of Medicine, Atlanta, GA, 30332, USA
| |
Collapse
|
27
|
Prakash A, Majumder S, Ahmad S, Varkey M, Anish TA, Jenkins C, Rigby M, Orsburn B. Detection and verification of 2.3 million cancer mutations in NCI60 cancer cell lines with a cloud search engine. J Proteomics 2019; 209:103488. [PMID: 31445215 DOI: 10.1016/j.jprot.2019.103488] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Revised: 07/25/2019] [Accepted: 08/12/2019] [Indexed: 12/13/2022]
Abstract
Today we have unprecedented access to human genomic and proteomic data that appear to be rapidly approaching our current understanding of comprehensive coverage. Combining genomic information with shotgun proteomics remains challenging due to the large increase in proteomics search space. However, making this connection between genomic and proteomic information is critical for cancer studies to vaccine development. Furthermore, as we progress towards personalized medicine, it will be essential for proteomics analysis to identify individual mutations and variants in order to fully understand protein networks and to develop personalized therapies. While these advantages are well-established, only a few studies have demonstrated the successful integration of proteomic data with large genomic input. We present and examine the abilities of Bolt, a new cloud-based proteomics search engine to search for the presence of over 2.3 million known cancer mutations in a matter of minutes while still performing a standard proteomics search that includes 31 post translational modifications. We use previously published proteomics data sets and identify mutations that are verified using genomic studies as well as previous proteomics efforts. Our results also emphasize the need to search for mutations in a comprehensive manner while still searching for both common and rare PTMs. SIGNIFICANCE: We present and examine the abilities of Bolt, a new cloud-based proteomics search engine to search for the presence of over 2.3 million known cancer mutations in a matter of minutes while still performing a standard proteomics search that includes 31 post translational modifications. No other proteomics search software can do so.
Collapse
Affiliation(s)
- Amol Prakash
- Optys Tech Corporation, Shrewsbury, MA, United States of America.
| | | | - Shadab Ahmad
- Optys Tech Corporation, Shrewsbury, MA, United States of America
| | - Manu Varkey
- Optys Tech Corporation, Shrewsbury, MA, United States of America
| | - T A Anish
- Optys Tech Corporation, Shrewsbury, MA, United States of America
| | - Conor Jenkins
- Proteomic und Genomic Sciences, Baltimore, MD 21214, United States of America
| | - Megan Rigby
- Hood College Department of Biology, Frederick, MD 21701, United States of America
| | - Benjamin Orsburn
- Proteomic und Genomic Sciences, Baltimore, MD 21214, United States of America
| |
Collapse
|
28
|
Zografos E, Anagnostopoulos AK, Papadopoulou A, Legaki E, Zagouri F, Marinos E, Tsangaris GT, Gazouli M. Serum Proteomic Signatures of Male Breast Cancer. Cancer Genomics Proteomics 2019; 16:129-137. [PMID: 30850364 DOI: 10.21873/cgp.20118] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Revised: 01/12/2019] [Accepted: 01/16/2019] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND To date, the elucidation of serum protein alterations in male breast cancer (MBC) has not been extensively studied, due to the rarity of the disease. MATERIALS AND METHODS In the present work, two-dimensional gel electrophoresis (2-DE) and matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) were employed to detect differences in serum protein expression between patients with MBC and healthy controls. RESULTS A panel of differentially expressed serum proteins was identified, including proteins involved in the regulation of the cell cycle [e.g. cell division cycle 7-related protein kinase (CDC7)], in mitochondrial function [e.g. mitochondrial aldehyde dehydrogenase (ALDH2) and dimethyladenosine transferase 1 (TFB1M)], in lipid metabolism and transport [e.g. apolipoprotein A-I (APOA1) and E (APOE)], in apoptosis and immune response [e.g. CD5 antigen-like (CD5L), clusterin (CLUS) and C-C motif chemokine 14 (CCL14)], in transcription (e.g. protein SSX3 and signal transducer and activator of transcription 3 (STAT3)], in invasion and metastasis (e.g. alpha-2-HS-glycoprotein (FETUA)], in estrogen synthesis [aromatase (CYP19A1)] and other diverse biological roles [e.g. actin-related protein 2/3 complex subunit 4 (ARPC4), dual specificity mitogen-activated protein kinase kinase 4 (MP2K4), ectoderm-neural cortex protein 1 (ENC1), and matrix metalloproteinase-27 (MMP27)]. CONCLUSION These findings provide valuable insight into the distinct clinicopathological features of MBC and indicate that select serum proteomic markers may help improve MBC management.
Collapse
Affiliation(s)
- Eleni Zografos
- Department of Basic Medical Sciences, Laboratory of Biology, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece
| | - Athanasios K Anagnostopoulos
- Proteomics Research Unit, Center of Basic Research II, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | - Aggeliki Papadopoulou
- Proteomics Research Unit, Center of Basic Research II, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | - Evangelia Legaki
- Department of Basic Medical Sciences, Laboratory of Biology, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece
| | - Flora Zagouri
- Department of Clinical Therapeutics, Alexandra Hospital, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece
| | - Evangelos Marinos
- Department of Basic Medical Sciences, Laboratory of Biology, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece
| | - George T Tsangaris
- Proteomics Research Unit, Center of Basic Research II, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | - Maria Gazouli
- Department of Basic Medical Sciences, Laboratory of Biology, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece
| |
Collapse
|
29
|
He Y, Mohamedali A, Huang C, Baker MS, Nice EC. Oncoproteomics: Current status and future opportunities. Clin Chim Acta 2019; 495:611-624. [PMID: 31176645 DOI: 10.1016/j.cca.2019.06.006] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 06/05/2019] [Accepted: 06/05/2019] [Indexed: 02/07/2023]
Abstract
Oncoproteomics is the systematic study of cancer samples using omics technologies to detect changes implicated in tumorigenesis. Recent progress in oncoproteomics is already opening new avenues for the identification of novel biomarkers for early clinical stage cancer detection, targeted molecular therapies, disease monitoring, and drug development. Such information will lead to new understandings of cancer biology and impact dramatically on the future care of cancer patients. In this review, we will summarize the advantages and limitations of the key technologies used in (onco)proteogenomics, (the Omics Pipeline), explain how they can assist us in understanding the biology behind the overarching "Hallmarks of Cancer", discuss how they can advance the development of precision/personalised medicine and the future directions in the field.
Collapse
Affiliation(s)
- Yujia He
- West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, PR China
| | - Abidali Mohamedali
- Department of Molecular Sciences, Faculty of Science and Engineering, Macquarie University, New South Wales 2109, Australia
| | - Canhua Huang
- West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, PR China
| | - Mark S Baker
- Department of Biomedical Sciences, Faculty of Medicine and Health Sciences, Macquarie University, New South Wales 2109, Australia.
| | - Edouard C Nice
- West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, PR China; Department of Biomedical Sciences, Faculty of Medicine and Health Sciences, Macquarie University, New South Wales 2109, Australia; Department of Biochemistry and Molecular Biology, Monash University, Clayton, Australia.
| |
Collapse
|
30
|
Comprehensive proteome and phosphoproteome profiling shows negligible influence of RNAlater on protein abundance and phosphorylation. Clin Proteomics 2019; 16:18. [PMID: 31049047 PMCID: PMC6482574 DOI: 10.1186/s12014-019-9239-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Accepted: 04/17/2019] [Indexed: 01/06/2023] Open
Abstract
Certain tumors such as pancreatic ductal adenocarcinoma (PDAC) are known to contain a variety of hydrolytic enzymes including RNases and proteases that may lead to degradation of RNA and proteins during sample processing. For such tumor tissues with RNA instability, RNAlater containing a high concentration of quaternary ammonium sulfates that denature RNA-hydrolyzing enzymes is often used to protect RNAs from hydrolysis. Although a few studies have been carried out to determine the effect of RNAlater on DNA and RNA, whether RNAlater influences the proteome and phosphoproteome is largely unknown. In this study we carried out a systematic and comprehensive analysis of the effect of RNAlater on the proteome and phosphoproteome using high-resolution mass spectrometry. PDAC tissues from three patients were individually pulverized and the tissue powders of each patient were divided into two portions, one of which was incubated in RNAlater at 4 °C for 24 h (RNAlater tissue) while the other was kept at - 80 °C (frozen tissue). Comprehensive quantitative profiling experiments on the RNAlater tissues and the frozen tissues resulted in the identification of 99,136 distinct peptides of 8803 protein groups and 17,345 phosphopeptides of 16,436 phosphosites. The data exhibited no significant quantitative changes in both proteins and phosphorylation between the RNAlater tissues and the frozen tissue. In addition, the phosphoproteome data showed heterogeneously activated pathways among the three patients that were not altered by RNAlater. These results indicate that the tissue preservation method using RNAlater can be effectively used on PDAC tissues for proteogenomic studies where preservation of intact DNA, RNA and proteins is prerequisite. Data from this study are available via ProteomeXchange with the identifier PXD010710.
Collapse
|
31
|
Zhang B, Whiteaker JR, Hoofnagle AN, Baird GS, Rodland KD, Paulovich AG. Clinical potential of mass spectrometry-based proteogenomics. Nat Rev Clin Oncol 2019; 16:256-268. [PMID: 30487530 PMCID: PMC6448780 DOI: 10.1038/s41571-018-0135-7] [Citation(s) in RCA: 130] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Cancer genomics research aims to advance personalized oncology by finding and targeting specific genetic alterations associated with cancers. In genome-driven oncology, treatments are selected for individual patients on the basis of the findings of tumour genome sequencing. This personalized approach has prolonged the survival of subsets of patients with cancer. However, many patients do not respond to the predicted therapies based on the genomic profiles of their tumours. Furthermore, studies pairing genomic and proteomic analyses of samples from the same tumours have shown that the proteome contains novel information that cannot be discerned through genomic analysis alone. This observation has led to the concept of proteogenomics, in which both types of data are leveraged for a more complete view of tumour biology that might enable patients to be more successfully matched to effective treatments than they would using genomics alone. In this Perspective, we discuss the added value of proteogenomics over the current genome-driven approach to the clinical characterization of cancers and summarize current efforts to incorporate targeted proteomic measurements based on selected/multiple reaction monitoring (SRM/MRM) mass spectrometry into the clinical laboratory to facilitate clinical proteogenomics.
Collapse
Affiliation(s)
- Bing Zhang
- Department of Molecular and Human Genetics, Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA
| | - Jeffrey R Whiteaker
- Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Andrew N Hoofnagle
- Department of Medicine, University of Washington, Seattle, WA, USA
- Department of Laboratory Medicine, University of Washington, Seattle, WA, USA
| | - Geoffrey S Baird
- Department of Laboratory Medicine, University of Washington, Seattle, WA, USA
- Department of Pathology, University of Washington, Seattle, WA, USA
| | - Karin D Rodland
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
- Department of Cell, Development and Cancer Biology, Oregon Health & Sciences University, Portland, OR, USA
| | - Amanda G Paulovich
- Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA.
- Division of Medical Oncology, University of Washington School of Medicine, Seattle, WA, USA.
| |
Collapse
|
32
|
Mnatsakanyan R, Shema G, Basik M, Batist G, Borchers CH, Sickmann A, Zahedi RP. Detecting post-translational modification signatures as potential biomarkers in clinical mass spectrometry. Expert Rev Proteomics 2019; 15:515-535. [PMID: 29893147 DOI: 10.1080/14789450.2018.1483340] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
INTRODUCTION Numerous diseases are caused by changes in post-translational modifications (PTMs). Therefore, the number of clinical proteomics studies that include the analysis of PTMs is increasing. Combining complementary information-for example changes in protein abundance, PTM levels, with the genome and transcriptome (proteogenomics)-holds great promise for discovering important drivers and markers of disease, as variations in copy number, expression levels, or mutations without spatial/functional/isoform information is often insufficient or even misleading. Areas covered: We discuss general considerations, requirements, pitfalls, and future perspectives in applying PTM-centric proteomics to clinical samples. This includes samples obtained from a human subject, for instance (i) bodily fluids such as plasma, urine, or cerebrospinal fluid, (ii) primary cells such as reproductive cells, blood cells, and (iii) tissue samples/biopsies. Expert commentary: PTM-centric discovery proteomics can substantially contribute to the understanding of disease mechanisms by identifying signatures with potential diagnostic or even therapeutic relevance but may require coordinated efforts of interdisciplinary and eventually multi-national consortia, such as initiated in the cancer moonshot program. Additionally, robust and standardized mass spectrometry (MS) assays-particularly targeted MS, MALDI imaging, and immuno-MALDI-may be transferred to the clinic to improve patient stratification for precision medicine, and guide therapies.
Collapse
Affiliation(s)
- Ruzanna Mnatsakanyan
- a Protein Dynamics , Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V , Dortmund , 44227 , Germany
| | - Gerta Shema
- a Protein Dynamics , Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V , Dortmund , 44227 , Germany
| | - Mark Basik
- b Gerald Bronfman Department of Oncology , Jewish General Hospital, McGill University , Montreal , Quebec H4A 3T2 , Canada
| | - Gerald Batist
- b Gerald Bronfman Department of Oncology , Jewish General Hospital, McGill University , Montreal , Quebec H4A 3T2 , Canada
| | - Christoph H Borchers
- b Gerald Bronfman Department of Oncology , Jewish General Hospital, McGill University , Montreal , Quebec H4A 3T2 , Canada.,c University of Victoria-Genome British Columbia Proteomics Centre, University of Victoria , Victoria , British Columbia V8Z 7X8 , Canada.,d Department of Biochemistry and Microbiology , University of Victoria , Victoria , British Columbia , V8P 5C2 , Canada.,e Segal Cancer Proteomics Centre, Lady Davis Institute, Jewish General Hospital, McGill University , Montreal , Quebec H3T 1E2 , Canada
| | - Albert Sickmann
- a Protein Dynamics , Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V , Dortmund , 44227 , Germany.,f Medizinische Fakultät, Medizinische Proteom-Center (MPC), Ruhr-Universität Bochum , 44801 Bochum , Germany.,g Department of Chemistry , College of Physical Sciences, University of Aberdeen , Aberdeen AB24 3FX , Scotland , United Kingdom
| | - René P Zahedi
- a Protein Dynamics , Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V , Dortmund , 44227 , Germany.,b Gerald Bronfman Department of Oncology , Jewish General Hospital, McGill University , Montreal , Quebec H4A 3T2 , Canada.,e Segal Cancer Proteomics Centre, Lady Davis Institute, Jewish General Hospital, McGill University , Montreal , Quebec H3T 1E2 , Canada
| |
Collapse
|
33
|
Abstract
Breaking down the silos between disciplines to accelerate the pace of cancer research is a key paradigm for the Cancer Moonshot. Molecular analyses of cancer biology have tended to segregate between a focus on nucleic acids-DNA, RNA, and their modifications-and a focus on proteins and protein function. Proteogenomics represents a fusion of those two approaches, leveraging the strengths of each to provide a more integrated vision of the flow of information from DNA to RNA to protein and eventually function at the molecular level. Proteogenomic studies have been incorporated into multiple activities associated with the Cancer Moonshot, demonstrating substantial added value. Innovative study designs integrating genomic, transcriptomic, and proteomic data, particularly those using clinically relevant samples and involving clinical trials, are poised to provide new insights regarding cancer risk, progression, and response to therapy.
Collapse
|
34
|
Lorentzian A, Uzozie A, Lange PF. Origins and clinical relevance of proteoforms in pediatric malignancies. Expert Rev Proteomics 2019; 16:185-200. [PMID: 30700156 DOI: 10.1080/14789450.2019.1575206] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
INTRODUCTION Cancer changes the proteome in complex ways that reach well beyond simple changes in protein abundance. Genomic and transcriptional variations and post-translational protein modification create functional variants of a protein, known as proteoforms. Childhood cancers have fewer genomic alterations but show equally dramatic phenotypic changes as malignant cells in adults. Therefore, unraveling the complexities of the proteome is even more important in pediatric malignancies. Areas covered: In this review, the biological origins of proteoforms and technological advancements in the study of proteoforms are discussed. Particular emphasis is given to their implication in childhood malignancies and the critical role of cancer-specific proteoforms for the next generation of cancer therapies and diagnostics. Expert opinion: Recent advancements in technology have led to a better understanding of the underlying mechanisms of tumorigenesis. This has been critical for the development of more effective and less harmful treatments that are based on direct targeting of altered proteins and deregulated pathways. As proteome coverage and the ability to detect complex proteoforms increase, the most need for change is in data compilation and database availability to mediate high-level data analysis and allow for better functional annotation of proteoforms.
Collapse
Affiliation(s)
- Amanda Lorentzian
- a Department of Cell and Developmental Biology , University of British Columbia , Vancouver , BC , Canada.,b Michael Cuccione Childhood Cancer Research Program , BC Children's Hospital Research Institute , Vancouver , BC , Canada
| | - Anuli Uzozie
- b Michael Cuccione Childhood Cancer Research Program , BC Children's Hospital Research Institute , Vancouver , BC , Canada.,c Department of Pathology and Laboratory Medicine , University of British Columbia , Vancouver , BC , Canada
| | - Philipp F Lange
- a Department of Cell and Developmental Biology , University of British Columbia , Vancouver , BC , Canada.,b Michael Cuccione Childhood Cancer Research Program , BC Children's Hospital Research Institute , Vancouver , BC , Canada.,c Department of Pathology and Laboratory Medicine , University of British Columbia , Vancouver , BC , Canada
| |
Collapse
|
35
|
Chen MX, Zhu FY, Wang FZ, Ye NH, Gao B, Chen X, Zhao SS, Fan T, Cao YY, Liu TY, Su ZZ, Xie LJ, Hu QJ, Wu HJ, Xiao S, Zhang J, Liu YG. Alternative splicing and translation play important roles in hypoxic germination in rice. JOURNAL OF EXPERIMENTAL BOTANY 2019; 70:817-833. [PMID: 30535157 PMCID: PMC6363088 DOI: 10.1093/jxb/ery393] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 10/27/2018] [Indexed: 05/04/2023]
Abstract
Post-transcriptional mechanisms (PTMs), including alternative splicing (AS) and alternative translation initiation (ATI), may explain the diversity of proteins involved in plant development and stress responses. Transcriptional regulation is important during the hypoxic germination of rice seeds, but the potential roles of PTMs in this process have not been characterized. We used a combination of proteomics and RNA sequencing to discover how AS and ATI contribute to plant responses to hypoxia. In total, 10 253 intron-containing genes were identified. Of these, ~1741 differentially expressed AS (DAS) events from 811 genes were identified in hypoxia-treated seeds compared with controls. Over 95% of these were not present in the list of differentially expressed genes. In particular, regulatory pathways such as the spliceosome, ribosome, endoplasmic reticulum protein processing and export, proteasome, phagosome, oxidative phosphorylation, and mRNA surveillance showed substantial AS changes under hypoxia, suggesting that AS responses are largely independent of transcriptional regulation. Considerable AS changes were identified, including the preferential usage of some non-conventional splice sites and enrichment of splicing factors in the DAS data sets. Taken together, these results not only demonstrate that AS and ATI function during hypoxic germination but they have also allowed the identification of numerous novel proteins/peptides produced via ATI.
Collapse
Affiliation(s)
- Mo-Xian Chen
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, Shandong, China
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Biology and the Environment, Nanjing Forestry University, Nanjing, China
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
| | - Fu-Yuan Zhu
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Biology and the Environment, Nanjing Forestry University, Nanjing, China
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
| | - Feng-Zhu Wang
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Neng-Hui Ye
- Southern Regional Collaborative Innovation Center for Grain and Oil Crops in China, Hunan Agricultural University, Changsha, China
| | - Bei Gao
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Xi Chen
- SpecAlly Life Technology Co., Ltd, Wuhan, China
| | - Shan-Shan Zhao
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Tao Fan
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, Shandong, China
| | - Yun-Ying Cao
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
- College of Life Sciences, Nantong University, Nantong, Jiangsu, China
| | - Tie-Yuan Liu
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Ze-Zhuo Su
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Li-Juan Xie
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Qi-Juan Hu
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
| | - Hui-Jie Wu
- College of Life Sciences, Nantong University, Nantong, Jiangsu, China
| | - Shi Xiao
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Jianhua Zhang
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Biology and the Environment, Nanjing Forestry University, Nanjing, China
- Department of Biology, Hong Kong Baptist University, and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
- Correspondence: or
| | - Ying-Gao Liu
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, Shandong, China
- Correspondence: or
| |
Collapse
|
36
|
González-Gomariz J, Guruceaga E, López-Sánchez M, Segura V. Proteogenomics in the context of the Human Proteome Project (HPP). Expert Rev Proteomics 2019; 16:267-275. [PMID: 30654666 DOI: 10.1080/14789450.2019.1571916] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
INTRODUCTION The technological and scientific progress performed in the Human Proteome Project (HPP) has provided to the scientific community a new set of experimental and bioinformatic methods in the challenging field of shotgun and SRM/MRM-based Proteomics. The requirements for a protein to be considered experimentally validated are now well-established, and the information about the human proteome is available in the neXtProt database, while targeted proteomic assays are stored in SRMAtlas. However, the study of the missing proteins continues being an outstanding issue. Areas covered: This review is focused on the implementation of proteogenomic methods designed to improve the detection and validation of the missing proteins. The evolution of the methodological strategies based on the combination of different omic technologies and the use of huge publicly available datasets is shown taking the Chromosome 16 Consortium as reference. Expert commentary: Proteogenomics and other strategies of data analysis implemented within the C-HPP initiative could be used as guidance to complete in a near future the catalog of the human proteins. Besides, in the next years, we will probably witness their use in the B/D-HPP initiative to go a step forward on the implications of the proteins in the human biology and disease.
Collapse
Affiliation(s)
- José González-Gomariz
- a Bioinformatics Platform, Center for Applied Medical Research , University of Navarra , Pamplona , Spain.,b IdiSNA , Navarra Institute for Health Research , Pamplona , Spain
| | - Elizabeth Guruceaga
- a Bioinformatics Platform, Center for Applied Medical Research , University of Navarra , Pamplona , Spain.,b IdiSNA , Navarra Institute for Health Research , Pamplona , Spain
| | - Macarena López-Sánchez
- a Bioinformatics Platform, Center for Applied Medical Research , University of Navarra , Pamplona , Spain
| | - Victor Segura
- a Bioinformatics Platform, Center for Applied Medical Research , University of Navarra , Pamplona , Spain.,b IdiSNA , Navarra Institute for Health Research , Pamplona , Spain
| |
Collapse
|
37
|
Dimitrakopoulos L, Prassas I, Sieuwerts AM, Diamandis EP, Martens JWM, Charames GS. Proteome-wide onco-proteogenomic somatic variant identification in ER-positive breast cancer. Clin Biochem 2019; 66:63-75. [PMID: 30684468 DOI: 10.1016/j.clinbiochem.2019.01.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Revised: 01/14/2019] [Accepted: 01/18/2019] [Indexed: 01/19/2023]
Abstract
BACKGROUND Recent advances in mass spectrometric instrumentation and bioinformatics have critically contributed to the field of proteogenomics. Nonetheless, whether that integrative approach has reached the point of maturity to effectively reveal the flow of genetic variants from DNA to proteins still remains elusive. The objective of this study was to detect somatically acquired protein variants in breast cancer specimens for which full genome and transcriptome data was already available (BASIS cohort). METHODS LC-MS/MS shotgun proteomic results of 21 breast cancer tissues were coupled to DNA sequencing data to identify variants at the protein level and finally were used to associate protein expression with gene expression levels. RESULTS Here we report the observation of three sequencing-predicted single amino acid somatic variants. The sensitivity of single amino acid variant (SAAV) detection based on DNA sequencing-predicted single nucleotide variants was 0.4%. This sensitivity was increased to 0.6% when all the predicted variants were filtered for MS "compatibility" and was further increased to 2.9% when only proteins with at least one wild type peptide detected were taken into account. A correlation of mRNA abundance and variant peptide detection revealed that transcripts for which variant proteins were detected ranked among the top 6.3% most abundant transcripts. The variants were detected in highly abundant proteins as well, thus establishing transcript and protein abundance and MS "compatibility" as the main factors affecting variant onco-proteogenomic identification. CONCLUSIONS While proteomics fails to identify the vast majority of exome DNA variants in the resulting proteome, its ability to detect a small subset of SAAVs could prove valuable for precision medicine applications.
Collapse
Affiliation(s)
- Lampros Dimitrakopoulos
- Department of Laboratory Medicine and Pathobiology, University of Toronto, 1 King's College Circle, Toronto, ON M5S 1A8, Canada; Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Joseph and Wolf Lebovic Health Complex, 600 University Avenue, Toronto, ON M5G 1X5, Canada; Lunenfeld-Tanenbaum Research Institute, Sinai Health System, 600 University Avenue, Toronto, ON M5G 1X5, Canada
| | - Ioannis Prassas
- Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Joseph and Wolf Lebovic Health Complex, 600 University Avenue, Toronto, ON M5G 1X5, Canada; Lunenfeld-Tanenbaum Research Institute, Sinai Health System, 600 University Avenue, Toronto, ON M5G 1X5, Canada
| | - Anieta M Sieuwerts
- Department of Medical Oncology and Cancer Genomics Netherlands, Erasmus MC Cancer Institute, Erasmus University Medical Center, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands
| | - Eleftherios P Diamandis
- Department of Laboratory Medicine and Pathobiology, University of Toronto, 1 King's College Circle, Toronto, ON M5S 1A8, Canada; Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Joseph and Wolf Lebovic Health Complex, 600 University Avenue, Toronto, ON M5G 1X5, Canada; Lunenfeld-Tanenbaum Research Institute, Sinai Health System, 600 University Avenue, Toronto, ON M5G 1X5, Canada; Department of Clinical Biochemistry, University Health Network, 190 Elizabeth Street, Toronto, ON M5G 2C4, Canada
| | - John W M Martens
- Department of Medical Oncology and Cancer Genomics Netherlands, Erasmus MC Cancer Institute, Erasmus University Medical Center, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands.
| | - George S Charames
- Department of Laboratory Medicine and Pathobiology, University of Toronto, 1 King's College Circle, Toronto, ON M5S 1A8, Canada; Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Joseph and Wolf Lebovic Health Complex, 600 University Avenue, Toronto, ON M5G 1X5, Canada; Lunenfeld-Tanenbaum Research Institute, Sinai Health System, 600 University Avenue, Toronto, ON M5G 1X5, Canada.
| |
Collapse
|
38
|
Rinschen MM, Limbutara K, Knepper MA, Payne DM, Pisitkun T. From Molecules to Mechanisms: Functional Proteomics and Its Application to Renal Tubule Physiology. Physiol Rev 2019; 98:2571-2606. [PMID: 30182799 DOI: 10.1152/physrev.00057.2017] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Classical physiological studies using electrophysiological, biophysical, biochemical, and molecular techniques have created a detailed picture of molecular transport, bioenergetics, contractility and movement, and growth, as well as the regulation of these processes by external stimuli in cells and organisms. Newer systems biology approaches are beginning to provide deeper and broader understanding of these complex biological processes and their dynamic responses to a variety of environmental cues. In the past decade, advances in mass spectrometry-based proteomic technologies have provided invaluable tools to further elucidate these complex cellular processes, thereby confirming, complementing, and advancing common views of physiology. As one notable example, the application of proteomics to study the regulation of kidney function has yielded novel insights into the chemical and physical processes that tightly control body fluids, electrolytes, and metabolites to provide optimal microenvironments for various cellular and organ functions. Here, we systematically review, summarize, and discuss the most significant key findings from functional proteomic studies in renal epithelial physiology. We also identify further improvements in technological and bioinformatics methods that will be essential to advance precision medicine in nephrology.
Collapse
Affiliation(s)
- Markus M Rinschen
- Department II of Internal Medicine, University Hospital Cologne , Cologne , Germany ; Center for Molecular Medicine Cologne, University of Cologne , Cologne , Germany ; Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases, University of Cologne , Cologne , Germany ; Division of Nephrology, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand ; Epithelial Systems Biology Laboratory, National Heart, Lung, and Blood Institute, National Institutes of Health , Bethesda, Maryland ; and Center of Excellence in Systems Biology, Research Affairs, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand
| | - Kavee Limbutara
- Department II of Internal Medicine, University Hospital Cologne , Cologne , Germany ; Center for Molecular Medicine Cologne, University of Cologne , Cologne , Germany ; Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases, University of Cologne , Cologne , Germany ; Division of Nephrology, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand ; Epithelial Systems Biology Laboratory, National Heart, Lung, and Blood Institute, National Institutes of Health , Bethesda, Maryland ; and Center of Excellence in Systems Biology, Research Affairs, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand
| | - Mark A Knepper
- Department II of Internal Medicine, University Hospital Cologne , Cologne , Germany ; Center for Molecular Medicine Cologne, University of Cologne , Cologne , Germany ; Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases, University of Cologne , Cologne , Germany ; Division of Nephrology, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand ; Epithelial Systems Biology Laboratory, National Heart, Lung, and Blood Institute, National Institutes of Health , Bethesda, Maryland ; and Center of Excellence in Systems Biology, Research Affairs, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand
| | - D Michael Payne
- Department II of Internal Medicine, University Hospital Cologne , Cologne , Germany ; Center for Molecular Medicine Cologne, University of Cologne , Cologne , Germany ; Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases, University of Cologne , Cologne , Germany ; Division of Nephrology, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand ; Epithelial Systems Biology Laboratory, National Heart, Lung, and Blood Institute, National Institutes of Health , Bethesda, Maryland ; and Center of Excellence in Systems Biology, Research Affairs, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand
| | - Trairak Pisitkun
- Department II of Internal Medicine, University Hospital Cologne , Cologne , Germany ; Center for Molecular Medicine Cologne, University of Cologne , Cologne , Germany ; Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases, University of Cologne , Cologne , Germany ; Division of Nephrology, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand ; Epithelial Systems Biology Laboratory, National Heart, Lung, and Blood Institute, National Institutes of Health , Bethesda, Maryland ; and Center of Excellence in Systems Biology, Research Affairs, Faculty of Medicine, Chulalongkorn University , Bangkok , Thailand
| |
Collapse
|
39
|
Hattori E, Kondo T. Current status of cancer proteogenomics: a brief introduction. ACTA ACUST UNITED AC 2019. [DOI: 10.2198/jelectroph.63.33] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Emi Hattori
- Division of Rare Cancer Research, National Cancer Center Research Institute
| | - Tadashi Kondo
- Division of Rare Cancer Research, National Cancer Center Research Institute
| |
Collapse
|
40
|
Low TY, Mohtar MA, Ang MY, Jamal R. Connecting Proteomics to Next‐Generation Sequencing: Proteogenomics and Its Current Applications in Biology. Proteomics 2018; 19:e1800235. [DOI: 10.1002/pmic.201800235] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2018] [Revised: 10/09/2018] [Indexed: 12/17/2022]
Affiliation(s)
- Teck Yew Low
- UKM Medical Molecular Biology Institute (UMBI)Universiti Kebangsaan Malaysia 56000 Kuala Lumpur Malaysia
| | - M. Aiman Mohtar
- UKM Medical Molecular Biology Institute (UMBI)Universiti Kebangsaan Malaysia 56000 Kuala Lumpur Malaysia
| | - Mia Yang Ang
- UKM Medical Molecular Biology Institute (UMBI)Universiti Kebangsaan Malaysia 56000 Kuala Lumpur Malaysia
| | - Rahman Jamal
- UKM Medical Molecular Biology Institute (UMBI)Universiti Kebangsaan Malaysia 56000 Kuala Lumpur Malaysia
| |
Collapse
|
41
|
Laumont CM, Vincent K, Hesnard L, Audemard É, Bonneil É, Laverdure JP, Gendron P, Courcelles M, Hardy MP, Côté C, Durette C, St-Pierre C, Benhammadi M, Lanoix J, Vobecky S, Haddad E, Lemieux S, Thibault P, Perreault C. Noncoding regions are the main source of targetable tumor-specific antigens. Sci Transl Med 2018; 10:10/470/eaau5516. [DOI: 10.1126/scitranslmed.aau5516] [Citation(s) in RCA: 256] [Impact Index Per Article: 42.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 11/15/2018] [Indexed: 12/15/2022]
Abstract
Tumor-specific antigens (TSAs) represent ideal targets for cancer immunotherapy, but few have been identified thus far. We therefore developed a proteogenomic approach to enable the high-throughput discovery of TSAs coded by potentially all genomic regions. In two murine cancer cell lines and seven human primary tumors, we identified a total of 40 TSAs, about 90% of which derived from allegedly noncoding regions and would have been missed by standard exome-based approaches. Moreover, most of these TSAs derived from nonmutated yet aberrantly expressed transcripts (such as endogenous retroelements) that could be shared by multiple tumor types. Last, we demonstrated that, in mice, the strength of antitumor responses after TSA vaccination was influenced by two parameters that can be estimated in humans and could serve for TSA prioritization in clinical studies: TSA expression and the frequency of TSA-responsive T cells in the preimmune repertoire. In conclusion, the strategy reported herein could considerably facilitate the identification and prioritization of actionable human TSAs.
Collapse
|
42
|
Kwon OK, Jeon JM, Sung E, Na AY, Kim SJ, Lee S. Comparative Secretome Profiling and Mutant Protein Identification in Metastatic Prostate Cancer Cells by Quantitative Mass Spectrometry-based Proteomics. Cancer Genomics Proteomics 2018; 15:279-290. [PMID: 29976633 DOI: 10.21873/cgp.20086] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 06/04/2018] [Accepted: 06/06/2018] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Secreted proteins play an important role in promoting cancer (PCa) cell migration and invasion. Proteogenomics helps elucidate the mechanism of diseases, discover therapeutic targets, and generate biomarkers for diagnosis through protein variations. MATERIALS AND METHODS We carried out mass a spectrometry-based proteomic analysis of the conditioned media (CM) from two human prostate cancer cell lines, belonging to different metastatic sites, to identify potential metastatic and/or aggressive factors. RESULTS We identified a total of 598 proteins, among which 561 were quantified based on proteomic analysis. Among the quantified proteins, 128 were up-regulated and 83 were down-regulated in DU145/PC3 cells. Six mutant peptides were identified in the CM of prostate cancer cell lines using proteogenomics approach. CONCLUSION This is the first proteogenomics study in PCa aiming at exploring a new type of metastatic factor, which are mutant peptides, predicting a novel biomarker of metastatic PCa for diagnosis, prognosis and drug targeting.
Collapse
Affiliation(s)
- Oh Kwang Kwon
- College of Pharmacy, Research Institute of Pharmaceutical Sciences, BK21 Plus KNU Multi-Omics-based Creative Drug Research Team, Kyungpook National University, Daegu, Republic of Korea
| | - Ju Mi Jeon
- College of Pharmacy, Research Institute of Pharmaceutical Sciences, BK21 Plus KNU Multi-Omics-based Creative Drug Research Team, Kyungpook National University, Daegu, Republic of Korea
| | - Eunji Sung
- College of Pharmacy, Research Institute of Pharmaceutical Sciences, BK21 Plus KNU Multi-Omics-based Creative Drug Research Team, Kyungpook National University, Daegu, Republic of Korea
| | - Ann-Yea Na
- College of Pharmacy, Research Institute of Pharmaceutical Sciences, BK21 Plus KNU Multi-Omics-based Creative Drug Research Team, Kyungpook National University, Daegu, Republic of Korea
| | - Sun Joo Kim
- College of Pharmacy, Research Institute of Pharmaceutical Sciences, BK21 Plus KNU Multi-Omics-based Creative Drug Research Team, Kyungpook National University, Daegu, Republic of Korea
| | - Sangkyu Lee
- College of Pharmacy, Research Institute of Pharmaceutical Sciences, BK21 Plus KNU Multi-Omics-based Creative Drug Research Team, Kyungpook National University, Daegu, Republic of Korea
| |
Collapse
|
43
|
Mamie Lih TS, Choong WK, Chen YJ, Sung TY. Evaluating the Possibility of Detecting Variants in Shotgun Proteomics via LeTE-Fusion Analysis Pipeline. J Proteome Res 2018; 17:2937-2952. [PMID: 30088773 DOI: 10.1021/acs.jproteome.8b00052] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In proteogenomic studies, many genome-annotated events, for example, single amino acid variation (SAAV) and short INDEL, are often unobserved in shotgun proteomics. Therefore, we propose an analysis pipeline called LeTE-fusion (Le, peptide length; T, theoretical values; E, experimental data) to first investigate whether peptides with certain lengths are observed more often in mass spectrometry (MS)-based proteomics, which may hinder peptide identification causing difficulty in detecting genome-annotated events. By applying LeTE-fusion on different MS-based proteome data sets, we found peptides within 7-20 amino acids are more frequently identified, possibly attributed to MS-related factors instead of proteases. We then further extended the usage of LeTE-fusion on four variant-containing-sequence data sets (SAAV-only) with various sample complexity up to the whole human proteome scale, which yields theoretically ∼70% variants observable in an ideal shotgun proteomics. However, only ∼40% of variants might be detectable in real shotgun proteomic experiments when LeTE-fusion utilizes the experimentally observed variant-site-containing wild-type peptides in PeptideAtlas to estimate the expected observable coverage of variants. Finally, we conducted a case study on HEK293 cell line with variants reported at genomic level that were also identified in shotgun proteomics to demonstrate the efficacy of LeTE-fusion on estimating expected observable coverage of variants. To the best of our knowledge, this is the first study to systematically investigate the detection limits of genome-annotated events via shotgun proteomics using such analysis pipeline.
Collapse
|
44
|
Ni Y, Stingo FC, Ha MJ, Akbani R, Baladandayuthapani V. Bayesian Hierarchical Varying-sparsity Regression Models with Application to Cancer Proteogenomics. J Am Stat Assoc 2018; 114:48-60. [PMID: 31178611 PMCID: PMC6552682 DOI: 10.1080/01621459.2018.1434529] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2017] [Revised: 10/01/2017] [Indexed: 10/18/2022]
Abstract
Identifying patient-specific prognostic biomarkers is of critical importance in developing personalized treatment for clinically and molecularly heterogeneous diseases such as cancer. In this article, we propose a novel regression framework, Bayesian hierarchical varying-sparsity regression (BEHAVIOR) models to select clinically relevant disease markers by integrating proteogenomic (proteomic+genomic) and clinical data. Our methods allow flexible modeling of protein-gene relationships as well as induces sparsity in both protein-gene and protein-survival relationships, to select ge-nomically driven prognostic protein markers at the patient-level. Simulation studies demonstrate the superior performance of BEHAVIOR against competing method in terms of both protein marker selection and survival prediction. We apply BEHAV-IOR to The Cancer Genome Atlas (TCGA) proteogenomic pan-cancer data and find several interesting prognostic proteins and pathways that are shared across multiple cancers and some that exclusively pertain to specific cancers.
Collapse
Affiliation(s)
- Yang Ni
- Department of Statistics and Data Sciences, The University of Texas at Austin
| | - Francesco C Stingo
- Department of Statistics, Computer Science, Applications "G. Parenti", The University of Florence
| | - Min Jin Ha
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center
| | - Rehan Akbani
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center
| | | |
Collapse
|
45
|
Yi X, Wang B, An Z, Gong F, Li J, Fu Y. Quality control of single amino acid variations detected by tandem mass spectrometry. J Proteomics 2018; 187:144-151. [PMID: 30012419 DOI: 10.1016/j.jprot.2018.07.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Revised: 06/26/2018] [Accepted: 07/02/2018] [Indexed: 02/04/2023]
Abstract
Study of single amino acid variations (SAVs) of proteins, resulting from single nucleotide polymorphisms, is of great importance for understanding the relationships between genotype and phenotype. In mass spectrometry based shotgun proteomics, identification of peptides with SAVs often suffers from high error rates on the variant sites detected. These site errors are due to multiple reasons and can be confirmed by manual inspection or genomic sequencing. Here, we present a software tool, named SAVControl, for site-level quality control of variant peptide identifications. It mainly includes strict false discovery rate control of variant peptide identifications and variant site verification by unrestrictive mass shift relocalization. SAVControl was validated on three colorectal adenocarcinoma cell line datasets with genomic sequencing evidences and tested on a colorectal cancer dataset from The Cancer Genome Atlas. The results show that SAVControl can effectively remove false detections of SAVs. SIGNIFICANCE Protein sequence variations caused by single nucleotide polymorphisms (SNPs) are single amino acid variations (SAVs). The investigation of SAVs may provide a chance for understanding the relationships between genotype and phenotype. Mass spectrometry (MS) based proteomics provides a large-scale way to detect SAVs. However, using the current analysis strategy to detect SAVs may lead to high rate of false positives. The SAVControl we present here is a computational workflow and software tool for site-level quality control of SAVs detected by MS. It accesses the confidence of detected variant sites by relocating the mass shift responsible for an SAV to search for alternative interpretations. In addition, it uses a strict false discovery rate control method for variant peptide identifications. The advantages of SAVControl were demonstrated on three colorectal adenocarcinoma cell line datasets and a colorectal cancer dataset. We believe that SAVControl will be a powerful tool for computational proteomics and proteogenomics.
Collapse
Affiliation(s)
- Xinpei Yi
- NCMIS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Bo Wang
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Zhiwu An
- NCMIS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Fuzhou Gong
- NCMIS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.
| | - Jing Li
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.
| | - Yan Fu
- NCMIS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
46
|
Lobas AA, Pyatnitskiy MA, Chernobrovkin AL, Ilina IY, Karpov DS, Solovyeva EM, Kuznetsova KG, Ivanov MV, Lyssuk EY, Kliuchnikova AA, Voronko OE, Larin SS, Zubarev RA, Gorshkov MV, Moshkovskii SA. Proteogenomics of Malignant Melanoma Cell Lines: The Effect of Stringency of Exome Data Filtering on Variant Peptide Identification in Shotgun Proteomics. J Proteome Res 2018; 17:1801-1811. [PMID: 29619825 DOI: 10.1021/acs.jproteome.7b00841] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
The identification of genetically encoded variants at the proteome level is an important problem in cancer proteogenomics. The generation of customized protein databases from DNA or RNA sequencing data is a crucial stage of the identification workflow. Genomic data filtering applied at this stage may significantly modify variant search results, yet its effect is generally left out of the scope of proteogenomic studies. In this work, we focused on this impact using data of exome sequencing and LC-MS/MS analyses of six replicates for eight melanoma cell lines processed by a proteogenomics workflow. The main objectives were identifying variant peptides and revealing the role of the genomic data filtering in the variant identification. A series of six confidence thresholds for single nucleotide polymorphisms and indels from the exome data were applied to generate customized sequence databases of different stringency. In the searches against unfiltered databases, between 100 and 160 variant peptides were identified for each of the cell lines using X!Tandem and MS-GF+ search engines. The recovery rate for variant peptides was ∼1%, which is approximately three times lower than that of the wild-type peptides. Using unfiltered genomic databases for variant searches resulted in higher sensitivity and selectivity of the proteogenomic workflow and positively affected the ability to distinguish the cell lines based on variant peptide signatures.
Collapse
Affiliation(s)
- Anna A Lobas
- Moscow Institute of Physics and Technology (State University) , Dolgoprudny 141700 , Moscow Region , Russia.,V.L. Talrose Institute for Energy Problems of Chemical Physics , Russian Academy of Sciences , Moscow 119334 , Russia
| | - Mikhail A Pyatnitskiy
- Institute of Biomedical Chemistry , Moscow 119121 , Russia.,Higher School of Economics , Moscow 101000 , Russia
| | | | - Irina Y Ilina
- Institute of Biomedical Chemistry , Moscow 119121 , Russia
| | - Dmitry S Karpov
- Institute of Biomedical Chemistry , Moscow 119121 , Russia.,Engelhardt Institute of Molecular Biology , Russian Academy of Sciences , Moscow 119991 , Russia
| | - Elizaveta M Solovyeva
- V.L. Talrose Institute for Energy Problems of Chemical Physics , Russian Academy of Sciences , Moscow 119334 , Russia
| | | | - Mark V Ivanov
- V.L. Talrose Institute for Energy Problems of Chemical Physics , Russian Academy of Sciences , Moscow 119334 , Russia
| | - Elena Y Lyssuk
- Institute of Gene Biology , Russian Academy of Sciences , Moscow 119334 , Russia
| | - Anna A Kliuchnikova
- Institute of Biomedical Chemistry , Moscow 119121 , Russia.,Pirogov Russian National Research Medical University , Moscow 117997 , Russia
| | - Olga E Voronko
- Institute of Biomedical Chemistry , Moscow 119121 , Russia
| | - Sergey S Larin
- Institute of Gene Biology , Russian Academy of Sciences , Moscow 119334 , Russia
| | | | - Mikhail V Gorshkov
- V.L. Talrose Institute for Energy Problems of Chemical Physics , Russian Academy of Sciences , Moscow 119334 , Russia
| | - Sergei A Moshkovskii
- Institute of Biomedical Chemistry , Moscow 119121 , Russia.,Pirogov Russian National Research Medical University , Moscow 117997 , Russia
| |
Collapse
|
47
|
Discovery of coding regions in the human genome by integrated proteogenomics analysis workflow. Nat Commun 2018; 9:903. [PMID: 29500430 PMCID: PMC5834625 DOI: 10.1038/s41467-018-03311-y] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2017] [Accepted: 02/02/2018] [Indexed: 01/23/2023] Open
Abstract
Proteogenomics enable the discovery of novel peptides (from unannotated genomic protein-coding loci) and single amino acid variant peptides (derived from single-nucleotide polymorphisms and mutations). Increasing the reliability of these identifications is crucial to ensure their usefulness for genome annotation and potential application as neoantigens in cancer immunotherapy. We here present integrated proteogenomics analysis workflow (IPAW), which combines peptide discovery, curation, and validation. IPAW includes the SpectrumAI tool for automated inspection of MS/MS spectra, eliminating false identifications of single-residue substitution peptides. We employ IPAW to analyze two proteomics data sets acquired from A431 cells and five normal human tissues using extended (pH range, 3–10) high-resolution isoelectric focusing (HiRIEF) pre-fractionation and TMT-based peptide quantitation. The IPAW results provide evidence for the translation of pseudogenes, lncRNAs, short ORFs, alternative ORFs, N-terminal extensions, and intronic sequences. Moreover, our quantitative analysis indicates that protein production from certain pseudogenes and lncRNAs is tissue specific. Proteogenomics enables the discovery of protein coding regions and disease-relevant mutations but their verification remains challenging. Here, the authors combine peptide discovery, curation and validation in an integrated proteogenomics workflow, robustly identifying unknown coding regions and mutations.
Collapse
|
48
|
Validation of a multi-omics strategy for prioritizing personalized candidate driver genes. Oncotarget 2018; 7:38440-38450. [PMID: 27469031 PMCID: PMC5122402 DOI: 10.18632/oncotarget.9540] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2016] [Accepted: 05/08/2016] [Indexed: 01/13/2023] Open
Abstract
Significant heterogeneity between different tumors prevents the discovery of cancer driver genes, especially in a patient-specific manner. We previously prioritized five personalized candidate mutation-driver genes in a hyper-mutated hepatocellular carcinoma patient using a multi-omics strategy. However, the roles of the prioritized driver genes and patient-specific mutations in hepatocarcinogenesis are unclear. We investigated the impact of the tumor-mutated allele on structure-function relationship of the encoded protein and assessed both loss- and gain-of-function of these genes and mutations on hepatoma cell behaviors in vitro. The prioritized mutation-driver genes act as tumor suppressor genes and inhibit cell proliferation and migration. In addition, the loss-of-function effect of the patient-specific mutations promoted cell proliferation and migration. Of note, the HNF1A S247T mutation significantly reduced the HNF1A transcriptional activity for hepatocyte nuclear factor 4 alpha (HNF4A) but did not disrupt nuclear localization of HNF1A. The results provide evidence for supporting the validity of our proposed multi-omics strategy, which supplies a new avenue for prioritizing mutation-drivers towards personalized cancer therapy.
Collapse
|
49
|
Next-Generation Sequencing and Mutational Analysis: Implications for Genes Encoding LINC Complex Proteins. Methods Mol Biol 2018; 1840:321-336. [PMID: 30141054 DOI: 10.1007/978-1-4939-8691-0_22] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Targeted panel, whole exome, or whole genome DNA sequencing using next-generation sequencing (NGS) allows for extensive high-throughput investigation of molecular machines/systems such as the LINC complex. This includes the identification of genetic variants in humans that cause disease, as is the case for some genes encoding LINC complex proteins. The relatively low cost and high speed of the sequencing process results in large datasets at various stages of analysis and interpretation. For those not intimately familiar with the process, interpretation of the data might prove challenging. This review lays out the most important and most commonly used materials and methods of NGS. It also discusses data analysis and potential pitfalls one might encounter because of peculiarities of the laboratory methodology or data analysis pipelines.
Collapse
|
50
|
Chambers MC, Jagtap PD, Johnson JE, McGowan T, Kumar P, Onsongo G, Guerrero CR, Barsnes H, Vaudel M, Martens L, Grüning B, Cooke IR, Heydarian M, Reddy KL, Griffin TJ. An Accessible Proteogenomics Informatics Resource for Cancer Researchers. Cancer Res 2017; 77:e43-e46. [PMID: 29092937 DOI: 10.1158/0008-5472.can-17-0331] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2017] [Revised: 04/07/2017] [Accepted: 06/30/2017] [Indexed: 11/16/2022]
Abstract
Proteogenomics has emerged as a valuable approach in cancer research, which integrates genomic and transcriptomic data with mass spectrometry-based proteomics data to directly identify expressed, variant protein sequences that may have functional roles in cancer. This approach is computationally intensive, requiring integration of disparate software tools into sophisticated workflows, challenging its adoption by nonexpert, bench scientists. To address this need, we have developed an extensible, Galaxy-based resource aimed at providing more researchers access to, and training in, proteogenomic informatics. Our resource brings together software from several leading research groups to address two foundational aspects of proteogenomics: (i) generation of customized, annotated protein sequence databases from RNA-Seq data; and (ii) accurate matching of tandem mass spectrometry data to putative variants, followed by filtering to confirm their novelty. Directions for accessing software tools and workflows, along with instructional documentation, can be found at z.umn.edu/canresgithub. Cancer Res; 77(21); e43-46. ©2017 AACR.
Collapse
Affiliation(s)
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota
| | - James E Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota
| | - Thomas McGowan
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota
| | - Praveen Kumar
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota.,Bioinformatics and Computational Biology Program, University of Minnesota-Rochester, Rochester, Minnesota
| | - Getiria Onsongo
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota
| | - Candace R Guerrero
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota
| | - Harald Barsnes
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway.,Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - Marc Vaudel
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Bergen, Norway.,Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.,Department of Biochemistry, Ghent University, Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Björn Grüning
- Department of Computer Science, Albert-Ludwigs-University, Freiburg, Freiburg, Germany.,Center for Biological Systems Analysis (ZBSA), University of Freiburg, Freiburg, Germany
| | - Ira R Cooke
- Comparative Genomics Centre and Department of Molecular and Cell Biology, James Cook University, Queensland, Australia
| | | | - Karen L Reddy
- Department of Biological Chemistry, Center for Epigenetics and Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University, Baltimore, Maryland
| | - Timothy J Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota.
| |
Collapse
|