Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cesnik AJ, Shortreed MR, Sheynkman GM, Frey BL, Smith LM. Human Proteomic Variation Revealed by Combining RNA-Seq Proteogenomics and Global Post-Translational Modification (G-PTM) Search Strategy. J Proteome Res 2016;15:800-8. [PMID: 26704769 PMCID: PMC4779408 DOI: 10.1021/acs.jproteome.5b00817] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

For:	Cesnik AJ, Shortreed MR, Sheynkman GM, Frey BL, Smith LM. Human Proteomic Variation Revealed by Combining RNA-Seq Proteogenomics and Global Post-Translational Modification (G-PTM) Search Strategy. J Proteome Res 2016;15:800-8. [PMID: 26704769 PMCID: PMC4779408 DOI: 10.1021/acs.jproteome.5b00817] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Number

Cited by Other Article(s)

Gulhane P, Singh S. Unraveling the Post-Translational Modifications and therapeutical approach in NSCLC pathogenesis. Transl Oncol 2023;33:101673. [PMID: 37062237 PMCID: PMC10133877 DOI: 10.1016/j.tranon.2023.101673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 04/09/2023] [Accepted: 04/10/2023] [Indexed: 04/18/2023] Open

Harney DJ, Larance M. Annotated Protein Database Using Known Cleavage Sites for Rapid Detection of Secreted Proteins. J Proteome Res 2022;21:965-974. [DOI: 10.1021/acs.jproteome.1c00806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Hyung D, Baek MJ, Lee J, Cho J, Kim HS, Park C, Cho SY. Protein-gene Expression Nexus: Comprehensive characterization of human cancer cell lines with proteogenomic analysis. Comput Struct Biotechnol J 2021;19:4759-4769. [PMID: 34504668 PMCID: PMC8405889 DOI: 10.1016/j.csbj.2021.08.022] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Revised: 08/13/2021] [Accepted: 08/14/2021] [Indexed: 12/30/2022] Open

Cesnik AJ, Miller RM, Ibrahim K, Lu L, Millikin RJ, Shortreed MR, Frey BL, Smith LM. Spritz: A Proteogenomic Database Engine. J Proteome Res 2020;20:1826-1834. [PMID: 32967423 DOI: 10.1021/acs.jproteome.0c00407] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Ramesh P, Nagarajan V, Khanchandani V, Desai VK, Niranjan V. Proteomic variations of esophageal squamous cell carcinoma revealed by combining RNA-seq proteogenomics and G-PTM search strategy. Heliyon 2020;6:e04813. [PMID: 32913912 PMCID: PMC7472856 DOI: 10.1016/j.heliyon.2020.e04813] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2020] [Revised: 07/10/2020] [Accepted: 08/25/2020] [Indexed: 02/07/2023] Open

Abstract

BACKGROUND

Cancer that arises from epithelial cells of the esophagus is called esophagus squamous cell carcinoma (ESCC) and is mostly observed in developing nations. Evaluation of cancer genomes and its regulation into proteins plays a predominant role in understanding the cancer progressions. Mass-spectrometry-based proteomics is a consequential tool to estimate proteomic variation and posttranslational modifications (PTMs) from standard protein databases. Post-translational modifications play a crucial role in protein folding and PTMs can be accounted for as a biological signal to interpret the structural changes and transition order of proteins. Functional validation of cancer-related mutations can explain the effects of mutations on genes and the identification of Oncogenes and tumor suppressor genes. Therefore, we present a study on protein variations to interpret the structural changes and transition order of proteins in ESCC carcinogenesis.

METHODOLOGY

We are using a bottom-up proteomics approach with Galaxy-P framework and RNA sequence data analysis to generate the sample-specific databases containing details of RNA splicing and variant peptides. Once the database generated with information on variable modification, only the curated PTMs at specific positions are considered to perform spectral matching. Proteogenomics mapping was performed to identify protein variations in ESCC.

RESULTS

RNA-sequence proteogenomics with G-PTM (Global Post-Translational Modification) searching strategy has revealed proteomic events including several peptides that contain single amino acid variations, novel splice junction peptides and posttranslationally modified peptides. Proteogenomic mapping exhibited the splice junction peptides mapped predominantly for Malic enzyme exon type (ME-3) and MCM7 protein-coding genes that promote cancer progression, found to be exhibited in ESCC samples. Approximately 25 ± types of PTM modifications were recorded, and Protein Phosphorylation was largely noted.

CONCLUSION

ESCC cancer prognosis at the molecular level enables a better understanding of cancer carcinogenesis and protein modifications can be used as potential biomarkers.

Collapse

Hubler SL, Kumar P, Mehta S, Easterly C, Johnson JE, Jagtap PD, Griffin TJ. Challenges in Peptide-Spectrum Matching: A Robust and Reproducible Statistical Framework for Removing Low-Accuracy, High-Scoring Hits. J Proteome Res 2019;19:161-173. [DOI: 10.1021/acs.jproteome.9b00478] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Myburg AA, Hussey SG, Wang JP, Street NR, Mizrachi E. Systems and Synthetic Biology of Forest Trees: A Bioengineering Paradigm for Woody Biomass Feedstocks. FRONTIERS IN PLANT SCIENCE 2019;10:775. [PMID: 31281326 PMCID: PMC6597874 DOI: 10.3389/fpls.2019.00775] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 05/28/2019] [Indexed: 05/07/2023]

Li S, Cha SW, Heffner K, Hizal DB, Bowen MA, Chaerkady R, Cole RN, Tejwani V, Kaushik P, Henry M, Meleady P, Sharfstein ST, Betenbaugh MJ, Bafna V, Lewis NE. Proteogenomic Annotation of Chinese Hamsters Reveals Extensive Novel Translation Events and Endogenous Retroviral Elements. J Proteome Res 2019;18:2433-2445. [PMID: 31020842 DOI: 10.1021/acs.jproteome.8b00935] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Wang Q, Peng WX, Wang L, Ye L. Toward multiomics-based next-generation diagnostics for precision medicine. Per Med 2019;16:157-170. [PMID: 30816060 DOI: 10.2217/pme-2018-0085] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

González-Gomariz J, Guruceaga E, López-Sánchez M, Segura V. Proteogenomics in the context of the Human Proteome Project (HPP). Expert Rev Proteomics 2019;16:267-275. [PMID: 30654666 DOI: 10.1080/14789450.2019.1571916] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Low TY, Mohtar MA, Ang MY, Jamal R. Connecting Proteomics to Next‐Generation Sequencing: Proteogenomics and Its Current Applications in Biology. Proteomics 2018;19:e1800235. [DOI: 10.1002/pmic.201800235] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2018] [Revised: 10/09/2018] [Indexed: 12/17/2022]

Cifani P, Dhabaria A, Chen Z, Yoshimi A, Kawaler E, Abdel-Wahab O, Poirier JT, Kentsis A. ProteomeGenerator: A Framework for Comprehensive Proteomics Based on de Novo Transcriptome Assembly and High-Accuracy Peptide Mass Spectral Matching. J Proteome Res 2018;17:3681-3692. [PMID: 30295032 DOI: 10.1021/acs.jproteome.8b00295] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Abstract

Modern mass spectrometry now permits genome-scale and quantitative measurements of biological proteomes. However, analysis of specific specimens is currently hindered by the incomplete representation of biological variability of protein sequences in canonical reference proteomes and the technical demands for their construction. Here, we report ProteomeGenerator, a framework for de novo and reference-assisted proteogenomic database construction and analysis based on sample-specific transcriptome sequencing and high-accuracy mass spectrometry proteomics. This enables the assembly of proteomes encoded by actively transcribed genes, including sample-specific protein isoforms resulting from non-canonical mRNA transcription, splicing, or editing. To improve the accuracy of protein isoform identification in non-canonical proteomes, ProteomeGenerator relies on statistical target-decoy database matching calibrated using sample-specific controls. Its current implementation includes automatic integration with MaxQuant mass spectrometry proteomics algorithms. We applied this method for the proteogenomic analysis of splicing factor SRSF2 mutant leukemia cells, demonstrating high-confidence identification of non-canonical protein isoforms arising from alternative transcriptional start sites, intron retention, and cryptic exon splicing as well as improved accuracy of genome-scale proteome discovery. Additionally, we report proteogenomic performance metrics for current state-of-the-art implementations of SEQUEST HT, MaxQuant, Byonic, and PEAKS mass spectral analysis algorithms. Finally, ProteomeGenerator is implemented as a Snakemake workflow within a Singularity container for one-step installation in diverse computing environments, thereby enabling open, scalable, and facile discovery of sample-specific, non-canonical, and neomorphic biological proteomes.

Collapse

Kiseleva OI, Lisitsa AV, Poverennaya EV. Proteoforms: Methods of Analysis and Clinical Prospects. Mol Biol 2018. [DOI: 10.1134/s0026893318030068] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Lu L, Millikin RJ, Solntsev SK, Rolfs Z, Scalf M, Shortreed MR, Smith LM. Identification of MS-Cleavable and Noncleavable Chemically Cross-Linked Peptides with MetaMorpheus. J Proteome Res 2018;17:2370-2376. [PMID: 29793340 DOI: 10.1021/acs.jproteome.8b00141] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Lin YY, Gawronski A, Hach F, Li S, Numanagić I, Sarrafi I, Mishra S, McPherson A, Collins CC, Radovich M, Tang H, Sahinalp SC. Computational identification of micro-structural variations and their proteogenomic consequences in cancer. Bioinformatics 2018;34:1672-1681. [PMID: 29267878 PMCID: PMC5946953 DOI: 10.1093/bioinformatics/btx807] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Revised: 11/24/2017] [Accepted: 12/15/2017] [Indexed: 12/18/2022] Open

Abstract

Motivation

Rapid advancement in high throughput genome and transcriptome sequencing (HTS) and mass spectrometry (MS) technologies has enabled the acquisition of the genomic, transcriptomic and proteomic data from the same tissue sample. We introduce a computational framework, ProTIE, to integratively analyze all three types of omics data for a complete molecular profile of a tissue sample. Our framework features MiStrVar, a novel algorithmic method to identify micro structural variants (microSVs) on genomic HTS data. Coupled with deFuse, a popular gene fusion detection method we developed earlier, MiStrVar can accurately profile structurally aberrant transcripts in tumors. Given the breakpoints obtained by MiStrVar and deFuse, our framework can then identify all relevant peptides that span the breakpoint junctions and match them with unique proteomic signatures. Observing structural aberrations in all three types of omics data validates their presence in the tumor samples.

Results

We have applied our framework to all The Cancer Genome Atlas (TCGA) breast cancer Whole Genome Sequencing (WGS) and/or RNA-Seq datasets, spanning all four major subtypes, for which proteomics data from Clinical Proteomic Tumor Analysis Consortium (CPTAC) have been released. A recent study on this dataset focusing on SNVs has reported many that lead to novel peptides. Complementing and significantly broadening this study, we detected 244 novel peptides from 432 candidate genomic or transcriptomic sequence aberrations. Many of the fusions and microSVs we discovered have not been reported in the literature. Interestingly, the vast majority of these translated aberrations, fusions in particular, were private, demonstrating the extensive inter-genomic heterogeneity present in breast cancer. Many of these aberrations also have matching out-of-frame downstream peptides, potentially indicating novel protein sequence and structure.

Availability and implementation

MiStrVar is available for download at https://bitbucket.org/compbio/mistrvar, and ProTIE is available at https://bitbucket.org/compbio/protie.

Contact

cenksahi@indiana.edu.

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

Solntsev SK, Shortreed MR, Frey BL, Smith LM. Enhanced Global Post-translational Modification Discovery with MetaMorpheus. J Proteome Res 2018;17:1844-1851. [PMID: 29578715 DOI: 10.1021/acs.jproteome.7b00873] [Citation(s) in RCA: 168] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Proffitt JM, Glenn J, Cesnik AJ, Jadhav A, Shortreed MR, Smith LM, Kavanagh K, Cox LA, Olivier M. Proteomics in non-human primates: utilizing RNA-Seq data to improve protein identification by mass spectrometry in vervet monkeys. BMC Genomics 2017;18:877. [PMID: 29132314 PMCID: PMC5683380 DOI: 10.1186/s12864-017-4279-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Accepted: 11/03/2017] [Indexed: 01/05/2023] Open

Abstract

Background

Shotgun proteomics utilizes a database search strategy to compare detected mass spectra to a library of theoretical spectra derived from reference genome information. As such, the robustness of proteomics results is contingent upon the completeness and accuracy of the gene annotation in the reference genome. For animal models of disease where genomic annotation is incomplete, such as non-human primates, proteogenomic methods can improve the detection of proteins by incorporating transcriptional data from RNA-Seq to improve proteomics search databases used for peptide spectral matching. Customized search databases derived from RNA-Seq data are capable of identifying unannotated genetic and splice variants while simultaneously reducing the number of comparisons to only those transcripts actively expressed in the tissue.

Results

We collected RNA-Seq and proteomic data from 10 vervet monkey liver samples and used the RNA-Seq data to curate sample-specific search databases which were analyzed in the program Morpheus. We compared these results against those from a search database generated from the reference vervet genome. A total of 284 previously unannotated splice junctions were predicted by the RNA-Seq data, 92 of which were confirmed by peptide spectral matches. More than half (53/92) of these unannotated splice variants had orthologs in other non-human primates, suggesting that failure to match these peptides in the reference analyses likely arose from incomplete gene model information. The sample-specific databases also identified 101 unique peptides containing single amino acid substitutions which were missed by the reference database. Because the sample-specific searches were restricted to actively expressed transcripts, the search databases were smaller, more computationally efficient, and identified more peptides at the empirically derived 1 % false discovery rate.

Conclusion

Proteogenomic approaches are ideally suited to facilitate the discovery and annotation of proteins in less widely studies animal models such as non-human primates. We expect that these approaches will help to improve existing genome annotations of non-human primate species such as vervet.

Electronic supplementary material

The online version of this article (doi: 10.1186/s12864-017-4279-0) contains supplementary material, which is available to authorized users.

Collapse

Detecting protein variants by mass spectrometry: a comprehensive study in cancer cell-lines. Genome Med 2017;9:62. [PMID: 28716134 PMCID: PMC5514513 DOI: 10.1186/s13073-017-0454-9] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Accepted: 06/22/2017] [Indexed: 02/07/2023] Open

Post-translational modifications of FDA-approved plasma biomarkers in glioblastoma samples. PLoS One 2017;12:e0177427. [PMID: 28493947 PMCID: PMC5426747 DOI: 10.1371/journal.pone.0177427] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Accepted: 04/27/2017] [Indexed: 01/08/2023] Open

Willems P, Ndah E, Jonckheere V, Stael S, Sticker A, Martens L, Van Breusegem F, Gevaert K, Van Damme P. N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana. Mol Cell Proteomics 2017;16:1064-1080. [PMID: 28432195 PMCID: PMC5461538 DOI: 10.1074/mcp.m116.066662] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Revised: 04/11/2017] [Indexed: 01/05/2023] Open

Affiliation(s)

Patrick Willems From the ‡VIB/UGent Center for Plant Systems Biology, 9052 Ghent, Belgium.,§Ghent University, Department of Plant Biotechnology and Bioinformatics, 9052 Ghent.,¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium
Elvis Ndah ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium.,**Ghent University, Department of Mathematical Modeling, Statistics and Bioinformatics, 9000 Ghent, Belgium
Veronique Jonckheere ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium
Simon Stael From the ‡VIB/UGent Center for Plant Systems Biology, 9052 Ghent, Belgium.,§Ghent University, Department of Plant Biotechnology and Bioinformatics, 9052 Ghent.,¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium
Adriaan Sticker ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium.,**Ghent University, Department of Mathematical Modeling, Statistics and Bioinformatics, 9000 Ghent, Belgium
Lennart Martens ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium.,**Ghent University, Department of Mathematical Modeling, Statistics and Bioinformatics, 9000 Ghent, Belgium
Frank Van Breusegem From the ‡VIB/UGent Center for Plant Systems Biology, 9052 Ghent, Belgium.,§Ghent University, Department of Plant Biotechnology and Bioinformatics, 9052 Ghent
Kris Gevaert ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium.,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium
Petra Van Damme ¶VIB/UGent Center for Medical Biotechnology, 9000 Ghent, Belgium; .,‖Ghent University, Department of Biochemistry, 9000 Ghent, Belgium

Collapse

Li Q, Shortreed MR, Wenger CD, Frey BL, Schaffer LV, Scalf M, Smith LM. Global Post-Translational Modification Discovery. J Proteome Res 2017;16:1383-1390. [PMID: 28248113 PMCID: PMC5387672 DOI: 10.1021/acs.jproteome.6b00034] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Kumar D, Bansal G, Narang A, Basak T, Abbas T, Dash D. Integrating transcriptome and proteome profiling: Strategies and applications. Proteomics 2016;16:2533-2544. [PMID: 27343053 DOI: 10.1002/pmic.201600140] [Citation(s) in RCA: 108] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Revised: 06/12/2016] [Accepted: 06/23/2016] [Indexed: 12/17/2022]

Kohlbacher O, Vitek O, Weintraub ST. Challenges in Large-Scale Computational Mass Spectrometry and Multiomics. J Proteome Res 2016;15:681-2. [DOI: 10.1021/acs.jproteome.6b00067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]