Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li J, Su Z, Ma ZQ, Slebos RJC, Halvey P, Tabb DL, Liebler DC, Pao W, Zhang B. A bioinformatics workflow for variant peptide detection in shotgun proteomics. Mol Cell Proteomics 2011;10:M110.006536. [PMID: 21389108 DOI: 10.1074/mcp.m110.006536] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

For:	Li J, Su Z, Ma ZQ, Slebos RJC, Halvey P, Tabb DL, Liebler DC, Pao W, Zhang B. A bioinformatics workflow for variant peptide detection in shotgun proteomics. Mol Cell Proteomics 2011;10:M110.006536. [PMID: 21389108 DOI: 10.1074/mcp.m110.006536] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Number

Cited by Other Article(s)

Raj A, Aggarwal S, Singh P, Yadav AK, Dash D. PgxSAVy: A tool for comprehensive evaluation of variant peptide quality in proteogenomics - catching the (un)usual suspects. Comput Struct Biotechnol J 2024;23:711-722. [PMID: 38292474 PMCID: PMC10825656 DOI: 10.1016/j.csbj.2023.12.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 12/19/2023] [Accepted: 12/23/2023] [Indexed: 02/01/2024] Open

Abstract

Variant peptides resulting from single nucleotide polymorphisms (SNPs) can lead to aberrant protein functions and have translational potential for disease diagnosis and personalized therapy. Variant peptides detected by proteogenomics are fraught with high number of false positives, but there is no uniform and comprehensive approach to assess variant quality across analysis pipelines. Despite class-specific FDR along with ad-hoc filters, the problem is far from solved. These protocols are typically manual and tedious, and thus not uniform across labs. We demonstrate that variant peptide rescoring, integrated with intensity, variant event information and search result features, allows better discrimination of correct variant peptides. Implemented into PgxSAVy - a tool for quality control of variant peptides, this method can tackle the high rate of false positives. PgxSAVy provides a rigorous framework for quality control and annotations of variant peptides on the basis of (i) variant quality, (ii) isobaric masses, and (iii) disease annotation. PgxSAVy demonstrated high accuracy by identifying true variants with 98.43% accuracy on simulated data. Large-scale proteogenomic reanalysis of ∼2.8 million spectra (PXD004010 and PXD001468) resulted in 12,705 variant peptide spectrum matches (PSMs), of which PgxSAVy evaluated 3028 (23.8%), 1409 (11.1%) and 8268 (65.1%) as confident, semi-confident and doubtful respectively. PgxSAVy also annotates the variants based on their pathogenicity and provides support for assisted manual validation. The analysis of proteins carrying variants can provide fine granularity in discovering important pathways. PgxSAVy will advance personalized medicine by providing a comprehensive framework for quality control and prioritization of proteogenomics variants. PgxSAVy is freely available at https://pgxsavy.igib.res.in/ as a webserver and https://github.com/anuragraj/PgxSAVy as a stand-alone tool.

Collapse

Zhang M, Gong C, Ge F, Yu DJ. FCMSTrans: Accurate Prediction of Disease-Associated nsSNPs by Utilizing Multiscale Convolution and Deep Feature Combination within a Transformer Framework. J Chem Inf Model 2024;64:1394-1406. [PMID: 38349747 DOI: 10.1021/acs.jcim.3c02025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]

Abstract

Nonsynonymous single-nucleotide polymorphisms (nsSNPs), implicated in over 6000 diseases, necessitate accurate prediction for expedited drug discovery and improved disease diagnosis. In this study, we propose FCMSTrans, a novel nsSNP predictor that innovatively combines the transformer framework and multiscale modules for comprehensive feature extraction. The distinctive attribute of FCMSTrans resides in a deep feature combination strategy. This strategy amalgamates evolutionary-scale modeling (ESM) and ProtTrans (PT) features, providing an understanding of protein biochemical properties, and position-specific scoring matrix, secondary structure, predicted relative solvent accessibility, and predicted disorder (PSPP) features, which are derived from four protein sequences and structure-oriented characteristics. This feature combination offers a comprehensive view of the molecular dynamics involving nsSNPs. Our model employs the transformer's self-attention mechanisms across multiple layers, extracting higher-level and abstract representations. Simultaneously, varied-level features are captured by multiscale convolutions, enriching feature abstraction at multiple echelons. Our comparative analyses with existing methodologies highlight significant improvements made possible by the integrated feature fusion approach adopted in FCMSTrans. This is further substantiated by performance assessments based on diverse data sets, such as PredictSNP, MMP, and PMD, with areas under the curve (AUCs) of 0.869, 0.819, and 0.693, respectively. Furthermore, FCMSTrans shows robustness and superiority by outperforming the current best predictor, PROVEAN, in a blind test conducted on a third-party data set, achieving an impressive AUC score of 0.7838. The Python code of FCMSTrans is available at https://github.com/gc212/FCMSTrans for academic usage.

Collapse

Lin TT, Zhang T, Kitata RB, Liu T, Smith RD, Qian WJ, Shi T. Mass spectrometry-based targeted proteomics for analysis of protein mutations. MASS SPECTROMETRY REVIEWS 2023;42:796-821. [PMID: 34719806 PMCID: PMC9054944 DOI: 10.1002/mas.21741] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 09/28/2021] [Accepted: 10/07/2021] [Indexed: 05/03/2023]

Abstract

Cancers are caused by accumulated DNA mutations. This recognition of the central role of mutations in cancer and recent advances in next-generation sequencing, has initiated the massive screening of clinical samples and the identification of 1000s of cancer-associated gene mutations. However, proteomic analysis of the expressed mutation products lags far behind genomic (transcriptomic) analysis. With comprehensive global proteomics analysis, only a small percentage of single nucleotide variants detected by DNA and RNA sequencing have been observed as single amino acid variants due to current technical limitations. Proteomic analysis of mutations is important with the potential to advance cancer biomarker development and the discovery of new therapeutic targets for more effective disease treatment. Targeted proteomics using selected reaction monitoring (also known as multiple reaction monitoring) and parallel reaction monitoring, has emerged as a powerful tool with significant advantages over global proteomics for analysis of protein mutations in terms of detection sensitivity, quantitation accuracy and overall practicality (e.g., reliable identification and the scale of quantification). Herein we review recent advances in the targeted proteomics technology for enhancing detection sensitivity and multiplexing capability and highlight its broad biomedical applications for analysis of protein mutations in human bodily fluids, tissues, and cell lines. Furthermore, we review recent applications of top-down proteomics for analysis of protein mutations. Unlike the commonly used bottom-up proteomics which requires digestion of proteins into peptides, top-down proteomics directly analyzes intact proteins for more precise characterization of mutation isoforms. Finally, general perspectives on the potential of achieving both high sensitivity and high sample throughput for large-scale targeted detection and quantification of important protein mutations are discussed.

Collapse

Yu Y, Zhang Z, Dong X, Yang R, Duan Z, Xiang Z, Li J, Li G, Yan F, Xue H, Jiao D, Lu J, Lu H, Zhang W, Wei Y, Fan S, Li J, Jia J, Zhang J, Ji J, Liu P, Lu H, Zhao H, Chen S, Wei C, Chen H, Zhu Z. Pangenomic analysis of Chinese gastric cancer. Nat Commun 2022;13:5412. [PMID: 36109518 PMCID: PMC9477819 DOI: 10.1038/s41467-022-33073-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Accepted: 08/31/2022] [Indexed: 11/25/2022] Open

Fancello L, Burger T. An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics. Genome Biol 2022;23:132. [PMID: 35725496 PMCID: PMC9208142 DOI: 10.1186/s13059-022-02701-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 06/09/2022] [Indexed: 12/03/2022] Open

Abstract

Background

Proteogenomics aims to identify variant or unknown proteins in bottom-up proteomics, by searching transcriptome- or genome-derived custom protein databases. However, empirical observations reveal that these large proteogenomic databases produce lower-sensitivity peptide identifications. Various strategies have been proposed to avoid this, including the generation of reduced transcriptome-informed protein databases, which only contain proteins whose transcripts are detected in the sample-matched transcriptome. These were found to increase peptide identification sensitivity. Here, we present a detailed evaluation of this approach.

Results

We establish that the increased sensitivity in peptide identification is in fact a statistical artifact, directly resulting from the limited capability of target-decoy competition to accurately model incorrect target matches when using excessively small databases. As anti-conservative false discovery rates (FDRs) are likely to hamper the robustness of the resulting biological conclusions, we advocate for alternative FDR control methods that are less sensitive to database size. Nevertheless, reduced transcriptome-informed databases are useful, as they reduce the ambiguity of protein identifications, yielding fewer shared peptides. Furthermore, searching the reference database and subsequently filtering proteins whose transcripts are not expressed reduces protein identification ambiguity to a similar extent, but is more transparent and reproducible.

Conclusions

In summary, using transcriptome information is an interesting strategy that has not been promoted for the right reasons. While the increase in peptide identifications from searching reduced transcriptome-informed databases is an artifact caused by the use of an FDR control method unsuitable to excessively small databases, transcriptome information can reduce the ambiguity of protein identifications.

Supplementary Information

The online version contains supplementary material available at 10.1186/s13059-022-02701-2.

Collapse

Ge F, Zhang Y, Xu J, Muhammad A, Song J, Yu DJ. Prediction of disease-associated nsSNPs by integrating multi-scale ResNet models with deep feature fusion. Brief Bioinform 2021;23:6483068. [PMID: 34953462 DOI: 10.1093/bib/bbab530] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 11/13/2021] [Accepted: 11/16/2021] [Indexed: 11/13/2022] Open

Abstract

More than 6000 human diseases have been recorded to be caused by non-synonymous single nucleotide polymorphisms (nsSNPs). Rapid and accurate prediction of pathogenic nsSNPs can improve our understanding of the principle and design of new drugs, which remains an unresolved challenge. In the present work, a new computational approach, termed MSRes-MutP, is proposed based on ResNet blocks with multi-scale kernel size to predict disease-associated nsSNPs. By feeding the serial concatenation of the extracted four types of features, the performance of MSRes-MutP does not obviously improve. To address this, a second model FFMSRes-MutP is developed, which utilizes deep feature fusion strategy and multi-scale 2D-ResNet and 1D-ResNet blocks to extract relevant two-dimensional features and physicochemical properties. FFMSRes-MutP with the concatenated features achieves a better performance than that with individual features. The performance of FFMSRes-MutP is benchmarked on five different datasets. It achieves the Matthew's correlation coefficient (MCC) of 0.593 and 0.618 on the PredictSNP and MMP datasets, which are 0.101 and 0.210 higher than that of the existing best method PredictSNP1. When tested on the HumDiv and HumVar datasets, it achieves MCC of 0.9605 and 0.9507, and area under curve (AUC) of 0.9796 and 0.9748, which are 0.1747 and 0.2669, 0.0853 and 0.1335, respectively, higher than the existing best methods PolyPhen-2 and FATHMM (weighted). In addition, on blind test using a third-party dataset, FFMSRes-MutP performs as the second-best predictor (with MCC and AUC of 0.5215 and 0.7633, respectively), when compared with the other four predictors. Extensive benchmarking experiments demonstrate that FFMSRes-MutP achieves effective feature fusion and can be explored as a useful approach for predicting disease-associated nsSNPs. The webserver is freely available at http://csbio.njust.edu.cn/bioinf/ffmsresmutp/ for academic use.

Collapse

The structure-based cancer-related single amino acid variation prediction. Sci Rep 2021;11:13599. [PMID: 34193921 PMCID: PMC8245468 DOI: 10.1038/s41598-021-92793-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 06/16/2021] [Indexed: 11/09/2022] Open

Salz R, Bouwmeester R, Gabriels R, Degroeve S, Martens L, Volders PJ, 't Hoen PAC. Personalized Proteome: Comparing Proteogenomics and Open Variant Search Approaches for Single Amino Acid Variant Detection. J Proteome Res 2021;20:3353-3364. [PMID: 33998808 PMCID: PMC8280751 DOI: 10.1021/acs.jproteome.1c00264] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Indexed: 12/30/2022]

Forensic proteomics. Forensic Sci Int Genet 2021;54:102529. [PMID: 34139528 DOI: 10.1016/j.fsigen.2021.102529] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 05/06/2021] [Accepted: 05/07/2021] [Indexed: 12/19/2022]

Ge F, Hu J, Zhu YH, Arif M, Yu DJ. TargetMM: Accurate Missense Mutation Prediction by Utilizing Local and Global Sequence Information with Classifier Ensemble. Comb Chem High Throughput Screen 2021;25:38-52. [DOI: 10.2174/1386207323666201204140438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 10/22/2020] [Accepted: 10/26/2020] [Indexed: 11/22/2022]

Jorge GL, Balbuena TS. Identification of novel protein-coding sequences in Eucalyptus grandis plants by high-resolution mass spectrometry. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2020;1869:140594. [PMID: 33385527 DOI: 10.1016/j.bbapap.2020.140594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/11/2020] [Accepted: 12/23/2020] [Indexed: 10/22/2022]

Shukla N, Siva N, Malik B, Suravajhala P. Current Challenges and Implications of Proteogenomic Approaches in Prostate Cancer. Curr Top Med Chem 2020;20:1968-1980. [PMID: 32703135 DOI: 10.2174/1568026620666200722112450] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 05/30/2020] [Accepted: 06/29/2020] [Indexed: 12/16/2022]

Kiseleva O, Zgoda V, Naryzhny S, Poverennaya E. Empowering Shotgun Mass Spectrometry with 2DE: A HepG2 Study. Int J Mol Sci 2020;21:E3813. [PMID: 32471280 PMCID: PMC7312985 DOI: 10.3390/ijms21113813] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 05/21/2020] [Accepted: 05/26/2020] [Indexed: 01/07/2023] Open

Yi X, Gong F, Fu Y. Transfer posterior error probability estimation for peptide identification. BMC Bioinformatics 2020;21:173. [PMID: 32366221 PMCID: PMC7199311 DOI: 10.1186/s12859-020-3485-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 04/08/2020] [Indexed: 12/31/2022] Open

Abstract

BACKGROUND

In shotgun proteomics, database searching of tandem mass spectra results in a great number of peptide-spectrum matches (PSMs), many of which are false positives. Quality control of PSMs is a multiple hypothesis testing problem, and the false discovery rate (FDR) or the posterior error probability (PEP) is the commonly used statistical confidence measure. PEP, also called local FDR, can evaluate the confidence of individual PSMs and thus is more desirable than FDR, which evaluates the global confidence of a collection of PSMs. Estimation of PEP can be achieved by decomposing the null and alternative distributions of PSM scores as long as the given data is sufficient. However, in many proteomic studies, only a group (subset) of PSMs, e.g. those with specific post-translational modifications, are of interest. The group can be very small, making the direct PEP estimation by the group data inaccurate, especially for the high-score area where the score threshold is taken. Using the whole set of PSMs to estimate the group PEP is inappropriate either, because the null and/or alternative distributions of the group can be very different from those of combined scores.

RESULTS

The transfer PEP algorithm is proposed to more accurately estimate the PEPs of peptide identifications in small groups. Transfer PEP derives the group null distribution through its empirical relationship with the combined null distribution, and estimates the group alternative distribution, as well as the null proportion, using an iterative semi-parametric method. Validated on both simulated data and real proteomic data, transfer PEP showed remarkably higher accuracy than the direct combined and separate PEP estimation methods.

CONCLUSIONS

We presented a novel approach to group PEP estimation for small groups and implemented it for the peptide identification problem in proteomics. The methodology of the approach is in principle applicable to the small-group PEP estimation problems in other fields.

Collapse

Wen B, Li K, Zhang Y, Zhang B. Cancer neoantigen prioritization through sensitive and reliable proteogenomics analysis. Nat Commun 2020;11:1759. [PMID: 32273506 PMCID: PMC7145864 DOI: 10.1038/s41467-020-15456-w] [Citation(s) in RCA: 69] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Accepted: 03/10/2020] [Indexed: 01/01/2023] Open

Kwon OK, Ha YS, Lee JN, Kim S, Lee H, Chun SY, Kwon TG, Lee S. Comparative Proteome Profiling and Mutant Protein Identification in Metastatic Prostate Cancer Cells by Quantitative Mass Spectrometry-based Proteogenomics. Cancer Genomics Proteomics 2019;16:273-286. [PMID: 31243108 DOI: 10.21873/cgp.20132] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Revised: 04/16/2019] [Accepted: 04/18/2019] [Indexed: 12/13/2022] Open

Na S, Kim J, Paek E. MODplus: Robust and Unrestrictive Identification of Post-Translational Modifications Using Mass Spectrometry. Anal Chem 2019;91:11324-11333. [PMID: 31365238 DOI: 10.1021/acs.analchem.9b02445] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Weldatsadik R, Datta N, Kolmeder C, Vuopio J, Kere J, Wilkman S, Flatt J, Vuento R, Haapasalo K, Keskitalo S, Varjosalo M, Jokiranta T. Pool-seq driven proteogenomic database for Group G Streptococcus. J Proteomics 2019;201:84-92. [DOI: 10.1016/j.jprot.2019.04.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 03/29/2019] [Accepted: 04/17/2019] [Indexed: 02/07/2023]

Schiza C, Korbakis D, Jarvi K, Diamandis EP, Drabovich AP. Identification of TEX101-associated Proteins Through Proteomic Measurement of Human Spermatozoa Homozygous for the Missense Variant rs35033974. Mol Cell Proteomics 2019;18:338-351. [PMID: 30429210 PMCID: PMC6356071 DOI: 10.1074/mcp.ra118.001170] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Indexed: 01/19/2023] Open

Abstract

TEX101 is a germ-cell-specific protein and a validated biomarker of male infertility. Mouse TEX101 was found essential for male fertility and was suggested to function as a cell surface chaperone involved in maturation of proteins required for sperm migration and sperm-oocyte interaction. However, the precise functional role of human TEX101 is not known and cannot be studied in vitro due to the lack of human germ cell lines. Here, we genotyped 386 men for a common missense variant rs35033974 of TEX101 and identified 52 heterozygous and 4 homozygous men. We then discovered by targeted proteomics that the variant allele rs35033974 was associated with the near-complete degradation (>97%) of the corresponding G99V TEX101 form and suggested that spermatozoa of homozygous men could serve as a knockdown model to study TEX101 function in humans. Differential proteomic profiling with label-free quantification measured 8,046 proteins in spermatozoa of eight men and identified eight cell-surface and nine secreted testis-specific proteins significantly down-regulated in four patients homozygous for rs35033974. Substantially reduced levels of testis-specific cell-surface proteins potentially involved in sperm migration and sperm-oocyte interaction (including LY6K and ADAM29) were confirmed by targeted proteomics and Western blotting assays. Because recent population-scale genomic data revealed homozygous fathers with biological children, rs35033974 is not a monogenic factor of male infertility in humans. However, median TEX101 levels in seminal plasma were found fivefold lower (p = 0.0005) in heterozygous than in wild-type men of European ancestry. We conclude that spermatozoa of rs35033974 homozygous men have substantially reduced levels of TEX101 and could be used as a model to elucidate the precise TEX101 function, which will advance biology of human reproduction.

Collapse

Wen B, Wang X, Zhang B. PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations. Genome Res 2019;29:485-493. [PMID: 30610011 PMCID: PMC6396417 DOI: 10.1101/gr.235028.118] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Accepted: 12/28/2018] [Indexed: 12/20/2022]

Abstract

Massively parallel or second-generation sequencing-based genomic studies continuously identify new genomic alterations that may lead to novel protein sequences, which are attractive candidates for disease biomarkers and therapeutic targets after proteomic validation. Integrative proteogenomic methods have been developed to use mass spectrometry (MS)-based proteomics data for such validation. These methods replace the reference sequence database in proteomic database searching with a customized protein database that incorporates sample- or disease-specific sequences derived from DNA or RNA sequencing, thus enabling the identification of novel protein sequences. Although useful, this spectrum-centric approach requires a full evaluation of all possible spectrum-peptide pairs, which is time-consuming, error-prone, and difficult to apply. Here, we present PepQuery, a peptide-centric approach that focuses on only novel DNA or protein sequences of interest. PepQuery allows quick and easy proteomic validation of genomic alterations without customized database construction. We demonstrated the sensitivity and specificity of the approach in validating completely novel proteins, novel splice junctions, and single amino acid variants using simulations and experimental data. Notably, enabling unrestricted modification searching in PepQuery reduced false positives by up to 95%. We implemented PepQuery as both web-based and stand-alone applications. The web version provides direct access to more than half a billion MS/MS spectra from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) and other cancer proteomic studies. The stand-alone version supports batch analysis and user-provided MS/MS data. PepQuery will increase the usage of proteogenomics beyond the proteomics community and will broaden the application of proteogenomics in personalized medicine.

Collapse

Tan Z, Yi X, Carruthers NJ, Stemmer PM, Lubman DM. Single Amino Acid Variant Discovery in Small Numbers of Cells. J Proteome Res 2018;18:417-425. [PMID: 30404448 DOI: 10.1021/acs.jproteome.8b00694] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Bubis JA, Levitsky LI, Ivanov MV, Gorshkov MV. Validation of Peptide Identification Results in Proteomics Using Amino Acid Counting. Proteomics 2018;18:e1800117. [PMID: 30307114 DOI: 10.1002/pmic.201800117] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Revised: 09/12/2018] [Indexed: 01/11/2023]

Robin T, Bairoch A, Müller M, Lisacek F, Lane L. Large-Scale Reanalysis of Publicly Available HeLa Cell Proteomics Data in the Context of the Human Proteome Project. J Proteome Res 2018;17:4160-4170. [DOI: 10.1021/acs.jproteome.8b00392] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Yi X, Wang B, An Z, Gong F, Li J, Fu Y. Quality control of single amino acid variations detected by tandem mass spectrometry. J Proteomics 2018;187:144-151. [PMID: 30012419 DOI: 10.1016/j.jprot.2018.07.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Revised: 06/26/2018] [Accepted: 07/02/2018] [Indexed: 02/04/2023]

Abstract

Study of single amino acid variations (SAVs) of proteins, resulting from single nucleotide polymorphisms, is of great importance for understanding the relationships between genotype and phenotype. In mass spectrometry based shotgun proteomics, identification of peptides with SAVs often suffers from high error rates on the variant sites detected. These site errors are due to multiple reasons and can be confirmed by manual inspection or genomic sequencing. Here, we present a software tool, named SAVControl, for site-level quality control of variant peptide identifications. It mainly includes strict false discovery rate control of variant peptide identifications and variant site verification by unrestrictive mass shift relocalization. SAVControl was validated on three colorectal adenocarcinoma cell line datasets with genomic sequencing evidences and tested on a colorectal cancer dataset from The Cancer Genome Atlas. The results show that SAVControl can effectively remove false detections of SAVs.

SIGNIFICANCE

Protein sequence variations caused by single nucleotide polymorphisms (SNPs) are single amino acid variations (SAVs). The investigation of SAVs may provide a chance for understanding the relationships between genotype and phenotype. Mass spectrometry (MS) based proteomics provides a large-scale way to detect SAVs. However, using the current analysis strategy to detect SAVs may lead to high rate of false positives. The SAVControl we present here is a computational workflow and software tool for site-level quality control of SAVs detected by MS. It accesses the confidence of detected variant sites by relocating the mass shift responsible for an SAV to search for alternative interpretations. In addition, it uses a strict false discovery rate control method for variant peptide identifications. The advantages of SAVControl were demonstrated on three colorectal adenocarcinoma cell line datasets and a colorectal cancer dataset. We believe that SAVControl will be a powerful tool for computational proteomics and proteogenomics.

Collapse

Discovery of coding regions in the human genome by integrated proteogenomics analysis workflow. Nat Commun 2018;9:903. [PMID: 29500430 PMCID: PMC5834625 DOI: 10.1038/s41467-018-03311-y] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2017] [Accepted: 02/02/2018] [Indexed: 01/23/2023] Open

Xiao J, Tanca A, Jia B, Yang R, Wang B, Zhang Y, Li J. Metagenomic Taxonomy-Guided Database-Searching Strategy for Improving Metaproteomic Analysis. J Proteome Res 2018;17:1596-1605. [DOI: 10.1021/acs.jproteome.7b00894] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Dimitrakopoulos L, Prassas I, Diamandis EP, Charames GS. Onco-proteogenomics: Multi-omics level data integration for accurate phenotype prediction. Crit Rev Clin Lab Sci 2017;54:414-432. [DOI: 10.1080/10408363.2017.1384446] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Choong WK, Lih TSM, Chen YJ, Sung TY. Decoding the Effect of Isobaric Substitutions on Identifying Missing Proteins and Variant Peptides in Human Proteome. J Proteome Res 2017;16:4415-4424. [DOI: 10.1021/acs.jproteome.7b00342] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Hernandez-Valladares M, Vaudel M, Selheim F, Berven F, Bruserud Ø. Proteogenomics approaches for studying cancer biology and their potential in the identification of acute myeloid leukemia biomarkers. Expert Rev Proteomics 2017;14:649-663. [DOI: 10.1080/14789450.2017.1352474] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Detecting protein variants by mass spectrometry: a comprehensive study in cancer cell-lines. Genome Med 2017;9:62. [PMID: 28716134 PMCID: PMC5514513 DOI: 10.1186/s13073-017-0454-9] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Accepted: 06/22/2017] [Indexed: 02/07/2023] Open

Li H, Park J, Kim H, Hwang KB, Paek E. Systematic Comparison of False-Discovery-Rate-Controlling Strategies for Proteogenomic Search Using Spike-in Experiments. J Proteome Res 2017;16:2231-2239. [PMID: 28452485 DOI: 10.1021/acs.jproteome.7b00033] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Zhang M, Wang B, Xu J, Wang X, Xie L, Zhang B, Li Y, Li J. CanProVar 2.0: An Updated Database of Human Cancer Proteome Variation. J Proteome Res 2017;16:421-432. [PMID: 27977206 PMCID: PMC5515284 DOI: 10.1021/acs.jproteome.6b00505] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Tan Z, Nie S, McDermott SP, Wicha MS, Lubman DM. Single Amino Acid Variant Profiles of Subpopulations in the MCF-7 Breast Cancer Cell Line. J Proteome Res 2017;16:842-851. [PMID: 28076950 DOI: 10.1021/acs.jproteome.6b00824] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Cao R, Shi Y, Chen S, Ma Y, Chen J, Yang J, Chen G, Shi T. dbSAP: single amino-acid polymorphism database for protein variation detection. Nucleic Acids Res 2017;45:D827-D832. [PMID: 27903894 PMCID: PMC5210569 DOI: 10.1093/nar/gkw1096] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Revised: 10/25/2016] [Accepted: 11/01/2016] [Indexed: 12/13/2022] Open

Affiliation(s)

Ruifang Cao The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
Yan Shi The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
Shuangguan Chen The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
Yimin Ma The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
Jiajun Chen The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
Juan Yang The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
Geng Chen The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
Tieliu Shi The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China

Collapse

Rahman SMJ, Ji X, Zimmerman LJ, Li M, Harris BK, Hoeksema MD, Trenary IA, Zou Y, Qian J, Slebos RJ, Beane J, Spira A, Shyr Y, Eisenberg R, Liebler DC, Young JD, Massion PP. The airway epithelium undergoes metabolic reprogramming in individuals at high risk for lung cancer. JCI Insight 2016;1:e88814. [PMID: 27882349 DOI: 10.1172/jci.insight.88814] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Affiliation(s)

S M Jamshedur Rahman Division of Allergy, Pulmonary and Critical Care Medicine, Department of Medicine, Cancer Early Detection and Prevention Initiative, Vanderbilt Ingram Cancer Center
Xiangming Ji Division of Allergy, Pulmonary and Critical Care Medicine, Department of Medicine, Cancer Early Detection and Prevention Initiative, Vanderbilt Ingram Cancer Center
Lisa J Zimmerman Department of Biochemistry
Ming Li Department of Biostatistics, and
Bradford K Harris Division of Allergy, Pulmonary and Critical Care Medicine, Department of Medicine, Cancer Early Detection and Prevention Initiative, Vanderbilt Ingram Cancer Center
Megan D Hoeksema Division of Allergy, Pulmonary and Critical Care Medicine, Department of Medicine, Cancer Early Detection and Prevention Initiative, Vanderbilt Ingram Cancer Center
Irina A Trenary Department of Chemical and Biomolecular Engineering, Vanderbilt University Medical Center, Nashville, Tennessee, USA
Yong Zou Division of Allergy, Pulmonary and Critical Care Medicine, Department of Medicine, Cancer Early Detection and Prevention Initiative, Vanderbilt Ingram Cancer Center
Jun Qian Division of Allergy, Pulmonary and Critical Care Medicine, Department of Medicine, Cancer Early Detection and Prevention Initiative, Vanderbilt Ingram Cancer Center
Robbert Jc Slebos Department of Biochemistry
Jennifer Beane Pulmonary Center and Section of Computational Biomedicine, Department of Medicine, Boston University Medical Center, Boston, Massachusetts, USA
Avrum Spira Pulmonary Center and Section of Computational Biomedicine, Department of Medicine, Boston University Medical Center, Boston, Massachusetts, USA
Yu Shyr Department of Biostatistics, and
Rosana Eisenberg Departments of Pathology, Microbiology, and Immunology
Daniel C Liebler Department of Biochemistry
Jamey D Young Department of Chemical and Biomolecular Engineering, Vanderbilt University Medical Center, Nashville, Tennessee, USA.,Department of Molecular Physiology and Biophysics, and
Pierre P Massion Division of Allergy, Pulmonary and Critical Care Medicine, Department of Medicine, Cancer Early Detection and Prevention Initiative, Vanderbilt Ingram Cancer Center.,Department of Cancer Biology, Vanderbilt University Medical Center, Nashville, Tennessee, USA.,Veterans Affairs, Tennessee Valley Healthcare System, Nashville, Tennessee, USA

Collapse

On the privacy risks of sharing clinical proteomics data. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2016;2016:122-31. [PMID: 27595046 PMCID: PMC5009298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Wen B, Xu S, Zhou R, Zhang B, Wang X, Liu X, Xu X, Liu S. PGA: an R/Bioconductor package for identification of novel peptides using a customized database derived from RNA-Seq. BMC Bioinformatics 2016;17:244. [PMID: 27316337 PMCID: PMC4912784 DOI: 10.1186/s12859-016-1133-3] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2015] [Accepted: 06/09/2016] [Indexed: 11/27/2022] Open

Abstract

Background

Peptide identification based upon mass spectrometry (MS) is generally achieved by comparison of the experimental mass spectra with the theoretically digested peptides derived from a reference protein database. Obviously, this strategy could not identify peptide and protein sequences that are absent from a reference database. A customized protein database on the basis of RNA-Seq data is thus proposed to assist with and improve the identification of novel peptides. Correspondingly, development of a comprehensive pipeline, which provides an end-to-end solution for novel peptide detection with the customized protein database, is necessary.

Results

A pipeline with an R package, assigned as a PGA utility, was developed that enables automated treatment to the tandem mass spectrometry (MS/MS) data acquired from different MS platforms and construction of customized protein databases based on RNA-Seq data with or without a reference genome guide. Hence, PGA can identify novel peptides and generate an HTML-based report with a visualized interface. On the basis of a published dataset, PGA was employed to identify peptides, resulting in 636 novel peptides, including 510 single amino acid polymorphism (SAP) peptides, 2 INDEL peptides, 49 splice junction peptides, and 75 novel transcript-derived peptides. The software is freely available from http://bioconductor.org/packages/PGA/, and the example reports are available at http://wenbostar.github.io/PGA/.

Conclusions

The pipeline of PGA, aimed at being platform-independent and easy-to-use, was successfully developed and shown to be capable of identifying novel peptides by searching the customized protein database derived from RNA-Seq data.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-016-1133-3) contains supplementary material, which is available to authorized users.

Collapse

Li Y, Wang X, Cho JH, Shaw TI, Wu Z, Bai B, Wang H, Zhou S, Beach TG, Wu G, Zhang J, Peng J. JUMPg: An Integrative Proteogenomics Pipeline Identifying Unannotated Proteins in Human Brain and Cancer Cells. J Proteome Res 2016;15:2309-20. [PMID: 27225868 DOI: 10.1021/acs.jproteome.6b00344] [Citation(s) in RCA: 62] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Sheynkman GM, Shortreed MR, Cesnik AJ, Smith LM. Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY (PALO ALTO, CALIF.) 2016;9:521-45. [PMID: 27049631 PMCID: PMC4991544 DOI: 10.1146/annurev-anchem-071015-041722] [Citation(s) in RCA: 73] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]

Xiong Y, Guo Y, Xiao W, Cao Q, Li S, Qi X, Zhang Z, Wang Q, Shui W. An NGS-Independent Strategy for Proteome-Wide Identification of Single Amino Acid Polymorphisms by Mass Spectrometry. Anal Chem 2016;88:2784-91. [PMID: 26810586 DOI: 10.1021/acs.analchem.5b04417] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Cesnik AJ, Shortreed MR, Sheynkman GM, Frey BL, Smith LM. Human Proteomic Variation Revealed by Combining RNA-Seq Proteogenomics and Global Post-Translational Modification (G-PTM) Search Strategy. J Proteome Res 2016;15:800-8. [PMID: 26704769 PMCID: PMC4779408 DOI: 10.1021/acs.jproteome.5b00817] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Giese SH, Zickmann F, Renard BY. Detection of Unknown Amino Acid Substitutions Using Error-Tolerant Database Search. Methods Mol Biol 2016;1362:247-264. [PMID: 26519182 DOI: 10.1007/978-1-4939-3106-4_16] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Mutant Proteogenomics. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016;926:77-91. [PMID: 27686807 DOI: 10.1007/978-3-319-42316-6_6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Askenazi M, Ruggles KV, Fenyö D. PGx: Putting Peptides to BED. J Proteome Res 2015;15:795-9. [PMID: 26638927 PMCID: PMC4782174 DOI: 10.1021/acs.jproteome.5b00870] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Ruggles KV, Tang Z, Wang X, Grover H, Askenazi M, Teubl J, Cao S, McLellan MD, Clauser KR, Tabb DL, Mertins P, Slebos R, Erdmann-Gilmore P, Li S, Gunawardena HP, Xie L, Liu T, Zhou JY, Sun S, Hoadley KA, Perou CM, Chen X, Davies SR, Maher CA, Kinsinger CR, Rodland KD, Zhang H, Zhang Z, Ding L, Townsend RR, Rodriguez H, Chan D, Smith RD, Liebler DC, Carr SA, Payne S, Ellis MJ, Fenyő D. An Analysis of the Sensitivity of Proteogenomic Mapping of Somatic Mutations and Novel Splicing Events in Cancer. Mol Cell Proteomics 2015;15:1060-71. [PMID: 26631509 DOI: 10.1074/mcp.m115.056226] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Indexed: 11/06/2022] Open

Affiliation(s)

Kelly V Ruggles From the ‡New York University School of Medicine, New York, NY
Zuojian Tang From the ‡New York University School of Medicine, New York, NY
Xuya Wang From the ‡New York University School of Medicine, New York, NY
Himanshu Grover From the ‡New York University School of Medicine, New York, NY
Manor Askenazi §Biomedical Hosting, LLC, Arlington, MA
Jennifer Teubl From the ‡New York University School of Medicine, New York, NY
Song Cao ¶Washington University in St. Louis, St. Louis, MO
Michael D McLellan ¶Washington University in St. Louis, St. Louis, MO
Karl R Clauser ‖Broad Institute of Harvard and MIT, Cambridge, MA
David L Tabb **Vanderbilt University School of Medicine, Nashville, TN
Philipp Mertins ‖Broad Institute of Harvard and MIT, Cambridge, MA
Robbert Slebos **Vanderbilt University School of Medicine, Nashville, TN
Petra Erdmann-Gilmore ¶Washington University in St. Louis, St. Louis, MO
Shunqiang Li ¶Washington University in St. Louis, St. Louis, MO
Harsha P Gunawardena ‡‡Universtiy of North Carolina School of Medicine, Chapel Hill, NC
Ling Xie ‡‡Universtiy of North Carolina School of Medicine, Chapel Hill, NC
Tao Liu §§Pacific Northwest National Laboratory, Richland, WA
Jian-Ying Zhou ¶¶Johns Hopkins University, Baltimore, MD
Shisheng Sun ¶¶Johns Hopkins University, Baltimore, MD
Katherine A Hoadley ‡‡Universtiy of North Carolina School of Medicine, Chapel Hill, NC
Charles M Perou ‡‡Universtiy of North Carolina School of Medicine, Chapel Hill, NC
Xian Chen ‡‡Universtiy of North Carolina School of Medicine, Chapel Hill, NC
Sherri R Davies ¶Washington University in St. Louis, St. Louis, MO
Christopher A Maher ¶Washington University in St. Louis, St. Louis, MO
Christopher R Kinsinger ‖‖Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD
Karen D Rodland §§Pacific Northwest National Laboratory, Richland, WA
Hui Zhang ¶¶Johns Hopkins University, Baltimore, MD
Zhen Zhang ¶¶Johns Hopkins University, Baltimore, MD
Li Ding ¶Washington University in St. Louis, St. Louis, MO
R Reid Townsend ¶Washington University in St. Louis, St. Louis, MO
Henry Rodriguez ‖‖Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD
Daniel Chan ¶¶Johns Hopkins University, Baltimore, MD
Richard D Smith §§Pacific Northwest National Laboratory, Richland, WA
Daniel C Liebler **Vanderbilt University School of Medicine, Nashville, TN
Steven A Carr ‖Broad Institute of Harvard and MIT, Cambridge, MA
Samuel Payne §§Pacific Northwest National Laboratory, Richland, WA;
Matthew J Ellis ¶Washington University in St. Louis, St. Louis, MO;
David Fenyő From the ‡New York University School of Medicine, New York, NY;

Collapse

Shukla HD, Mahmood J, Vujaskovic Z. Integrated proteo-genomic approach for early diagnosis and prognosis of cancer. Cancer Lett 2015;369:28-36. [DOI: 10.1016/j.canlet.2015.08.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2015] [Revised: 08/05/2015] [Accepted: 08/05/2015] [Indexed: 12/28/2022]

Choong WK, Chang HY, Chen CT, Tsai CF, Hsu WL, Chen YJ, Sung TY. Informatics View on the Challenges of Identifying Missing Proteins from Shotgun Proteomics. J Proteome Res 2015;14:5396-407. [DOI: 10.1021/acs.jproteome.5b00482] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Stewart PA, Parapatics K, Welsh EA, Müller AC, Cao H, Fang B, Koomen JM, Eschrich SA, Bennett KL, Haura EB. A Pilot Proteogenomic Study with Data Integration Identifies MCT1 and GLUT1 as Prognostic Markers in Lung Adenocarcinoma. PLoS One 2015;10:e0142162. [PMID: 26539827 PMCID: PMC4634858 DOI: 10.1371/journal.pone.0142162] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2015] [Accepted: 10/19/2015] [Indexed: 11/19/2022] Open

Song Y, Laskay ÜA, Vilcins IME, Barbour AG, Wysocki VH. Top-down-assisted bottom-up method for homologous protein sequencing: hemoglobin from 33 bird species. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2015;26:1875-84. [PMID: 26111519 PMCID: PMC6467653 DOI: 10.1007/s13361-015-1185-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Revised: 05/08/2015] [Accepted: 05/08/2015] [Indexed: 05/12/2023]

Woo S, Cha SW, Bonissone S, Na S, Tabb DL, Pevzner PA, Bafna V. Advanced Proteogenomic Analysis Reveals Multiple Peptide Mutations and Complex Immunoglobulin Peptides in Colon Cancer. J Proteome Res 2015;14:3555-67. [PMID: 26139413 DOI: 10.1021/acs.jproteome.5b00264] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]