Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Serang O, Käll L. Solution to Statistical Challenges in Proteomics Is More Statistics, Not Less. J Proteome Res 2015;14:4099-103. [PMID: 26257019 DOI: 10.1021/acs.jproteome.5b00568] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

For:	Serang O, Käll L. Solution to Statistical Challenges in Proteomics Is More Statistics, Not Less. J Proteome Res 2015;14:4099-103. [PMID: 26257019 DOI: 10.1021/acs.jproteome.5b00568] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Number

Cited by Other Article(s)

Henke AN, Chilukuri S, Langan LM, Brooks BW. Reporting and reproducibility: Proteomics of fish models in environmental toxicology and ecotoxicology. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;912:168455. [PMID: 37979845 DOI: 10.1016/j.scitotenv.2023.168455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 11/06/2023] [Accepted: 11/07/2023] [Indexed: 11/20/2023]

Reanalysis of ProteomicsDB Using an Accurate, Sensitive, and Scalable False Discovery Rate Estimation Approach for Protein Groups. Mol Cell Proteomics 2022;21:100437. [PMID: 36328188 PMCID: PMC9718969 DOI: 10.1016/j.mcpro.2022.100437] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 10/16/2022] [Accepted: 10/28/2022] [Indexed: 11/07/2022] Open

Abstract

Estimating false discovery rates (FDRs) of protein identification continues to be an important topic in mass spectrometry-based proteomics, particularly when analyzing very large datasets. One performant method for this purpose is the Picked Protein FDR approach which is based on a target-decoy competition strategy on the protein level that ensures that FDRs scale to large datasets. Here, we present an extension to this method that can also deal with protein groups, that is, proteins that share common peptides such as protein isoforms of the same gene. To obtain well-calibrated FDR estimates that preserve protein identification sensitivity, we introduce two novel ideas. First, the picked group target-decoy and second, the rescued subset grouping strategies. Using entrapment searches and simulated data for validation, we demonstrate that the new Picked Protein Group FDR method produces accurate protein group-level FDR estimates regardless of the size of the data set. The validation analysis also uncovered that applying the commonly used Occam's razor principle leads to anticonservative FDR estimates for large datasets. This is not the case for the Picked Protein Group FDR method. Reanalysis of deep proteomes of 29 human tissues showed that the new method identified up to 4% more protein groups than MaxQuant. Applying the method to the reanalysis of the entire human section of ProteomicsDB led to the identification of 18,000 protein groups at 1% protein group-level FDR. The analysis also showed that about 1250 genes were represented by ≥2 identified protein groups. To make the method accessible to the proteomics community, we provide a software tool including a graphical user interface that enables merging results from multiple MaxQuant searches into a single list of identified and quantified protein groups.

Collapse

Perez-Riverol Y. Proteomic repository data submission, dissemination, and reuse: key messages. Expert Rev Proteomics 2022;19:297-310. [PMID: 36529941 PMCID: PMC7614296 DOI: 10.1080/14789450.2022.2160324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Aggarwal S, Raj A, Kumar D, Dash D, Yadav AK. False discovery rate: the Achilles' heel of proteogenomics. Brief Bioinform 2022;23:6582880. [PMID: 35534181 DOI: 10.1093/bib/bbac163] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 03/14/2022] [Accepted: 04/12/2022] [Indexed: 12/25/2022] Open

Proteome Discoverer-A Community Enhanced Data Processing Suite for Protein Informatics. Proteomes 2021;9:proteomes9010015. [PMID: 33806881 PMCID: PMC8006021 DOI: 10.3390/proteomes9010015] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 03/18/2021] [Accepted: 03/20/2021] [Indexed: 01/01/2023] Open

Sperk M, van Domselaar R, Rodriguez JE, Mikaeloff F, Sá Vinhas B, Saccon E, Sönnerborg A, Singh K, Gupta S, Végvári Á, Neogi U. Utility of Proteomics in Emerging and Re-Emerging Infectious Diseases Caused by RNA Viruses. J Proteome Res 2020;19:4259-4274. [PMID: 33095583 PMCID: PMC7640957 DOI: 10.1021/acs.jproteome.0c00380] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Indexed: 12/21/2022]

Affiliation(s)

Maike Sperk Division of Clinical Microbiology, Department of Laboratory Medicine, Karolinska Institute, ANA Futura, Campus Flemingsberg, Stockholm 14152, Sweden
Robert van Domselaar Division of Infectious Diseases, Department of Medicine Huddinge, Karolinska Institute, ANA Futura, Campus Flemingsberg, Stockholm 14152, Sweden
Jimmy Esneider Rodriguez Division of Chemistry I, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm 14152 Sweden
Flora Mikaeloff Division of Clinical Microbiology, Department of Laboratory Medicine, Karolinska Institute, ANA Futura, Campus Flemingsberg, Stockholm 14152, Sweden
Beatriz Sá Vinhas Division of Clinical Microbiology, Department of Laboratory Medicine, Karolinska Institute, ANA Futura, Campus Flemingsberg, Stockholm 14152, Sweden
Elisa Saccon Division of Clinical Microbiology, Department of Laboratory Medicine, Karolinska Institute, ANA Futura, Campus Flemingsberg, Stockholm 14152, Sweden
Anders Sönnerborg Division of Clinical Microbiology, Department of Laboratory Medicine, Karolinska Institute, ANA Futura, Campus Flemingsberg, Stockholm 14152, Sweden Division of Infectious Diseases, Department of Medicine Huddinge, Karolinska Institute, ANA Futura, Campus Flemingsberg, Stockholm 14152, Sweden
Kamal Singh Department of Molecular Microbiology and Immunology and the Bond Life Science Center, University of Missouri, Columbia, Missouri 65211, United States
Soham Gupta Division of Clinical Microbiology, Department of Laboratory Medicine, Karolinska Institute, ANA Futura, Campus Flemingsberg, Stockholm 14152, Sweden
Ákos Végvári Division of Chemistry I, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm 14152 Sweden
Ujjwal Neogi Division of Clinical Microbiology, Department of Laboratory Medicine, Karolinska Institute, ANA Futura, Campus Flemingsberg, Stockholm 14152, Sweden Department of Molecular Microbiology and Immunology and the Bond Life Science Center, University of Missouri, Columbia, Missouri 65211, United States

Collapse

Agten A, Van Houtven J, Askenazi M, Burzykowski T, Laukens K, Valkenborg D. Visualizing the agreement of peptide assignments between different search engines. JOURNAL OF MASS SPECTROMETRY : JMS 2020;55:e4471. [PMID: 31713933 DOI: 10.1002/jms.4471] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 10/23/2019] [Accepted: 10/28/2019] [Indexed: 06/10/2023]

Handler DCL, Haynes PA. Statistics in Proteomics: A Meta-analysis of 100 Proteomics Papers Published in 2019. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2020;31:1337-1343. [PMID: 32324388 DOI: 10.1021/jasms.9b00142] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Thomas SP, Haws SA, Borth LE, Denu JM. A practical guide for analysis of histone post-translational modifications by mass spectrometry: Best practices and pitfalls. Methods 2019;184:53-60. [PMID: 31816396 DOI: 10.1016/j.ymeth.2019.12.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2019] [Revised: 11/23/2019] [Accepted: 12/02/2019] [Indexed: 02/06/2023] Open

LeDuc RD, Fellers RT, Early BP, Greer JB, Shams DP, Thomas PM, Kelleher NL. Accurate Estimation of Context-Dependent False Discovery Rates in Top-Down Proteomics. Mol Cell Proteomics 2019;18:796-805. [PMID: 30647073 PMCID: PMC6442365 DOI: 10.1074/mcp.ra118.000993] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Revised: 01/04/2019] [Indexed: 11/06/2022] Open

Łącki MK, Lermyte F, Miasojedow B, Startek MP, Sobott F, Valkenborg D, Gambin A. masstodon: A Tool for Assigning Peaks and Modeling Electron Transfer Reactions in Top-Down Mass Spectrometry. Anal Chem 2019;91:1801-1807. [DOI: 10.1021/acs.analchem.8b01479] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Henning J, Tostengard A, Smith R. A Peptide-Level Fully Annotated Data Set for Quantitative Evaluation of Precursor-Aware Mass Spectrometry Data Processing Algorithms. J Proteome Res 2018;18:392-398. [PMID: 30394759 DOI: 10.1021/acs.jproteome.8b00659] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Borges H, Guibert R, Permiakova O, Burger T. Distinguishing between Spectral Clustering and Cluster Analysis of Mass Spectra. J Proteome Res 2018;18:571-573. [PMID: 30394750 DOI: 10.1021/acs.jproteome.8b00516] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Bittremieux W, Tabb DL, Impens F, Staes A, Timmerman E, Martens L, Laukens K. Quality control in mass spectrometry-based proteomics. MASS SPECTROMETRY REVIEWS 2018;37:697-711. [PMID: 28802010 DOI: 10.1002/mas.21544] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2017] [Revised: 07/24/2017] [Accepted: 07/24/2017] [Indexed: 05/21/2023]

The M, Edfors F, Perez-Riverol Y, Payne SH, Hoopmann MR, Palmblad M, Forsström B, Käll L. A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms. J Proteome Res 2018;17:1879-1886. [PMID: 29631402 DOI: 10.1021/acs.jproteome.7b00899] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Burger T. Gentle Introduction to the Statistical Foundations of False Discovery Rate in Quantitative Proteomics. J Proteome Res 2017;17:12-22. [DOI: 10.1021/acs.jproteome.7b00170] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Dowsey AW. The need for statistical contributions to bioinformatics at scale, with illustration to mass spectrometry. STAT MODEL 2017. [DOI: 10.1177/1471082x17708519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Rosenberger G, Bludau I, Schmitt U, Heusel M, Hunter CL, Liu Y, MacCoss MJ, MacLean BX, Nesvizhskii AI, Pedrioli PGA, Reiter L, Röst HL, Tate S, Ting YS, Collins BC, Aebersold R. Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses. Nat Methods 2017;14:921-927. [PMID: 28825704 PMCID: PMC5581544 DOI: 10.1038/nmeth.4398] [Citation(s) in RCA: 145] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Accepted: 07/07/2017] [Indexed: 12/18/2022]

Affiliation(s)

George Rosenberger Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
Isabell Bludau Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
Uwe Schmitt ID Scientific IT Services, ETH Zurich, Zurich, Switzerland
Moritz Heusel Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD program in Molecular and Translational Biomedicine, Competence Center Personalized Medicine (CC-PM), ETH Zurich and University of Zurich, Zurich, Switzerland
Christie L Hunter SCIEX, Redwood City, California, USA
Yansheng Liu Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
Michael J MacCoss Department of Genome Sciences, University of Washington, Seattle, Washington, USA
Brendan X MacLean Department of Genome Sciences, University of Washington, Seattle, Washington, USA
Alexey I Nesvizhskii Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA.,Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
Patrick G A Pedrioli Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
Lukas Reiter Biognosys, Schlieren, Switzerland
Hannes L Röst Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
Stephen Tate SCIEX, Concord, Ontario, Canada
Ying S Ting Department of Genome Sciences, University of Washington, Seattle, Washington, USA
Ben C Collins Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
Ruedi Aebersold Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Faculty of Science, University of Zurich, Zurich, Switzerland

Collapse

Liu X, Guo Z, Sun H, Li W, Sun W. Comprehensive Map and Functional Annotation of Human Pituitary and Thyroid Proteome. J Proteome Res 2017;16:2680-2691. [PMID: 28678506 DOI: 10.1021/acs.jproteome.6b00914] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

Creation of Reusable Bioinformatics Workflows for Reproducible Analysis of LC-MS Proteomics Data. ACTA ACUST UNITED AC 2017. [DOI: 10.1007/978-1-4939-7119-0_19] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

van Ooijen MP, Jong VL, Eijkemans MJC, Heck AJR, Andeweg AC, Binai NA, van den Ham HJ. Identification of differentially expressed peptides in high-throughput proteomics data. Brief Bioinform 2017;19:971-981. [DOI: 10.1093/bib/bbx031] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Indexed: 12/25/2022] Open

Zhang B, Pirmoradian M, Zubarev R, Käll L. Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences. Mol Cell Proteomics 2017;16:936-948. [PMID: 28302922 PMCID: PMC5417831 DOI: 10.1074/mcp.o117.067728] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Revised: 03/13/2017] [Indexed: 12/29/2022] Open

Abstract

Most implementations of mass spectrometry-based proteomics involve enzymatic digestion of proteins, expanding the analysis to multiple proteolytic peptides for each protein. Currently, there is no consensus of how to summarize peptides' abundances to protein concentrations, and such efforts are complicated by the fact that error control normally is applied to the identification process, and do not directly control errors linking peptide abundance measures to protein concentration. Peptides resulting from suboptimal digestion or being partially modified are not representative of the protein concentration. Without a mechanism to remove such unrepresentative peptides, their abundance adversely impacts the estimation of their protein's concentration. Here, we present a relative quantification approach, Diffacto, that applies factor analysis to extract the covariation of peptides' abundances. The method enables a weighted geometrical average summarization and automatic elimination of incoherent peptides. We demonstrate, based on a set of controlled label-free experiments using standard mixtures of proteins, that the covariation structure extracted by the factor analysis accurately reflects protein concentrations. In the 1% peptide-spectrum match-level FDR data set, as many as 11% of the peptides have abundance differences incoherent with the other peptides attributed to the same protein. If not controlled, such contradicting peptide abundance have a severe impact on protein quantifications. When adding the quantities of each protein's three most abundant peptides, we note as many as 14% of the proteins being estimated as having a negative correlation with their actual concentration differences between samples. Diffacto reduced the amount of such obviously incorrectly quantified proteins to 1.6%. Furthermore, by analyzing clinical data sets from two breast cancer studies, our method revealed the persistent proteomic signatures linked to three subtypes of breast cancer. We conclude that Diffacto can facilitate the interpretation and enhance the utility of most types of proteomics data.

Collapse

Audain E, Uszkoreit J, Sachsenberg T, Pfeuffer J, Liang X, Hermjakob H, Sanchez A, Eisenacher M, Reinert K, Tabb DL, Kohlbacher O, Perez-Riverol Y. In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics. J Proteomics 2017;150:170-182. [DOI: 10.1016/j.jprot.2016.08.002] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2016] [Revised: 07/30/2016] [Accepted: 08/02/2016] [Indexed: 12/24/2022]

The M, MacCoss MJ, Noble WS, Käll L. Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2016;27:1719-1727. [PMID: 27572102 PMCID: PMC5059416 DOI: 10.1007/s13361-016-1460-7] [Citation(s) in RCA: 240] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Revised: 06/15/2016] [Accepted: 07/20/2016] [Indexed: 05/21/2023]

The M, Tasnim A, Käll L. How to talk about protein-level false discovery rates in shotgun proteomics. Proteomics 2016;16:2461-9. [PMID: 27503675 PMCID: PMC5096025 DOI: 10.1002/pmic.201500431] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Revised: 05/12/2016] [Accepted: 07/20/2016] [Indexed: 12/04/2022]

Kumar D, Bansal G, Narang A, Basak T, Abbas T, Dash D. Integrating transcriptome and proteome profiling: Strategies and applications. Proteomics 2016;16:2533-2544. [PMID: 27343053 DOI: 10.1002/pmic.201600140] [Citation(s) in RCA: 106] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Revised: 06/12/2016] [Accepted: 06/23/2016] [Indexed: 12/17/2022]

Wright JC, Choudhary JS. DecoyPyrat: Fast Non-redundant Hybrid Decoy Sequence Generation for Large Scale Proteomics. JOURNAL OF PROTEOMICS & BIOINFORMATICS 2016;9:176-180. [PMID: 27418748 PMCID: PMC4941923 DOI: 10.4172/jpb.1000404] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Abstract

Accurate statistical evaluation of sequence database peptide identifications from tandem mass spectra is essential in mass spectrometry based proteomics experiments. These statistics are dependent on accurately modelling random identifications. The target-decoy approach has risen to become the de facto approach to calculating FDR in proteomic datasets. The main principle of this approach is to search a set of decoy protein sequences that emulate the size and composition of the target protein sequences searched whilst not matching real proteins in the sample. To do this, it is commonplace to reverse or shuffle the proteins and peptides in the target database. However, these approaches have their drawbacks and limitations. A key confounding issue is the peptide redundancy between target and decoy databases leading to inaccurate FDR estimation. This inaccuracy is further amplified at the protein level and when searching large sequence databases such as those used for proteogenomics. Here, we present a unifying hybrid method to quickly and efficiently generate decoy sequences with minimal overlap between target and decoy peptides. We show that applying a reversed decoy approach can produce up to 5% peptide redundancy and many more additional peptides will have the exact same precursor mass as a target peptide. Our hybrid method addresses both these issues by first switching proteolytic cleavage sites with preceding amino acid, reversing the database and then shuffling any redundant sequences. This flexible hybrid method reduces the peptide overlap between target and decoy peptides to about 1% of peptides, making a more robust decoy model suitable for large search spaces. We also demonstrate the anti-conservative effect of redundant peptides on the calculation of q-values in mouse brain tissue data.

Collapse

Bogdanow B, Zauber H, Selbach M. Systematic Errors in Peptide and Protein Identification and Quantification by Modified Peptides. Mol Cell Proteomics 2016;15:2791-801. [PMID: 27215553 DOI: 10.1074/mcp.m115.055103] [Citation(s) in RCA: 49] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2015] [Indexed: 01/17/2023] Open

Maes E, Kelchtermans P, Bittremieux W, De Grave K, Degroeve S, Hooyberghs J, Mertens I, Baggerman G, Ramon J, Laukens K, Martens L, Valkenborg D. Designing biomedical proteomics experiments: state-of-the-art and future perspectives. Expert Rev Proteomics 2016;13:495-511. [PMID: 27031651 DOI: 10.1586/14789450.2016.1172967] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Affiliation(s)

Evelyne Maes a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium
Pieter Kelchtermans b CFP , University of Antwerp , Antwerp , Belgium.,c Medical Biotechnology Center , VIB , Ghent , Belgium.,d Department of Biochemistry , Ghent University , Ghent , Belgium.,e Bioinformatics Institute Ghent , Ghent University , Ghent , Belgium
Wout Bittremieux f Department of Mathematics and Computer Science , University of Antwerp , Antwerp , Belgium.,g Biomedical Informatics Research Center Antwerp (biomina) , University of Antwerp/Antwerp University Hospital , Antwerp , Belgium
Kurt De Grave h Department of Computer Science , KU Leuven , Leuven , Belgium
Sven Degroeve c Medical Biotechnology Center , VIB , Ghent , Belgium.,d Department of Biochemistry , Ghent University , Ghent , Belgium.,e Bioinformatics Institute Ghent , Ghent University , Ghent , Belgium
Jef Hooyberghs a Applied Bio & molecular systems , VITO , Mol , Belgium
Inge Mertens a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium
Geert Baggerman a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium
Jan Ramon h Department of Computer Science , KU Leuven , Leuven , Belgium.,i INRIA , Lille , France
Kris Laukens f Department of Mathematics and Computer Science , University of Antwerp , Antwerp , Belgium.,g Biomedical Informatics Research Center Antwerp (biomina) , University of Antwerp/Antwerp University Hospital , Antwerp , Belgium
Lennart Martens c Medical Biotechnology Center , VIB , Ghent , Belgium.,d Department of Biochemistry , Ghent University , Ghent , Belgium.,e Bioinformatics Institute Ghent , Ghent University , Ghent , Belgium
Dirk Valkenborg a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium.,j Interuniversity Institute for Biostatistics and statistical Bioinformatics , Hasselt University , Hasselt , Belgium

Collapse

Blattmann P, Heusel M, Aebersold R. SWATH2stats: An R/Bioconductor Package to Process and Convert Quantitative SWATH-MS Proteomics Data for Downstream Analysis Tools. PLoS One 2016;11:e0153160. [PMID: 27054327 PMCID: PMC4824525 DOI: 10.1371/journal.pone.0153160] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2016] [Accepted: 03/24/2016] [Indexed: 11/19/2022] Open

The M, Käll L. MaRaCluster: A Fragment Rarity Metric for Clustering Fragment Spectra in Shotgun Proteomics. J Proteome Res 2016;15:713-20. [PMID: 26653874 DOI: 10.1021/acs.jproteome.5b00749] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Zhang B, Käll L, Zubarev RA. DeMix-Q: Quantification-Centered Data Processing Workflow. Mol Cell Proteomics 2016;15:1467-78. [PMID: 26729709 DOI: 10.1074/mcp.o115.055475] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Indexed: 12/31/2022] Open

Gatto L, Hansen KD, Hoopmann MR, Hermjakob H, Kohlbacher O, Beyer A. Testing and Validation of Computational Methods for Mass Spectrometry. J Proteome Res 2015;15:809-14. [PMID: 26549429 DOI: 10.1021/acs.jproteome.5b00852] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Ezkurdia I, Calvo E, Del Pozo A, Vázquez J, Valencia A, Tress ML. The potential clinical impact of the release of two drafts of the human proteome. Expert Rev Proteomics 2015;12:579-93. [PMID: 26496066 PMCID: PMC4732427 DOI: 10.1586/14789450.2015.1103186] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]