Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kim S, Gupta N, Bandeira N, Pevzner PA. Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra. Mol Cell Proteomics 2008;8:53-69. [PMID: 18703573 DOI: 10.1074/mcp.m800103-mcp200] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

For:	Kim S, Gupta N, Bandeira N, Pevzner PA. Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra. Mol Cell Proteomics 2008;8:53-69. [PMID: 18703573 DOI: 10.1074/mcp.m800103-mcp200] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Number

Cited by Other Article(s)

Mao J, Zhu H, Liu L, Fang Z, Dong M, Qin H, Ye M. MS-Decipher: a user-friendly proteome database search software with an emphasis on deciphering the spectra of O-linked glycopeptides. Bioinformatics 2022;38:1911-1919. [PMID: 35020790 DOI: 10.1093/bioinformatics/btac014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Revised: 12/29/2021] [Accepted: 01/08/2022] [Indexed: 02/03/2023] Open

Abstract

MOTIVATION

The interpretation of mass spectrometry (MS) data is a crucial step in proteomics analysis, and the identification of post-translational modifications (PTMs) is vital for the understanding of the regulation mechanism of the living system. Among various PTMs, glycosylation is one of the most diverse ones. Though many search engines have been developed to decipher proteomic data, some of them are difficult to operate and have poor performance on glycoproteomic datasets compared to advanced glycoproteomic software.

RESULTS

To simplify the analysis of proteomic datasets, especially O-glycoproteomic datasets, here, we present a user-friendly proteomic database search platform, MS-Decipher, for the identification of peptides from MS data. Two scoring schemes can be chosen for peptide-spectra matching. It was found that MS-Decipher had the same sensitivity and confidence in peptide identification compared to traditional database searching software. In addition, a special search mode, O-Search, is integrated into MS-Decipher to identify O-glycopeptides for O-glycoproteomic analysis. Compared with Mascot, MetaMorpheus and MSFragger, MS-Decipher can obtain about 139.9%, 48.8% and 6.9% more O-glycopeptide-spectrum matches. A useful tool is provided in MS-Decipher for the visualization of O-glycopeptide-spectra matches. MS-Decipher has a user-friendly graphical user interface, making it easier to operate. Several file formats are available in the searching and validation steps. MS-Decipher is implemented with Java, and can be used cross-platform.

AVAILABILITY AND IMPLEMENTATION

MS-Decipher is freely available at https://github.com/DICP-1809/MS-Decipher for academic use. For detailed implementation steps, please see the user guide.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

The impact of noise and missing fragmentation cleavages on de novo peptide identification algorithms. Comput Struct Biotechnol J 2022;20:1402-1412. [PMID: 35386104 PMCID: PMC8956878 DOI: 10.1016/j.csbj.2022.03.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 03/09/2022] [Accepted: 03/09/2022] [Indexed: 01/24/2023] Open

Abstract

•

Most correct de novo peptides have ⩽1 missing fragmentation cleavages.

•

DeepNovo outperforms Novor for peptide accuracy for both data types.

•

Novor excels at amino acid recall when many fragmentation cleavages are missing.

•

Deep learning allows DeepNovo to predict amino acids without adjacent peaks.

Proteomics aims to characterise system-wide protein expression and typically relies on mass-spectrometry and peptide fragmentation, followed by a database search for protein identification. It has wide ranging applications from clinical to environmental settings and virtually impacts on every area of biology. In that context, de novo peptide sequencing is becoming increasingly popular. Historically its performance lagged behind database search methods but with the integration of machine learning, this field of research is gaining momentum. To enable de novo peptide sequencing to realise its full potential, it is critical to explore the mass spectrometry data underpinning peptide identification. In this research we investigate the characteristics of tandem mass spectra using 8 published datasets. We then evaluate two state of the art de novo peptide sequencing algorithms, Novor and DeepNovo, with a particular focus on their performance with regard to missing fragmentation cleavage sites and noise. DeepNovo was found to perform better than Novor overall. However, Novor recalled more correct amino acids when 6 or more cleavage sites were missing. Furthermore, less than 11% of each algorithms’ correct peptide predictions emanate from data with more than one missing cleavage site, highlighting the issues missing cleavages pose. We further investigate how the algorithms manage to correctly identify peptides with many of these missing fragmentation cleavages. We show how noise negatively impacts the performance of both algorithms, when high intensity peaks are considered. Finally, we provide recommendations regarding further algorithms’ improvements and offer potential avenues to overcome current inherent data limitations.

Collapse

Dai J, Yu F, Zhou C, Yu W. Understanding the Limit of Open Search in the Identification of Peptides With Post-translational Modifications - A Simulation-Based Study. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:2884-2890. [PMID: 32356758 DOI: 10.1109/tcbb.2020.2991207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Aksenov AA, Laponogov I, Zhang Z, Doran SLF, Belluomo I, Veselkov D, Bittremieux W, Nothias LF, Nothias-Esposito M, Maloney KN, Misra BB, Melnik AV, Smirnov A, Du X, Jones KL, Dorrestein K, Panitchpakdi M, Ernst M, van der Hooft JJJ, Gonzalez M, Carazzone C, Amézquita A, Callewaert C, Morton JT, Quinn RA, Bouslimani A, Orio AA, Petras D, Smania AM, Couvillion SP, Burnet MC, Nicora CD, Zink E, Metz TO, Artaev V, Humston-Fulmer E, Gregor R, Meijler MM, Mizrahi I, Eyal S, Anderson B, Dutton R, Lugan R, Boulch PL, Guitton Y, Prevost S, Poirier A, Dervilly G, Le Bizec B, Fait A, Persi NS, Song C, Gashu K, Coras R, Guma M, Manasson J, Scher JU, Barupal DK, Alseekh S, Fernie AR, Mirnezami R, Vasiliou V, Schmid R, Borisov RS, Kulikova LN, Knight R, Wang M, Hanna GB, Dorrestein PC, Veselkov K. Auto-deconvolution and molecular networking of gas chromatography-mass spectrometry data. Nat Biotechnol 2021;39:169-173. [PMID: 33169034 PMCID: PMC7971188 DOI: 10.1038/s41587-020-0700-3] [Citation(s) in RCA: 61] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 08/26/2020] [Accepted: 09/09/2020] [Indexed: 12/23/2022]

Affiliation(s)

Alexander A Aksenov Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Ivan Laponogov Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK
Zheng Zhang Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
Sophie L F Doran Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK
Ilaria Belluomo Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK
Dennis Veselkov Intelligify Limited, London, UK Department of Computing, Imperial College, South Kensington Campus, London, UK
Wout Bittremieux Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA Department of Computer Science, University of Antwerp, Antwerp, Belgium
Louis Felix Nothias Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Mélissa Nothias-Esposito Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Katherine N Maloney Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Department of Chemistry, Point Loma Nazarene University, San Diego, CA, USA
Biswapriya B Misra Center for Precision Medicine, Department of Internal Medicine, Section of Molecular Medicine, Wake Forest School of Medicine, Winston-Salem, NC, USA
Alexey V Melnik Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
Aleksandr Smirnov Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, USA
Xiuxia Du Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, USA
Kenneth L Jones Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
Kathleen Dorrestein Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Morgan Panitchpakdi Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
Madeleine Ernst Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Section for Clinical Mass Spectrometry, Department of Congenital Disorders, Danish Center for Neonatal Screening, Statens Serum Institut, Copenhagen, Denmark
Justin J J van der Hooft Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
Mabel Gonzalez Department of Chemistry, Universidad de los Andes, Bogotá, Colombia
Chiara Carazzone Department of Chemistry, Universidad de los Andes, Bogotá, Colombia
Adolfo Amézquita Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
Chris Callewaert Center for Microbial Ecology and Technology, Ghent, Belgium Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA
James T Morton Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
Robert A Quinn Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
Amina Bouslimani Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Andrea Albarracín Orio IRNASUS, Universidad Católica de Córdoba, CONICET, Facultad de Ciencias Agropecuarias, Córdoba, Argentina
Daniel Petras Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Andrea M Smania Universidad Nacional de Córdoba, Facultad de Ciencias Químicas, Departamento de Química Biológica Ranwel Caputto, Córdoba, Argentina CONICET, Universidad Nacional de Córdoba, Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), Córdoba, Argentina
Sneha P Couvillion Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Meagan C Burnet Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Carrie D Nicora Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Erika Zink Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Thomas O Metz Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Viatcheslav Artaev LECO Corporation, St. Joseph, MI, USA
Elizabeth Humston-Fulmer LECO Corporation, St. Joseph, MI, USA
Rachel Gregor Department of Chemistry and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Michael M Meijler Department of Chemistry and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Itzhak Mizrahi Department of Life Sciences and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Stav Eyal Department of Life Sciences and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Brooke Anderson Division of Biological Sciences, University of California, San Diego, La Jolla, CA, USA
Rachel Dutton Division of Biological Sciences, University of California, San Diego, La Jolla, CA, USA
Raphaël Lugan UMR Qualisud, Université d'Avignon et des Pays du Vaucluse, Agrosciences, Avignon, France
Pauline Le Boulch UMR Qualisud, Université d'Avignon et des Pays du Vaucluse, Agrosciences, Avignon, France
Yann Guitton Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Stephanie Prevost Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Audrey Poirier Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Gaud Dervilly Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Bruno Le Bizec Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Aaron Fait The French Associates Institute for Agriculture and Biotechnology of Dryland, The Jacob Blaustein Institutes for Desert Research, Ben Gurion University of the Negev, Sede Boqer Campus, Beer Sheva, Israel
Noga Sikron Persi The French Associates Institute for Agriculture and Biotechnology of Dryland, The Jacob Blaustein Institutes for Desert Research, Ben Gurion University of the Negev, Sede Boqer Campus, Beer Sheva, Israel
Chao Song The French Associates Institute for Agriculture and Biotechnology of Dryland, The Jacob Blaustein Institutes for Desert Research, Ben Gurion University of the Negev, Sede Boqer Campus, Beer Sheva, Israel
Kelem Gashu The French Associates Institute for Agriculture and Biotechnology of Dryland, The Jacob Blaustein Institutes for Desert Research, Ben Gurion University of the Negev, Sede Boqer Campus, Beer Sheva, Israel
Roxana Coras Division of Rheumatology, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
Monica Guma Division of Rheumatology, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
Julia Manasson Division of Rheumatology, Department of Medicine, New York University School of Medicine, New York, NY, USA
Jose U Scher Division of Rheumatology, Department of Medicine, New York University School of Medicine, New York, NY, USA
Dinesh Kumar Barupal Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Saleh Alseekh Max Planck Institute for Molecular Plant Physiology, Potsdam-Golm, Germany Center of Plant Systems Biology and Biotechnology (CPSBB), Plovdiv, Bulgaria
Alisdair R Fernie Max Planck Institute for Molecular Plant Physiology, Potsdam-Golm, Germany Center of Plant Systems Biology and Biotechnology (CPSBB), Plovdiv, Bulgaria
Reza Mirnezami Department of Colorectal Surgery, Royal Free Hospital NHS Foundation Trust, Hampstead, London, UK
Vasilis Vasiliou Department of Environmental Health Sciences, Yale School of Public Health, Yale University, New Haven, CT, USA
Robin Schmid Institute of Inorganic and Analytical Chemistry, University of Münster, Münster, Germany
Roman S Borisov A.V. Topchiev Institute of Petrochemical Synthesis RAS, Moscow, Russian Federation
Larisa N Kulikova Рeoples' Friendship University of Russia (RUDN University), Moscow, Russian Federation
Rob Knight Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA UCSD Center for Microbiome Innovation, University of California, San Diego, La Jolla, CA, USA Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA Department of Computer Science & Engineering, University of California, San Diego, La Jolla, CA, USA
Mingxun Wang Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
George B Hanna Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK
Pieter C Dorrestein Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA. Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA. Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA. UCSD Center for Microbiome Innovation, University of California, San Diego, La Jolla, CA, USA.
Kirill Veselkov Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK.

Collapse

Aksenov AA, Laponogov I, Zhang Z, Doran SLF, Belluomo I, Veselkov D, Bittremieux W, Nothias LF, Nothias-Esposito M, Maloney KN, Misra BB, Melnik AV, Smirnov A, Du X, Jones KL, Dorrestein K, Panitchpakdi M, Ernst M, van der Hooft JJJ, Gonzalez M, Carazzone C, Amézquita A, Callewaert C, Morton JT, Quinn RA, Bouslimani A, Orio AA, Petras D, Smania AM, Couvillion SP, Burnet MC, Nicora CD, Zink E, Metz TO, Artaev V, Humston-Fulmer E, Gregor R, Meijler MM, Mizrahi I, Eyal S, Anderson B, Dutton R, Lugan R, Boulch PL, Guitton Y, Prevost S, Poirier A, Dervilly G, Le Bizec B, Fait A, Persi NS, Song C, Gashu K, Coras R, Guma M, Manasson J, Scher JU, Barupal DK, Alseekh S, Fernie AR, Mirnezami R, Vasiliou V, Schmid R, Borisov RS, Kulikova LN, Knight R, Wang M, Hanna GB, Dorrestein PC, Veselkov K. Auto-deconvolution and molecular networking of gas chromatography-mass spectrometry data. Nat Biotechnol 2021. [PMID: 33169034 DOI: 10.1038/s41587-41020-40700-41583] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]

Affiliation(s)

Alexander A Aksenov Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Ivan Laponogov Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK
Zheng Zhang Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
Sophie L F Doran Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK
Ilaria Belluomo Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK
Dennis Veselkov Intelligify Limited, London, UK Department of Computing, Imperial College, South Kensington Campus, London, UK
Wout Bittremieux Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA Department of Computer Science, University of Antwerp, Antwerp, Belgium
Louis Felix Nothias Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Mélissa Nothias-Esposito Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Katherine N Maloney Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Department of Chemistry, Point Loma Nazarene University, San Diego, CA, USA
Biswapriya B Misra Center for Precision Medicine, Department of Internal Medicine, Section of Molecular Medicine, Wake Forest School of Medicine, Winston-Salem, NC, USA
Alexey V Melnik Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
Aleksandr Smirnov Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, USA
Xiuxia Du Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, USA
Kenneth L Jones Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
Kathleen Dorrestein Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Morgan Panitchpakdi Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
Madeleine Ernst Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Section for Clinical Mass Spectrometry, Department of Congenital Disorders, Danish Center for Neonatal Screening, Statens Serum Institut, Copenhagen, Denmark
Justin J J van der Hooft Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
Mabel Gonzalez Department of Chemistry, Universidad de los Andes, Bogotá, Colombia
Chiara Carazzone Department of Chemistry, Universidad de los Andes, Bogotá, Colombia
Adolfo Amézquita Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
Chris Callewaert Center for Microbial Ecology and Technology, Ghent, Belgium Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA
James T Morton Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
Robert A Quinn Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
Amina Bouslimani Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Andrea Albarracín Orio IRNASUS, Universidad Católica de Córdoba, CONICET, Facultad de Ciencias Agropecuarias, Córdoba, Argentina
Daniel Petras Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
Andrea M Smania Universidad Nacional de Córdoba, Facultad de Ciencias Químicas, Departamento de Química Biológica Ranwel Caputto, Córdoba, Argentina CONICET, Universidad Nacional de Córdoba, Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), Córdoba, Argentina
Sneha P Couvillion Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Meagan C Burnet Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Carrie D Nicora Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Erika Zink Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Thomas O Metz Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
Viatcheslav Artaev LECO Corporation, St. Joseph, MI, USA
Elizabeth Humston-Fulmer LECO Corporation, St. Joseph, MI, USA
Rachel Gregor Department of Chemistry and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Michael M Meijler Department of Chemistry and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Itzhak Mizrahi Department of Life Sciences and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Stav Eyal Department of Life Sciences and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Brooke Anderson Division of Biological Sciences, University of California, San Diego, La Jolla, CA, USA
Rachel Dutton Division of Biological Sciences, University of California, San Diego, La Jolla, CA, USA
Raphaël Lugan UMR Qualisud, Université d'Avignon et des Pays du Vaucluse, Agrosciences, Avignon, France
Pauline Le Boulch UMR Qualisud, Université d'Avignon et des Pays du Vaucluse, Agrosciences, Avignon, France
Yann Guitton Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Stephanie Prevost Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Audrey Poirier Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Gaud Dervilly Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Bruno Le Bizec Laboratoire d'Etude des Résidus et Contaminants dans les Aliments (LABERCA), Oniris, INRAe, Nantes, France
Aaron Fait The French Associates Institute for Agriculture and Biotechnology of Dryland, The Jacob Blaustein Institutes for Desert Research, Ben Gurion University of the Negev, Sede Boqer Campus, Beer Sheva, Israel
Noga Sikron Persi The French Associates Institute for Agriculture and Biotechnology of Dryland, The Jacob Blaustein Institutes for Desert Research, Ben Gurion University of the Negev, Sede Boqer Campus, Beer Sheva, Israel
Chao Song The French Associates Institute for Agriculture and Biotechnology of Dryland, The Jacob Blaustein Institutes for Desert Research, Ben Gurion University of the Negev, Sede Boqer Campus, Beer Sheva, Israel
Kelem Gashu The French Associates Institute for Agriculture and Biotechnology of Dryland, The Jacob Blaustein Institutes for Desert Research, Ben Gurion University of the Negev, Sede Boqer Campus, Beer Sheva, Israel
Roxana Coras Division of Rheumatology, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
Monica Guma Division of Rheumatology, Department of Medicine, University of California, San Diego, La Jolla, CA, USA
Julia Manasson Division of Rheumatology, Department of Medicine, New York University School of Medicine, New York, NY, USA
Jose U Scher Division of Rheumatology, Department of Medicine, New York University School of Medicine, New York, NY, USA
Dinesh Kumar Barupal Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Saleh Alseekh Max Planck Institute for Molecular Plant Physiology, Potsdam-Golm, Germany Center of Plant Systems Biology and Biotechnology (CPSBB), Plovdiv, Bulgaria
Alisdair R Fernie Max Planck Institute for Molecular Plant Physiology, Potsdam-Golm, Germany Center of Plant Systems Biology and Biotechnology (CPSBB), Plovdiv, Bulgaria
Reza Mirnezami Department of Colorectal Surgery, Royal Free Hospital NHS Foundation Trust, Hampstead, London, UK
Vasilis Vasiliou Department of Environmental Health Sciences, Yale School of Public Health, Yale University, New Haven, CT, USA
Robin Schmid Institute of Inorganic and Analytical Chemistry, University of Münster, Münster, Germany
Roman S Borisov A.V. Topchiev Institute of Petrochemical Synthesis RAS, Moscow, Russian Federation
Larisa N Kulikova Рeoples' Friendship University of Russia (RUDN University), Moscow, Russian Federation
Rob Knight Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA UCSD Center for Microbiome Innovation, University of California, San Diego, La Jolla, CA, USA Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA Department of Computer Science & Engineering, University of California, San Diego, La Jolla, CA, USA
Mingxun Wang Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA
George B Hanna Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK
Pieter C Dorrestein Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA. Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California,San Diego, La Jolla, CA, USA. Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA. UCSD Center for Microbiome Innovation, University of California, San Diego, La Jolla, CA, USA.
Kirill Veselkov Department of Surgery and Cancer, Imperial College London, South Kensington Campus, London, UK.

Collapse

Vitorino R, Guedes S, Trindade F, Correia I, Moura G, Carvalho P, Santos MAS, Amado F. De novo sequencing of proteins by mass spectrometry. Expert Rev Proteomics 2020;17:595-607. [PMID: 33016158 DOI: 10.1080/14789450.2020.1831387] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

DeLaney K, Cao W, Ma Y, Ma M, Zhang Y, Li L. PRESnovo: Prescreening Prior to de novo Sequencing to Improve Accuracy and Sensitivity of Neuropeptide Identification. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2020;31:1358-1371. [PMID: 32266812 PMCID: PMC7332408 DOI: 10.1021/jasms.0c00013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]

Abstract

Identification of peptides in species lacking fully sequenced genomes is challenging due to the lack of prior knowledge. De novo sequencing is the method of choice, but its performance is less than satisfactory due to algorithmic bias and interference in complex MS/MS spectra. The task becomes even more challenging for endogenous peptides that do not involve an enzymatic digestion step, such as neuropeptides. However, many neuropeptides possess common sequence motifs that are conserved across members of the same family. Taking advantage of this feature to improve de novo sequencing of neuropeptides, we have developed a method named PRESnovo (prescreening precursors prior to de novo sequencing) to predict the motif from a MS/MS spectrum. A neuropeptide sequence is broken into a motif with conserved amino acid residues and the remaining partial sequence. By searching against a predefined motif database constructed from known homologous sequences, PRESnovo assigns the most probable motif to each precursor via a sophisticated scoring function. Performance analysis was conducted with 15 neuropeptide standards, and 11 neuropeptides were correctly identified with PRESnovo compared to 1 identification by PEAKS only. We applied PRESnovo to assign motifs to peptide sequences in conjunction with PEAKS for assigning the rest of the peptide sequence in order to discover neuropeptides in tissue samples of green crab, C. maenas, and Jonah crab, C. borealis. Collectively, a large number of neuropeptides were identified, including 13 putative neuropeptides identified in green crab brain, 77 in Jonah crab brain, and 47 in Jonah crab sinus glands for the first time. This PRESnovo strategy greatly simplifies de novo sequencing and enhances the accuracy and sensitivity of neuropeptide identification when common motifs are present.

Collapse

Tagirdzhanov AM, Shlemov A, Gurevich A. NPS: scoring and evaluating the statistical significance of peptidic natural product-spectrum matches. Bioinformatics 2020;35:i315-i323. [PMID: 31510666 PMCID: PMC6612854 DOI: 10.1093/bioinformatics/btz374] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Zhong J, Sun Y, Xie M, Peng W, Zhang C, Wu FX, Wang J. Proteoform characterization based on top-down mass spectrometry. Brief Bioinform 2020;22:1729-1750. [PMID: 32118252 DOI: 10.1093/bib/bbaa015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Revised: 01/23/2020] [Indexed: 12/16/2022] Open

Mao Y, Daly TJ, Li N. Lys-Sequencer: An algorithm for de novo sequencing of peptides by paired single residue transposed Lys-C and Lys-N digestion coupled with high-resolution mass spectrometry. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2020;34:e8574. [PMID: 31499586 DOI: 10.1002/rcm.8574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Revised: 08/27/2019] [Accepted: 09/02/2019] [Indexed: 06/10/2023]

Abstract

RATIONALE

Database-dependent identification of proteins by mass spectrometry is well established, but has limitations when there are novel proteins, mutations, splice variants, and post-translational modifications (PTMs) not available in the established reference database. De novo sequencing as a database-independent approach could address these limitations by deducing peptide sequences directly from experimental tandem mass spectrometry spectra, while concomitantly yielding residue-by-residue confidence metrics.

METHODS

Equal amounts of bovine serum albumin (BSA) sample aliquots were digested separately with Lys-C and Lys-N complementary peptidases, separated by reversed-phase ultra-high-performance liquid chromatography (UPLC), and analyzed by collision-induced dissociation (CID)-based mass spectrometry on an Orbitrap mass spectrometer. In the Lys-Sequencer algorithm, matched tandem mass spectra with equal precursor ion mass from complementary digestions were paired, and fragment ion types were identified based on the unique mass relationship between fragment ions extracted from a spectrum pair followed by de novo sequencing of peptides with identification confidence assigned at the residue level.

RESULTS

In all the matched spectrum pairs, 34 top-ranked BSA peptides were identified, from which 391 amino acid residues were identified correctly, covering ~67% of the full sequence of BSA (583 residues) with only ~6% (35 residues) exhibiting ambiguity in the sequence order (although amino acid compositions were still correctly assigned). Of note, this approach identified peptide sequences up to 17 amino acids in length without ambiguity, with the exception of the N-terminal or C-terminal peptides containing lysine (18-mer).

CONCLUSIONS

The algorithm ("Lys-Sequencer") developed in this work achieves high precision for de novo sequencing of peptides. This method facilitates the identification of point mutation and new PTMs in the protein characterization and discovery of new peptides and proteins with varying levels of confidence.

Collapse

Chi H, Liu C, Yang H, Zeng WF, Wu L, Zhou WJ, Wang RM, Niu XN, Ding YH, Zhang Y, Wang ZW, Chen ZL, Sun RX, Liu T, Tan GM, Dong MQ, Xu P, Zhang PH, He SM. Comprehensive identification of peptides in tandem mass spectra using an efficient open search engine. Nat Biotechnol 2018;36:nbt.4236. [PMID: 30295672 DOI: 10.1038/nbt.4236] [Citation(s) in RCA: 219] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2017] [Accepted: 08/03/2018] [Indexed: 12/27/2022]

Affiliation(s)

Hao Chi Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Chao Liu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Hao Yang Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Wen-Feng Zeng Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Long Wu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Wen-Jing Zhou Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Rui-Min Wang Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Xiu-Nan Niu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Yue-He Ding National Institute of Biological Sciences, Beijing, Beijing, China
Yao Zhang State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, College of Ecology and Evolution, Sun Yat-Sen University, Guangzhou, China
Zhao-Wei Wang Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Zhen-Lin Chen Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Rui-Xiang Sun Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Tao Liu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China
Guang-Ming Tan Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China
Meng-Qiu Dong National Institute of Biological Sciences, Beijing, Beijing, China
Ping Xu State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
Pei-Heng Zhang Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China
Si-Min He Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China

Collapse

Kou Q, Wu S, Liu X. Systematic Evaluation of Protein Sequence Filtering Algorithms for Proteoform Identification Using Top-Down Mass Spectrometry. Proteomics 2018;18. [PMID: 29327814 DOI: 10.1002/pmic.201700306] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2017] [Revised: 11/20/2017] [Indexed: 01/19/2023]

Dimitrakopoulos L, Prassas I, Diamandis EP, Charames GS. Onco-proteogenomics: Multi-omics level data integration for accurate phenotype prediction. Crit Rev Clin Lab Sci 2017;54:414-432. [DOI: 10.1080/10408363.2017.1384446] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Yu F, Li N, Yu W. PIPI: PTM-Invariant Peptide Identification Using Coding Method. J Proteome Res 2016;15:4423-4435. [PMID: 27748123 DOI: 10.1021/acs.jproteome.6b00485] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]

Abstract

In computational proteomics, the identification of peptides with an unlimited number of post-translational modification (PTM) types is a challenging task. The computational cost associated with database search increases exponentially with respect to the number of modified amino acids and linearly with respect to the number of potential PTM types at each amino acid. The problem becomes intractable very quickly if we want to enumerate all possible PTM patterns. To address this issue, one group of methods named restricted tools (including Mascot, Comet, and MS-GF+) only allow a small number of PTM types in database search process. Alternatively, the other group of methods named unrestricted tools (including MS-Alignment, ProteinProspector, and MODa) avoids enumerating PTM patterns with an alignment-based approach to localizing and characterizing modified amino acids. However, because of the large search space and PTM localization issue, the sensitivity of these unrestricted tools is low. This paper proposes a novel method named PIPI to achieve PTM-invariant peptide identification. PIPI belongs to the category of unrestricted tools. It first codes peptide sequences into Boolean vectors and codes experimental spectra into real-valued vectors. For each coded spectrum, it then searches the coded sequence database to find the top scored peptide sequences as candidates. After that, PIPI uses dynamic programming to localize and characterize modified amino acids in each candidate. We used simulation experiments and real data experiments to evaluate the performance in comparison with restricted tools (i.e., Mascot, Comet, and MS-GF+) and unrestricted tools (i.e., Mascot with error tolerant search, MS-Alignment, ProteinProspector, and MODa). Comparison with restricted tools shows that PIPI has a close sensitivity and running speed. Comparison with unrestricted tools shows that PIPI has the highest sensitivity except for Mascot with error tolerant search and ProteinProspector. These two tools simplify the task by only considering up to one modified amino acid in each peptide, which results in a higher sensitivity but has difficulty in dealing with multiple modified amino acids. The simulation experiments also show that PIPI has the lowest false discovery proportion, the highest PTM characterization accuracy, and the shortest running time among the unrestricted tools.

Collapse

Gorshkov V, Hotta SYK, Verano-Braga T, Kjeldsen F. Peptide de novo sequencing of mixture tandem mass spectra. Proteomics 2016;16:2470-9. [PMID: 27329701 PMCID: PMC5297990 DOI: 10.1002/pmic.201500549] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2015] [Revised: 04/27/2016] [Accepted: 06/17/2016] [Indexed: 02/02/2023]

Gillet LC, Leitner A, Aebersold R. Mass Spectrometry Applied to Bottom-Up Proteomics: Entering the High-Throughput Era for Hypothesis Testing. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY (PALO ALTO, CALIF.) 2016;9:449-72. [PMID: 27049628 DOI: 10.1146/annurev-anchem-071015-041535] [Citation(s) in RCA: 218] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]

Sheynkman GM, Shortreed MR, Cesnik AJ, Smith LM. Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY (PALO ALTO, CALIF.) 2016;9:521-45. [PMID: 27049631 PMCID: PMC4991544 DOI: 10.1146/annurev-anchem-071015-041722] [Citation(s) in RCA: 73] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]

Xiong Y, Guo Y, Xiao W, Cao Q, Li S, Qi X, Zhang Z, Wang Q, Shui W. An NGS-Independent Strategy for Proteome-Wide Identification of Single Amino Acid Polymorphisms by Mass Spectrometry. Anal Chem 2016;88:2784-91. [PMID: 26810586 DOI: 10.1021/acs.analchem.5b04417] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Askenazi M, Ruggles KV, Fenyö D. PGx: Putting Peptides to BED. J Proteome Res 2015;15:795-9. [PMID: 26638927 PMCID: PMC4782174 DOI: 10.1021/acs.jproteome.5b00870] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Sadygov RG. Using SEQUEST with theoretically complete sequence databases. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2015;26:1858-1864. [PMID: 26238326 PMCID: PMC4607654 DOI: 10.1007/s13361-015-1228-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Revised: 05/08/2015] [Accepted: 06/17/2015] [Indexed: 06/04/2023]

Chi H, He K, Yang B, Chen Z, Sun RX, Fan SB, Zhang K, Liu C, Yuan ZF, Wang QH, Liu SQ, Dong MQ, He SM. Reprint of "pFind-Alioth: A novel unrestricted database search algorithm to improve the interpretation of high-resolution MS/MS data". J Proteomics 2015;129:33-41. [PMID: 26232248 DOI: 10.1016/j.jprot.2015.07.019] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Revised: 05/04/2015] [Accepted: 05/10/2015] [Indexed: 01/23/2023]

Chick JM, Kolippakkam D, Nusinow DP, Zhai B, Rad R, Huttlin EL, Gygi SP. A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides. Nat Biotechnol 2015;33:743-9. [PMID: 26076430 PMCID: PMC4515955 DOI: 10.1038/nbt.3267] [Citation(s) in RCA: 284] [Impact Index Per Article: 31.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2014] [Accepted: 05/11/2015] [Indexed: 12/17/2022]

Chi H, He K, Yang B, Chen Z, Sun RX, Fan SB, Zhang K, Liu C, Yuan ZF, Wang QH, Liu SQ, Dong MQ, He SM. pFind-Alioth: A novel unrestricted database search algorithm to improve the interpretation of high-resolution MS/MS data. J Proteomics 2015;125:89-97. [PMID: 25979774 DOI: 10.1016/j.jprot.2015.05.009] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Revised: 05/04/2015] [Accepted: 05/10/2015] [Indexed: 10/23/2022]

Medzihradszky KF, Chalkley RJ. Lessons in de novo peptide sequencing by tandem mass spectrometry. MASS SPECTROMETRY REVIEWS 2015;34:43-63. [PMID: 25667941 PMCID: PMC4367481 DOI: 10.1002/mas.21406] [Citation(s) in RCA: 137] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]

De Haes W, Van Sinay E, Detienne G, Temmerman L, Schoofs L, Boonen K. Functional neuropeptidomics in invertebrates. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2014;1854:812-26. [PMID: 25528324 DOI: 10.1016/j.bbapap.2014.12.011] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2014] [Revised: 11/27/2014] [Accepted: 12/10/2014] [Indexed: 10/24/2022]

MS-GF+ makes progress towards a universal database search tool for proteomics. Nat Commun 2014;5:5277. [PMID: 25358478 PMCID: PMC5036525 DOI: 10.1038/ncomms6277] [Citation(s) in RCA: 764] [Impact Index Per Article: 76.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2014] [Accepted: 09/16/2014] [Indexed: 02/06/2023] Open

Deng F, Wang L, Liu X. An efficient algorithm for the blocked pattern matching problem. Bioinformatics 2014;31:532-8. [DOI: 10.1093/bioinformatics/btu678] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Wang J, Bourne PE, Bandeira N. MixGF: spectral probabilities for mixture spectra from more than one peptide. Mol Cell Proteomics 2014;13:3688-97. [PMID: 25225354 DOI: 10.1074/mcp.o113.037218] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Su ZD, Sheng QH, Li QR, Chi H, Jiang X, Yan Z, Fu N, He SM, Khaitovich P, Wu JR, Zeng R. De novo identification and quantification of single amino-acid variants in human brain. J Mol Cell Biol 2014;6:421-33. [PMID: 25007923 DOI: 10.1093/jmcb/mju031] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Wang J, Anania VG, Knott J, Rush J, Lill JR, Bourne PE, Bandeira N. Combinatorial approach for large-scale identification of linked peptides from tandem mass spectrometry spectra. Mol Cell Proteomics 2014;13:1128-36. [PMID: 24493012 DOI: 10.1074/mcp.m113.035758] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Liu X, Hengel S, Wu S, Tolić N, Pasa-Tolić L, Pevzner PA. Identification of ultramodified proteins using top-down tandem mass spectra. J Proteome Res 2013;12:5830-8. [PMID: 24188097 DOI: 10.1021/pr400849y] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Mazin P, Xiong J, Liu X, Yan Z, Zhang X, Li M, He L, Somel M, Yuan Y, Phoebe Chen YP, Li N, Hu Y, Fu N, Ning Z, Zeng R, Yang H, Chen W, Gelfand M, Khaitovich P. Widespread splicing changes in human brain development and aging. Mol Syst Biol 2013;9:633. [PMID: 23340839 PMCID: PMC3564255 DOI: 10.1038/msb.2012.67] [Citation(s) in RCA: 147] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2012] [Revised: 11/14/2012] [Accepted: 12/16/2012] [Indexed: 02/07/2023] Open

Jeong K, Kim S, Pevzner PA. UniNovo: a universal tool for de novo peptide sequencing. ACTA ACUST UNITED AC 2013;29:1953-62. [PMID: 23766417 PMCID: PMC3722526 DOI: 10.1093/bioinformatics/btt338] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Abstract

Motivation: Mass spectrometry (MS) instruments and experimental protocols are rapidly advancing, but de novo peptide sequencing algorithms to analyze tandem mass (MS/MS) spectra are lagging behind. Although existing de novo sequencing tools perform well on certain types of spectra [e.g. Collision Induced Dissociation (CID) spectra of tryptic peptides], their performance often deteriorates on other types of spectra, such as Electron Transfer Dissociation (ETD), Higher-energy Collisional Dissociation (HCD) spectra or spectra of non-tryptic digests. Thus, rather than developing a new algorithm for each type of spectra, we develop a universal de novo sequencing algorithm called UniNovo that works well for all types of spectra or even for spectral pairs (e.g. CID/ETD spectral pairs). UniNovo uses an improved scoring function that captures the dependences between different ion types, where such dependencies are learned automatically using a modified offset frequency function.

Results: The performance of UniNovo is compared with PepNovo+, PEAKS and pNovo using various types of spectra. The results show that the performance of UniNovo is superior to other tools for ETD spectra and superior or comparable with others for CID and HCD spectra. UniNovo also estimates the probability that each reported reconstruction is correct, using simple statistics that are readily obtained from a small training dataset. We demonstrate that the estimation is accurate for all tested types of spectra (including CID, HCD, ETD, CID/ETD and HCD/ETD spectra of trypsin, LysC or AspN digested peptides).

Availability: UniNovo is implemented in JAVA and tested on Windows, Ubuntu and OS X machines. UniNovo is available at http://proteomics.ucsd.edu/Software/UniNovo.html along with the manual.

Contact:kwj@ucsd.edu or ppevzner@ucsd.edu

Supplementary information:Supplementary data are available at Bioinformatics online.

Collapse

Costa EP, Menschaert G, Luyten W, De Grave K, Ramon J. PIUS: peptide identification by unbiased search. Bioinformatics 2013;29:1913-4. [DOI: 10.1093/bioinformatics/btt298] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Faccin M, Bruscolini P. MS/MS Spectra Interpretation as a Statistical–Mechanics Problem. Anal Chem 2013;85:4884-92. [DOI: 10.1021/ac4005666] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Zhang Y, Fonslow BR, Shan B, Baek MC, Yates JR. Protein analysis by shotgun/bottom-up proteomics. Chem Rev 2013;113:2343-94. [PMID: 23438204 PMCID: PMC3751594 DOI: 10.1021/cr3003533] [Citation(s) in RCA: 986] [Impact Index Per Article: 89.6] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Plant proteogenomics: from protein extraction to improved gene predictions. Methods Mol Biol 2013;1002:267-94. [PMID: 23625410 DOI: 10.1007/978-1-62703-360-2_21] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Identification of Ultramodified Proteins Using Top-Down Spectra. LECTURE NOTES IN COMPUTER SCIENCE 2013. [DOI: 10.1007/978-3-642-37195-0_11] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]

Hoopmann MR, Moritz RL. Current algorithmic solutions for peptide-based proteomics data generation and identification. Curr Opin Biotechnol 2012;24:31-8. [PMID: 23142544 DOI: 10.1016/j.copbio.2012.10.013] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2012] [Revised: 10/08/2012] [Accepted: 10/18/2012] [Indexed: 12/28/2022]

Guthals A, Bandeira N. Peptide identification by tandem mass spectrometry with alternate fragmentation modes. Mol Cell Proteomics 2012;11:550-7. [PMID: 22595789 PMCID: PMC3434779 DOI: 10.1074/mcp.r112.018556] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2012] [Revised: 05/04/2012] [Indexed: 11/06/2022] Open

Key issues in the acquisition and analysis of qualitative and quantitative mass spectrometry data for peptide-centric proteomic experiments. Amino Acids 2012;43:1075-85. [PMID: 22821266 DOI: 10.1007/s00726-012-1287-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2010] [Accepted: 04/03/2012] [Indexed: 01/05/2023]

Abstract

Proteomic technologies have matured to a level enabling accurate and reproducible quantitation of peptides and proteins from complex biological matrices. Analysis of samples as diverse as assembled protein complexes, whole cell lysates or sub-cellular proteomes from cell cultures, and direct analysis of animal and human tissues and fluids demonstrate the incredible versatility of the fundamental nature of the technique that forms the basis of most proteomic applications today (mass spectrometry). Determining the mass of biomolecules and their fragments or related products with high accuracy can convey a highly specific assay for detection and identification. Importantly, ion currents representative of these specifically identified analytes can be accurately quantified with the correct application of smart isobaric tagging chemistries, heavy and light isotopically derivatised samples or standards, or by careful application of workflows to compare unlabelled samples in so-called 'label-free' and targeted selected reaction monitoring experiments. In terms of exploring biology, a myriad of protein changes and modifications are being increasingly probed and quantified, including diverse chemical changes from relatively decisive modifications such as protein splicing and truncation, to more transient dynamic modifications such as phosphorylation, acetylation and ubiquitination. Proteomic workflows can be complex beasts and several key considerations to ensure effective applications have been outlined in the recent literature. The past year has witnessed the publication of several excellent reviews that thoroughly describe the fundamental principles underlying the state of the art. This review further elaborates on specific critical issues introduced by these publications and raises other important unaddressed considerations and new developments that directly impact on the effectiveness of proteomic technologies, in particular for, but not necessarily exclusive to peptide-centric experiments. These factors are discussed both in terms of qualitative analyses, including dynamic range and sampling issues, and developments to improve the translation of peptide fragmentation data into peptide and protein identities, as well as quantitative analyses, including data normalisation and the utility of ontology or functional annotation, the effects of modified peptides, and considered experimental design to facilitate the use of robust statistical methods.

Collapse

Medzihradszky KF, Bohlen CJ. Partial de novo sequencing and unusual CID fragmentation of a 7 kDa, disulfide-bridged toxin. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2012;23:923-34. [PMID: 22351294 PMCID: PMC4367482 DOI: 10.1007/s13361-012-0350-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2011] [Revised: 01/12/2012] [Accepted: 01/22/2012] [Indexed: 05/12/2023]

Allmer J. Algorithms for the de novo sequencing of peptides from tandem mass spectra. Expert Rev Proteomics 2012;8:645-57. [PMID: 21999834 DOI: 10.1586/epr.11.54] [Citation(s) in RCA: 91] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

Cantarel BL, Erickson AR, VerBerkmoes NC, Erickson BK, Carey PA, Pan C, Shah M, Mongodin EF, Jansson JK, Fraser-Liggett CM, Hettich RL. Strategies for metagenomic-guided whole-community proteomics of complex microbial environments. PLoS One 2011;6:e27173. [PMID: 22132090 PMCID: PMC3223167 DOI: 10.1371/journal.pone.0027173] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2011] [Accepted: 10/11/2011] [Indexed: 11/05/2022] Open

Proteomics in molecular diagnosis: typing of amyloidosis. J Biomed Biotechnol 2011;2011:754109. [PMID: 22131817 PMCID: PMC3205904 DOI: 10.1155/2011/754109] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2011] [Revised: 07/01/2011] [Accepted: 07/11/2011] [Indexed: 12/21/2022] Open

Mohimani H, Liu WT, Yang YL, Gaudêncio SP, Fenical W, Dorrestein PC, Pevzner PA. Multiplex de novo sequencing of peptide antibiotics. J Comput Biol 2011;18:1371-81. [PMID: 22035290 DOI: 10.1089/cmb.2011.0158] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Mohimani H, Liu WT, Mylne JS, Poth AG, Colgrave ML, Tran D, Selsted ME, Dorrestein PC, Pevzner PA. Cycloquest: identification of cyclopeptides via database search of their mass spectra against genome databases. J Proteome Res 2011;10:4505-12. [PMID: 21851130 PMCID: PMC3242011 DOI: 10.1021/pr200323a] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Wang J, Bourne PE, Bandeira N. Peptide identification by database search of mixture tandem mass spectra. Mol Cell Proteomics 2011;10:M111.010017. [PMID: 21862760 DOI: 10.1074/mcp.m111.010017] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Gupta N, Bandeira N, Keich U, Pevzner PA. Target-decoy approach and false discovery rate: when things may go wrong. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2011;22:1111-20. [PMID: 21953092 PMCID: PMC3220955 DOI: 10.1007/s13361-011-0139-3] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2010] [Revised: 02/19/2011] [Accepted: 02/22/2011] [Indexed: 05/12/2023]

Jefferys SR, Giddings MC. Baking a mass-spectrometry data PIE with McMC and simulated annealing: predicting protein post-translational modifications from integrated top-down and bottom-up data. ACTA ACUST UNITED AC 2011;27:844-52. [PMID: 21389073 DOI: 10.1093/bioinformatics/btr027] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]