1
|
Liu W, Zhao M, Gan L, Sun B, He S, Liu Y, Liu L, Li W, Chen J, Liu Y, Zhang J, Xu J. PeposX-Exhaust: A lightweight and efficient tool for identification of short peptides. Food Chem X 2024; 22:101249. [PMID: 38440058 PMCID: PMC10910222 DOI: 10.1016/j.fochx.2024.101249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 02/16/2024] [Accepted: 02/18/2024] [Indexed: 03/06/2024] Open
Abstract
Short peptides have become the focus of recent research due to their variable bioactivities, good digestibility and wide existences in food-derived protein hydrolysates. However, due to the high complexity of the samples, identifying short peptides still remains a challenge. In this work, a tool, named PeposX-Exhaust, was developed for short peptide identification. Through validation with known peptides, PeposX-Exhaust identified all the submitted spectra and the accuracy rate reached 75.36%, and the adjusted accuracy rate further reached 98.55% when with top 5 candidates considered. Compared with other tools, the accuracy rate by PeposX-Exhaust was at least 70% higher than two database-search tools and 15% higher than the other two de novo-sequencing tools, respectively. For further application, the numbers of short peptides identified from soybean, walnut, collagen and bonito protein hydrolysates reached 1145, 628, 746 and 681, respectively. This fully demonstrated the superiority of the tool in short peptide identification.
Collapse
Affiliation(s)
- Wanshun Liu
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
| | - Mouming Zhao
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
- School of Food Science and Engineering, South China University of Technology, Guangzhou 510640, China
| | - Lishe Gan
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Baoguo Sun
- Beijing Advanced Innovation Center for Food Nutrition and Human Health, Beijing Technology & Business University, Beijing 100048, China
| | - Shiqi He
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
| | - Yang Liu
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
- College of Food Science and Technology, Hunan Agricultural University, Changsha 410128, China
| | - Lei Liu
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
| | - Wu Li
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
| | - Jing Chen
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
| | - Yang Liu
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
| | - Jianan Zhang
- School of Food Science and Engineering, South China University of Technology, Guangzhou 510640, China
| | - Jucai Xu
- Guangdong Provincial Key Laboratory of Large Animal Models for Biomedicine, School of Pharmacy and Food Engineering, Wuyi University, Jiangmen 529020, China
| |
Collapse
|
2
|
Skeffington A, Fischer A, Sviben S, Brzezinka M, Górka M, Bertinetti L, Woehle C, Huettel B, Graf A, Scheffel A. A joint proteomic and genomic investigation provides insights into the mechanism of calcification in coccolithophores. Nat Commun 2023; 14:3749. [PMID: 37353496 PMCID: PMC10290126 DOI: 10.1038/s41467-023-39336-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 06/05/2023] [Indexed: 06/25/2023] Open
Abstract
Coccolithophores are globally abundant, calcifying microalgae that have profound effects on marine biogeochemical cycles, the climate, and life in the oceans. They are characterized by a cell wall of CaCO3 scales called coccoliths, which may contribute to their ecological success. The intricate morphologies of coccoliths are of interest for biomimetic materials synthesis. Despite the global impact of coccolithophore calcification, we know little about the molecular machinery underpinning coccolithophore biology. Working on the model Emiliania huxleyi, a globally distributed bloom-former, we deploy a range of proteomic strategies to identify coccolithogenesis-related proteins. These analyses are supported by a new genome, with gene models derived from long-read transcriptome sequencing, which revealed many novel proteins specific to the calcifying haptophytes. Our experiments provide insights into proteins involved in various aspects of coccolithogenesis. Our improved genome, complemented with transcriptomic and proteomic data, constitutes a new resource for investigating fundamental aspects of coccolithophore biology.
Collapse
Affiliation(s)
- Alastair Skeffington
- Max-Planck Institute of Molecular Plant Physiology, Potsdam-Golm, 14476, Germany
- Biological and Environmental Sciences, University of Stirling, Stirling, FK9 4LA, UK
| | - Axel Fischer
- Max-Planck Institute of Molecular Plant Physiology, Potsdam-Golm, 14476, Germany
| | - Sanja Sviben
- Max-Planck Institute of Molecular Plant Physiology, Potsdam-Golm, 14476, Germany
| | - Magdalena Brzezinka
- Max-Planck Institute of Molecular Plant Physiology, Potsdam-Golm, 14476, Germany
| | - Michał Górka
- Max-Planck Institute of Molecular Plant Physiology, Potsdam-Golm, 14476, Germany
| | - Luca Bertinetti
- Max Planck Institute of Colloids and Interfaces, Potsdam-Golm, 14476, Germany
| | - Christian Woehle
- Max Planck Institute for Plant Breeding Research, Max Planck-Genome-Centre Cologne, Cologne, 50829, Germany
| | - Bruno Huettel
- Max Planck Institute for Plant Breeding Research, Max Planck-Genome-Centre Cologne, Cologne, 50829, Germany
| | - Alexander Graf
- Max-Planck Institute of Molecular Plant Physiology, Potsdam-Golm, 14476, Germany
| | - André Scheffel
- Technische Universität Dresden, Faculty of Biology, 01307, Dresden, Germany.
- Max-Planck Institute of Molecular Plant Physiology, Potsdam-Golm, 14476, Germany.
| |
Collapse
|
3
|
Tabb DL, Jeong K, Druart K, Gant MS, Brown KA, Nicora C, Zhou M, Couvillion S, Nakayasu E, Williams JE, Peterson HK, McGuire MK, McGuire MA, Metz TO, Chamot-Rooke J. Comparing Top-Down Proteoform Identification: Deconvolution, PrSM Overlap, and PTM Detection. J Proteome Res 2023. [PMID: 37235544 DOI: 10.1021/acs.jproteome.2c00673] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Generating top-down tandem mass spectra (MS/MS) from complex mixtures of proteoforms benefits from improvements in fractionation, separation, fragmentation, and mass analysis. The algorithms to match MS/MS to sequences have undergone a parallel evolution, with both spectral alignment and match-counting approaches producing high-quality proteoform-spectrum matches (PrSMs). This study assesses state-of-the-art algorithms for top-down identification (ProSight PD, TopPIC, MSPathFinderT, and pTop) in their yield of PrSMs while controlling false discovery rate. We evaluated deconvolution engines (ThermoFisher Xtract, Bruker AutoMSn, Matrix Science Mascot Distiller, TopFD, and FLASHDeconv) in both ThermoFisher Orbitrap-class and Bruker maXis Q-TOF data (PXD033208) to produce consistent precursor charges and mass determinations. Finally, we sought post-translational modifications (PTMs) in proteoforms from bovine milk (PXD031744) and human ovarian tissue. Contemporary identification workflows produce excellent PrSM yields, although approximately half of all identified proteoforms from these four pipelines were specific to only one workflow. Deconvolution algorithms disagree on precursor masses and charges, contributing to identification variability. Detection of PTMs is inconsistent among algorithms. In bovine milk, 18% of PrSMs produced by pTop and TopMG were singly phosphorylated, but this percentage fell to 1% for one algorithm. Applying multiple search engines produces more comprehensive assessments of experiments. Top-down algorithms would benefit from greater interoperability.
Collapse
Affiliation(s)
- David L Tabb
- Université Paris Cité, Institut Pasteur, CNRS UAR 2024, Mass Spectrometry for Biology Unit, Paris 75015, France
| | - Kyowon Jeong
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen 72076, Germany
| | - Karen Druart
- Université Paris Cité, Institut Pasteur, CNRS UAR 2024, Mass Spectrometry for Biology Unit, Paris 75015, France
| | - Megan S Gant
- Université Paris Cité, Institut Pasteur, CNRS UAR 2024, Mass Spectrometry for Biology Unit, Paris 75015, France
| | - Kyle A Brown
- School of Medicine and Public Health, University of Wisconsin, Madison, Wisconsin 53705, United States
| | - Carrie Nicora
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Mowei Zhou
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Sneha Couvillion
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Ernesto Nakayasu
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Janet E Williams
- Department of Animal, Veterinary, and Food Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Haley K Peterson
- Department of Animal, Veterinary, and Food Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Michelle K McGuire
- Margaret Ritchie School of Family and Consumer Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Mark A McGuire
- Department of Animal, Veterinary, and Food Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Thomas O Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Julia Chamot-Rooke
- Université Paris Cité, Institut Pasteur, CNRS UAR 2024, Mass Spectrometry for Biology Unit, Paris 75015, France
| |
Collapse
|
4
|
Caira S, Picariello G, Renzone G, Arena S, Troise AD, De Pascale S, Ciaravolo V, Pinto G, Addeo F, Scaloni A. Recent developments in peptidomics for the quali-quantitative analysis of food-derived peptides in human body fluids and tissues. Trends Food Sci Technol 2022. [DOI: 10.1016/j.tifs.2022.06.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
5
|
Moser-Katz T, Gavile CM, Barwick BG, Lee KP, Boise LH. PDZ Proteins SCRIB and DLG1 Regulate Myeloma Cell Surface CD86 Expression, Growth, and Survival. Mol Cancer Res 2022; 20:1122-1136. [PMID: 35380688 PMCID: PMC9262820 DOI: 10.1158/1541-7786.mcr-21-0681] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 01/28/2022] [Accepted: 04/01/2022] [Indexed: 01/09/2023]
Abstract
Despite advances in the treatment of multiple myeloma in the past decades, the disease remains incurable, and understanding signals and molecules that can control myeloma growth and survival are important for the development of novel therapeutic strategies. One such molecule, CD86, regulates multiple myeloma cell survival via its interaction with CD28 and signaling through its cytoplasmic tail. Although the CD86 cytoplasmic tail has been shown to be involved in drug resistance and can induce molecular changes in multiple myeloma cells, its function has been largely unexplored. Here, we show that CD86 cytoplasmic tail has a role in trafficking CD86 to the cell surface. This is due in part to a PDZ-binding motif at its C-terminus which is important for proper trafficking from the Golgi apparatus. BioID analysis revealed 10 PDZ domain-containing proteins proximal to CD86 cytoplasmic tail in myeloma cells. Among them, we found the planar cell polarity proteins, SCRIB and DLG1, are important for proper CD86 surface expression and the growth and survival of myeloma cells. These findings indicate a mechanism by which myeloma cells confer cellular survival and drug resistance and indicate a possible motif to target for therapeutic gain. IMPLICATIONS These findings demonstrate the importance of proper trafficking of CD86 to the cell surface in myeloma cell survival and may provide a new therapeutic target in this disease.
Collapse
Affiliation(s)
- Tyler Moser-Katz
- Department of Hematology and Medical Oncology, Winship Cancer Institute, Emory University, Atlanta, GA
| | - Catherine M. Gavile
- Department of Hematology and Medical Oncology, Winship Cancer Institute, Emory University, Atlanta, GA
| | - Benjamin G. Barwick
- Department of Hematology and Medical Oncology, Winship Cancer Institute, Emory University, Atlanta, GA
| | - Kelvin P. Lee
- Melvin and Bren Simon Comprehensive Cancer Center, Indiana University School of Medicine, Indianapolis, Indiana
| | - Lawrence H. Boise
- Department of Hematology and Medical Oncology, Winship Cancer Institute, Emory University, Atlanta, GA
| |
Collapse
|
6
|
Validation of De Novo Peptide Sequences with Bottom-Up Tag Convolution. Proteomes 2021; 10:proteomes10010001. [PMID: 35076636 PMCID: PMC8788492 DOI: 10.3390/proteomes10010001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Revised: 12/22/2021] [Accepted: 12/23/2021] [Indexed: 11/16/2022] Open
Abstract
De novo sequencing is indispensable for the analysis of proteins from organisms with unknown genomes, novel splice variants, and antibodies. However, despite a variety of methods developed to this end, distinguishing between the correct interpretation of a mass spectrum and a number of incorrect alternatives often remains a challenge. Tag convolution is computed for a set of peptide sequence tags of a fixed length k generated from the input tandem mass spectra and can be viewed as a generalization of the well-known spectral convolution. We demonstrate its utility for validating de novo peptide sequences by using a set of those generated by the algorithm PepNovo+ from high-resolution bottom-up data sets for carbonic anhydrase 2 and the Fab region of alemtuzumab and indicate its further potential applications.
Collapse
|
7
|
Abstract
Peptides play a crucial role in many vitally important functions of living organisms. The goal of peptidomics is the identification of the "peptidome," the whole peptide content of a cell, organ, tissue, body fluid, or organism. In peptidomic or proteomic studies, capillary electrophoresis (CE) is an alternative technique for liquid chromatography. It is a highly efficient and fast separation method requiring extremely low amounts of sample. In peptidomic approaches, CE is commonly combined with mass spectrometric (MS) detection. Most often, CE is coupled with electrospray ionization MS and less frequently with matrix-assisted laser desorption/ionization MS. CE-MS has been employed in numerous studies dealing with determination of peptide biomarkers in different body fluids for various diseases, or in food peptidomic research for the analysis and identification of peptides with special biological activities. In addition to the above topics, sample preparation techniques commonly applied in peptidomics before CE separation and possibilities for peptide identification and quantification by CE-MS or CE-MS/MS methods are discussed in this chapter.
Collapse
|
8
|
Proteomics of Multiple Sclerosis: Inherent Issues in Defining the Pathoetiology and Identifying (Early) Biomarkers. Int J Mol Sci 2021; 22:ijms22147377. [PMID: 34298997 PMCID: PMC8306353 DOI: 10.3390/ijms22147377] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 06/25/2021] [Accepted: 06/29/2021] [Indexed: 02/06/2023] Open
Abstract
Multiple Sclerosis (MS) is a demyelinating disease of the human central nervous system having an unconfirmed pathoetiology. Although animal models are used to mimic the pathology and clinical symptoms, no single model successfully replicates the full complexity of MS from its initial clinical identification through disease progression. Most importantly, a lack of preclinical biomarkers is hampering the earliest possible diagnosis and treatment. Notably, the development of rationally targeted therapeutics enabling pre-emptive treatment to halt the disease is also delayed without such biomarkers. Using literature mining and bioinformatic analyses, this review assessed the available proteomic studies of MS patients and animal models to discern (1) whether the models effectively mimic MS; and (2) whether reasonable biomarker candidates have been identified. The implication and necessity of assessing proteoforms and the critical importance of this to identifying rational biomarkers are discussed. Moreover, the challenges of using different proteomic analytical approaches and biological samples are also addressed.
Collapse
|
9
|
Porosnicu I, Butnaru CM, Tiseanu I, Stancu E, Munteanu CVA, Bita BI, Duliu OG, Sima F. Y 2O 3 Nanoparticles and X-ray Radiation-Induced Effects in Melanoma Cells. Molecules 2021; 26:molecules26113403. [PMID: 34199757 PMCID: PMC8200002 DOI: 10.3390/molecules26113403] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 05/28/2021] [Accepted: 05/31/2021] [Indexed: 12/20/2022] Open
Abstract
The innovative strategy of using nanoparticles in radiotherapy has become an exciting topic due to the possibility of simultaneously improving local efficiency of radiation in tumors and real-time monitoring of the delivered doses. Yttrium oxide (Y2O3) nanoparticles (NPs) are used in material science to prepare phosphors for various applications including X-ray induced photodynamic therapy and in situ nano-dosimetry, but few available reports only addressed the effect induced in cells by combined exposure to different doses of superficial X-ray radiation and nanoparticles. Herein, we analyzed changes induced in melanoma cells by exposure to different doses of X-ray radiation and various concentrations of Y2O3 NPs. By evaluation of cell mitochondrial activity and production of intracellular reactive oxygen species (ROS), we estimated that 2, 4, and 6 Gy X-ray radiation doses are visibly altering the cells by inducing ROS production with increasing the dose while at 6 Gy the mitochondrial activity is also affected. Separately, high-concentrated solutions of 25, 50, and 100 µg/mL Y2O3 NPs were also found to affect the cells by inducing ROS production with the increase of concentration. Additionally, the colony-forming units assay evidenced a rather synergic effect of NPs and radiation. By adding the NPs to cells before irradiation, a decrease of the number of proliferating cell colonies was observed with increase of X-ray dose. DNA damage was evidenced by quantifying the γ-H2AX foci for cells treated with Y2O3 NPs and exposed to superficial X-ray radiation. Proteomic profile confirmed that a combined effect of 50 µg/mL Y2O3 NPs and 6 Gy X-ray dose induced mitochondria alterations and DNA changes in melanoma cells.
Collapse
Affiliation(s)
- Ioana Porosnicu
- National Institute of Laser Plasma and Radiation Physics, P.O. Box MG-36, 76900 Bucharest-Magurele, Romania; (I.P.); (I.T.); (E.S.); (B.I.B.)
- Faculty of Physics, Doctoral School on Physics, University of Bucharest, 405 Atomistilor Street, 077125 Magurele-Ilfov, Romania;
| | - Cristian M. Butnaru
- National Institute of Laser Plasma and Radiation Physics, P.O. Box MG-36, 76900 Bucharest-Magurele, Romania; (I.P.); (I.T.); (E.S.); (B.I.B.)
- Correspondence: (C.M.B.); (F.S.)
| | - Ion Tiseanu
- National Institute of Laser Plasma and Radiation Physics, P.O. Box MG-36, 76900 Bucharest-Magurele, Romania; (I.P.); (I.T.); (E.S.); (B.I.B.)
| | - Elena Stancu
- National Institute of Laser Plasma and Radiation Physics, P.O. Box MG-36, 76900 Bucharest-Magurele, Romania; (I.P.); (I.T.); (E.S.); (B.I.B.)
| | - Cristian V. A. Munteanu
- Institute of Biochemistry, Romanian Academy, 296 Splaiul Independentei, 060031 Bucharest, Romania;
| | - Bogdan I. Bita
- National Institute of Laser Plasma and Radiation Physics, P.O. Box MG-36, 76900 Bucharest-Magurele, Romania; (I.P.); (I.T.); (E.S.); (B.I.B.)
| | - Octavian G. Duliu
- Faculty of Physics, Doctoral School on Physics, University of Bucharest, 405 Atomistilor Street, 077125 Magurele-Ilfov, Romania;
| | - Felix Sima
- National Institute of Laser Plasma and Radiation Physics, P.O. Box MG-36, 76900 Bucharest-Magurele, Romania; (I.P.); (I.T.); (E.S.); (B.I.B.)
- Correspondence: (C.M.B.); (F.S.)
| |
Collapse
|
10
|
Yang H, Butler ER, Monier SA, Teubl J, Fenyö D, Ueberheide B, Siegel D. A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry. Sci Rep 2021; 11:10900. [PMID: 34035355 PMCID: PMC8149876 DOI: 10.1038/s41598-021-90231-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Accepted: 05/06/2021] [Indexed: 11/17/2022] Open
Abstract
Proteogenomics is an increasingly common method for species identification as it allows for rapid and inexpensive interrogation of an unknown organism’s proteome—even when the proteome is partially degraded. The proteomic method typically uses tandem mass spectrometry to survey all peptides detectable in a sample that frequently contains hundreds or thousands of proteins. Species identification is based on detection of a small numbers of species-specific peptides. Genetic analysis of proteins by mass spectrometry, however, is a developing field, and the bone proteome, typically consisting of only two proteins, pushes the limits of this technology. Nearly 20% of highly confident spectra from modern human bone samples identify non-human species when searched against a vertebrate database—as would be necessary with a fragment of unknown bone. These non-human peptides are often the result of current limitations in mass spectrometry or algorithm interpretation errors. Consequently, it is difficult to know if a “species-specific” peptide used to identify a sample is actually present in that sample. Here we evaluate the causes of peptide sequence errors and propose an unbiased, probabilistic approach to determine the likelihood that a species is correctly identified from bone without relying on species-specific peptides.
Collapse
Affiliation(s)
- Heyi Yang
- Office of Chief Medical Examiner, 421 East 26th Street, New York, NY, 10016, USA
| | - Erin R Butler
- Office of Chief Medical Examiner, 421 East 26th Street, New York, NY, 10016, USA
| | - Samantha A Monier
- Office of Chief Medical Examiner, 421 East 26th Street, New York, NY, 10016, USA
| | - Jennifer Teubl
- Institute for Systems Genetics, Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY, 10016, USA
| | - David Fenyö
- Institute for Systems Genetics, Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY, 10016, USA
| | - Beatrix Ueberheide
- Institute for Systems Genetics, Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY, 10016, USA.,Department of Biochemistry and Molecular Pharmacology, Department of Neurology, Director Proteomics Laboratory, Division of Advanced Research Technologies, NYU Grossman School of Medicine, New York, NY, 10016, USA
| | - Donald Siegel
- Office of Chief Medical Examiner, 421 East 26th Street, New York, NY, 10016, USA.
| |
Collapse
|
11
|
Mohsin AZ, Sukor R, Selamat J, Meor Hussin AS, Ismail IH, Jambari NN, Jonet A. A highly selective two-way purification method using liquid chromatography for isolating α S2-casein from goat milk of five different breeds. J Chromatogr B Analyt Technol Biomed Life Sci 2020; 1160:122380. [PMID: 32971369 DOI: 10.1016/j.jchromb.2020.122380] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Revised: 07/27/2020] [Accepted: 09/07/2020] [Indexed: 12/22/2022]
Abstract
The main challenges in the purification of αS2-casein are due to the low quantity in milk and high homology with other casein subunits, i.e., αS1-casein, β-casein, and κ-casein. To overcome these challenges, the aim of this study was to develop a two-step purification to isolate native αS2-casein in goat milk from five different breeds; British Alpine, Jamnapari, Saanen, Shami, and Toggenburg. The first step of the purification was executed by anion-exchange chromatography under optimal elution conditions followed by size exclusion chromatography. Tryptic peptides from in-gel digestion of purified αS2-casein were sequenced and analyzed by LC-ESI-MS/MS. From 1.05 g of whole casein, the highest yield of αS2-casein (6.7 mg/mL) was obtained from Jamnapari and the lowest yield (2.2 mg/mL) was from Saanen. A single band of pure αS2-casein was observed on SDS-PAGE for all breeds. The αS2-casein showed coverage percentage of amino acid sequence from 76.68 to 92.83%. The two-step purification process developed herein was successfully applied for isolating native αS2-casein from goat milk with high purity, which will allow for future in vitro studies to be conducted on this protein.
Collapse
Affiliation(s)
- Aliah Zannierah Mohsin
- Laboratory of Food Safety and Food Integrity, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
| | - Rashidah Sukor
- Laboratory of Food Safety and Food Integrity, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia; Faculty of Food Science and Technology, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia.
| | - Jinap Selamat
- Laboratory of Food Safety and Food Integrity, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia; Faculty of Food Science and Technology, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
| | - Anis Shobirin Meor Hussin
- Faculty of Food Science and Technology, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
| | - Intan Hakimah Ismail
- Faculty of Medicine, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
| | - Nuzul Noorahya Jambari
- Laboratory of Food Safety and Food Integrity, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia; Faculty of Food Science and Technology, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia
| | - Anuar Jonet
- Department of Structural Biology and Biophysics, Malaysia Genome Institute, Kajang 43000, Selangor, Malaysia
| |
Collapse
|
12
|
Takan S, Allmer J. DNMSO; an ontology for representing de novo sequencing results from Tandem-MS data. PeerJ 2020; 8:e10216. [PMID: 33150092 PMCID: PMC7585381 DOI: 10.7717/peerj.10216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Accepted: 09/28/2020] [Indexed: 11/20/2022] Open
Abstract
For the identification and sequencing of proteins, mass spectrometry (MS) has become the tool of choice and, as such, drives proteomics. MS/MS spectra need to be assigned a peptide sequence for which two strategies exist. Either database search or de novo sequencing can be employed to establish peptide spectrum matches. For database search, mzIdentML is the current community standard for data representation. There is no community standard for representing de novo sequencing results, but we previously proposed the de novo markup language (DNML). At the moment, each de novo sequencing solution uses different data representation, complicating downstream data integration, which is crucial since ensemble predictions may be more useful than predictions of a single tool. We here propose the de novo MS Ontology (DNMSO), which can, for example, provide many-to-many mappings between spectra and peptide predictions. Additionally, an application programming interface (API) that supports any file operation necessary for de novo sequencing from spectra input to reading, writing, creating, of the DNMSO format, as well as conversion from many other file formats, has been implemented. This API removes all overhead from the production of de novo sequencing tools and allows developers to concentrate on algorithm development completely. We make the API and formal descriptions of the format freely available at https://github.com/savastakan/dnmso.
Collapse
Affiliation(s)
- Savaş Takan
- Department of Computer Engineering, Faculty of Engineering, Izmir Institute of Technology, Izmir, Turkey
| | - Jens Allmer
- Hochschule Ruhr West, University of Applied Sciences, Medical Informatics and Bioinformatics, Institute for Measurement Engineering and Sensor Technology, Mülheim an der Ruhr, Germany
| |
Collapse
|
13
|
Fei Z, Wang K, Chi H. GameTag: A New Sequence Tag Generation Algorithm Based on Cooperative Game Theory. Proteomics 2020; 20:e2000021. [PMID: 32927502 DOI: 10.1002/pmic.202000021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Revised: 08/06/2020] [Indexed: 02/02/2023]
Abstract
Sequence tag-based peptide search is a critical technology in proteomics for the characterization of proteins from tandem mass spectrometry data. However, the main reason for hindering the full application of such an approach lies that accurately extracting sequence tags responsible for each experimental spectrum. Toward that end, GameTag, a novel cooperative game framework for sequence tag generation is proposed, which includes a tag generator and a tag discriminator to collaboratively generate sequence tags. Specifically, the tag generator works to extract as many correct tag candidates as possible and the tag discriminator serves to determine the correctness of tag candidates and reduce the total number of output tags simultaneously. Through the dynamic two-player game, the number of extracted tags is decreased while the number of correct tags gets boosted. The performance of the proposed method is also investigated under various hyperparameter and structure settings. Extensive experiments on a wide variety of data sets from different species demonstrate that GameTag outperforms previous state-of-the-art methods, InsPecT, PepNovo+, DirecTag, and the existing tag-extraction method in Open-pFind, increasing by at least 10% the number of spectra extracted more than one correct tag.
Collapse
Affiliation(s)
- Zhengcong Fei
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, No. 6 Zhongguancun South Road, Beijing, 100190, China.,University of Chinese Academy of Sciences, No.19(A) Yuquan Road, Shijingshan District, Beijing, 100049, China
| | - Kaifei Wang
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, No. 6 Zhongguancun South Road, Beijing, 100190, China.,University of Chinese Academy of Sciences, No.19(A) Yuquan Road, Shijingshan District, Beijing, 100049, China
| | - Hao Chi
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, No. 6 Zhongguancun South Road, Beijing, 100190, China.,University of Chinese Academy of Sciences, No.19(A) Yuquan Road, Shijingshan District, Beijing, 100049, China
| |
Collapse
|
14
|
Matyushin DD, Sholokhova AY, Buryak AK. Deep Learning Driven GC-MS Library Search and Its Application for Metabolomics. Anal Chem 2020; 92:11818-11825. [PMID: 32867500 DOI: 10.1021/acs.analchem.0c02082] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Preliminary compound identification and peak annotation in gas chromatography-mass spectrometry is usually made using mass spectral databases. There are a few algorithms that enable performing a search of a spectrum in a large mass spectral library. In many cases, a library search procedure returns a wrong answer even if a correct compound is contained in a library. In this work, we present a deep learning driven approach to a library search in order to reduce the probability of such cases. Machine learning ranking (learning to rank) is a class of machine learning and deep learning algorithms that perform a comparison (ranking) of objects. This work introduces the usage of deep learning ranking for small molecules identification using low-resolution electron ionization mass spectrometry. Instead of simple similarity measures for two spectra, such as the dot product or the Euclidean distance between vectors that represent spectra, a deep convolutional neural network is used. The deep learning ranking model outperforms other approaches and enables reducing a fraction of wrong answers (at rank-1) by 9-23% depending on the used data set. Spectra from the Golm Metabolome Database, Human Metabolome Database, and FiehnLib were used for testing the model.
Collapse
Affiliation(s)
- Dmitriy D Matyushin
- A.N. Frumkin Institute of Physical Chemistry and Electrochemistry, Russian Academy of Sciences, 31 Leninsky Prospect, Moscow, GSP-1, 119071, Russia
| | - Anastasia Yu Sholokhova
- A.N. Frumkin Institute of Physical Chemistry and Electrochemistry, Russian Academy of Sciences, 31 Leninsky Prospect, Moscow, GSP-1, 119071, Russia
| | - Aleksey K Buryak
- A.N. Frumkin Institute of Physical Chemistry and Electrochemistry, Russian Academy of Sciences, 31 Leninsky Prospect, Moscow, GSP-1, 119071, Russia
| |
Collapse
|
15
|
Ramesh P, Nagarajan V, Khanchandani V, Desai VK, Niranjan V. Proteomic variations of esophageal squamous cell carcinoma revealed by combining RNA-seq proteogenomics and G-PTM search strategy. Heliyon 2020; 6:e04813. [PMID: 32913912 PMCID: PMC7472856 DOI: 10.1016/j.heliyon.2020.e04813] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2020] [Revised: 07/10/2020] [Accepted: 08/25/2020] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Cancer that arises from epithelial cells of the esophagus is called esophagus squamous cell carcinoma (ESCC) and is mostly observed in developing nations. Evaluation of cancer genomes and its regulation into proteins plays a predominant role in understanding the cancer progressions. Mass-spectrometry-based proteomics is a consequential tool to estimate proteomic variation and posttranslational modifications (PTMs) from standard protein databases. Post-translational modifications play a crucial role in protein folding and PTMs can be accounted for as a biological signal to interpret the structural changes and transition order of proteins. Functional validation of cancer-related mutations can explain the effects of mutations on genes and the identification of Oncogenes and tumor suppressor genes. Therefore, we present a study on protein variations to interpret the structural changes and transition order of proteins in ESCC carcinogenesis. METHODOLOGY We are using a bottom-up proteomics approach with Galaxy-P framework and RNA sequence data analysis to generate the sample-specific databases containing details of RNA splicing and variant peptides. Once the database generated with information on variable modification, only the curated PTMs at specific positions are considered to perform spectral matching. Proteogenomics mapping was performed to identify protein variations in ESCC. RESULTS RNA-sequence proteogenomics with G-PTM (Global Post-Translational Modification) searching strategy has revealed proteomic events including several peptides that contain single amino acid variations, novel splice junction peptides and posttranslationally modified peptides. Proteogenomic mapping exhibited the splice junction peptides mapped predominantly for Malic enzyme exon type (ME-3) and MCM7 protein-coding genes that promote cancer progression, found to be exhibited in ESCC samples. Approximately 25 ± types of PTM modifications were recorded, and Protein Phosphorylation was largely noted. CONCLUSION ESCC cancer prognosis at the molecular level enables a better understanding of cancer carcinogenesis and protein modifications can be used as potential biomarkers.
Collapse
Affiliation(s)
- Pooja Ramesh
- Department of Biotechnology, RV College of Engineering, Bangalore, Karnataka, India
| | | | - Vartika Khanchandani
- Department of Biotechnology, RV College of Engineering, Bangalore, Karnataka, India
| | - Vasanth Kumar Desai
- Department of Biotechnology, RV College of Engineering, Bangalore, Karnataka, India
| | - Vidya Niranjan
- Department of Biotechnology, RV College of Engineering, Bangalore, Karnataka, India
| |
Collapse
|
16
|
Dong N, Spencer DM, Quan Q, Le Blanc JCY, Feng J, Li M, Siu KWM, Chu IK. rPTMDetermine: A Fully Automated Methodology for Endogenous Tyrosine Nitration Validation, Site-Localization, and Beyond. Anal Chem 2020; 92:10768-10776. [DOI: 10.1021/acs.analchem.0c02148] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Naiping Dong
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | - Daniel M. Spencer
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | - Quan Quan
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | | | - Jinwen Feng
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | - Mengzhu Li
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| | - K. W. Michael Siu
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
- Department of Chemistry and Centre for Research in Mass Spectrometry, York University, Toronto, Ontario M3J 1P3, Canada
- Department of Chemistry and Biochemistry, University of Windsor, Windsor, Ontario N9B 3P4, Canada
| | - Ivan K. Chu
- Department of Chemistry, The University of Hong Kong, Pokfulam, Hong Kong, China
| |
Collapse
|
17
|
Araujo NA, Rincón M, Vonasek E, Calabokis M, Bubis J. Biochemical characterization of the cAMP-dependent protein kinase regulatory subunit-like protein from Trypanosoma equiperdum, detection of its inhibitory activity, and identification of potential interacting proteins. Biochimie 2020; 168:110-123. [DOI: 10.1016/j.biochi.2019.10.020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2019] [Accepted: 10/31/2019] [Indexed: 11/26/2022]
|
18
|
Awad D, Brueck T. Optimization of protein isolation by proteomic qualification from Cutaneotrichosporon oleaginosus. Anal Bioanal Chem 2020; 412:449-462. [PMID: 31797019 PMCID: PMC6992551 DOI: 10.1007/s00216-019-02254-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Revised: 10/23/2019] [Accepted: 10/30/2019] [Indexed: 11/03/2022]
Abstract
In the last decades, microbial oils have been extensively investigated as a renewable platform for biofuel and oleochemical production. Offering a potent alternative to plant-based oils, oleaginous microorganisms have been the target of ongoing metabolic engineering aimed at increasing growth and lipid yields, in addition to specialty fatty acids. Discovery proteomics is an attractive tool for elucidating lipogenesis and identifying metabolic bottlenecks, feedback regulation, and competing biosynthetic pathways. One prominent microbial oil producer is Cutaneotrichosporon oleaginosus, due to its broad feedstock catabolism and high lipid yield. However, this yeast has a recalcitrant cell wall and high cell lipid content, which complicates efficient and unbiased protein extraction for downstream proteomic analysis. Optimization efforts of protein sample preparation from C. oleaginosus in the present study encompasses the comparison of 8 lysis methods, 13 extraction buffers, and 17 purification methods with respect to protein abundance, proteome coverage, applicability, and physiochemical properties (pI, MW, hydrophobicity in addition to COG, and GO analysis). The optimized protocol presented in this work entails a one-step extraction method utilizing an optimal lysis method (liquid homogenization), which is augmented with a superior extraction buffer (50 mM Tris, 8/2 M Urea/Thiourea, and 1% C7BzO), followed by either of 2 advantageous purification methods (hexane/ethanol or TCA/acetone), depending on subsequent applications and target studies. This work presents a significant step forward towards implementation of efficient C. oleaginosus proteome mining for the identification of potential targets for genetic optimization of this yeast to improve lipogenesis and production of specialty lipids. Graphical abstract.
Collapse
Affiliation(s)
- Dania Awad
- Werner Siemens-Lehrstuhl für Synthetische Biotechnologie, Technische Universität München, Lichtenbergstrasse 4, 85748, Garching, Germany
| | - Thomas Brueck
- Werner Siemens-Lehrstuhl für Synthetische Biotechnologie, Technische Universität München, Lichtenbergstrasse 4, 85748, Garching, Germany.
| |
Collapse
|
19
|
Li C, Li K, Li K, Xie X, Lin F. SWPepNovo: An Efficient De Novo Peptide Sequencing Tool for Large-scale MS/MS Spectra Analysis. Int J Biol Sci 2019; 15:1787-1801. [PMID: 31523183 PMCID: PMC6743289 DOI: 10.7150/ijbs.32142] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Accepted: 04/09/2019] [Indexed: 12/17/2022] Open
Abstract
Tandem mass spectrometry (MS/MS)-based de novo peptide sequencing is a powerful method for high-throughput protein analysis. However, the explosively increasing size of MS/MS spectra dataset inevitably and exponentially raises the computational demand of existing de novo peptide sequencing methods, which is an issue urgently to be solved in computational biology. This paper introduces an efficient tool based on SW26010 many-core processor, namely SWPepNovo, to process the large-scale peptide MS/MS spectra using a parallel peptide spectrum matches (PSMs) algorithm. Our design employs a two-level parallelization mechanism: (1) the task-level parallelism between MPEs using MPI based on a data transformation method and a dynamic feedback task scheduling algorithm, (2) the thread-level parallelism across CPEs using asynchronous task transfer and multithreading. Moreover, three optimization strategies, including vectorization, double buffering and memory access optimizations, have been employed to overcome both the compute-bound and the memory-bound bottlenecks in the parallel PSMs algorithm. The results of experiments conducted on multiple spectra datasets demonstrate the performance of SWPepNovo against three state-of-the-art tools for peptide sequencing, including PepNovo+, PEAKS and DeepNovo-DIA. The SWPepNovo also shows high scalability in experiments on extremely large datasets sized up to 11.22 GB. The software and the parameter settings are available at https://github.com/ChuangLi99/SWPepNovo.
Collapse
Affiliation(s)
- Chuang Li
- College of Information Science and Engineering, Hunan University, Changsha, China
| | - Kenli Li
- College of Information Science and Engineering, Hunan University, National Supercomputing Center in Changsha, Changsha, China
| | - Keqin Li
- College of Information Science and Engineering, Hunan University, Department of Computer Science, State University of New York, NY, USA
| | - Xianghui Xie
- State Key Laboratory of Mathematic Engineering and Advance Computing, Wuxi Jiangnan Institute of Computing Technology, Jiangsu, China
| | - Feng Lin
- School of Computer Science and Engineering, Nanyang Technological University, Singapore
| |
Collapse
|
20
|
Ser Z, Cifani P, Kentsis A. Optimized Cross-Linking Mass Spectrometry for in Situ Interaction Proteomics. J Proteome Res 2019; 18:2545-2558. [PMID: 31083951 DOI: 10.1021/acs.jproteome.9b00085] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Recent development of mass spectrometer cleavable protein cross-linkers and algorithms for their spectral identification now permits large-scale cross-linking mass spectrometry (XL-MS). Here, we optimized the use of cleavable disuccinimidyl sulfoxide (DSSO) cross-linker for labeling native protein complexes in live human cells. We applied a generalized linear mixture model to calibrate cross-link peptide-spectra matching (CSM) scores to control the sensitivity and specificity of large-scale XL-MS. Using specific CSM score thresholds to control the false discovery rate, we found that higher-energy collisional dissociation (HCD) and electron transfer dissociation (ETD) can both be effective for large-scale XL-MS protein interaction mapping. We found that the coverage of protein-protein interaction maps is significantly improved through the use of multiple proteases. In addition, the use of focused sample-specific search databases can be used to improve the specificity of cross-linked peptide spectral matching. Application of this approach to human chromatin labeled in live cells recapitulated known and revealed new protein interactions of nucleosomes and other chromatin-associated complexes in situ. This optimized approach for mapping native protein interactions should be useful for a wide range of biological problems.
Collapse
Affiliation(s)
| | | | - Alex Kentsis
- Department of Pediatrics, Pharmacology, and Physiology & Biophysics, Weill Cornell Medical College , Cornell University , New York , New York 10065 , United States
| |
Collapse
|
21
|
Maes E, Oeyen E, Boonen K, Schildermans K, Mertens I, Pauwels P, Valkenborg D, Baggerman G. The challenges of peptidomics in complementing proteomics in a clinical context. MASS SPECTROMETRY REVIEWS 2019; 38:253-264. [PMID: 30372792 DOI: 10.1002/mas.21581] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 10/01/2018] [Indexed: 06/08/2023]
Abstract
Naturally occurring peptides, including growth factors, hormones, and neurotransmitters, represent an important class of biomolecules and have crucial roles in human physiology. The study of these peptides in clinical samples is therefore as relevant as ever. Compared to more routine proteomics applications in clinical research, peptidomics research questions are more challenging and have special requirements with regard to sample handling, experimental design, and bioinformatics. In this review, we describe the issues that confront peptidomics in a clinical context. After these hurdles are (partially) overcome, peptidomics will be ready for a successful translation into medical practice.
Collapse
Affiliation(s)
- Evelyne Maes
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Food and Bio-Based Products, AgResearch Ltd., Lincoln, New Zealand
| | - Eline Oeyen
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Kurt Boonen
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Karin Schildermans
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Inge Mertens
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Patrick Pauwels
- Molecular Pathology Unit, Department of Pathology, Antwerp University Hospital, Edegem, Belgium
| | - Dirk Valkenborg
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Center for Statistics, Hasselt University, Diepenbeek, Belgium
| | - Geert Baggerman
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| |
Collapse
|
22
|
Diharce J, Cueto M, Beltramo M, Aucagne V, Bonnet P. In Silico Peptide Ligation: Iterative Residue Docking and Linking as a New Approach to Predict Protein-Peptide Interactions. Molecules 2019; 24:E1351. [PMID: 30959812 PMCID: PMC6480567 DOI: 10.3390/molecules24071351] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2019] [Revised: 04/02/2019] [Accepted: 04/03/2019] [Indexed: 11/16/2022] Open
Abstract
Peptide⁻protein interactions are corner-stones of living functions involved in essential mechanisms, such as cell signaling. Given the difficulty of obtaining direct experimental structural biology data, prediction of those interactions is of crucial interest for the rational development of new drugs, notably to fight diseases, such as cancer or Alzheimer's disease. Because of the high flexibility of natural unconstrained linear peptides, prediction of their binding mode in a protein cavity remains challenging. Several theoretical approaches have been developed in the last decade to address this issue. Nevertheless, improvements are needed, such as the conformation prediction of peptide side-chains, which are dependent on peptide length and flexibility. Here, we present a novel in silico method, Iterative Residue Docking and Linking (IRDL), to efficiently predict peptide⁻protein interactions. In order to reduce the conformational space, this innovative method splits peptides into several short segments. Then, it uses the performance of intramolecular covalent docking to rebuild, sequentially, the complete peptide in the active site of its protein target. Once the peptide is constructed, a rescoring step is applied in order to correctly rank all IRDL solutions. Applied on a set of 11 crystallized peptide⁻protein complexes, the IRDL method shows promising results, since it is able to retrieve experimental binding conformations with a Root Mean Square Deviation (RMSD) below 2 Å in the top five ranked solutions. For some complexes, IRDL method outperforms two other docking protocols evaluated in this study. Hence, IRDL is a new tool that could be used in drug design projects to predict peptide⁻protein interactions.
Collapse
Affiliation(s)
- Julien Diharce
- Institut de Chimie Organique et Analytique (ICOA), UMR CNRS-Université d'Orléans 7311, Université d'Orléans BP 6759, 45067, Orléans CEDEX 2, France.
| | - Mickaël Cueto
- Institut de Chimie Organique et Analytique (ICOA), UMR CNRS-Université d'Orléans 7311, Université d'Orléans BP 6759, 45067, Orléans CEDEX 2, France.
| | - Massimiliano Beltramo
- UMR Physiologie de la Reproduction et des Comportements (INRA, UMR85; CNRS, UMR7247; Universitéde Tours; IFCE), F-37380 Nouzilly, France.
| | - Vincent Aucagne
- Centre de Biophysique Moléculaire (CNRS UPR4301), Rue Charles Sadron, F-45071 Orléans cedex 2, France.
| | - Pascal Bonnet
- Institut de Chimie Organique et Analytique (ICOA), UMR CNRS-Université d'Orléans 7311, Université d'Orléans BP 6759, 45067, Orléans CEDEX 2, France.
| |
Collapse
|
23
|
Miller SE, Rizzo AI, Waldbauer JR. Postnovo: Postprocessing Enables Accurate and FDR-Controlled de Novo Peptide Sequencing. J Proteome Res 2018; 17:3671-3680. [PMID: 30277077 DOI: 10.1021/acs.jproteome.8b00278] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
De novo sequencing offers an alternative to database search methods for peptide identification from mass spectra. Since it does not rely on a predetermined database of expected or potential sequences in the sample, de novo sequencing is particularly appropriate for samples lacking a well-defined or comprehensive reference database. However, the low accuracy of many de novo sequence predictions has prevented the widespread use of the variety of sequencing tools currently available. Here, we present a new open-source tool, Postnovo, that postprocesses de novo sequence predictions to find high-accuracy results. Postnovo uses a predictive model to rescore and rerank candidate sequences in a manner akin to database search postprocessing tools such as Percolator. Postnovo leverages the output from multiple de novo sequencing tools in its own analyses, producing many times the length of amino acid sequence information (including both full- and partial-length peptide sequences) at an equivalent false discovery rate (FDR) compared to any individual tool. We present a methodology to reliably screen the sequence predictions to a desired FDR given the Postnovo sequence score. We validate Postnovo with multiple data sets and demonstrate its ability to identify proteins that are missed by database search even in samples with paired reference databases.
Collapse
Affiliation(s)
- Samuel E Miller
- Department of the Geophysical Sciences , University of Chicago , 5734 South Ellis Avenue , Chicago , Illinois 60637 , United States
| | - Adriana I Rizzo
- Department of the Geophysical Sciences , University of Chicago , 5734 South Ellis Avenue , Chicago , Illinois 60637 , United States
| | - Jacob R Waldbauer
- Department of the Geophysical Sciences , University of Chicago , 5734 South Ellis Avenue , Chicago , Illinois 60637 , United States
| |
Collapse
|
24
|
RNA editing derived epitopes function as cancer antigens to elicit immune responses. Nat Commun 2018; 9:3919. [PMID: 30254248 PMCID: PMC6156571 DOI: 10.1038/s41467-018-06405-9] [Citation(s) in RCA: 97] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Accepted: 07/30/2018] [Indexed: 02/08/2023] Open
Abstract
In addition to genomic mutations, RNA editing is another major mechanism creating sequence variations in proteins by introducing nucleotide changes in mRNA sequences. Deregulated RNA editing contributes to different types of human diseases, including cancers. Here we report that peptides generated as a consequence of RNA editing are indeed naturally presented by human leukocyte antigen (HLA) molecules. We provide evidence that effector CD8+ T cells specific for edited peptides derived from cyclin I are present in human tumours and attack tumour cells that are presenting these epitopes. We show that subpopulations of cancer patients have increased peptide levels and that levels of edited RNA correlate with peptide copy numbers. These findings demonstrate that RNA editing extends the classes of HLA presented self-antigens and that these antigens can be recognised by the immune system. RNA editing is a biological process that creates sequence variation. Here the authors show that peptides generated as a consequence of RNA editing are naturally presented by human leukocyte antigen (HLA) and serve as antigens to elicit anti-tumour immune responses.
Collapse
|
25
|
Su Q, Hu F, Ge X, Lei J, Yu S, Wang T, Zhou Q, Mei C, Shi Y. Structure of the human PKD1-PKD2 complex. Science 2018; 361:science.aat9819. [DOI: 10.1126/science.aat9819] [Citation(s) in RCA: 120] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Accepted: 07/30/2018] [Indexed: 12/12/2022]
Abstract
Mutations in two genes, PKD1 and PKD2, account for most cases of autosomal dominant polycystic kidney disease, one of the most common monogenetic disorders. Here we report the 3.6-angstrom cryo–electron microscopy structure of truncated human PKD1-PKD2 complex assembled in a 1:3 ratio. PKD1 contains a voltage-gated ion channel (VGIC) fold that interacts with PKD2 to form the domain-swapped, yet noncanonical, transient receptor potential (TRP) channel architecture. The S6 helix in PKD1 is broken in the middle, with the extracellular half, S6a, resembling pore helix 1 in a typical TRP channel. Three positively charged, cavity-facing residues on S6b may block cation permeation. In addition to the VGIC, a five–transmembrane helix domain and a cytosolic PLAT domain were resolved in PKD1. The PKD1-PKD2 complex structure establishes a framework for dissecting the function and disease mechanisms of the PKD proteins.
Collapse
|
26
|
Lewis NH, Hitchcock DB, Dryden IL, Rose JR. Peptide refinement by using a stochastic search. J R Stat Soc Ser C Appl Stat 2018. [DOI: 10.1111/rssc.12280] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
27
|
Adeola HA, Van Wyk JC, Arowolo A, Ngwanya RM, Mkentane K, Khumalo NP. Emerging Diagnostic and Therapeutic Potentials of Human Hair Proteomics. Proteomics Clin Appl 2017; 12. [PMID: 28960873 DOI: 10.1002/prca.201700048] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 06/09/2017] [Indexed: 01/22/2023]
Abstract
The use of noninvasive human substrates to interrogate pathophysiological conditions has become essential in the post- Human Genome Project era. Due to its high turnover rate, and its long term capability to incorporate exogenous and endogenous substances from the circulation, hair testing is emerging as a key player in monitoring long term drug compliance, chronic alcohol abuse, forensic toxicology, and biomarker discovery, among other things. Novel high-throughput 'omics based approaches like proteomics have been underutilized globally in comprehending human hair morphology and its evolving use as a diagnostic testing substrate in the era of precision medicine. There is paucity of scientific evidence that evaluates the difference in drug incorporation into hair based on lipid content, and very few studies have addressed hair growth rates, hair forms, and the biological consequences of hair grooming or bleaching. It is apparent that protein-based identification using the human hair proteome would play a major role in understanding these parameters akin to DNA single nucleotide polymorphism profiling, up to single amino acid polymorphism resolution. Hence, this work seeks to identify and discuss the progress made thus far in the field of molecular hair testing using proteomic approaches, and identify ways in which proteomics would improve the field of hair research, considering that the human hair is mostly composed of proteins. Gaps in hair proteomics research are identified and the potential of hair proteomics in establishing a historic medical repository of normal and disease-specific proteome is also discussed.
Collapse
Affiliation(s)
- Henry A Adeola
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| | - Jennifer C Van Wyk
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| | - Afolake Arowolo
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| | - Reginald M Ngwanya
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa
| | - Khwezikazi Mkentane
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| | - Nonhlanhla P Khumalo
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| |
Collapse
|
28
|
Shao W, Lam H. Tandem mass spectral libraries of peptides and their roles in proteomics research. MASS SPECTROMETRY REVIEWS 2017; 36:634-648. [PMID: 27403644 DOI: 10.1002/mas.21512] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 05/21/2016] [Indexed: 05/15/2023]
Abstract
Proteomics is a rapidly maturing field aimed at the high-throughput identification and quantification of all proteins in a biological system. The cornerstone of proteomic technology is tandem mass spectrometry of peptides resulting from the digestion of protein mixtures. The fragmentation pattern of each peptide ion is captured in its tandem mass spectrum, which enables its identification and acts as a fingerprint for the peptide. Spectral libraries are simply searchable collections of these fingerprints, which have taken on an increasingly prominent role in proteomic data analysis. This review describes the historical development of spectral libraries in proteomics, details the computational procedures behind library building and searching, surveys the current applications of spectral libraries, and discusses the outstanding challenges. © 2016 Wiley Periodicals, Inc. Mass Spec Rev 36:634-648, 2017.
Collapse
Affiliation(s)
- Wenguang Shao
- Department of Biology, Institute of Molecular Systems Biology, Eidgenössische Technische Hochschule (ETH) Zurich, Zurich, Switzerland
- Division of Biomedical Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
| | - Henry Lam
- Division of Biomedical Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
- Department of Chemical and Biomolecular Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
| |
Collapse
|
29
|
|
30
|
Hu H, Khatri K, Zaia J. Algorithms and design strategies towards automated glycoproteomics analysis. MASS SPECTROMETRY REVIEWS 2017; 36:475-498. [PMID: 26728195 PMCID: PMC4931994 DOI: 10.1002/mas.21487] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Accepted: 11/30/2015] [Indexed: 05/09/2023]
Abstract
Glycoproteomics involves the study of glycosylation events on protein sequences ranging from purified proteins to whole proteome scales. Understanding these complex post-translational modification (PTM) events requires elucidation of the glycan moieties (monosaccharide sequences and glycosidic linkages between residues), protein sequences, as well as site-specific attachment of glycan moieties onto protein sequences, in a spatial and temporal manner in a variety of biological contexts. Compared with proteomics, bioinformatics for glycoproteomics is immature and many researchers still rely on tedious manual interpretation of glycoproteomics data. As sample preparation protocols and analysis techniques have matured, the number of publications on glycoproteomics and bioinformatics has increased substantially; however, the lack of consensus on tool development and code reuse limits the dissemination of bioinformatics tools because it requires significant effort to migrate a computational tool tailored for one method design to alternative methods. This review discusses algorithms and methods in glycoproteomics, and refers to the general proteomics field for potential solutions. It also introduces general strategies for tool integration and pipeline construction in order to better serve the glycoproteomics community. © 2016 Wiley Periodicals, Inc. Mass Spec Rev 36:475-498, 2017.
Collapse
Affiliation(s)
- Han Hu
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
- Center for Biomedical Mass Spectrometry, Department of Biochemistry, Boston University School of Medicine, Boston University, Boston, Massachusetts 02118, USA
| | - Kshitij Khatri
- Center for Biomedical Mass Spectrometry, Department of Biochemistry, Boston University School of Medicine, Boston University, Boston, Massachusetts 02118, USA
| | - Joseph Zaia
- Center for Biomedical Mass Spectrometry, Department of Biochemistry, Boston University School of Medicine, Boston University, Boston, Massachusetts 02118, USA
| |
Collapse
|
31
|
Horton AP, Robotham SA, Cannon JR, Holden DD, Marcotte EM, Brodbelt JS. Comprehensive de Novo Peptide Sequencing from MS/MS Pairs Generated through Complementary Collision Induced Dissociation and 351 nm Ultraviolet Photodissociation. Anal Chem 2017; 89:3747-3753. [PMID: 28234449 PMCID: PMC5480239 DOI: 10.1021/acs.analchem.7b00130] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
We describe a strategy for de novo peptide sequencing based on matched pairs of tandem mass spectra (MS/MS) obtained by collision induced dissociation (CID) and 351 nm ultraviolet photodissociation (UVPD). Each precursor ion is isolated twice with the mass spectrometer switching between CID and UVPD activation modes to obtain a complementary MS/MS pair. To interpret these paired spectra, we modified the UVnovo de novo sequencing software to automatically learn from and interpret fragmentation spectra, provided a representative set of training data. This machine learning procedure, using random forests, synthesizes information from one or multiple complementary spectra, such as the CID/UVPD pairs, into peptide fragmentation site predictions. In doing so, the burden of fragmentation model definition shifts from programmer to machine and opens up the model parameter space for inclusion of nonobvious features and interactions. This spectral synthesis also serves to transform distinct types of spectra into a common representation for subsequent activation-independent processing steps. Then, independent from precursor activation constraints, UVnovo's de novo sequencing procedure generates and scores sequence candidates for each precursor. We demonstrate the combined experimental and computational approach for de novo sequencing using whole cell E. coli lysate. In benchmarks on the CID/UVPD data, UVnovo assigned correct full-length sequences to 83% of the spectral pairs of doubly charged ions with high-confidence database identifications. Considering only top-ranked de novo predictions, 70% of the pairs were deciphered correctly. This de novo sequencing performance exceeds that of PEAKS and PepNovo on the CID spectra and that of UVnovo on CID or UVPD spectra alone. As presented here, the methods for paired CID/UVPD spectral acquisition and interpretation constitute a powerful workflow for high-throughput and accurate de novo peptide sequencing.
Collapse
Affiliation(s)
- Andrew P Horton
- Center for Systems and Synthetic Biology, Department of Molecular Biosciences, University of Texas , Austin, Texas 78712, United States
| | - Scott A Robotham
- Department of Chemistry, University of Texas , Austin, Texas 78712, United States
| | - Joe R Cannon
- Department of Chemistry, University of Texas , Austin, Texas 78712, United States
| | - Dustin D Holden
- Department of Chemistry, University of Texas , Austin, Texas 78712, United States
| | - Edward M Marcotte
- Center for Systems and Synthetic Biology, Department of Molecular Biosciences, University of Texas , Austin, Texas 78712, United States
| | - Jennifer S Brodbelt
- Department of Chemistry, University of Texas , Austin, Texas 78712, United States
| |
Collapse
|
32
|
Dereplication of peptidic natural products through database search of mass spectra. Nat Chem Biol 2016; 13:30-37. [PMID: 27820803 PMCID: PMC5409158 DOI: 10.1038/nchembio.2219] [Citation(s) in RCA: 157] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Accepted: 08/17/2016] [Indexed: 11/08/2022]
Abstract
Peptidic Natural Products (PNPs) are widely used compounds that include many antibiotics and a variety of other bioactive peptides. While recent breakthroughs in PNP discovery raised the challenge of developing new algorithms for their analysis, identification of PNPs via database search of tandem mass spectra remains an open problem. To address this problem, natural product researchers utilize dereplication strategies that identify known PNPs and lead to the discovery of new ones even in cases when the reference spectra are not present in existing spectral libraries. DEREPLICATOR is a new dereplication algorithm that enabled high-throughput PNP identification and that is compatible with large-scale mass spectrometry-based screening platforms for natural product discovery. After searching nearly one hundred million tandem mass spectra in the Global Natural Products Social (GNPS) molecular networking infrastructure, DEREPLICATOR identified an order of magnitude more PNPs (and their new variants) than any previous dereplication efforts.
Collapse
|
33
|
Gorshkov V, Hotta SYK, Verano-Braga T, Kjeldsen F. Peptide de novo sequencing of mixture tandem mass spectra. Proteomics 2016; 16:2470-9. [PMID: 27329701 PMCID: PMC5297990 DOI: 10.1002/pmic.201500549] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2015] [Revised: 04/27/2016] [Accepted: 06/17/2016] [Indexed: 02/02/2023]
Abstract
The impact of mixture spectra deconvolution on the performance of four popular de novo sequencing programs was tested using artificially constructed mixture spectra as well as experimental proteomics data. Mixture fragmentation spectra are recognized as a limitation in proteomics because they decrease the identification performance using database search engines. De novo sequencing approaches are expected to be even more sensitive to the reduction in mass spectrum quality resulting from peptide precursor co‐isolation and thus prone to false identifications. The deconvolution approach matched complementary b‐, y‐ions to each precursor peptide mass, which allowed the creation of virtual spectra containing sequence specific fragment ions of each co‐isolated peptide. Deconvolution processing resulted in equally efficient identification rates but increased the absolute number of correctly sequenced peptides. The improvement was in the range of 20–35% additional peptide identifications for a HeLa lysate sample. Some correct sequences were identified only using unprocessed spectra; however, the number of these was lower than those where improvement was obtained by mass spectral deconvolution. Tight candidate peptide score distribution and high sensitivity to small changes in the mass spectrum introduced by the employed deconvolution method could explain some of the missing peptide identifications.
Collapse
Affiliation(s)
- Vladimir Gorshkov
- Department of Biochemistry and Molecular Biology, University of Southern Denmark Odense M, Odense, Denmark.
| | | | - Thiago Verano-Braga
- Department of Biochemistry and Molecular Biology, University of Southern Denmark Odense M, Odense, Denmark.,Department of Physiology and Biophysics, Federal University of Minas Gerais Belo Horizonte - MG, Belo Horizonte, Brazil
| | - Frank Kjeldsen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark Odense M, Odense, Denmark
| |
Collapse
|
34
|
Detection of acid and hop shock induced responses in beer spoiling Lactobacillus brevis by MALDI-TOF MS. Food Microbiol 2015; 46:501-506. [DOI: 10.1016/j.fm.2014.09.018] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2014] [Revised: 08/27/2014] [Accepted: 09/28/2014] [Indexed: 11/24/2022]
|
35
|
Chapman B, Bellgard M. High-throughput parallel proteogenomics: A bacterial case study. Proteomics 2014; 14:2780-9. [DOI: 10.1002/pmic.201400185] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2014] [Revised: 10/11/2014] [Accepted: 10/22/2014] [Indexed: 11/08/2022]
Affiliation(s)
- Brett Chapman
- Centre for Comparative Genomics; Murdoch University; Western Australia Australia
| | - Matthew Bellgard
- Centre for Comparative Genomics; Murdoch University; Western Australia Australia
| |
Collapse
|
36
|
Lu L, Wang J, Xu Y, Wang K, Hu Y, Tian R, Yang B, Lai Q, Li Y, Zhang W, Shao Z, Lam H, Qian PY. A high-resolution LC-MS-based secondary metabolite fingerprint database of marine bacteria. Sci Rep 2014; 4:6537. [PMID: 25298017 PMCID: PMC5377448 DOI: 10.1038/srep06537] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2014] [Accepted: 09/04/2014] [Indexed: 01/01/2023] Open
Abstract
Marine bacteria are the most widely distributed organisms in the ocean environment and produce a wide variety of secondary metabolites. However, traditional screening for bioactive natural compounds is greatly hindered by the lack of a systematic way of cataloguing the chemical profiles of bacterial strains found in nature. Here we present a chemical fingerprint database of marine bacteria based on their secondary metabolite profiles, acquired by high-resolution LC-MS. Till now, 1,430 bacterial strains spanning 168 known species collected from different marine environments were cultured and profiled. Using this database, we demonstrated that secondary metabolite profile similarity is approximately, but not always, correlated with taxonomical similarity. We also validated the ability of this database to find species-specific metabolites, as well as to discover known bioactive compounds from previously unknown sources. An online interface to this database, as well as the accompanying software, is provided freely for the community to use.
Collapse
Affiliation(s)
- Liang Lu
- 1] Environmental Science Program, School of Science, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China [2]
| | - Jijie Wang
- 1] Division of Biomedical Engineering, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China [2]
| | - Ying Xu
- Division of Life Science, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China
| | - Kailing Wang
- School of Medicine and Pharmacy, Ocean University of China, Qingdao 266003, China
| | - Yingwei Hu
- Department of Chemical and Biomolecular Engineering, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China
| | - Renmao Tian
- Environmental Science Program, School of Science, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China
| | - Bo Yang
- Division of Life Science, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China
| | - Qiliang Lai
- Third Institute of Oceanography, State Oceanic Administration, Xiamen 361005, China
| | - Yongxin Li
- Division of Life Science, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China
| | - Weipeng Zhang
- Environmental Science Program, School of Science, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China
| | - Zongze Shao
- Third Institute of Oceanography, State Oceanic Administration, Xiamen 361005, China
| | - Henry Lam
- 1] Division of Biomedical Engineering, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China [2] Department of Chemical and Biomolecular Engineering, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China
| | - Pei-Yuan Qian
- 1] Environmental Science Program, School of Science, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China [2] Division of Life Science, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, China
| |
Collapse
|
37
|
Zelanis A, Keiji Tashima A. Unraveling snake venom complexity with ‘omics’ approaches: Challenges and perspectives. Toxicon 2014; 87:131-4. [DOI: 10.1016/j.toxicon.2014.05.011] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2014] [Revised: 04/14/2014] [Accepted: 05/07/2014] [Indexed: 11/29/2022]
|
38
|
Dong NP, Liang YZ, Xu QS, Mok DKW, Yi LZ, Lu HM, He M, Fan W. Prediction of Peptide Fragment Ion Mass Spectra by Data Mining Techniques. Anal Chem 2014; 86:7446-54. [DOI: 10.1021/ac501094m] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
| | | | | | - Daniel K. W. Mok
- Department
of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hung Hom, Hong Kong
- State Key Laboratory of Chinese Medicine and Molecular Pharmacology (Incubation), Shenzhen, 518000, P. R. China
| | - Lun-zhao Yi
- Yunnan
Food Safety Research Institute, Kunming University of Science and Technology, Kunming, 650500, P. R. China
| | | | - Min He
- Department of
Pharmaceutical Engineering,
School of Chemical Engineering, Xiangtan University, Xiangtan, 411105, P.R. China
| | - Wei Fan
- College of
Bioscience and Biotechnology, Hunan Agricultural University, Changsha, 410083, P. R. China
| |
Collapse
|
39
|
Wang J, Anania VG, Knott J, Rush J, Lill JR, Bourne PE, Bandeira N. Combinatorial approach for large-scale identification of linked peptides from tandem mass spectrometry spectra. Mol Cell Proteomics 2014; 13:1128-36. [PMID: 24493012 DOI: 10.1074/mcp.m113.035758] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The combination of chemical cross-linking and mass spectrometry has recently been shown to constitute a powerful tool for studying protein-protein interactions and elucidating the structure of large protein complexes. However, computational methods for interpreting the complex MS/MS spectra from linked peptides are still in their infancy, making the high-throughput application of this approach largely impractical. Because of the lack of large annotated datasets, most current approaches do not capture the specific fragmentation patterns of linked peptides and therefore are not optimal for the identification of cross-linked peptides. Here we propose a generic approach to address this problem and demonstrate it using disulfide-bridged peptide libraries to (i) efficiently generate large mass spectral reference data for linked peptides at a low cost and (ii) automatically train an algorithm that can efficiently and accurately identify linked peptides from MS/MS spectra. We show that using this approach we were able to identify thousands of MS/MS spectra from disulfide-bridged peptides through comparison with proteome-scale sequence databases and significantly improve the sensitivity of cross-linked peptide identification. This allowed us to identify 60% more direct pairwise interactions between the protein subunits in the 20S proteasome complex than existing tools on cross-linking studies of the proteasome complexes. The basic framework of this approach and the MS/MS reference dataset generated should be valuable resources for the future development of new tools for the identification of linked peptides.
Collapse
Affiliation(s)
- Jian Wang
- Bioinformatics Program, University of California, San Diego, La Jolla, California
| | | | | | | | | | | | | |
Collapse
|
40
|
Shao W, Zhu K, Lam H. Refining similarity scoring to enable decoy-free validation in spectral library searching. Proteomics 2013; 13:3273-83. [DOI: 10.1002/pmic.201300232] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2013] [Revised: 08/06/2013] [Accepted: 09/10/2013] [Indexed: 12/30/2022]
Affiliation(s)
- Wenguang Shao
- Division of Biomedical Engineering; The Hong Kong University of Science and Technology; Clear Water Bay Hong Kong China
| | - Kan Zhu
- Department of Chemical and Biomolecular Engineering; The Hong Kong University of Science and Technology; Clear Water Bay Hong Kong China
| | - Henry Lam
- Division of Biomedical Engineering; The Hong Kong University of Science and Technology; Clear Water Bay Hong Kong China
- Department of Chemical and Biomolecular Engineering; The Hong Kong University of Science and Technology; Clear Water Bay Hong Kong China
| |
Collapse
|
41
|
Richards AL, Vincent CE, Guthals A, Rose CM, Westphall MS, Bandeira N, Coon JJ. Neutron-encoded signatures enable product ion annotation from tandem mass spectra. Mol Cell Proteomics 2013; 12:3812-23. [PMID: 24043425 DOI: 10.1074/mcp.m113.028951] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
We report the use of neutron-encoded (NeuCode) stable isotope labeling of amino acids in cell culture for the purpose of C-terminal product ion annotation. Two NeuCode labeling isotopologues of lysine, (13)C6(15)N2 and (2)H8, which differ by 36 mDa, were metabolically embedded in a sample proteome, and the resultant labeled proteins were combined, digested, and analyzed via liquid chromatography and mass spectrometry. With MS/MS scan resolving powers of ~50,000 or higher, product ions containing the C terminus (i.e. lysine) appear as a doublet spaced by exactly 36 mDa, whereas N-terminal fragments exist as a single m/z peak. Through theory and experiment, we demonstrate that over 90% of all y-type product ions have detectable doublets. We report on an algorithm that can extract these neutron signatures with high sensitivity and specificity. In other words, of 15,503 y-type product ion peaks, the y-type ion identification algorithm correctly identified 14,552 (93.2%) based on detection of the NeuCode doublet; 6.8% were misclassified (i.e. other ion types that were assigned as y-type products). Searching NeuCode labeled yeast with PepNovo(+) resulted in a 34% increase in correct de novo identifications relative to searching through MS/MS only. We use this tool to simplify spectra prior to database searching, to sort unmatched tandem mass spectra for spectral richness, for correlation of co-fragmented ions to their parent precursor, and for de novo sequence identification.
Collapse
Affiliation(s)
- Alicia L Richards
- Department of Chemistry, University of Wisconsin, Madison, Wisconsin 53706
| | | | | | | | | | | | | |
Collapse
|
42
|
Abstract
Motivation: Mass spectrometry (MS) instruments and experimental protocols are rapidly advancing, but de novo peptide sequencing algorithms to analyze tandem mass (MS/MS) spectra are lagging behind. Although existing de novo sequencing tools perform well on certain types of spectra [e.g. Collision Induced Dissociation (CID) spectra of tryptic peptides], their performance often deteriorates on other types of spectra, such as Electron Transfer Dissociation (ETD), Higher-energy Collisional Dissociation (HCD) spectra or spectra of non-tryptic digests. Thus, rather than developing a new algorithm for each type of spectra, we develop a universal de novo sequencing algorithm called UniNovo that works well for all types of spectra or even for spectral pairs (e.g. CID/ETD spectral pairs). UniNovo uses an improved scoring function that captures the dependences between different ion types, where such dependencies are learned automatically using a modified offset frequency function. Results: The performance of UniNovo is compared with PepNovo+, PEAKS and pNovo using various types of spectra. The results show that the performance of UniNovo is superior to other tools for ETD spectra and superior or comparable with others for CID and HCD spectra. UniNovo also estimates the probability that each reported reconstruction is correct, using simple statistics that are readily obtained from a small training dataset. We demonstrate that the estimation is accurate for all tested types of spectra (including CID, HCD, ETD, CID/ETD and HCD/ETD spectra of trypsin, LysC or AspN digested peptides). Availability: UniNovo is implemented in JAVA and tested on Windows, Ubuntu and OS X machines. UniNovo is available at http://proteomics.ucsd.edu/Software/UniNovo.html along with the manual. Contact:kwj@ucsd.edu or ppevzner@ucsd.edu Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kyowon Jeong
- Department of Electrical and Computer Engineering and Department of Computer Science and Engineering, University of California-San Diego, CA 92093, USA.
| | | | | |
Collapse
|
43
|
Guthals A, Clauser KR, Frank AM, Bandeira N. Sequencing-grade de novo analysis of MS/MS triplets (CID/HCD/ETD) from overlapping peptides. J Proteome Res 2013; 12:2846-57. [PMID: 23679345 DOI: 10.1021/pr400173d] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Full-length de novo sequencing of unknown proteins remains a challenging open problem. Traditional methods that sequence spectra individually are limited by short peptide length, incomplete peptide fragmentation, and ambiguous de novo interpretations. We address these issues by determining consensus sequences for assembled tandem mass (MS/MS) spectra from overlapping peptides (e.g., by using multiple enzymatic digests). We have combined electron-transfer dissociation (ETD) with collision-induced dissociation (CID) and higher-energy collision-induced dissociation (HCD) fragmentation methods to boost interpretation of long, highly charged peptides and take advantage of corroborating b/y/c/z ions in CID/HCD/ETD. Using these strategies, we show that triplet CID/HCD/ETD MS/MS spectra from overlapping peptides yield de novo sequences of average length 70 AA and as long as 200 AA at up to 99% sequencing accuracy.
Collapse
Affiliation(s)
- Adrian Guthals
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, California 92093, United States
| | | | | | | |
Collapse
|
44
|
Costa EP, Menschaert G, Luyten W, De Grave K, Ramon J. PIUS: peptide identification by unbiased search. Bioinformatics 2013; 29:1913-4. [DOI: 10.1093/bioinformatics/btt298] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
45
|
Ivankov DN, Payne SH, Galperin MY, Bonissone S, Pevzner PA, Frishman D. How many signal peptides are there in bacteria? Environ Microbiol 2013; 15:983-90. [PMID: 23556536 PMCID: PMC3621014 DOI: 10.1111/1462-2920.12105] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Over the last 5 years proteogenomics (using mass spectroscopy to identify proteins predicted from genomic sequences) has emerged as a promising approach to the high-throughput identification of protein N-termini, which remains a problem in genome annotation. Comparison of the experimentally determined N-termini with those predicted by sequence analysis tools allows identification of the signal peptides and therefore conclusions on the cytoplasmic or extracytoplasmic (periplasmic or extracellular) localization of the respective proteins. We present here the results of a proteogenomic study of the signal peptides in Escherichia coli K-12 and compare its results with the available experimental data and predictions by such software tools as SignalP and Phobius. A single proteogenomics experiment recovered more than a third of all signal peptides that had been experimentally determined during the past three decades and confirmed at least 31 additional signal peptides, mostly in the known exported proteins, which had been previously predicted but not validated. The filtering of putative signal peptides for the peptide length and the presence of an eight-residue hydrophobic patch and a typical signal peptidase cleavage site proved sufficient to eliminate the false-positive hits. Surprisingly, the results of this proteogenomics study, as well as a re-analysis of the E. coli genome with the latest version of SignalP program, show that the fraction of proteins containing signal peptides is only about 10%, or half of previous estimates.
Collapse
Affiliation(s)
- Dmitry N. Ivankov
- Technische Universität München, Department of Genome-Oriented Bioinformatics, 85354 Freising, Germany
| | - Samuel H. Payne
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Michael Y. Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | - Dmitrij Frishman
- Technische Universität München, Department of Genome-Oriented Bioinformatics, 85354 Freising, Germany
- Helmholtz Zentrum Munich, National Research Center for Environment and Health, Institute for Bioinformatics, 85764 Neuherberg, Germany
| |
Collapse
|
46
|
Abstract
Signal peptides are a cornerstone mechanism for cellular protein localization, yet until now experimental determination of signal peptides has come from only a narrow taxonomic sampling. As a result, the dominant view is that Sec-cleaved signal peptides in prokaryotes are defined by a canonical AxA motif. Although other residues are permitted in the motif, alanine is by far the most common. Here we broadly examine proteomics data to reveal the signal peptide sequences for 32 bacterial and archaeal organisms from nine phyla and demonstrate that this alanine preference is not universal. Discoveries include fundamentally distinct signal peptide motifs from Alphaproteobacteria, Spirochaetes, Thermotogae and Euryarchaeota. In these novel motifs, alanine is no longer the dominant residue but has been replaced in a different way for each taxon. Surprisingly, divergent motifs correlate with a proteome-wide reduction in alanine. Computational analyses of ~1,500 genomes reveal numerous major evolutionary clades which have replaced the canonical signal peptide sequence with novel motifs. This article replaces a widely held general model with a more detailed model describing phylogenetically correlated variation in motifs for Sec secretion.
Collapse
|
47
|
Key issues in the acquisition and analysis of qualitative and quantitative mass spectrometry data for peptide-centric proteomic experiments. Amino Acids 2012; 43:1075-85. [PMID: 22821266 DOI: 10.1007/s00726-012-1287-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2010] [Accepted: 04/03/2012] [Indexed: 01/05/2023]
Abstract
Proteomic technologies have matured to a level enabling accurate and reproducible quantitation of peptides and proteins from complex biological matrices. Analysis of samples as diverse as assembled protein complexes, whole cell lysates or sub-cellular proteomes from cell cultures, and direct analysis of animal and human tissues and fluids demonstrate the incredible versatility of the fundamental nature of the technique that forms the basis of most proteomic applications today (mass spectrometry). Determining the mass of biomolecules and their fragments or related products with high accuracy can convey a highly specific assay for detection and identification. Importantly, ion currents representative of these specifically identified analytes can be accurately quantified with the correct application of smart isobaric tagging chemistries, heavy and light isotopically derivatised samples or standards, or by careful application of workflows to compare unlabelled samples in so-called 'label-free' and targeted selected reaction monitoring experiments. In terms of exploring biology, a myriad of protein changes and modifications are being increasingly probed and quantified, including diverse chemical changes from relatively decisive modifications such as protein splicing and truncation, to more transient dynamic modifications such as phosphorylation, acetylation and ubiquitination. Proteomic workflows can be complex beasts and several key considerations to ensure effective applications have been outlined in the recent literature. The past year has witnessed the publication of several excellent reviews that thoroughly describe the fundamental principles underlying the state of the art. This review further elaborates on specific critical issues introduced by these publications and raises other important unaddressed considerations and new developments that directly impact on the effectiveness of proteomic technologies, in particular for, but not necessarily exclusive to peptide-centric experiments. These factors are discussed both in terms of qualitative analyses, including dynamic range and sampling issues, and developments to improve the translation of peptide fragmentation data into peptide and protein identities, as well as quantitative analyses, including data normalisation and the utility of ontology or functional annotation, the effects of modified peptides, and considered experimental design to facilitate the use of robust statistical methods.
Collapse
|
48
|
Yang C, He Z, Yang C, Yu W. Peptide reranking with protein-peptide correspondence and precursor peak intensity information. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1212-1219. [PMID: 22350209 DOI: 10.1109/tcbb.2012.29] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Searching tandem mass spectra against a protein database has been a mainstream method for peptide identification. Improving peptide identification results by ranking true Peptide-Spectrum Matches (PSMs) over their false counterparts leads to the development of various reranking algorithms. In peptide reranking, discriminative information is essential to distinguish true PSMs from false PSMs. Generally, most peptide reranking methods obtain discriminative information directly from database search scores or by training machine learning models. Information in the protein database and MS1 spectra (i.e., single stage MS spectra) is ignored. In this paper, we propose to use information in the protein database and MS1 spectra to rerank peptide identification results. To quantitatively analyze their effects to peptide reranking results, three peptide reranking methods are proposed: PPMRanker, PPIRanker, and MIRanker. PPMRanker only uses Protein-Peptide Map (PPM) information from the protein database, PPIRanker only uses Precursor Peak Intensity (PPI) information, and MIRanker employs both PPM information and PPI information. According to our experiments on a standard protein mixture data set, a human data set and a mouse data set, PPMRanker and MIRanker achieve better peptide reranking results than PetideProphet, PeptideProphet+NSP (number of sibling peptides) and a score regularization method SRPI. The source codes of PPMRanker, PPIRanker, and MIRanker, and all supplementary documents are available at our website: http://bioinformatics.ust.hk/pepreranking/. Alternatively, these documents can also be downloaded from: http://sourceforge.net/projects/pepreranking/.
Collapse
Affiliation(s)
- Chao Yang
- The Hong Kong University of Science and Technology, RM B007D, University Apartment Tower B, Clear Water Bay, Kowloon, Hong Kong.
| | | | | | | |
Collapse
|
49
|
Mohimani H, Liu WT, Mylne JS, Poth AG, Colgrave ML, Tran D, Selsted ME, Dorrestein PC, Pevzner PA. Cycloquest: identification of cyclopeptides via database search of their mass spectra against genome databases. J Proteome Res 2011; 10:4505-12. [PMID: 21851130 PMCID: PMC3242011 DOI: 10.1021/pr200323a] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Hundreds of ribosomally synthesized cyclopeptides have been isolated from all domains of life, the vast majority having been reported in the last 15 years. Studies of cyclic peptides have highlighted their exceptional potential both as stable drug scaffolds and as biomedicines in their own right. Despite this, computational techniques for cyclopeptide identification are still in their infancy, with many such peptides remaining uncharacterized. Tandem mass spectrometry has occupied a niche role in cyclopeptide identification, taking over from traditional techniques such as nuclear magnetic resonance spectroscopy (NMR). MS/MS studies require only picogram quantities of peptide (compared to milligrams for NMR studies) and are applicable to complex samples, abolishing the requirement for time-consuming chromatographic purification. While database search tools such as Sequest and Mascot have become standard tools for the MS/MS identification of linear peptides, they are not applicable to cyclopeptides, due to the parent mass shift resulting from cyclization and different fragmentation patterns of cyclic peptides. In this paper, we describe the development of a novel database search methodology to aid in the identification of cyclopeptides by mass spectrometry and evaluate its utility in identifying two peptide rings from Helianthus annuus, a bacterial cannibalism factor from Bacillus subtilis, and a θ-defensin from Rhesus macaque.
Collapse
Affiliation(s)
- Hosein Mohimani
- Department of Electrical and Computer Engineering, UC San Diego
| | - Wei-Ting Liu
- Department of Chemistry and Biochemistry, UC San Diego
| | - Joshua S. Mylne
- Institute for Molecular Bioscience, The University of Queensland, Brisbane
| | - Aaron G. Poth
- Institute for Molecular Bioscience, The University of Queensland, Brisbane
- Division of Livestock Industries, CSIRO, Brisbane
| | | | - Dat Tran
- Department of Pathology and Laboratory Medicine, School of Medicine, UC Irvine
- Center for Immunology, UC Irvine
- Department of Pathology and Laboratory Medicine, Keck School of Medicine, USC
| | - Michael E. Selsted
- Department of Pathology and Laboratory Medicine, School of Medicine, UC Irvine
- Center for Immunology, UC Irvine
- Department of Pathology and Laboratory Medicine, Keck School of Medicine, USC
| | - Pieter C. Dorrestein
- Department of Chemistry and Biochemistry, UC San Diego
- Skaggs School of Pharmacy and Pharmaceutical Sciences, UC San Diego
| | | |
Collapse
|
50
|
Mohimani H, Yang YL, Liu WT, Hsieh PW, Dorrestein PC, Pevzner PA. Sequencing cyclic peptides by multistage mass spectrometry. Proteomics 2011; 11:3642-50. [PMID: 21751357 DOI: 10.1002/pmic.201000697] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2010] [Revised: 05/16/2011] [Accepted: 06/09/2011] [Indexed: 11/08/2022]
Abstract
Some of the most effective antibiotics (e.g. Vancomycin and Daptomycin) are cyclic peptides produced by non-ribosomal biosynthetic pathways. While hundreds of biomedically important cyclic peptides have been sequenced, the computational techniques for sequencing cyclic peptides are still in their infancy. Previous methods for sequencing peptide antibiotics and other cyclic peptides are based on Nuclear Magnetic Resonance spectroscopy, and require large amount (miligrams) of purified materials that, for most compounds, are not possible to obtain. Recently, development of MS-based methods has provided some hope for accurate sequencing of cyclic peptides using picograms of materials. In this paper we develop a method for sequencing of cyclic peptides by multistage MS, and show its advantages over single-stage MS. The method is tested on known and new cyclic peptides from Bacillus brevis, Dianthus superbus and Streptomyces griseus, as well as a new family of cyclic peptides produced by marine bacteria.
Collapse
Affiliation(s)
- Hosein Mohimani
- Department of Electrical and Computer Engineering, UC San Diego, CA, USA
| | | | | | | | | | | |
Collapse
|