1
|
Sierocki R, Jneid B, Orsini Delgado ML, Plaisance M, Maillère B, Nozach H, Simon S. An antibody targeting type III secretion system induces broad protection against Salmonella and Shigella infections. PLoS Negl Trop Dis 2021; 15:e0009231. [PMID: 33711056 PMCID: PMC7990167 DOI: 10.1371/journal.pntd.0009231] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 03/24/2021] [Accepted: 02/11/2021] [Indexed: 11/18/2022] Open
Abstract
Salmonella and Shigella bacteria are food- and waterborne pathogens that are responsible for enteric infections in humans and are still the major cause of morbidity and mortality in the emerging countries. The existence of multiple Salmonella and Shigella serotypes as well as the emergence of strains resistant to antibiotics requires the development of broadly protective therapies. Recently, the needle tip proteins of the type III secretion system of these bacteria were successfully utilized (SipD for Salmonella and IpaD for Shigella) as vaccine immunogens to provide good prophylactic cross-protection in murine models of infections. From these experiments, we have isolated a cross-protective monoclonal antibody directed against a conserved region of both proteins. Its conformational epitope determined by Deep Mutational Scanning is conserved among needle tip proteins of all pathogenic Shigella species and Salmonella serovars, and are well recognized by this antibody. Our study provides the first in vivo experimental evidence of the importance of this common region in the mechanism of virulence of Salmonella and Shigella and opens the way to the development of cross-protective therapeutic agents. Salmonella and Shigella are responsible for gastrointestinal diseases and continue to remain a serious health hazard in South and South-East Asia and African countries, even more with the new emergence of multi drug resistances. Developed vaccines are either not commercialized (for Shigella) or cover only a limited number of serotypes (for Salmonella). There is thus a crucial need to develop cross-protective therapies. By targeting proteins SipD and IpaD belonging respectively to the injectisome of Salmonella and Shigella and necessary to their virulence, we have shown that a monoclonal antibody (mAb) directed against a conserved common region of their apical part provides good cross-protection prophylactic efficacy. We have determined the region targeted by this mAb which could explain why it is conserved among Salmonella and Shigella bacteria.
Collapse
Affiliation(s)
- Raphaël Sierocki
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SIMoS, Gif-sur-Yvette, France
| | - Bakhos Jneid
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, Gif-sur-Yvette, France
| | - Maria Lucia Orsini Delgado
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, Gif-sur-Yvette, France
| | - Marc Plaisance
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, Gif-sur-Yvette, France
| | - Bernard Maillère
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SIMoS, Gif-sur-Yvette, France
| | - Hervé Nozach
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SIMoS, Gif-sur-Yvette, France
| | - Stéphanie Simon
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, Gif-sur-Yvette, France
- * E-mail:
| |
Collapse
|
2
|
Fijalkowska D, Fijalkowski I, Willems P, Van Damme P. Bacterial riboproteogenomics: the era of N-terminal proteoform existence revealed. FEMS Microbiol Rev 2021; 44:418-431. [PMID: 32386204 DOI: 10.1093/femsre/fuaa013] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 05/07/2020] [Indexed: 12/17/2022] Open
Abstract
With the rapid increase in the number of sequenced prokaryotic genomes, relying on automated gene annotation became a necessity. Multiple lines of evidence, however, suggest that current bacterial genome annotations may contain inconsistencies and are incomplete, even for so-called well-annotated genomes. We here discuss underexplored sources of protein diversity and new methodologies for high-throughput genome reannotation. The expression of multiple molecular forms of proteins (proteoforms) from a single gene, particularly driven by alternative translation initiation, is gaining interest as a prominent contributor to bacterial protein diversity. In consequence, riboproteogenomic pipelines were proposed to comprehensively capture proteoform expression in prokaryotes by the complementary use of (positional) proteomics and the direct readout of translated genomic regions using ribosome profiling. To complement these discoveries, tailored strategies are required for the functional characterization of newly discovered bacterial proteoforms.
Collapse
Affiliation(s)
- Daria Fijalkowska
- Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, B-9000 Ghent, Belgium
| | - Igor Fijalkowski
- Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, B-9000 Ghent, Belgium
| | - Patrick Willems
- Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, B-9000 Ghent, Belgium
| | - Petra Van Damme
- Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, B-9000 Ghent, Belgium
| |
Collapse
|
3
|
Jneid B, Rouaix A, Féraudet-Tarisse C, Simon S. SipD and IpaD induce a cross-protection against Shigella and Salmonella infections. PLoS Negl Trop Dis 2020; 14:e0008326. [PMID: 32463817 PMCID: PMC7282677 DOI: 10.1371/journal.pntd.0008326] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 06/09/2020] [Accepted: 04/26/2020] [Indexed: 01/05/2023] Open
Abstract
Salmonella and Shigella species are food- and water-borne pathogens that are responsible for enteric infections in both humans and animals and are still the major cause of morbidity and mortality in the emerging countries. The existence of multiple Salmonella and Shigella serotypes as well as the emergence of strains resistant to antibiotics require the development of broadly protective therapies. Those bacteria utilize a Type III Secretion System (T3SS), necessary for their pathogenicity. The structural proteins composing the T3SS are common to all virulent Salmonella and Shigella spp., particularly the needle-tip proteins SipD (Salmonella) and IpaD (Shigella). We investigated the immunogenicity and protective efficacy of SipD and IpaD administered by intranasal and intragastric routes, in a mouse model of Salmonella enterica serotype Typhimurium (S. Typhimurium) intestinal challenge. Robust IgG (in all immunization routes) and IgA (in intranasal and oral immunization routes) antibody responses were induced against both proteins. Mice immunized with SipD or IpaD were protected against lethal intestinal challenge with S. Typhimurium or Shigella flexneri (100 Lethal Dose 50%). We have shown that SipD and IpaD are able to induce a cross-protection in a murine model of infection by Salmonella and Shigella. We provide the first demonstration that Salmonella and Shigella T3SS SipD and IpaD are promising antigens for the development of a cross-protective Salmonella-Shigella vaccine. These results open the way to the development of cross-protective therapeutic molecules.
Collapse
Affiliation(s)
- Bakhos Jneid
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, Gif-sur-Yvette, France
| | - Audrey Rouaix
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, Gif-sur-Yvette, France
| | - Cécile Féraudet-Tarisse
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, Gif-sur-Yvette, France
| | - Stéphanie Simon
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, Gif-sur-Yvette, France
- * E-mail:
| |
Collapse
|
4
|
Kaspar JR, Walker AR. Expanding the Vocabulary of Peptide Signals in Streptococcus mutans. Front Cell Infect Microbiol 2019; 9:194. [PMID: 31245303 PMCID: PMC6563777 DOI: 10.3389/fcimb.2019.00194] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Accepted: 05/21/2019] [Indexed: 12/18/2022] Open
Abstract
Streptococci, including the dental pathogen Streptococcus mutans, undergo cell-to-cell signaling that is mediated by small peptides to control critical physiological functions such as adaptation to the environment, control of subpopulation behaviors and regulation of virulence factors. One such model pathway is the regulation of genetic competence, controlled by the ComRS signaling system and the peptide XIP. However, recent research in the characterization of this pathway has uncovered novel operons and peptides that are intertwined into its regulation. These discoveries, such as cell lysis playing a critical role in XIP release and importance of bacterial self-sensing during the signaling process, have caused us to reevaluate previous paradigms and shift our views on the true purpose of these signaling systems. The finding of new peptides such as the ComRS inhibitor XrpA and the peptides of the RcrRPQ operon also suggests there may be more peptides hidden in the genomes of streptococci that could play critical roles in the physiology of these organisms. In this review, we summarize the recent findings in S. mutans regarding the integration of other circuits into the ComRS signaling pathway, the true mode of XIP export, and how the RcrRPQ operon controls competence activation. We also look at how new technologies can be used to re-annotate the genome to find new open reading frames that encode peptide signals. Together, this summary of research will allow us to reconsider how we perceive these systems to behave and lead us to expand our vocabulary of peptide signals within the genus Streptococcus.
Collapse
Affiliation(s)
- Justin R. Kaspar
- Department of Oral Biology, University of Florida, Gainesville, FL, United States
| | | |
Collapse
|
5
|
Lewis NH, Hitchcock DB, Dryden IL, Rose JR. Peptide refinement by using a stochastic search. J R Stat Soc Ser C Appl Stat 2018. [DOI: 10.1111/rssc.12280] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
6
|
|
7
|
Zhou M, Paša-Tolić L, Stenoien DL. Profiling of Histone Post-Translational Modifications in Mouse Brain with High-Resolution Top-Down Mass Spectrometry. J Proteome Res 2016; 16:599-608. [DOI: 10.1021/acs.jproteome.6b00694] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Mowei Zhou
- Pacific Northwest National
Laboratory, Earth and Biological Sciences Directorate, P.O. Box 999, Richland, Washington 99352, United States
| | - Ljiljana Paša-Tolić
- Pacific Northwest National
Laboratory, Earth and Biological Sciences Directorate, P.O. Box 999, Richland, Washington 99352, United States
| | - David L. Stenoien
- Pacific Northwest National
Laboratory, Earth and Biological Sciences Directorate, P.O. Box 999, Richland, Washington 99352, United States
| |
Collapse
|
8
|
Zhu X, Xie S, Armengaud J, Xie W, Guo Z, Kang S, Wu Q, Wang S, Xia J, He R, Zhang Y. Tissue-specific Proteogenomic Analysis of Plutella xylostella Larval Midgut Using a Multialgorithm Pipeline. Mol Cell Proteomics 2016; 15:1791-807. [PMID: 26902207 PMCID: PMC5083088 DOI: 10.1074/mcp.m115.050989] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2015] [Revised: 02/04/2016] [Indexed: 11/06/2022] Open
Abstract
The diamondback moth, Plutella xylostella (L.), is the major cosmopolitan pest of brassica and other cruciferous crops. Its larval midgut is a dynamic tissue that interfaces with a wide variety of toxicological and physiological processes. The draft sequence of the P. xylostella genome was recently released, but its annotation remains challenging because of the low sequence coverage of this branch of life and the poor description of exon/intron splicing rules for these insects. Peptide sequencing by computational assignment of tandem mass spectra to genome sequence information provides an experimental independent approach for confirming or refuting protein predictions, a concept that has been termed proteogenomics. In this study, we carried out an in-depth proteogenomic analysis to complement genome annotation of P. xylostella larval midgut based on shotgun HPLC-ESI-MS/MS data by means of a multialgorithm pipeline. A total of 876,341 tandem mass spectra were searched against the predicted P. xylostella protein sequences and a whole-genome six-frame translation database. Based on a data set comprising 2694 novel genome search specific peptides, we discovered 439 novel protein-coding genes and corrected 128 existing gene models. To get the most accurate data to seed further insect genome annotation, more than half of the novel protein-coding genes, i.e. 235 over 439, were further validated after RT-PCR amplification and sequencing of the corresponding transcripts. Furthermore, we validated 53 novel alternative splicings. Finally, a total of 6764 proteins were identified, resulting in one of the most comprehensive proteogenomic study of a nonmodel animal. As the first tissue-specific proteogenomics analysis of P. xylostella, this study provides the fundamental basis for high-throughput proteomics and functional genomics approaches aimed at deciphering the molecular mechanisms of resistance and controlling this pest.
Collapse
Affiliation(s)
- Xun Zhu
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | | | - Jean Armengaud
- ¶CEA-Marcoule, DSV/IBITEC-S/SPI/Li2D, Laboratory, BP 17171, F-30200, Bagnols-sur-Cèze, F-30207, France
| | - Wen Xie
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Zhaojiang Guo
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Shi Kang
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Qingjun Wu
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Shaoli Wang
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Jixing Xia
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Rongjun He
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Youjun Zhang
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China;
| |
Collapse
|
9
|
Berry IJ, Steele JR, Padula MP, Djordjevic SP. The application of terminomics for the identification of protein start sites and proteoforms in bacteria. Proteomics 2015; 16:257-72. [DOI: 10.1002/pmic.201500319] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Revised: 09/21/2015] [Accepted: 09/30/2015] [Indexed: 01/11/2023]
Affiliation(s)
- Iain J. Berry
- The ithree Institute; University of Technology Sydney; Broadway NSW Australia
- Proteomics Core Facility; University of Technology Sydney; Broadway NSW Australia
| | - Joel R. Steele
- Proteomics Core Facility; University of Technology Sydney; Broadway NSW Australia
| | - Matthew P. Padula
- The ithree Institute; University of Technology Sydney; Broadway NSW Australia
- Proteomics Core Facility; University of Technology Sydney; Broadway NSW Australia
| | - Steven P. Djordjevic
- The ithree Institute; University of Technology Sydney; Broadway NSW Australia
- Proteomics Core Facility; University of Technology Sydney; Broadway NSW Australia
| |
Collapse
|
10
|
Kumar D, Mondal AK, Kutum R, Dash D. Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes. Proteomics 2015; 16:226-40. [PMID: 26773550 DOI: 10.1002/pmic.201500263] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 09/18/2015] [Accepted: 09/28/2015] [Indexed: 01/04/2023]
Abstract
Sustainable innovations in sequencing technologies have resulted in a torrent of microbial genome sequencing projects. However, the prokaryotic genomes sequenced so far are unequally distributed along their phylogenetic tree; few phyla contain the majority, the rest only a few representatives. Accurate genome annotation lags far behind genome sequencing. While automated computational prediction, aided by comparative genomics, remains a popular choice for genome annotation, substantial fraction of these annotations are erroneous. Proteogenomics utilizes protein level experimental observations to annotate protein coding genes on a genome wide scale. Benefits of proteogenomics include discovery and correction of gene annotations regardless of their phylogenetic conservation. This not only allows detection of common, conserved proteins but also the discovery of protein products of rare genes that may be horizontally transferred or taxonomy specific. Chances of encountering such genes are more in rare phyla that comprise a small number of complete genome sequences. We collated all bacterial and archaeal proteogenomic studies carried out to date and reviewed them in the context of genome sequencing projects. Here, we present a comprehensive list of microbial proteogenomic studies, their taxonomic distribution, and also urge for targeted proteogenomics of underexplored taxa to build an extensive reference of protein coding genes.
Collapse
Affiliation(s)
- Dhirendra Kumar
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Anupam Kumar Mondal
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Rintu Kutum
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Debasis Dash
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| |
Collapse
|
11
|
Pettersen VK, Steinsland H, Wiker HG. Improving genome annotation of enterotoxigenicEscherichia coliTW10598 by a label-free quantitative MS/MS approach. Proteomics 2015; 15:3826-34. [DOI: 10.1002/pmic.201500278] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2015] [Revised: 08/18/2015] [Accepted: 09/04/2015] [Indexed: 12/14/2022]
Affiliation(s)
- Veronika Kuchařová Pettersen
- The Gade Research Group for Infection and Immunity; Department of Clinical Science; University of Bergen; Bergen Norway
| | - Hans Steinsland
- Centre for International Health; Department of Global Public Health and Primary Care; University of Bergen; Bergen Norway
- Department of Biomedicine; University of Bergen; Bergen Norway
| | - Harald G. Wiker
- The Gade Research Group for Infection and Immunity; Department of Clinical Science; University of Bergen; Bergen Norway
| |
Collapse
|
12
|
Experimental Validation of Bacillus anthracis A16R Proteogenomics. Sci Rep 2015; 5:14608. [PMID: 26423727 PMCID: PMC4589699 DOI: 10.1038/srep14608] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2015] [Accepted: 08/26/2015] [Indexed: 11/09/2022] Open
Abstract
Anthrax, caused by the pathogenic bacterium Bacillus anthracis, is a zoonosis that causes serious disease and is of significant concern as a biological warfare agent. Validating annotated genes and reannotating misannotated genes are important to understand its biology and mechanisms of pathogenicity. Proteomics studies are, to date, the best method for verifying and improving current annotations. To this end, the proteome of B. anthracis A16R was analyzed via one-dimensional gel electrophoresis followed by liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS). In total, we identified 3,712 proteins, including many regulatory and key functional proteins at relatively low abundance, representing the most complete proteome of B. anthracis to date. Interestingly, eight sequencing errors were detected by proteogenomic analysis and corrected by resequencing. More importantly, three unannotated peptide fragments were identified in this study and validated by synthetic peptide mass spectrum mapping and green fluorescent protein fusion experiments. These data not only give a more comprehensive understanding of B. anthracis A16R but also demonstrate the power of proteomics to improve genome annotations and determine true translational elements.
Collapse
|
13
|
Matthews TD, Schmieder R, Silva GGZ, Busch J, Cassman N, Dutilh BE, Green D, Matlock B, Heffernan B, Olsen GJ, Farris Hanna L, Schifferli DM, Maloy S, Dinsdale EA, Edwards RA. Genomic Comparison of the Closely-Related Salmonella enterica Serovars Enteritidis, Dublin and Gallinarum. PLoS One 2015; 10:e0126883. [PMID: 26039056 PMCID: PMC4454671 DOI: 10.1371/journal.pone.0126883] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Accepted: 04/08/2015] [Indexed: 11/18/2022] Open
Abstract
The Salmonella enterica serovars Enteritidis, Dublin, and Gallinarum are closely related but differ in virulence and host range. To identify the genetic elements responsible for these differences and to better understand how these serovars are evolving, we sequenced the genomes of Enteritidis strain LK5 and Dublin strain SARB12 and compared these genomes to the publicly available Enteritidis P125109, Dublin CT 02021853 and Dublin SD3246 genome sequences. We also compared the publicly available Gallinarum genome sequences from biotype Gallinarum 287/91 and Pullorum RKS5078. Using bioinformatic approaches, we identified single nucleotide polymorphisms, insertions, deletions, and differences in prophage and pseudogene content between strains belonging to the same serovar. Through our analysis we also identified several prophage cargo genes and pseudogenes that affect virulence and may contribute to a host-specific, systemic lifestyle. These results strongly argue that the Enteritidis, Dublin and Gallinarum serovars of Salmonella enterica evolve by acquiring new genes through horizontal gene transfer, followed by the formation of pseudogenes. The loss of genes necessary for a gastrointestinal lifestyle ultimately leads to a systemic lifestyle and niche exclusion in the host-specific serovars.
Collapse
Affiliation(s)
- T. David Matthews
- Department of Biology, San Diego State University, San Diego, California, 92182, United States of America
| | - Robert Schmieder
- Department of Computer Science, San Diego State University, San Diego, California, 92182, United States of America
| | - Genivaldo G. Z. Silva
- Computational Science Research Center, San Diego State University, San Diego, California, 92182, United States of America
| | - Julia Busch
- Department of Biology, San Diego State University, San Diego, California, 92182, United States of America
| | - Noriko Cassman
- Department of Biology, San Diego State University, San Diego, California, 92182, United States of America
| | - Bas E. Dutilh
- Theoretical Biology and Bioinformatics, Utrecht University, Utrecht, The Netherlands
- Centre for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Centre, Nijmegen, The Netherlands
| | - Dawn Green
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Brian Matlock
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Brian Heffernan
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Gary J. Olsen
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Leigh Farris Hanna
- Molecular Sciences Department, University of Tennessee Health Sciences Center, 858 Madison Ave, Memphis, Tennessee, United States of America
| | - Dieter M. Schifferli
- University of Pennsylvania School of Veterinary Medicine, 3800 Spruce St, Philadelphia, Pennsylvania, 19104, United States of America
| | - Stanley Maloy
- Department of Biology, San Diego State University, San Diego, California, 92182, United States of America
| | - Elizabeth A. Dinsdale
- Department of Biology, San Diego State University, San Diego, California, 92182, United States of America
| | - Robert A. Edwards
- Department of Biology, San Diego State University, San Diego, California, 92182, United States of America
- Department of Computer Science, San Diego State University, San Diego, California, 92182, United States of America
- Department of Marine Biology, Institute of Biology, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
- Argonne National Laboratory, 9700 S. Cass Ave, Argonne, Illinois, 60349, United States of America
- * E-mail:
| |
Collapse
|
14
|
Proteogenomic analysis and global discovery of posttranslational modifications in prokaryotes. Proc Natl Acad Sci U S A 2014; 111:E5633-42. [PMID: 25512518 DOI: 10.1073/pnas.1412722111] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
We describe an integrated workflow for proteogenomic analysis and global profiling of posttranslational modifications (PTMs) in prokaryotes and use the model cyanobacterium Synechococcus sp. PCC 7002 (hereafter Synechococcus 7002) as a test case. We found more than 20 different kinds of PTMs, and a holistic view of PTM events in this organism grown under different conditions was obtained without specific enrichment strategies. Among 3,186 predicted protein-coding genes, 2,938 gene products (>92%) were identified. We also identified 118 previously unidentified proteins and corrected 38 predicted gene-coding regions in the Synechococcus 7002 genome. This systematic analysis not only provides comprehensive information on protein profiles and the diversity of PTMs in Synechococcus 7002 but also provides some insights into photosynthetic pathways in cyanobacteria. The entire proteogenomics pipeline is applicable to any sequenced prokaryotic organism, and we suggest that it should become a standard part of genome annotation projects.
Collapse
|
15
|
Kucharova V, Wiker HG. Proteogenomics in microbiology: taking the right turn at the junction of genomics and proteomics. Proteomics 2014; 14:2360-675. [PMID: 25263021 DOI: 10.1002/pmic.201400168] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Revised: 08/18/2014] [Accepted: 09/23/2014] [Indexed: 12/14/2022]
Abstract
High-accuracy and high-throughput proteomic methods have completely changed the way we can identify and characterize proteins. MS-based proteomics can now provide a unique supplement to genomic data and add a new level of information to the interpretation of genomic sequences. Proteomics-driven genome annotation has become especially relevant in microbiology where genomes are sequenced on a daily basis and limitations of an in silico driven annotation process are well recognized. In this review paper, we outline different strategies on how one can design a proteogenomic experiment, for example on genome-sequenced (synonymous proteogenomics) versus unsequenced organisms (ortho-proteogenomics) or with the aid of other "omic" data such as RNA-seq. We touch upon many challenges that are encountered during a typical proteogenomic study, mostly concerning bioinformatics methods and downstream data analysis, but also related to creation and use of sequence databases. A large list of proteogenomic case studies of different microorganisms is provided to illustrate the mapping of MS/MS-derived peptide spectra to genomic DNA sequences. These investigations have led to accurate determination of translational initiation sites, pointed out eventual read-throughs or programmed frameshifts, detected signal peptide processing or other protein maturation events, removed questionable annotation assignments, and provided evidence for predicted hypothetical proteins.
Collapse
Affiliation(s)
- Veronika Kucharova
- Department of Clinical Science, The Gade Research Group for Infection and Immunity, University of Bergen, Norway
| | | |
Collapse
|
16
|
Tacchi JL, Raymond BBA, Jarocki VM, Berry IJ, Padula MP, Djordjevic SP. Cilium Adhesin P216 (MHJ_0493) Is a Target of Ectodomain Shedding and Aminopeptidase Activity on the Surface of Mycoplasma hyopneumoniae. J Proteome Res 2014; 13:2920-30. [DOI: 10.1021/pr500087c] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Affiliation(s)
- Jessica L. Tacchi
- The ithree institute and ‡Proteomics Core Facility, University of Technology Sydney, P.O.
Box 123, Broadway, Sydney, NSW 2007, Australia
| | - Benjamin B. A. Raymond
- The ithree institute and ‡Proteomics Core Facility, University of Technology Sydney, P.O.
Box 123, Broadway, Sydney, NSW 2007, Australia
| | - Veronica M. Jarocki
- The ithree institute and ‡Proteomics Core Facility, University of Technology Sydney, P.O.
Box 123, Broadway, Sydney, NSW 2007, Australia
| | - Iain J. Berry
- The ithree institute and ‡Proteomics Core Facility, University of Technology Sydney, P.O.
Box 123, Broadway, Sydney, NSW 2007, Australia
| | - Matthew P. Padula
- The ithree institute and ‡Proteomics Core Facility, University of Technology Sydney, P.O.
Box 123, Broadway, Sydney, NSW 2007, Australia
| | - Steven P. Djordjevic
- The ithree institute and ‡Proteomics Core Facility, University of Technology Sydney, P.O.
Box 123, Broadway, Sydney, NSW 2007, Australia
| |
Collapse
|
17
|
Bland C, Hartmann EM, Christie-Oleza JA, Fernandez B, Armengaud J. N-Terminal-oriented proteogenomics of the marine bacterium roseobacter denitrificans Och114 using N-Succinimidyloxycarbonylmethyl)tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP) labeling and diagonal chromatography. Mol Cell Proteomics 2014; 13:1369-81. [PMID: 24536027 DOI: 10.1074/mcp.o113.032854] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Given the ease of whole genome sequencing with next-generation sequencers, structural and functional gene annotation is now purely based on automated prediction. However, errors in gene structure are frequent, the correct determination of start codons being one of the main concerns. Here, we combine protein N termini derivatization using (N-Succinimidyloxycarbonylmethyl)tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP Ac-OSu) as a labeling reagent with the COmbined FRActional DIagonal Chromatography (COFRADIC) sorting method to enrich labeled N-terminal peptides for mass spectrometry detection. Protein digestion was performed in parallel with three proteases to obtain a reliable automatic validation of protein N termini. The analysis of these N-terminal enriched fractions by high-resolution tandem mass spectrometry allowed the annotation refinement of 534 proteins of the model marine bacterium Roseobacter denitrificans OCh114. This study is especially efficient regarding mass spectrometry analytical time. From the 534 validated N termini, 480 confirmed existing gene annotations, 41 highlighted erroneous start codon annotations, five revealed totally new mis-annotated genes; the mass spectrometry data also suggested the existence of multiple start sites for eight different genes, a result that challenges the current view of protein translation initiation. Finally, we identified several proteins for which classical genome homology-driven annotation was inconsistent, questioning the validity of automatic annotation pipelines and emphasizing the need for complementary proteomic data. All data have been deposited to the ProteomeXchange with identifier PXD000337.
Collapse
Affiliation(s)
- Céline Bland
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze, F-30207, France
| | | | | | | | | |
Collapse
|
18
|
Kumari H, Murugapiran SK, Balasubramanian D, Schneper L, Merighi M, Sarracino D, Lory S, Mathee K. LTQ-XL mass spectrometry proteome analysis expands the Pseudomonas aeruginosa AmpR regulon to include cyclic di-GMP phosphodiesterases and phosphoproteins, and identifies novel open reading frames. J Proteomics 2013; 96:328-342. [PMID: 24291602 DOI: 10.1016/j.jprot.2013.11.018] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2013] [Revised: 11/12/2013] [Accepted: 11/18/2013] [Indexed: 12/23/2022]
Abstract
UNLABELLED Pseudomonas aeruginosa is well known for its antibiotic resistance and intricate regulatory network, contributing to its success as an opportunistic pathogen. This study is an extension of our transcriptomic analyses (microarray and RNA-Seq) to understand the global changes in PAO1 upon deleting a gene encoding a transcriptional regulator AmpR, in the presence and absence of β-lactam antibiotic. This study was performed under identical conditions to explore the proteome profile of the ampR deletion mutant (PAOΔampR) using LTQ-XL mass spectrometry. The proteomic data identified ~53% of total PAO1 proteins and expanded the master regulatory role of AmpR in determining antibiotic resistance and multiple virulence phenotypes in P. aeruginosa. AmpR proteome analysis identified 853 AmpR-dependent proteins, which include 102 transcriptional regulators and 21 two-component system proteins. AmpR also regulates cyclic di-GMP phosphodiesterases (PA4367, PA4969, PA4781) possibly affecting major virulence systems. Phosphoproteome analysis also suggests a significant role for AmpR in Ser, Thr and Tyr phosphorylation. These novel mechanisms of gene regulation were previously not associated with AmpR. The proteome analysis also identified many unannotated and misannotated ORFs in the P. aeruginosa genome. Thus, our data sheds light on important virulence regulatory pathways that can potentially be exploited to deal with P. aeruginosa infections. BIOLOGICAL SIGNIFICANCE The AmpR proteome data not only confirmed the role of AmpR in virulence and resistance to multiple antibiotics, but also expanded the perimeter of AmpR regulon. The data presented here points to the role of AmpR in regulating cyclic di-GMP levels and phosphorylation of Ser, Thr and Tyr, adding another dimension to the regulatory functions of AmpR. We also identify some previously unannotated/misannotated ORFs in the P. aeruginosa genome, indicating the limitations of existing ORF analyses software. This study will contribute towards understanding complex genetic organization of P. aeruginosa. Whole genome proteomic picture of regulators at higher nodal positions in the regulatory network will not only help us link various virulence phenotypes but also design novel therapeutic strategies.
Collapse
Affiliation(s)
- Hansi Kumari
- Department of Molecular Microbiology and Infectious Diseases, Herbert Wertheim College of Medicine, Florida International University, Miami, FL
| | - Senthil K Murugapiran
- Department of Molecular Microbiology and Infectious Diseases, Herbert Wertheim College of Medicine, Florida International University, Miami, FL
| | - Deepak Balasubramanian
- Department of Biological Sciences, College of Arts and Sciences, Florida International University, Miami, FL United States
| | - Lisa Schneper
- Department of Molecular Microbiology and Infectious Diseases, Herbert Wertheim College of Medicine, Florida International University, Miami, FL
| | - Massimo Merighi
- Department of Microbiology and Molecular Genetics, Harvard Medical School, Boston, MA
| | - David Sarracino
- Department of Microbiology and Molecular Genetics, Harvard Medical School, Boston, MA
| | - Stephen Lory
- Department of Microbiology and Molecular Genetics, Harvard Medical School, Boston, MA
| | - Kalai Mathee
- Department of Molecular Microbiology and Infectious Diseases, Herbert Wertheim College of Medicine, Florida International University, Miami, FL
| |
Collapse
|
19
|
Carlier AL, Omasits U, Ahrens CH, Eberl L. Proteomics analysis of Psychotria leaf nodule symbiosis: improved genome annotation and metabolic predictions. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2013; 26:1325-1333. [PMID: 23902262 DOI: 10.1094/mpmi-05-13-0152-r] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Several plant species of the genus Psychotria (Rubiaceae) harbor Burkholderia sp. bacteria within specialized leaf nodules. The bacteria are transmitted vertically between plant generations and have not yet been cultured outside of their host. This symbiosis is considered to be obligatory because plants devoid of symbionts fail to develop into mature individuals. The genome of 'Candidatus Burkholderia kirkii' has been sequenced recently and has revealed evidence of reductive genome evolution, as shown by the proliferation of insertion sequences and the presence of numerous pseudogenes. We employed shotgun proteomics to investigate the expression of 'Ca. B. kirkii' proteins in the leaf nodule. Drawing from this dataset and refined comparative genomics analyses, we designed a new pseudogene prediction algorithm and improved the genome annotation. We also found conclusive evidence that nodule bacteria allocate vast resources to synthesis of secondary metabolites, possibly of the C7N aminocyclitol family. Expression of a putative 2-epi-5-valiolone synthase, a key enzyme of the C7N aminocyclitol synthesis, is high in the nodule population but downregulated in bacteria residing in the shoot apex, suggesting that production of secondary metabolites is particularly important in the leaf nodule.
Collapse
|
20
|
Armengaud J, Hartmann EM, Bland C. Proteogenomics for environmental microbiology. Proteomics 2013; 13:2731-42. [PMID: 23636904 DOI: 10.1002/pmic.201200576] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2012] [Revised: 03/06/2013] [Accepted: 04/09/2013] [Indexed: 11/09/2022]
Abstract
Proteogenomics sensu stricto refers to the use of proteomic data to refine the annotation of genomes from model organisms. Because of the limitations of automatic annotation pipelines, a relatively high number of errors occur during the structural annotation of genes coding for proteins. Whether putative orphan sequences or short genes encoding low-molecular-weight proteins really exist is still frequently a mystery. Whether start codons are well defined is also an open debate. These problems are exacerbated for genomes of microorganisms belonging to poorly documented genera, as related sequences are not always available for homology-guided annotation. The functional annotation of a significant proportion of genes is also another well-known issue when annotating environmental microorganisms. High-throughput shotgun proteomics has recently greatly evolved, allowing the exploration of the proteome from any microorganism at an unprecedented depth. The structural and functional annotation process may be usefully complemented with experimental data. Indeed, proteogenomic mapping has been successfully performed for a wide variety of organisms. Specific approaches devoted to systematically establishing the N-termini of a large set of proteins are being developed. N-terminomics is giving rise to datasets of experimentally proven translational start codons as well as validated peptide signals for secreted proteins. By extension, combining genomic and proteomic data is becoming routine in many research projects. The proteomic analysis of organisms with unfinished genome sequences, the so-called composite proteomics, and the search for microbial biomarkers by bottom-up and top-down combined approaches are some examples of proteogenomic-flavored studies. They illustrate the advent of a new era of environmental microbiology where proteomics and genomics are intimately integrated to answer key biological questions.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze, France
| | | | | |
Collapse
|
21
|
Ansong C, Deatherage BL, Hyduke D, Schmidt B, McDermott JE, Jones MB, Chauhan S, Charusanti P, Kim YM, Nakayasu ES, Li J, Kidwai A, Niemann G, Brown RN, Metz TO, McAteer K, Heffron F, Peterson SN, Motin V, Palsson BO, Smith RD, Adkins JN. Studying Salmonellae and Yersiniae host-pathogen interactions using integrated 'omics and modeling. Curr Top Microbiol Immunol 2013; 363:21-41. [PMID: 22886542 DOI: 10.1007/82_2012_247] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Salmonella and Yersinia are two distantly related genera containing species with wide host-range specificity and pathogenic capacity. The metabolic complexity of these organisms facilitates robust lifestyles both outside of and within animal hosts. Using a pathogen-centric systems biology approach, we are combining a multi-omics (transcriptomics, proteomics, metabolomics) strategy to define properties of these pathogens under a variety of conditions including those that mimic the environments encountered during pathogenesis. These high-dimensional omics datasets are being integrated in selected ways to improve genome annotations, discover novel virulence-related factors, and model growth under infectious states. We will review the evolving technological approaches toward understanding complex microbial life through multi-omic measurements and integration, while highlighting some of our most recent successes in this area.
Collapse
Affiliation(s)
- Charles Ansong
- Biological Separations and Mass Spectroscopy Group, Pacific Northwest National Laboratory, PO Box 999, MSIN: K8-98, Richland, WA, 99352, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Donati C, Rappuoli R. Reverse vaccinology in the 21st century: improvements over the original design. Ann N Y Acad Sci 2013; 1285:115-32. [PMID: 23527566 DOI: 10.1111/nyas.12046] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Reverse vaccinology (RV), the first application of genomic technologies in vaccine research, represented a major revolution in the process of discovering novel vaccines. By determining their entire antigenic repertoire, researchers could identify protective targets and design efficacious vaccines for pathogens where conventional approaches had failed. Bexsero, the first vaccine developed using RV, has recently received positive opinion from the European Medicines Agency. The use of RV initiated a cascade of changes that affected the entire vaccine development process, shifting the focus from the identification of a list of vaccine candidates to the definition of a set of high throughput screens to reduce the need for costly and labor intensive tests in animal models. It is now clear that a deep understanding of the epidemiology of vaccine candidates, and their regulation and role in host-pathogen interactions, must become an integral component of the screening workflow. Far from being outdated by technological advancements, RV still represents a paradigm of how high-throughput technologies and scientific insight can be integrated into biotechnology research.
Collapse
|
23
|
Abstract
Signal peptides are a cornerstone mechanism for cellular protein localization, yet until now experimental determination of signal peptides has come from only a narrow taxonomic sampling. As a result, the dominant view is that Sec-cleaved signal peptides in prokaryotes are defined by a canonical AxA motif. Although other residues are permitted in the motif, alanine is by far the most common. Here we broadly examine proteomics data to reveal the signal peptide sequences for 32 bacterial and archaeal organisms from nine phyla and demonstrate that this alanine preference is not universal. Discoveries include fundamentally distinct signal peptide motifs from Alphaproteobacteria, Spirochaetes, Thermotogae and Euryarchaeota. In these novel motifs, alanine is no longer the dominant residue but has been replaced in a different way for each taxon. Surprisingly, divergent motifs correlate with a proteome-wide reduction in alanine. Computational analyses of ~1,500 genomes reveal numerous major evolutionary clades which have replaced the canonical signal peptide sequence with novel motifs. This article replaces a widely held general model with a more detailed model describing phylogenetically correlated variation in motifs for Sec secretion.
Collapse
|
24
|
Abstract
High-throughput identification of proteins with the latest generation of hybrid high-resolution mass spectrometers is opening new perspectives in microbiology. I present, here, an overview of tandem mass spectrometry technology and bioinformatics for shotgun proteomics that make 2D-PAGE approaches obsolete. Non-labelling quantitative approaches have become more popular than labelling techniques on most proteomic platforms because they are easier to carry out while their quantitative outcome is rather robust. Parameters for recording mass spectrometry data, however, need to be chosen carefully and statistics to assess the confidence of the results should not be neglected. Interestingly, next-generation sequencing methodologies make any microbial model quickly amenable to proteomics, leading to the documentation of a wide range of organisms from diverse environments. Some recent discoveries made using microbial proteomics have challenged some biological dogma, such as: (i) initiation of the translation does not occur predominantly from ATG codons in some microorganisms, (ii) non-canonical initiation codons are used to regulate the production of specific but important proteins and (iii) a gene may code for multiple polypeptide species, heterogeneous in terms of sequences. Microbial diversity and microbial physiology can now be revisited by means of exhaustive comparative proteomic surveys where thousands of proteins are detected and quantified. Proteogenomics, consisting of better annotating of genomes with the help of proteomic evidence, is paving the way for integrated multi-omic approaches in microbiology. Finally, meta-proteomic tools and approaches are emerging for tackling the high complexity of the microbial world as a whole, opening new perspectives for assessing how microbial communities function.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, F-30207 Bagnols-sur-Cèze, France.
| |
Collapse
|
25
|
George Priya Doss C, Rajith B. Computational refinement of functional single nucleotide polymorphisms associated with ATM gene. PLoS One 2012; 7:e34573. [PMID: 22529920 PMCID: PMC3326031 DOI: 10.1371/journal.pone.0034573] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Accepted: 03/07/2012] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Understanding and predicting molecular basis of disease is one of the major challenges in modern biology and medicine. SNPs associated with complex disorders can create, destroy, or modify protein coding sites. Single amino acid substitutions in the ATM gene are the most common forms of genetic variations that account for various forms of cancer. However, the extent to which SNPs interferes with the gene regulation and affects cancer susceptibility remains largely unknown. PRINCIPAL FINDINGS We analyzed the deleterious nsSNPs associated with ATM gene based on different computational methods. An integrative scoring system and sequence conservation of amino acid residues was adapted for a priori nsSNP analysis of variants associated with cancer. We further extended our approach on SNPs that could potentially influence protein Post Translational Modifications in ATM gene. SIGNIFICANCE In the lack of adequate prior reports on the possible deleterious effects of nsSNPs, we have systematically analyzed and characterized the functional variants in both coding and non coding region that can alter the expression and function of ATM gene. In silico characterization of nsSNPs affecting ATM gene function can aid in better understanding of genetic differences in disease susceptibility.
Collapse
Affiliation(s)
- C George Priya Doss
- Centre for Nanobiotechnology, Medical Biotechnology Division, School of Biosciences and Technology, Vellore Institute of Technology University, Vellore, Tamil Nadu, India.
| | | |
Collapse
|
26
|
Schrimpe-Rutledge AC, Jones MB, Chauhan S, Purvine SO, Sanford JA, Monroe ME, Brewer HM, Payne SH, Ansong C, Frank BC, Smith RD, Peterson SN, Motin VL, Adkins JN. Comparative omics-driven genome annotation refinement: application across Yersiniae. PLoS One 2012; 7:e33903. [PMID: 22479471 PMCID: PMC3313959 DOI: 10.1371/journal.pone.0033903] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2011] [Accepted: 02/19/2012] [Indexed: 02/03/2023] Open
Abstract
Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. The annotation process is now performed almost exclusively in an automated fashion to balance the large number of sequences generated. One possible way of reducing errors inherent to automated computational annotations is to apply data from omics measurements (i.e. transcriptional and proteomic) to the un-annotated genome with a proteogenomic-based approach. Here, the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species. Transcriptomic and proteomic data derived from highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis Pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 incorrect (i.e., observed frameshifts, extended start sites, and translated pseudogenes) protein-coding sequences within the three current genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus the discovery of many translated pseudogenes, including the insertion-ablated argD, underscores a need for functional analyses to investigate hypotheses related to divergence. Refinements included the discovery of a seemingly essential ribosomal protein, several virulence-associated factors, a transcriptional regulator, and many hypothetical proteins that were missed during annotation.
Collapse
Affiliation(s)
| | - Marcus B. Jones
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Sadhana Chauhan
- University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Samuel O. Purvine
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - James A. Sanford
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Matthew E. Monroe
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Heather M. Brewer
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Samuel H. Payne
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Charles Ansong
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Bryan C. Frank
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Richard D. Smith
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Scott N. Peterson
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Vladimir L. Motin
- University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Joshua N. Adkins
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
- * E-mail:
| |
Collapse
|