1
|
Discovery of Post-Translational Modifications in Emiliania huxleyi. Molecules 2021; 26:molecules26072027. [PMID: 33918234 PMCID: PMC8038017 DOI: 10.3390/molecules26072027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 03/31/2021] [Accepted: 04/01/2021] [Indexed: 11/17/2022] Open
Abstract
Emiliania huxleyi is a cosmopolitan coccolithophore that plays an essential role in global carbon and sulfur cycling, and contributes to marine cloud formation and climate regulation. Previously, the proteomic profile of Emiliania huxleyi was investigated using a three-dimensional separation strategy combined with liquid chromatography-tandem mass spectrometry (LC-MS/MS). The current study reuses the MS/MS spectra obtained, for the global discovery of post-translational modifications (PTMs) in this species without specific enrichment methods. Twenty-five different PTM types were examined using Trans-Proteomic Pipeline (Comet and PeptideProphet). Overall, 13,483 PTMs were identified in 7421 proteins. Methylation was the most frequent PTM with more than 2800 modified sites, and lysine was the most frequently modified amino acid with more than 4000 PTMs. The number of proteins identified increased by 22.5% to 18,780 after performing the PTM search. Compared to intact peptides, the intensities of some modified peptides were superior or equivalent. The intensities of some proteins increased dramatically after the PTM search. Gene ontology analysis revealed that protein persulfidation was related to photosynthesis in Emiliania huxleyi. Additionally, various membrane proteins were found to be phosphorylated. Thus, our global PTM discovery platform provides an overview of PTMs in the species and prompts further studies to uncover their biological functions. The combination of a three-dimensional separation method with global PTM search is a promising approach for the identification and discovery of PTMs in other species.
Collapse
|
2
|
Sanchiz Á, Morato E, Rastrojo A, Camacho E, González-de la Fuente S, Marina A, Aguado B, Requena JM. The Experimental Proteome of Leishmania infantum Promastigote and Its Usefulness for Improving Gene Annotations. Genes (Basel) 2020; 11:E1036. [PMID: 32887454 PMCID: PMC7563732 DOI: 10.3390/genes11091036] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 08/27/2020] [Accepted: 08/28/2020] [Indexed: 12/02/2022] Open
Abstract
Leishmania infantum causes visceral leishmaniasis (kala-azar), the most severe form of leishmaniasis, which is lethal if untreated. A few years ago, the re-sequencing and de novo assembling of the L. infantum (JPCM5 strain) genome was accomplished, and now we aimed to describe and characterize the experimental proteome of this species. In this work, we performed a proteomic analysis from axenic cultured promastigotes and carried out a detailed comparison with other Leishmania experimental proteomes published to date. We identified 2352 proteins based on a search of mass spectrometry data against a database built from the six-frame translated genome sequence of L. infantum. We detected many proteins belonging to organelles such as glycosomes, mitochondria, or flagellum, as well as many metabolic enzymes and many putative RNA binding proteins and molecular chaperones. Moreover, we listed some proteins presenting post-translational modifications, such as phosphorylations, acetylations, and methylations. On the other hand, the identification of peptides mapping to genomic regions previously annotated as non-coding allowed for the correction of annotations, leading to the N-terminal extension of protein sequences and the uncovering of eight novel protein-coding genes. The alliance of proteomics, genomics, and transcriptomics has resulted in a powerful combination for improving the annotation of the L. infantum reference genome.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Jose M. Requena
- Centro de Biología Molecular “Severo Ochoa” (CBMSO, CSIC-UAM) Campus de Excelencia Internacional (CEI) UAM+CSIC, Universidad Autónoma de Madrid, 28049 Madrid, Spain; (Á.S.); (E.M.); (A.R.); (E.C.); (S.G.-d.l.F.); (A.M.); (B.A.)
| |
Collapse
|
3
|
Proteomic analysis and translational perspective of hepatocellular carcinoma: Identification of diagnostic protein biomarkers by an onco-proteogenomics approach. Kaohsiung J Med Sci 2016; 32:535-544. [DOI: 10.1016/j.kjms.2016.09.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2016] [Revised: 09/07/2016] [Accepted: 09/08/2016] [Indexed: 02/07/2023] Open
|
4
|
|
5
|
Locard-Paulet M, Pible O, Gonzalez de Peredo A, Alpha-Bazin B, Almunia C, Burlet-Schiltz O, Armengaud J. Clinical implications of recent advances in proteogenomics. Expert Rev Proteomics 2016; 13:185-99. [DOI: 10.1586/14789450.2016.1132169] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|
6
|
Licona-Cassani C, Cruz-Morales P, Manteca A, Barona-Gomez F, Nielsen LK, Marcellin E. Systems Biology Approaches to Understand Natural Products Biosynthesis. Front Bioeng Biotechnol 2015; 3:199. [PMID: 26697425 PMCID: PMC4673338 DOI: 10.3389/fbioe.2015.00199] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2015] [Accepted: 11/24/2015] [Indexed: 11/24/2022] Open
Abstract
Actinomycetes populate soils and aquatic sediments that impose biotic and abiotic challenges for their survival. As a result, actinomycetes metabolism and genomes have evolved to produce an overwhelming diversity of specialized molecules. Polyketides, non-ribosomal peptides, post-translationally modified peptides, lactams, and terpenes are well-known bioactive natural products with enormous industrial potential. Accessing such biological diversity has proven difficult due to the complex regulation of cellular metabolism in actinomycetes and to the sparse knowledge of their physiology. The past decade, however, has seen the development of omics technologies that have significantly contributed to our better understanding of their biology. Key observations have contributed toward a shift in the exploitation of actinomycete’s biology, such as using their full genomic potential, activating entire pathways through key metabolic elicitors and pathway engineering to improve biosynthesis. Here, we review recent efforts devoted to achieving enhanced discovery, activation, and manipulation of natural product biosynthetic pathways in model actinomycetes using genome-scale biological datasets.
Collapse
Affiliation(s)
- Cuauhtemoc Licona-Cassani
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland , Brisbane, QLD , Australia ; National Laboratory of Genomics for Biodiversity (LANGEBIO), Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav-IPN) , Irapuato , México
| | - Pablo Cruz-Morales
- National Laboratory of Genomics for Biodiversity (LANGEBIO), Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav-IPN) , Irapuato , México
| | - Angel Manteca
- Departamento de Biología Funcional and Instituto Universitario de Oncología del Principado de Asturias (IUOPA), Facultad de Medicina, Universidad de Oviedo , Oviedo , Spain
| | - Francisco Barona-Gomez
- National Laboratory of Genomics for Biodiversity (LANGEBIO), Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (Cinvestav-IPN) , Irapuato , México
| | - Lars K Nielsen
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland , Brisbane, QLD , Australia
| | - Esteban Marcellin
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland , Brisbane, QLD , Australia
| |
Collapse
|
7
|
Kumar D, Mondal AK, Kutum R, Dash D. Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes. Proteomics 2015; 16:226-40. [PMID: 26773550 DOI: 10.1002/pmic.201500263] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 09/18/2015] [Accepted: 09/28/2015] [Indexed: 01/04/2023]
Abstract
Sustainable innovations in sequencing technologies have resulted in a torrent of microbial genome sequencing projects. However, the prokaryotic genomes sequenced so far are unequally distributed along their phylogenetic tree; few phyla contain the majority, the rest only a few representatives. Accurate genome annotation lags far behind genome sequencing. While automated computational prediction, aided by comparative genomics, remains a popular choice for genome annotation, substantial fraction of these annotations are erroneous. Proteogenomics utilizes protein level experimental observations to annotate protein coding genes on a genome wide scale. Benefits of proteogenomics include discovery and correction of gene annotations regardless of their phylogenetic conservation. This not only allows detection of common, conserved proteins but also the discovery of protein products of rare genes that may be horizontally transferred or taxonomy specific. Chances of encountering such genes are more in rare phyla that comprise a small number of complete genome sequences. We collated all bacterial and archaeal proteogenomic studies carried out to date and reviewed them in the context of genome sequencing projects. Here, we present a comprehensive list of microbial proteogenomic studies, their taxonomic distribution, and also urge for targeted proteogenomics of underexplored taxa to build an extensive reference of protein coding genes.
Collapse
Affiliation(s)
- Dhirendra Kumar
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Anupam Kumar Mondal
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Rintu Kutum
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Debasis Dash
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| |
Collapse
|
8
|
Christie-Oleza JA, Armengaud J. Proteomics of theRoseobacterclade, a window to the marine microbiology landscape. Proteomics 2015; 15:3928-42. [DOI: 10.1002/pmic.201500222] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2015] [Revised: 08/24/2015] [Accepted: 09/22/2015] [Indexed: 11/07/2022]
Affiliation(s)
| | - Jean Armengaud
- CEA; DSV; IBiTec-S; SPI; Li2D; Laboratory “Innovative Technologies for Detection and Diagnostics”; Bagnols-sur-Cèze France
| |
Collapse
|
9
|
Omer H, Alpha-Bazin B, Brunet JL, Armengaud J, Duport C. Proteomics identifies Bacillus cereus EntD as a pivotal protein for the production of numerous virulence factors. Front Microbiol 2015; 6:1004. [PMID: 26500610 PMCID: PMC4595770 DOI: 10.3389/fmicb.2015.01004] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Accepted: 09/07/2015] [Indexed: 11/13/2022] Open
Abstract
Bacillus cereus is a Gram-positive pathogen that causes a wide variety of diseases in humans. It secretes into the extracellular milieu proteins that may contribute directly or indirectly to its virulence. EntD is a novel exoprotein identified by proteogenomics of B. cereus ATCC 14579. We constructed a ΔentD mutant and analyzed the impact of entD disruption on the cellular proteome and exoproteome isolated from early, late, and stationary-phase cultures. We identified 308 and 79 proteins regulated by EntD in the cellular proteome and the exoproteome, respectively. The contribution of these proteins to important virulence-associated functions, including central metabolism, cell structure, antioxidative ability, cell motility, and toxin production, are presented. The proteomic data were correlated with the growth defect, cell morphology change, reduced motility, and reduced cytotoxicity of the ΔentD mutant strain. We conclude that EntD is an important player in B. cereus virulence. The function of EntD and the putative EntD-dependent regulatory network are discussed. To our knowledge, this study is the first characterization of an Ent family protein in a species of the B. cereus group.
Collapse
Affiliation(s)
- Hélène Omer
- Université d'Avignon et des Pays de Vaucluse, UMR408 Sécurité et Qualité des Produits d'Origine Végétale Avignon, France ; INRA, UMR408 Sécurité et Qualité des Produits d'Origine Végétale Avignon, France ; CEA-Marcoule, DSV/IBITEC-S/SPI/Li2D, Laboratory "Innovative technologies for Detection and Diagnostic" Bagnols-sur-Cèze, France
| | - Béatrice Alpha-Bazin
- CEA-Marcoule, DSV/IBITEC-S/SPI/Li2D, Laboratory "Innovative technologies for Detection and Diagnostic" Bagnols-sur-Cèze, France
| | | | - Jean Armengaud
- CEA-Marcoule, DSV/IBITEC-S/SPI/Li2D, Laboratory "Innovative technologies for Detection and Diagnostic" Bagnols-sur-Cèze, France
| | - Catherine Duport
- Université d'Avignon et des Pays de Vaucluse, UMR408 Sécurité et Qualité des Produits d'Origine Végétale Avignon, France ; INRA, UMR408 Sécurité et Qualité des Produits d'Origine Végétale Avignon, France
| |
Collapse
|
10
|
Proteogenomic analysis and global discovery of posttranslational modifications in prokaryotes. Proc Natl Acad Sci U S A 2014; 111:E5633-42. [PMID: 25512518 DOI: 10.1073/pnas.1412722111] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
We describe an integrated workflow for proteogenomic analysis and global profiling of posttranslational modifications (PTMs) in prokaryotes and use the model cyanobacterium Synechococcus sp. PCC 7002 (hereafter Synechococcus 7002) as a test case. We found more than 20 different kinds of PTMs, and a holistic view of PTM events in this organism grown under different conditions was obtained without specific enrichment strategies. Among 3,186 predicted protein-coding genes, 2,938 gene products (>92%) were identified. We also identified 118 previously unidentified proteins and corrected 38 predicted gene-coding regions in the Synechococcus 7002 genome. This systematic analysis not only provides comprehensive information on protein profiles and the diversity of PTMs in Synechococcus 7002 but also provides some insights into photosynthetic pathways in cyanobacteria. The entire proteogenomics pipeline is applicable to any sequenced prokaryotic organism, and we suggest that it should become a standard part of genome annotation projects.
Collapse
|
11
|
Boja ES, Rodriguez H. Proteogenomic convergence for understanding cancer pathways and networks. Clin Proteomics 2014; 11:22. [PMID: 24994965 PMCID: PMC4067069 DOI: 10.1186/1559-0275-11-22] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2014] [Accepted: 03/31/2014] [Indexed: 11/21/2022] Open
Abstract
During the past several decades, the understanding of cancer at the molecular level has been primarily focused on mechanisms on how signaling molecules transform homeostatically balanced cells into malignant ones within an individual pathway. However, it is becoming more apparent that pathways are dynamic and crosstalk at different control points of the signaling cascades, making the traditional linear signaling models inadequate to interpret complex biological systems. Recent technological advances in high throughput, deep sequencing for the human genomes and proteomic technologies to comprehensively characterize the human proteomes in conjunction with multiplexed targeted proteomic assays to measure panels of proteins involved in biologically relevant pathways have made significant progress in understanding cancer at the molecular level. It is undeniable that proteomic profiling of differentially expressed proteins under many perturbation conditions, or between normal and "diseased" states is important to capture a first glance at the overall proteomic landscape, which has been a main focus of proteomics research during the past 15-20 years. However, the research community is gradually shifting its heavy focus from that initial discovery step to protein target verification using multiplexed quantitative proteomic assays, capable of measuring changes in proteins and their interacting partners, isoforms, and post-translational modifications (PTMs) in response to stimuli in the context of signaling pathways and protein networks. With a critical link to genotypes (i.e., high throughput genomics and transcriptomics data), new and complementary information can be gleaned from multi-dimensional omics data to (1) assess the effect of genomic and transcriptomic aberrations on such complex molecular machinery in the context of cell signaling architectures associated with pathological diseases such as cancer (i.e., from genotype to proteotype to phenotype); and (2) target pathway- and network-driven changes and map the fluctuations of these functional units (proteins) responsible for cellular activities in response to perturbation in a spatiotemporal fashion to better understand cancer biology as a whole system.
Collapse
Affiliation(s)
- Emily S Boja
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, National Institutes of Health, 31 Center Drive, MSC 2580, 20892 Bethesda, MD, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, National Institutes of Health, 31 Center Drive, MSC 2580, 20892 Bethesda, MD, USA
| |
Collapse
|
12
|
Bland C, Hartmann EM, Christie-Oleza JA, Fernandez B, Armengaud J. N-Terminal-oriented proteogenomics of the marine bacterium roseobacter denitrificans Och114 using N-Succinimidyloxycarbonylmethyl)tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP) labeling and diagonal chromatography. Mol Cell Proteomics 2014; 13:1369-81. [PMID: 24536027 DOI: 10.1074/mcp.o113.032854] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Given the ease of whole genome sequencing with next-generation sequencers, structural and functional gene annotation is now purely based on automated prediction. However, errors in gene structure are frequent, the correct determination of start codons being one of the main concerns. Here, we combine protein N termini derivatization using (N-Succinimidyloxycarbonylmethyl)tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP Ac-OSu) as a labeling reagent with the COmbined FRActional DIagonal Chromatography (COFRADIC) sorting method to enrich labeled N-terminal peptides for mass spectrometry detection. Protein digestion was performed in parallel with three proteases to obtain a reliable automatic validation of protein N termini. The analysis of these N-terminal enriched fractions by high-resolution tandem mass spectrometry allowed the annotation refinement of 534 proteins of the model marine bacterium Roseobacter denitrificans OCh114. This study is especially efficient regarding mass spectrometry analytical time. From the 534 validated N termini, 480 confirmed existing gene annotations, 41 highlighted erroneous start codon annotations, five revealed totally new mis-annotated genes; the mass spectrometry data also suggested the existence of multiple start sites for eight different genes, a result that challenges the current view of protein translation initiation. Finally, we identified several proteins for which classical genome homology-driven annotation was inconsistent, questioning the validity of automatic annotation pipelines and emphasizing the need for complementary proteomic data. All data have been deposited to the ProteomeXchange with identifier PXD000337.
Collapse
Affiliation(s)
- Céline Bland
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze, F-30207, France
| | | | | | | | | |
Collapse
|
13
|
Armengaud J, Trapp J, Pible O, Geffard O, Chaumot A, Hartmann EM. Non-model organisms, a species endangered by proteogenomics. J Proteomics 2014; 105:5-18. [PMID: 24440519 DOI: 10.1016/j.jprot.2014.01.007] [Citation(s) in RCA: 100] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2013] [Revised: 12/24/2013] [Accepted: 01/07/2014] [Indexed: 10/25/2022]
Abstract
UNLABELLED Previously, large-scale proteomics was possible only for organisms whose genomes were sequenced, meaning the most common model organisms. The use of next-generation sequencers is now changing the deal. With "proteogenomics", the use of experimental proteomics data to refine genome annotations, a higher integration of omics data is gaining ground. By extension, combining genomic and proteomic data is becoming routine in many research projects. "Proteogenomic"-flavored approaches are currently expanding, enabling the molecular studies of non-model organisms at an unprecedented depth. Today draft genomes can be obtained using next-generation sequencers in a rather straightforward way and at a reasonable cost for any organism. Unfinished genome sequences can be used to interpret tandem mass spectrometry proteomics data without the need for time-consuming genome annotation, and the use of RNA-seq to establish nucleotide sequences that are directly translated into protein sequences appears promising. There are, however, certain drawbacks that deserve further attention for RNA-seq to become more efficient. Here, we discuss the opportunities of working with non-model organisms, the proteomic methods that have been used until now, and the dramatic improvements proffered by proteogenomics. These put the distinction between model and non-model organisms in great danger, at least in terms of proteomics! BIOLOGICAL SIGNIFICANCE Model organisms have been crucial for in-depth analysis of cellular and molecular processes of life. Focusing the efforts of thousands of researchers on the Escherichia coli bacterium, Saccharomyces cerevisiae yeast, Arabidopsis thaliana plant, Danio rerio fish and other models for which genetic manipulation was possible was certainly worthwhile in terms of fundamental and invaluable biological insights. Until recently, proteomics of non-model organisms was limited to tedious, homology-based techniques, but today draft genomes or RNA-seq data can be straightforwardly obtained using next-generation sequencers, allowing the establishment of a draft protein database for any organism. Thus, proteogenomics opens new perspectives for molecular studies of non-model organisms, although they are still difficult experimental organisms. This article is part of a Special Issue entitled: Proteomics of non-model organisms.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze F-30207, France.
| | - Judith Trapp
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze F-30207, France; Irstea, UR MALY, F-69626 Villeurbanne, France
| | - Olivier Pible
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze F-30207, France
| | | | | | - Erica M Hartmann
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze F-30207, France
| |
Collapse
|
14
|
Marcellin E, Licona-Cassani C, Mercer TR, Palfreyman RW, Nielsen LK. Re-annotation of the Saccharopolyspora erythraea genome using a systems biology approach. BMC Genomics 2013; 14:699. [PMID: 24118942 PMCID: PMC4008361 DOI: 10.1186/1471-2164-14-699] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2013] [Accepted: 09/24/2013] [Indexed: 12/01/2022] Open
Abstract
Background Accurate bacterial genome annotations provide a framework to understanding cellular functions, behavior and pathogenicity and are essential for metabolic engineering. Annotations based only on in silico predictions are inaccurate, particularly for large, high G + C content genomes due to the lack of similarities in gene length and gene organization to model organisms. Results Here we describe a 2D systems biology driven re-annotation of the Saccharopolyspora erythraea genome using proteogenomics, a genome-scale metabolic reconstruction, RNA-sequencing and small-RNA-sequencing. We observed transcription of more than 300 intergenic regions, detected 59 peptides in intergenic regions, confirmed 164 open reading frames previously annotated as hypothetical proteins and reassigned function to open reading frames using the genome-scale metabolic reconstruction. Finally, we present a novel way of mapping ribosomal binding sites across the genome by sequencing small RNAs. Conclusions The work presented here describes a novel framework for annotation of the Saccharopolyspora erythraea genome. Based on experimental observations, the 2D annotation framework greatly reduces errors that are commonly made when annotating large-high G + C content genomes using computational prediction algorithms.
Collapse
Affiliation(s)
- Esteban Marcellin
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, Qld 4072, Australia.
| | | | | | | | | |
Collapse
|
15
|
Agrawal GK, Sarkar A, Righetti PG, Pedreschi R, Carpentier S, Wang T, Barkla BJ, Kohli A, Ndimba BK, Bykova NV, Rampitsch C, Zolla L, Rafudeen MS, Cramer R, Bindschedler LV, Tsakirpaloglou N, Ndimba RJ, Farrant JM, Renaut J, Job D, Kikuchi S, Rakwal R. A decade of plant proteomics and mass spectrometry: translation of technical advancements to food security and safety issues. MASS SPECTROMETRY REVIEWS 2013; 32:335-65. [PMID: 23315723 DOI: 10.1002/mas.21365] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2012] [Revised: 09/10/2012] [Accepted: 09/10/2012] [Indexed: 05/21/2023]
Abstract
Tremendous progress in plant proteomics driven by mass spectrometry (MS) techniques has been made since 2000 when few proteomics reports were published and plant proteomics was in its infancy. These achievements include the refinement of existing techniques and the search for new techniques to address food security, safety, and health issues. It is projected that in 2050, the world's population will reach 9-12 billion people demanding a food production increase of 34-70% (FAO, 2009) from today's food production. Provision of food in a sustainable and environmentally committed manner for such a demand without threatening natural resources, requires that agricultural production increases significantly and that postharvest handling and food manufacturing systems become more efficient requiring lower energy expenditure, a decrease in postharvest losses, less waste generation and food with longer shelf life. There is also a need to look for alternative protein sources to animal based (i.e., plant based) to be able to fulfill the increase in protein demands by 2050. Thus, plant biology has a critical role to play as a science capable of addressing such challenges. In this review, we discuss proteomics especially MS, as a platform, being utilized in plant biology research for the past 10 years having the potential to expedite the process of understanding plant biology for human benefits. The increasing application of proteomics technologies in food security, analysis, and safety is emphasized in this review. But, we are aware that no unique approach/technology is capable to address the global food issues. Proteomics-generated information/resources must be integrated and correlated with other omics-based approaches, information, and conventional programs to ensure sufficient food and resources for human development now and in the future.
Collapse
Affiliation(s)
- Ganesh Kumar Agrawal
- Research Laboratory for Biotechnology and Biochemistry, PO Box 13265, Kathmandu, Nepal.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Krug K, Carpy A, Behrends G, Matic K, Soares NC, Macek B. Deep coverage of the Escherichia coli proteome enables the assessment of false discovery rates in simple proteogenomic experiments. Mol Cell Proteomics 2013; 12:3420-30. [PMID: 23908556 DOI: 10.1074/mcp.m113.029165] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Recent advances in mass spectrometry (MS) have led to increased applications of shotgun proteomics to the refinement of genome annotation. The typical "proteo-genomic" workflows rely on the mapping of peptide MS/MS spectra onto databases derived via six-frame translation of the genome sequence. These databases contain a large proportion of spurious protein sequences which make the statistical confidence of the resulting peptide spectrum matches difficult to assess. Here we performed a comprehensive analysis of the Escherichia coli proteome using LTQ-Orbitrap MS and mapped the corresponding MS/MS spectra onto a six-frame translation of the E. coli genome. We hypothesized that the protein-coding part of the E. coli genome approaches complete annotation and that the majority of six frame-specific (novel) peptide spectrum matches can be considered as false positive identifications. We confirm our hypothesis by showing that the posterior error probability distribution of novel hits is almost identical to that of reversed (decoy) hits; this enables us to estimate the sensitivity, specificity, accuracy, and false discovery rate in a typical bacterial proteo-genomic dataset. We use two complementary computational frameworks for processing and statistical assessment of MS/MS data: MaxQuant and Trans-Proteomic Pipeline. We show that MaxQuant achieves a more sensitive six-frame database search with an acceptable false discovery rate and is therefore well suited for global genome reannotation applications, whereas the Trans-Proteomic Pipeline achieves higher specificity and is well suited for high-confidence validation. The use of a small and well-annotated bacterial genome enables us to address genome coverage achieved in state-of-the-art bacterial proteomics: identified peptide sequences mapped to all expressed E. coli proteins but covered 31.7% of the protein-coding genome sequence. Our results show that false discovery rates can be substantially underestimated even in "simple" proteo-genomic experiments obtained by means of high-accuracy MS and point to the necessity of further improvements concerning the coverage of peptide sequences by MS-based methods.
Collapse
Affiliation(s)
- Karsten Krug
- Proteome Center Tuebingen, University of Tuebingen, 72076 Tuebingen, Germany
| | | | | | | | | | | |
Collapse
|
17
|
Liu D, Hoynes-O'Connor A, Zhang F. Bridging the gap between systems biology and synthetic biology. Front Microbiol 2013; 4:211. [PMID: 23898328 PMCID: PMC3722476 DOI: 10.3389/fmicb.2013.00211] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2013] [Accepted: 07/07/2013] [Indexed: 12/24/2022] Open
Abstract
Systems biology is an inter-disciplinary science that studies the complex interactions and the collective behavior of a cell or an organism. Synthetic biology, as a technological subject, combines biological science and engineering, allowing the design and manipulation of a system for certain applications. Both systems and synthetic biology have played important roles in the recent development of microbial platforms for energy, materials, and environmental applications. More importantly, systems biology provides the knowledge necessary for the development of synthetic biology tools, which in turn facilitates the manipulation and understanding of complex biological systems. Thus, the combination of systems and synthetic biology has huge potential for studying and engineering microbes, especially to perform advanced tasks, such as producing biofuels. Although there have been very few studies in integrating systems and synthetic biology, existing examples have demonstrated great power in extending microbiological capabilities. This review focuses on recent efforts in microbiological genomics, transcriptomics, proteomics, and metabolomics, aiming to fill the gap between systems and synthetic biology.
Collapse
Affiliation(s)
- Di Liu
- Department of Energy, Environmental and Chemical Engineering, Washington University St. Louis, MO, USA
| | | | | |
Collapse
|
18
|
Rubiano-Labrador C, Bland C, Miotello G, Guérin P, Pible O, Baena S, Armengaud J. Proteogenomic insights into salt tolerance by a halotolerant alpha-proteobacterium isolated from an Andean saline spring. J Proteomics 2013; 97:36-47. [PMID: 23727365 DOI: 10.1016/j.jprot.2013.05.020] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2012] [Revised: 04/30/2013] [Accepted: 05/16/2013] [Indexed: 01/18/2023]
Abstract
UNLABELLED Tistlia consotensis is a halotolerant Rhodospirillaceae that was isolated from a saline spring located in the Colombian Andes with a salt concentration close to seawater (4.5%w/vol). We cultivated this microorganism in three NaCl concentrations, i.e. optimal (0.5%), without (0.0%) and high (4.0%) salt concentration, and analyzed its cellular proteome. For assigning tandem mass spectrometry data, we first sequenced its genome and constructed a six reading frame ORF database from the draft sequence. We annotated only the genes whose products (872) were detected. We compared the quantitative proteome data sets recorded for the three different growth conditions. At low salinity general stress proteins (chaperons, proteases and proteins associated with oxidative stress protection), were detected in higher amounts, probably linked to difficulties for proper protein folding and metabolism. Proteogenomics and comparative genomics pointed at the CrgA transcriptional regulator as a key-factor for the proteome remodeling upon low osmolarity. In hyper-osmotic condition, T. consotensis produced in larger amounts proteins involved in the sensing of changes in salt concentration, as well as a wide panel of transport systems for the transport of organic compatible solutes such as glutamate. We have described here a straightforward procedure in making a new environmental isolate quickly amenable to proteomics. BIOLOGICAL SIGNIFICANCE The bacterium Tistlia consotensis was isolated from a saline spring in the Colombian Andes and represents an interesting environmental model to be compared with extremophiles or other moderate organisms. To explore the halotolerance molecular mechanisms of the bacterium T. consotensis, we developed an innovative proteogenomic strategy consisting of i) genome sequencing, ii) quick annotation of the genes whose products were detected by mass spectrometry, and iii) comparative proteomics of cells grown in three salt conditions. We highlighted in this manuscript how efficient such an approach can be compared to time-consuming genome annotation when pointing at the key proteins of a given biological question. We documented a large number of proteins found produced in greater amounts when cells are cultivated in either hypo-osmotic or hyper-osmotic conditions. This article is part of a Special Issue entitled: Trends in Microbial Proteomics.
Collapse
Affiliation(s)
- Carolina Rubiano-Labrador
- Unidad de Saneamiento y Biotecnología Ambiental, Departamento de Biología, Pontificia Universidad Javeriana, POB 56710, Bogotá D.C., Colombia; Colombian Center for Genomics and Bioinformatics of Extreme Environments, GeBiX, Colombia
| | - Céline Bland
- CEA, DSV, iBEB, SBTN, Lab Biochim System Perturb, Bagnols-sur-Cèze, F-30207, France
| | - Guylaine Miotello
- CEA, DSV, iBEB, SBTN, Lab Biochim System Perturb, Bagnols-sur-Cèze, F-30207, France
| | - Philippe Guérin
- CEA, DSV, iBEB, SBTN, Lab Biochim System Perturb, Bagnols-sur-Cèze, F-30207, France
| | - Olivier Pible
- CEA, DSV, iBEB, SBTN, Lab Biochim System Perturb, Bagnols-sur-Cèze, F-30207, France
| | - Sandra Baena
- Unidad de Saneamiento y Biotecnología Ambiental, Departamento de Biología, Pontificia Universidad Javeriana, POB 56710, Bogotá D.C., Colombia; Colombian Center for Genomics and Bioinformatics of Extreme Environments, GeBiX, Colombia
| | - Jean Armengaud
- CEA, DSV, iBEB, SBTN, Lab Biochim System Perturb, Bagnols-sur-Cèze, F-30207, France.
| |
Collapse
|
19
|
Armengaud J, Christie-Oleza JA, Clair G, Malard V, Duport C. Exoproteomics: exploring the world around biological systems. Expert Rev Proteomics 2013. [PMID: 23194272 DOI: 10.1586/epr.12.52] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The term 'exoproteome' describes the protein content that can be found in the extracellular proximity of a given biological system. These proteins arise from cellular secretion, other protein export mechanisms or cell lysis, but only the most stable proteins in this environment will remain in abundance. It has been shown that these proteins reflect the physiological state of the cells in a given condition and are indicators of how living systems interact with their environments. High-throughput proteomic approaches based on a shotgun strategy, and high-resolution mass spectrometers, have modified the authors' view of exoproteomes. In the present review, the authors describe how these new approaches should be exploited to obtain the maximum useful information from a sample, whatever its origin. The methodologies used for studying secretion from model cell lines derived from eukaryotic, multicellular organisms, virulence determinants of pathogens and environmental bacteria and their relationships with their habitats are illustrated with several examples. The implication of such data, in terms of proteogenomics and the discovery of novel protein functions, is discussed.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze, F-30207, France.
| | | | | | | | | |
Collapse
|
20
|
Dedieu A, Sahinovic E, Guérin P, Blanchard L, Fochesato S, Meunier B, de Groot A, Armengaud J. Major soluble proteome changes in Deinococcus deserti over the earliest stages following gamma-ray irradiation. Proteome Sci 2013; 11:3. [PMID: 23320389 PMCID: PMC3564903 DOI: 10.1186/1477-5956-11-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2012] [Accepted: 12/23/2012] [Indexed: 11/10/2022] Open
Abstract
UNLABELLED BACKGROUND Deinococcus deserti VCD115 has been isolated from Sahara surface sand. This radiotolerant bacterium represents an experimental model of choice to understand adaptation to harsh conditions encountered in hot arid deserts. We analysed the soluble proteome dynamics in this environmentally relevant model after exposure to 3 kGy gamma radiation, a non-lethal dose that generates massive DNA damages. For this, cells were harvested at different time lapses after irradiation and their soluble proteome contents have been analysed by 2-DE and mass spectrometry. RESULTS In the first stage of the time course we observed accumulation of DNA damage response protein DdrB (that shows the highest fold change ~11), SSB, and two different RecA proteins (RecAP and RecAC). Induction of DNA repair protein PprA, DNA damage response protein DdrD and the two gyrase subunits (GyrA and GyrB) was also detected. A response regulator of the SarP family, a type II site-specific deoxyribonuclease and a putative N-acetyltransferase are three new proteins found to be induced. In a more delayed stage, we observed accumulation of several proteins related to central metabolism and protein turn-over, as well as helicase UvrD and novel forms of both gyrase subunits differing in terms of isoelectric point and molecular weight. CONCLUSIONS Post-translational modifications of GyrA (N-terminal methionine removal and acetylation) have been evidenced and their significance discussed. We found that the Deide_02842 restriction enzyme, which is specifically found in D. deserti, is a new potential member of the radiation/desiccation response regulon, highlighting the specificities of D. deserti compared to the D. radiodurans model.
Collapse
Affiliation(s)
- Alain Dedieu
- Laboratoire de Biochimie des Systèmes Perturbés, CEA Marcoule, DSV, iBEB, SBTN, LBSP, BAGNOLS-SUR-CEZE, F-30207, France.
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Abstract
A newcomer to the -omics era, proteomics, is a broad instrument-intensive research area that has advanced rapidly since its inception less than 20 years ago. Although the 'wet-bench' aspects of proteomics have undergone a renaissance with the improvement in protein and peptide separation techniques, including various improvements in two-dimensional gel electrophoresis and gel-free or off-gel protein focusing, it has been the seminal advances in MS that have led to the ascension of this field. Recent improvements in sensitivity, mass accuracy and fragmentation have led to achievements previously only dreamed of, including whole-proteome identification, and quantification and extensive mapping of specific PTMs (post-translational modifications). With such capabilities at present, one might conclude that proteomics has already reached its zenith; however, 'capability' indicates that the envisioned goals have not yet been achieved. In the present review we focus on what we perceive as the areas requiring more attention to achieve the improvements in workflow and instrumentation that will bridge the gap between capability and achievement for at least most proteomes and PTMs. Additionally, it is essential that we extend our ability to understand protein structures, interactions and localizations. Towards these ends, we briefly focus on selected methods and research areas where we anticipate the next wave of proteomic advances.
Collapse
|
22
|
Christie-Oleza JA, Piña-Villalonga JM, Guerin P, Miotello G, Bosch R, Nogales B, Armengaud J. Shotgun nanoLC-MS/MS proteogenomics to document MALDI-TOF biomarkers for screening new members of theRuegeriagenus. Environ Microbiol 2012; 15:133-47. [DOI: 10.1111/j.1462-2920.2012.02812.x] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
23
|
Abstract
High-throughput identification of proteins with the latest generation of hybrid high-resolution mass spectrometers is opening new perspectives in microbiology. I present, here, an overview of tandem mass spectrometry technology and bioinformatics for shotgun proteomics that make 2D-PAGE approaches obsolete. Non-labelling quantitative approaches have become more popular than labelling techniques on most proteomic platforms because they are easier to carry out while their quantitative outcome is rather robust. Parameters for recording mass spectrometry data, however, need to be chosen carefully and statistics to assess the confidence of the results should not be neglected. Interestingly, next-generation sequencing methodologies make any microbial model quickly amenable to proteomics, leading to the documentation of a wide range of organisms from diverse environments. Some recent discoveries made using microbial proteomics have challenged some biological dogma, such as: (i) initiation of the translation does not occur predominantly from ATG codons in some microorganisms, (ii) non-canonical initiation codons are used to regulate the production of specific but important proteins and (iii) a gene may code for multiple polypeptide species, heterogeneous in terms of sequences. Microbial diversity and microbial physiology can now be revisited by means of exhaustive comparative proteomic surveys where thousands of proteins are detected and quantified. Proteogenomics, consisting of better annotating of genomes with the help of proteomic evidence, is paving the way for integrated multi-omic approaches in microbiology. Finally, meta-proteomic tools and approaches are emerging for tackling the high complexity of the microbial world as a whole, opening new perspectives for assessing how microbial communities function.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, F-30207 Bagnols-sur-Cèze, France.
| |
Collapse
|
24
|
Toueille M, Mirabella B, Guérin P, Bouthier de la Tour C, Boisnard S, Nguyen HH, Blanchard L, Servant P, de Groot A, Sommer S, Armengaud J. A comparative proteomic approach to better define Deinococcus nucleoid specificities. J Proteomics 2012; 75:2588-600. [DOI: 10.1016/j.jprot.2012.03.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2012] [Revised: 02/23/2012] [Accepted: 03/01/2012] [Indexed: 10/28/2022]
|
25
|
Translational plant proteomics: a perspective. J Proteomics 2012; 75:4588-601. [PMID: 22516432 DOI: 10.1016/j.jprot.2012.03.055] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2011] [Revised: 02/25/2012] [Accepted: 03/25/2012] [Indexed: 11/21/2022]
Abstract
Translational proteomics is an emerging sub-discipline of the proteomics field in the biological sciences. Translational plant proteomics aims to integrate knowledge from basic sciences to translate it into field applications to solve issues related but not limited to the recreational and economic values of plants, food security and safety, and energy sustainability. In this review, we highlight the substantial progress reached in plant proteomics during the past decade which has paved the way for translational plant proteomics. Increasing proteomics knowledge in plants is not limited to model and non-model plants, proteogenomics, crop improvement, and food analysis, safety, and nutrition but to many more potential applications. Given the wealth of information generated and to some extent applied, there is the need for more efficient and broader channels to freely disseminate the information to the scientific community. This article is part of a Special Issue entitled: Translational Proteomics.
Collapse
|
26
|
Peterson ES, McCue LA, Schrimpe-Rutledge AC, Jensen JL, Walker H, Kobold MA, Webb SR, Payne SH, Ansong C, Adkins JN, Cannon WR, Webb-Robertson BJM. VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data. BMC Genomics 2012; 13:131. [PMID: 22480257 PMCID: PMC3364912 DOI: 10.1186/1471-2164-13-131] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2011] [Accepted: 04/05/2012] [Indexed: 11/10/2022] Open
Abstract
Background The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. Results VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. Conclusions VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php.
Collapse
Affiliation(s)
- Elena S Peterson
- Scientific Data Management, Pacific Northwest National Laboratory, Richland, WA, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Christie-Oleza JA, Miotello G, Armengaud J. High-throughput proteogenomics of Ruegeria pomeroyi: seeding a better genomic annotation for the whole marine Roseobacter clade. BMC Genomics 2012; 13:73. [PMID: 22336032 PMCID: PMC3305630 DOI: 10.1186/1471-2164-13-73] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2011] [Accepted: 02/15/2012] [Indexed: 11/10/2022] Open
Abstract
Background The structural and functional annotation of genomes is now heavily based on data obtained using automated pipeline systems. The key for an accurate structural annotation consists of blending similarities between closely related genomes with biochemical evidence of the genome interpretation. In this work we applied high-throughput proteogenomics to Ruegeria pomeroyi, a member of the Roseobacter clade, an abundant group of marine bacteria, as a seed for the annotation of the whole clade. Results A large dataset of peptides from R. pomeroyi was obtained after searching over 1.1 million MS/MS spectra against a six-frame translated genome database. We identified 2006 polypeptides, of which thirty-four were encoded by open reading frames (ORFs) that had not previously been annotated. From the pool of 'one-hit-wonders', i.e. those ORFs specified by only one peptide detected by tandem mass spectrometry, we could confirm the probable existence of five additional new genes after proving that the corresponding RNAs were transcribed. We also identified the most-N-terminal peptide of 486 polypeptides, of which sixty-four had originally been wrongly annotated. Conclusions By extending these re-annotations to the other thirty-six Roseobacter isolates sequenced to date (twenty different genera), we propose the correction of the assigned start codons of 1082 homologous genes in the clade. In addition, we also report the presence of novel genes within operons encoding determinants of the important tricarboxylic acid cycle, a feature that seems to be characteristic of some Roseobacter genomes. The detection of their corresponding products in large amounts raises the question of their function. Their discoveries point to a possible theory for protein evolution that will rely on high expression of orphans in bacteria: their putative poor efficiency could be counterbalanced by a higher level of expression. Our proteogenomic analysis will increase the reliability of the future annotation of marine bacterial genomes.
Collapse
|
28
|
Chao TC, Hansmeier N. The current state of microbial proteomics: Where we are and where we want to go. Proteomics 2012; 12:638-50. [DOI: 10.1002/pmic.201100381] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2011] [Revised: 08/15/2011] [Accepted: 08/22/2011] [Indexed: 11/11/2022]
|
29
|
Clair G, Armengaud J, Duport C. Restricting fermentative potential by proteome remodeling: an adaptive strategy evidenced in Bacillus cereus. Mol Cell Proteomics 2012; 11:M111.013102. [PMID: 22232490 DOI: 10.1074/mcp.m111.013102] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Pathogenesis hinges on successful colonization of the gastrointestinal (GI) tract by pathogenic facultative anaerobes. The GI tract is a carbohydrate-limited environment with varying oxygen availability and oxidoreduction potential (ORP). How pathogenic bacteria are able to adapt and grow in these varying conditions remains a key fundamental question. Here, we designed a system biology-inspired approach to pinpoint the key regulators allowing Bacillus cereus to survive and grow efficiently under low ORP anoxic conditions mimicking those encountered in the intestinal lumen. We assessed the proteome components using high throughput nanoLC-MS/MS techniques, reconstituted the main metabolic circuits, constructed ΔohrA and ΔohrR mutants, and analyzed the impacts of ohrA and ohrR disruptions by a novel round of shotgun proteomics. Our study revealed that OhrR and OhrA are crucial to the successful adaptation of B. cereus to the GI tract environment. Specifically, we showed that B. cereus restricts its fermentative growth under low ORP anaerobiosis and sustains efficient aerobic respiratory metabolism, motility, and stress response via OhrRA-dependent proteome remodeling. Finally, our results introduced a new adaptive strategy where facultative anaerobes prefer to restrict their fermentative potential for a long term benefit.
Collapse
Affiliation(s)
- Gérémy Clair
- Université d'Avignon et des Pays de Vaucluse, UMR408, Sécurité et Qualité des Produits d'Origine Végétale, F-84000 Avignon, France
| | | | | |
Collapse
|
30
|
Christie-Oleza JA, Piña-Villalonga JM, Bosch R, Nogales B, Armengaud J. Comparative proteogenomics of twelve Roseobacter exoproteomes reveals different adaptive strategies among these marine bacteria. Mol Cell Proteomics 2011; 11:M111.013110. [PMID: 22122883 DOI: 10.1074/mcp.m111.013110] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Roseobacters are generalist bacteria abundantly found in the oceans. Because little is known on how marine microorganisms interact in association or competition, we focused our attention on the microbial exoproteome, a key component in their interaction with extracellular milieu. Here we present a comparative analysis of the theoretically encoded exoproteome of twelve members of the Roseobacter group validated by extensive comparative proteogenomics. In silico analysis revealed that 30% of the encoded proteome of these microorganisms could be exported. The ratio of the different protein categories varied in accordance to the ecological distinctness of each strain, a trait reinforced by quantitative proteomics data. Despite the interspecies variations found, the most abundantly detected proteins by shotgun proteomics were from transporter, adhesion, motility, and toxin-like protein categories, defining four different plausible adaptive strategies within the Roseobacter group. In some strains the toxin-secretion strategy was over-represented with repeats-in-toxin-like proteins. Our results show that exoproteomes strongly depend on bacterial trophic strategy and can slightly change because of culture conditions. Simulated natural conditions and the effect of the indigenous microbial community on the exoproteome of Ruegeria pomeroyi DSS-3 were also assayed. Interestingly, we observed a significant depletion of the toxin-like proteins usually secreted by R. pomeroyi DSS-3 when grown in presence of a natural community sampled from a Mediterranean Sea port. The significance of this specific fraction of the exoproteome is discussed.
Collapse
|
31
|
Archakov A, Aseev A, Bykov V, Grigoriev A, Govorun V, Ivanov V, Khlunov A, Lisitsa A, Mazurenko S, Makarov AA, Ponomarenko E, Sagdeev R, Skryabin K. Gene-centric view on the human proteome project: the example of the Russian roadmap for chromosome 18. Proteomics 2011; 11:1853-6. [PMID: 21563312 DOI: 10.1002/pmic.201000540] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
During the 2010 Human Proteome Organization Congress in Sydney, a gene-centric approach emerged as a feasible and tractable scaffold for assemblage of the Human Proteome Project. Bringing the gene-centric principle into practice, a roadmap for the 18th chromosome was drafted, postulating the limited sensitivity of analytical methods, as a serious bottleneck in proteomics. In the context of the sensitivity problem, we refer to the "copy number of protein molecules" as a measurable assessment of protein abundance. The roadmap is focused on the development of technology to attain the low- and ultralow -"copied" portion of the proteome. Roadmap merges the genomic, transcriptomic and proteomic levels to identify the majority of 285 proteins from 18th chromosome - master proteins. Master protein is the primary translation of the coding sequence and resembling at least one of the known isoforms, coded by the gene. The executive phase of the roadmap includes the expansion of the study of the master proteins with alternate splicing, single amino acid polymorphisms (SAPs) and post-translational modifications. In implementing the roadmap, Russian scientists are expecting to establish proteomic technologies for integrating MS and atomic force microscopy (AFM). These technologies are anticipated to unlock the value of new biomarkers at a detection limit of 10(-18) M, i.e. 1 protein copy per 1 μL of plasma. The roadmap plan is posted at www.proteome.ru/en/roadmap/ and a forum for discussion of the document is supported.
Collapse
Affiliation(s)
- Alexander Archakov
- Orekhovich Institute of Biomedical Chemistry, Russian Academy of Medical Sciences (RAMS), Moscow, Russia.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Proteogenomics and in silico structural and functional annotation of the barley powdery mildew Blumeria graminis f. sp. hordei. Methods 2011; 54:432-41. [DOI: 10.1016/j.ymeth.2011.03.006] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2011] [Revised: 03/08/2011] [Accepted: 03/16/2011] [Indexed: 02/05/2023] Open
|
33
|
Helmy M, Tomita M, Ishihama Y. OryzaPG-DB: rice proteome database based on shotgun proteogenomics. BMC PLANT BIOLOGY 2011; 11:63. [PMID: 21486466 PMCID: PMC3094275 DOI: 10.1186/1471-2229-11-63] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2010] [Accepted: 04/12/2011] [Indexed: 05/21/2023]
Abstract
BACKGROUND Proteogenomics aims to utilize experimental proteome information for refinement of genome annotation. Since mass spectrometry-based shotgun proteomics approaches provide large-scale peptide sequencing data with high throughput, a data repository for shotgun proteogenomics would represent a valuable source of gene expression evidence at the translational level for genome re-annotation. DESCRIPTION Here, we present OryzaPG-DB, a rice proteome database based on shotgun proteogenomics, which incorporates the genomic features of experimental shotgun proteomics data. This version of the database was created from the results of 27 nanoLC-MS/MS runs on a hybrid ion trap-orbitrap mass spectrometer, which offers high accuracy for analyzing tryptic digests from undifferentiated cultured rice cells. Peptides were identified by searching the product ion spectra against the protein, cDNA, transcript and genome databases from Michigan State University, and were mapped to the rice genome. Approximately 3200 genes were covered by these peptides and 40 of them contained novel genomic features. Users can search, download or navigate the database per chromosome, gene, protein, cDNA or transcript and download the updated annotations in standard GFF3 format, with visualization in PNG format. In addition, the database scheme of OryzaPG was designed to be generic and can be reused to host similar proteogenomic information for other species. OryzaPG is the first proteogenomics-based database of the rice proteome, providing peptide-based expression profiles, together with the corresponding genomic origin, including the annotation of novelty for each peptide. CONCLUSIONS The OryzaPG database was constructed and is freely available at http://oryzapg.iab.keio.ac.jp/.
Collapse
Affiliation(s)
- Mohamed Helmy
- Institute for Advanced Biosciences, Keio University, 403-1 Daihoji, Tsuruoka, Yamagata 997-0017, Japan
- Systems Biology Program, Graduate School of Media and Governance, Keio University, 5322 Endo, Fujisawa, Kanagawa 252-0882, Japan
| | - Masaru Tomita
- Institute for Advanced Biosciences, Keio University, 403-1 Daihoji, Tsuruoka, Yamagata 997-0017, Japan
| | - Yasushi Ishihama
- Institute for Advanced Biosciences, Keio University, 403-1 Daihoji, Tsuruoka, Yamagata 997-0017, Japan
- Graduate School of Pharmaceutical Sciences, Kyoto University, Sakyo-ku, Kyoto 606-8501, Japan
| |
Collapse
|
34
|
Reddy PJ, Jain R, Paik YK, Downey R, Ptolemy AS, Ozdemir V, Srivastava S. Personalized Medicine in the Age of Pharmacoproteomics: A Close up on India and Need for Social Science Engagement for Responsible Innovation in Post-Proteomic Biology. CURRENT PHARMACOGENOMICS AND PERSONALIZED MEDICINE 2011; 9:67-75. [PMID: 22279515 PMCID: PMC3264661 DOI: 10.2174/187569211794728850] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Affiliation(s)
- Panga Jaipal Reddy
- Wadhwani Research Center for Biosciences and Bioengineering, Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
| | - Rekha Jain
- Wadhwani Research Center for Biosciences and Bioengineering, Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
| | - Young-Ki Paik
- Yonsei Proteome Research Center, Biomedical Proteome Research Center, and Department of Biomedical Sciences, World Class University Program, Yonsei University, Seoul, Korea
| | | | - Adam S. Ptolemy
- Department of Pathology, Brigham and Women’s Hospital, Boston, MA, USA
| | - Vural Ozdemir
- Centre of Genomics and Policy, Department of Human Genetics, Faculty of Medicine, McGill University, Montreal, QC, Canada
| | - Sanjeeva Srivastava
- Wadhwani Research Center for Biosciences and Bioengineering, Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
| |
Collapse
|
35
|
Ozdemir V, Armengaud J, Dubé L, Aziz RK, Knoppers BM. Nutriproteomics and Proteogenomics: Cultivating Two Novel Hybrid Fields of Personalized Medicine with Added Societal Value. ACTA ACUST UNITED AC 2010; 8:240-244. [PMID: 21399751 DOI: 10.2174/187569210793368230] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Vural Ozdemir
- Centre of Genomics and Policy, Department of Human Genetics, Faculty of Medicine, McGill University, Montreal, QC, Canada
| | | | | | | | | |
Collapse
|
36
|
Dedieu A, Gaillard JC, Pourcher T, Darrouzet E, Armengaud J. Revisiting iodination sites in thyroglobulin with an organ-oriented shotgun strategy. J Biol Chem 2010; 286:259-69. [PMID: 20978121 DOI: 10.1074/jbc.m110.159483] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Thyroglobulin (Tg) is secreted by thyroid epithelial cells. It is essential for thyroid hormonogenesis and iodine storage. Although studied for many years, only indirect and partial surveys of its post-translational modifications were reported. Here, we present a direct proteomic approach, used to study the degree of iodination of mouse Tg without any preliminary purification. A comprehensive coverage of Tg was obtained using a combination of different proteases, MS/MS fragmentation procedures with inclusion lists and a hybrid mass high-resolution LTQ-Orbitrap XL mass spectrometer. Although only 16 iodinated sites are currently known for human Tg, we uncovered 37 iodinated tyrosine residues, most of them being mono- or diiodinated. We report the specific isotopic pattern of thyroxine modification, not recognized as a normal peptide pattern. Four hormonogenic sites were detected. Two donor sites were identified through the detection of a pyruvic acid residue in place of the initial tyrosine. Evidence for polypeptide cleavages sites due to the action of cathepsins and dipeptidyl proteases in the thyroid were also detected. This work shows that semi-quantitation of Tg iodination states is feasible for human biopsies and should be of significant medical interest for further characterization of human thyroid pathologies.
Collapse
Affiliation(s)
- Alain Dedieu
- Commissariat à l'Energie Atomique, DSV, iBEB, Laboratoire des Transporters en Imagerie et Radiothérapie en Oncologie, Bagnols-sur-Cèze F-30207, France.
| | | | | | | | | |
Collapse
|
37
|
Klubicová K, Danchenko M, Skultety L, Miernyk JA, Rashydov NM, Berezhna VV, Pret'ová A, Hajduch M. Proteomics analysis of flax grown in Chernobyl area suggests limited effect of contaminated environment on seed proteome. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2010; 44:6940-6. [PMID: 20715763 DOI: 10.1021/es100895s] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The accident at the Chernobyl Nuclear Power Plant (CNPP) on April 26, 1986 is the most serious nuclear disaster in human history. Surprisingly, while the area proximal to the CNPP remains substantially contaminated with long-lived radioisotopes including (90)Sr and (137)Cs, the local ecosystem has been able to adapt. To evaluate plant adaptation, seeds of a local flax (Linum usitatissimum) variety Kyivskyi were sown in radio-contaminated and control fields of the Chernobyl region. A total protein fraction was isolated from mature seeds, and analyzed using 2-dimensional electrophoresis combined with tandem-mass spectrometry. Interestingly, growth of the plants in the radio-contaminated environment had little effect on proteome and only 35 protein spots differed in abundance (p-value of ≤0.05) out of 720 protein spots that were quantified for seeds harvested from both radio-contaminated and control fields. Of the 35 differentially abundant spots, 28 proteins were identified using state-of-the-art MS(E) method. Based on the observed changes, the proteome of seeds from plants grown in radio-contaminated soil display minor adjustments to multiple signaling pathways.
Collapse
Affiliation(s)
- Katarína Klubicová
- Institute of Plant Genetics and Biotechnology, Slovak Academy of Sciences, Nitra, Slovakia
| | | | | | | | | | | | | | | |
Collapse
|
38
|
In-depth analysis of exoproteomes from marine bacteria by shotgun liquid chromatography-tandem mass spectrometry: the Ruegeria pomeroyi DSS-3 case-study. Mar Drugs 2010; 8:2223-39. [PMID: 20948905 PMCID: PMC2953401 DOI: 10.3390/md8082223] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2010] [Revised: 07/27/2010] [Accepted: 07/28/2010] [Indexed: 12/16/2022] Open
Abstract
Microorganisms secrete into their extracellular environment numerous compounds that are required for their survival. Many of these compounds could be of great interest for biotechnology applications and their genes used in synthetic biology design. The secreted proteins and the components of the translocation systems themselves can be scrutinized in-depth by the most recent proteomic tools. While the secretomes of pathogens are well-documented, those of non-pathogens remain largely to be established. Here, we present the analysis of the exoproteome from the marine bacterium Ruegeria pomeroyi DSS-3 grown in standard laboratory conditions. We used a shotgun approach consisting of trypsin digestion of the exoproteome, and identification of the resulting peptides by liquid chromatography coupled to tandem mass spectrometry. Three different proteins that have domains homologous to those observed in RTX toxins were uncovered and were semi-quantified as the most abundantly secreted proteins. One of these proteins clearly stands out from the catalogue, representing over half of the total exoproteome. We also listed many soluble proteins related to ABC and TRAP transporters implied in the uptake of nutrients. The Ruegeria pomeroyi DSS-3 case-study illustrates the power of the shotgun nano-LC-MS/MS strategy to decipher the exoproteome from marine bacteria and to contribute to environmental proteomics.
Collapse
|