151
|
Warren AS, Archuleta J, Feng WC, Setubal JC. Missing genes in the annotation of prokaryotic genomes. BMC Bioinformatics 2010; 11:131. [PMID: 20230630 PMCID: PMC3098052 DOI: 10.1186/1471-2105-11-131] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2009] [Accepted: 03/15/2010] [Indexed: 12/04/2022] Open
Abstract
Background Protein-coding gene detection in prokaryotic genomes is considered a much simpler problem than in intron-containing eukaryotic genomes. However there have been reports that prokaryotic gene finder programs have problems with small genes (either over-predicting or under-predicting). Therefore the question arises as to whether current genome annotations have systematically missing, small genes. Results We have developed a high-performance computing methodology to investigate this problem. In this methodology we compare all ORFs larger than or equal to 33 aa from all fully-sequenced prokaryotic replicons. Based on that comparison, and using conservative criteria requiring a minimum taxonomic diversity between conserved ORFs in different genomes, we have discovered 1,153 candidate genes that are missing from current genome annotations. These missing genes are similar only to each other and do not have any strong similarity to gene sequences in public databases, with the implication that these ORFs belong to missing gene families. We also uncovered 38,895 intergenic ORFs, readily identified as putative genes by similarity to currently annotated genes (we call these absent annotations). The vast majority of the missing genes found are small (less than 100 aa). A comparison of select examples with GeneMark, EasyGene and Glimmer predictions yields evidence that some of these genes are escaping detection by these programs. Conclusions Prokaryotic gene finders and prokaryotic genome annotations require improvement for accurate prediction of small genes. The number of missing gene families found is likely a lower bound on the actual number, due to the conservative criteria used to determine whether an ORF corresponds to a real gene.
Collapse
Affiliation(s)
- Andrew S Warren
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA.
| | | | | | | |
Collapse
|
152
|
Guccione E, Hitchcock A, Hall SJ, Mulholland F, Shearer N, van Vliet AHM, Kelly DJ. Reduction of fumarate, mesaconate and crotonate by Mfr, a novel oxygen-regulated periplasmic reductase inCampylobacter jejuni. Environ Microbiol 2010; 12:576-91. [DOI: 10.1111/j.1462-2920.2009.02096.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
153
|
Small stress response proteins in Escherichia coli: proteins missed by classical proteomic studies. J Bacteriol 2010; 192:46-58. [PMID: 19734316 DOI: 10.1128/jb.00872-09] [Citation(s) in RCA: 123] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Proteins of 50 or fewer amino acids are poorly characterized in all organisms. The corresponding genes are challenging to reliably annotate, and it is difficult to purify and characterize the small protein products. Due to these technical limitations, little is known about the abundance of small proteins, not to mention their biological functions. To begin to characterize these small proteins in Escherichia coli, we assayed their accumulation under a variety of growth conditions and after exposure to stress. We found that many small proteins accumulate under specific growth conditions or are stress induced. For some genes, the observed changes in protein levels were consistent with known transcriptional regulation, such as ArcA activation of the operons encoding yccB and ybgT. However, we also identified novel regulation, such as Zur repression of ykgMO, cyclic AMP response protein (CRP) repression of azuC, and CRP activation of ykgR. The levels of 11 small proteins increase after heat shock, and induction of at least 1 of these, YobF, occurs at a posttranscriptional level. These results show that small proteins are an overlooked subset of stress response proteins in E. coli and provide information that will be valuable for determining the functions of these proteins.
Collapse
|
154
|
Small RNAs and small proteins involved in resistance to cell envelope stress and acid shock in Escherichia coli: analysis of a bar-coded mutant collection. J Bacteriol 2010; 192:59-67. [PMID: 19734312 DOI: 10.1128/jb.00873-09] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
More than 80 small regulatory RNAs (sRNAs) and 60 proteins of 16 to 50 amino acids (small proteins) are encoded in the Escherichia coli genome. The vast majority of the corresponding genes have no known function. We screened 125 DNA bar-coded mutants to identify novel cell envelope stress and acute acid shock phenotypes associated with deletions of genes coding for sRNAs and small proteins. Nine deletion mutants (ssrA, micA, ybaM, ryeF, yqcG, sroH, ybhT, yobF, and glmY) were sensitive to cell envelope stress and two were resistant (rybB and blr). Deletion mutants of genes coding for four small proteins (yqgB, mgrB, yobF, and yceO) were sensitive to acute acid stress. We confirmed each of these phenotypes in one-on-one competition assays against otherwise-wild-type lacZ mutant cells. A more detailed investigation of the SsrA phenotype suggests that ribosome release is critical for resistance to cell envelope stress. The bar-coded deletion collection we generated can be screened for sensitivity or resistance to virtually any stress condition.
Collapse
|
155
|
Global approaches for finding small RNA and small open reading frame functions. J Bacteriol 2010; 192:26-8. [PMID: 19854892 DOI: 10.1128/jb.01316-09] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
156
|
Kim W, Silby MW, Purvine SO, Nicoll JS, Hixson KK, Monroe M, Nicora CD, Lipton MS, Levy SB. Proteomic detection of non-annotated protein-coding genes in Pseudomonas fluorescens Pf0-1. PLoS One 2009; 4:e8455. [PMID: 20041161 PMCID: PMC2794547 DOI: 10.1371/journal.pone.0008455] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2009] [Accepted: 12/02/2009] [Indexed: 11/18/2022] Open
Abstract
Genome sequences are annotated by computational prediction of coding sequences, followed by similarity searches such as BLAST, which provide a layer of possible functional information. While the existence of processes such as alternative splicing complicates matters for eukaryote genomes, the view of bacterial genomes as a linear series of closely spaced genes leads to the assumption that computational annotations that predict such arrangements completely describe the coding capacity of bacterial genomes. We undertook a proteomic study to identify proteins expressed by Pseudomonas fluorescens Pf0-1 from genes that were not predicted during the genome annotation. Mapping peptides to the Pf0-1 genome sequence identified sixteen non-annotated protein-coding regions, of which nine were antisense to predicted genes, six were intergenic, and one read in the same direction as an annotated gene but in a different frame. The expression of all but one of the newly discovered genes was verified by RT-PCR. Few clues as to the function of the new genes were gleaned from informatic analyses, but potential orthologs in other Pseudomonas genomes were identified for eight of the new genes. The 16 newly identified genes improve the quality of the Pf0-1 genome annotation, and the detection of antisense protein-coding genes indicates the under-appreciated complexity of bacterial genome organization.
Collapse
Affiliation(s)
- Wook Kim
- Center for Adaptation Genetics and Drug Resistance and Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, Massachusetts, United States of America
| | - Mark W. Silby
- Center for Adaptation Genetics and Drug Resistance and Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, Massachusetts, United States of America
| | - Sam O. Purvine
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Julie S. Nicoll
- Center for Adaptation Genetics and Drug Resistance and Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, Massachusetts, United States of America
| | - Kim K. Hixson
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Matt Monroe
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Carrie D. Nicora
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Mary S. Lipton
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Stuart B. Levy
- Center for Adaptation Genetics and Drug Resistance and Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
157
|
Lippa AM, Goulian M. Feedback inhibition in the PhoQ/PhoP signaling system by a membrane peptide. PLoS Genet 2009; 5:e1000788. [PMID: 20041203 PMCID: PMC2789325 DOI: 10.1371/journal.pgen.1000788] [Citation(s) in RCA: 165] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2009] [Accepted: 11/24/2009] [Indexed: 01/09/2023] Open
Abstract
The PhoQ/PhoP signaling system responds to low magnesium and the presence of certain cationic antimicrobial peptides. It regulates genes important for growth under these conditions, as well as additional genes important for virulence in many gram-negative pathogens. PhoQ is a sensor kinase that phosphorylates and activates the transcription factor PhoP. Since feedback inhibition is a common theme in stress-response circuits, we hypothesized that some members of the PhoP regulon may play such a role in the PhoQ/PhoP pathway. We therefore screened for PhoP-regulated genes that mediate feedback in this system. We found that deletion of mgrB (yobG), which encodes a 47 amino acid peptide, results in a potent increase in PhoP-regulated transcription. In addition, over-expression of mgrB decreased transcription at both high and low concentrations of magnesium. Localization and bacterial two-hybrid studies suggest that MgrB resides in the inner-membrane and interacts directly with PhoQ. We further show that MgrB homologs from Salmonella typhimurium and Yersinia pestis also repress PhoP-regulated transcription in these organisms. In cell regulatory circuits, feedback has been associated with modulating the induction kinetics and/or the cell-to-cell variability in response to stimulus. Interestingly, we found that elimination of MgrB-mediated feedback did not have a significant effect on the kinetics of reporter protein production and did not decrease the variability in expression among cells. Our results indicate MgrB is a broadly conserved membrane peptide that is a critical mediator of negative feedback in the PhoQ/PhoP circuit. This new regulator may function as a point of control that integrates additional input signals to modulate the activity of this important signaling system. The proteins PhoQ and PhoP comprise an environmental sensing system that has been extensively studied in numerous bacteria, including Salmonella typhimurium and Escherichia coli. The PhoQ/PhoP system is stimulated by conditions of low extracellular magnesium or the presence of certain cationic antimicrobial peptides; and it controls genes, whose protein products protect the cell under these conditions or play other critical roles in regulating the virulence of pathogens. The functions of many members of the PhoP regulon, however, remain uncharacterized. This leaves open the possibility that some PhoP-regulated genes may mediate feedback in this system. Regulatory circuits that allow adaptation to environmental change often make use of negative feedback to achieve the appropriate level of response. To look for negative feedback, we screened knockouts of PhoP-regulated genes in E. coli. We have identified a remarkably small membrane protein of just 47 amino acids that mediates potent negative feedback on the PhoQ/PhoP circuit in E. coli, S. typhimurium, Yersinia pestis, and likely other related bacteria. This represents a striking example of a small, easily-overlooked open reading frame that plays a critical role in regulating a broadly conserved signal transduction pathway.
Collapse
Affiliation(s)
- Andrew M. Lippa
- Cell and Molecular Biology Graduate Group, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Mark Goulian
- Cell and Molecular Biology Graduate Group, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
158
|
Deep sequencing analysis of the Methanosarcina mazei Gö1 transcriptome in response to nitrogen availability. Proc Natl Acad Sci U S A 2009; 106:21878-82. [PMID: 19996181 DOI: 10.1073/pnas.0909051106] [Citation(s) in RCA: 149] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Methanosarcina mazei and related mesophilic archaea are the only organisms fermenting acetate, methylamines, and methanol to methane and carbon dioxide, contributing significantly to greenhouse gas production. The biochemistry of these metabolic processes is well studied, and genome sequences are available, yet little is known about the overall transcriptional organization and the noncoding regions representing 25% of the 4.01-Mb genome of M. mazei. We present a genome-wide analysis of transcription start sites (TSS) in M. mazei grown under different nitrogen availabilities. Pyrosequencing-based differential analysis of primary vs. processed 5' ends of transcripts discovered 876 TSS across the M. mazei genome. Unlike in other archaea, in which leaderless mRNAs are prevalent, the majority of the detected mRNAs in M. mazei carry long untranslated 5' regions. Our experimental data predict a total of 208 small RNA (sRNA) candidates, mostly from intergenic regions but also antisense to 5' and 3' regions of mRNAs. In addition, 40 new small mRNAs with ORFs of < or = 30 aa were identified, some of which might have dual functions as mRNA and regulatory sRNA. We confirmed differential expression of several sRNA genes in response to nitrogen availability. Inspection of their promoter regions revealed a unique conserved sequence motif associated with nitrogen-responsive regulation, which might serve as a regulator binding site upstream of the common IIB recognition element. Strikingly, several sRNAs antisense to mRNAs encoding transposases indicate nitrogen-dependent transposition events. This global TSS map in archaea will facilitate a better understanding of transcriptional and posttranscriptional control in the third domain of life.
Collapse
|
159
|
Overgaard M, Kallipolitis B, Valentin-Hansen P. Modulating the bacterial surface with small RNAs: a new twist on PhoP/Q-mediated lipopolysaccharide modification. Mol Microbiol 2009; 74:1289-94. [PMID: 19906183 DOI: 10.1111/j.1365-2958.2009.06943.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Summary In recent years, small non-coding RNAs have emerged as important regulatory components in bacterial stress responses and in bacterial virulence. Many of these are conserved in related species and act on target mRNAs by sequence complementarity. They are tightly controlled at the transcription level, and are frequently elements of global regulatory systems. In Escherichia coli and Salmonella, almost one-third of the functional characterized small RNAs participate in control of outer membrane protein production. A subset of these genes is under the control of the sigma(E)-signalling system that monitors the folding status of the envelope and interacts with other regulatory systems to integrate multiple signals into a co-ordinated cellular response. In this issue of Molecular Microbiology, Moon and Gottesman describe a novel baseparing small RNA that participates in modulation of bacterial surface properties by regulating lipopolysaccharide modification. The small RNA is expressed as part of the PhoP/PhoQ two-component system that plays a major role in virulence of pathogenic species. This work expands the list of global regulators known to control small RNA expression in enterobacteria and reinforces the idea that one central role of bacterial small regulatory RNAs is to modulate and fine-tune cell surface composition and structure.
Collapse
Affiliation(s)
- Martin Overgaard
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Denmark
| | | | | |
Collapse
|
160
|
Moon K, Gottesman S. A PhoQ/P-regulated small RNA regulates sensitivity of Escherichia coli to antimicrobial peptides. Mol Microbiol 2009; 74:1314-30. [PMID: 19889087 DOI: 10.1111/j.1365-2958.2009.06944.x] [Citation(s) in RCA: 127] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Non-coding small RNAs (sRNAs) play a major role in post-transcriptional regulation of gene expression. Of the 80 sRNAs that have been identified in E. coli, one-third bind to the RNA chaperone Hfq. Hfq both stabilizes these sRNAs in vivo and stimulates pairing to targets in vitro. A novel Hfq-dependent RNA, called here MgrR, was identified by its ability to bind Hfq. Expression of MgrR requires the PhoQ/PhoP two-component system; the PhoP response regulator is active under low Mg2+ concentrations and is an important virulence regulator in Salmonella; mgrR is also found in Salmonella species. Negatively regulated targets of MgrR identified using microarrays include eptB, involved in lipopolysaccharide (LPS) modification, and ygdQ, encoding a hypothetical protein. Cell sensitivity to the antimicrobial polymyxin B is affected by LPS modifications, and cells carrying an mgrR deletion were approximately 10 times more resistant than wild-type cells to polymyxin B. Thus, lower Mg2+ concentrations, sensed by PhoQ/PhoP, lead to expression of MgrR, changing LPS. sRNAs have previously been shown to regulate many outer membrane proteins. This work demonstrates that LPS, a major contributor of bacterial interactions with mammalian cells, is also subject to regulation by sRNAs.
Collapse
Affiliation(s)
- Kyung Moon
- Laboratory of Molecular Biology, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | | |
Collapse
|
161
|
Abstract
Identification of short coding sequences is challenging, both experimentally and in silico, and functional natural peptides (< 50 amino acids) have to a large extent been overlooked in Gram-negative bacteria. Recent results have converged to highlight the role of hydrophobic peptides that form a novel class of active molecules in Escherichia coli and Salmonella enterica serovar Typhimurium. These peptides can play a regulatory role by interacting with protein partners at the inner membrane and by modulating protein partner activity or stability. Genome-wide analyses in both bacterial species have identified several conserved short open reading frames encoding a single transmembrane segment. We discuss the known and predicted membrane-associated peptides and the tools for their identification. Besides the identification of novel regulatory networks, characterization of peptides with a single transmembrane helix segment and proteins that interact with them provides a powerful opportunity to study interactions between alpha helices within biological membranes. In addition, some bioactive membrane peptides could provide a basis for engineering membrane protein antagonists.
Collapse
Affiliation(s)
- Eric Alix
- Inserm, ESPRI 26, Avenue J. F. Kennedy, 30908 Nîmes cedex 02, France
| | | |
Collapse
|