101
|
Abstract
Many viruses encode short transmembrane proteins that play vital roles in virus replication or virulence. Because many of these proteins are less than 50 amino acids long and not homologous to cellular proteins, their open reading frames were often overlooked during the initial annotation of viral genomes. Some of these proteins oligomerize in membranes and form ion channels. Other miniproteins bind to cellular transmembrane proteins and modulate their activity, whereas still others have an unknown mechanism of action. Based on the underlying principles of transmembrane miniprotein structure, it is possible to build artificial small transmembrane proteins that modulate a variety of biological processes. These findings suggest that short transmembrane proteins provide a versatile mechanism to regulate a wide range of cellular activities, and we speculate that cells also express many similar proteins that have not yet been discovered.
Collapse
Affiliation(s)
- Daniel DiMaio
- Department of Genetics, Yale School of Medicine, New Haven, Connecticut 06520;
| |
Collapse
|
102
|
SearchDOGS bacteria, software that provides automated identification of potentially missed genes in annotated bacterial genomes. J Bacteriol 2014; 196:2030-42. [PMID: 24659774 DOI: 10.1128/jb.01368-13] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
We report the development of SearchDOGS Bacteria, software to automatically detect missing genes in annotated bacterial genomes by combining BLAST searches with comparative genomics. Having successfully applied the approach to yeast genomes, we redeveloped SearchDOGS to function as a standalone, downloadable package, requiring only a set of GenBank annotation files as input. The software automatically generates a homology structure using reciprocal BLAST and a synteny-based method; this is followed by a scan of the entire genome of each species for unannotated genes. Results are provided in a HTML interface, providing coordinates, BLAST results, syntenic location, omega values (Ka/Ks, where Ks is the number of synonymous substitutions per synonymous site and Ka is the number of nonsynonymous substitutions per nonsynonymous site) for protein conservation estimates, and other information for each candidate gene. Using SearchDOGS Bacteria, we identified 155 gene candidates in the Shigella boydii sb227 genome, including 56 candidates of length < 60 codons. SearchDOGS Bacteria has two major advantages over currently available annotation software. First, it outperforms current methods in terms of sensitivity and is highly effective at identifying small or highly diverged genes. Second, as a freely downloadable package, it can be used with unpublished or confidential data.
Collapse
|
103
|
Abstract
Small proteins, here defined as proteins of 50 amino acids or fewer in the absence of processing, have traditionally been overlooked due to challenges in their annotation and biochemical detection. In the past several years, however, increasing numbers of small proteins have been identified either through the realization that mutations in intergenic regions are actually within unannotated small protein genes or through the discovery that some small, regulatory RNAs encode small proteins. These insights, together with comparative sequence analysis, indicate that tens if not hundreds of small proteins are synthesized in a given organism. This review summarizes what has been learned about the functions of several of these bacterial small proteins, most of which act at the membrane, illustrating the astonishing range of processes in which these small proteins act and suggesting several general conclusions. Important questions for future studies of these overlooked proteins are also discussed.
Collapse
Affiliation(s)
- Gisela Storz
- Cell Biology and Metabolism Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892-5430;
| | | | | |
Collapse
|
104
|
Marvin DA, Symmons MF, Straus SK. Structure and assembly of filamentous bacteriophages. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2014; 114:80-122. [PMID: 24582831 DOI: 10.1016/j.pbiomolbio.2014.02.003] [Citation(s) in RCA: 84] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2013] [Accepted: 02/09/2014] [Indexed: 12/24/2022]
Abstract
Filamentous bacteriophages are interesting paradigms in structural molecular biology, in part because of the unusual mechanism of filamentous phage assembly. During assembly, several thousand copies of an intracellular DNA-binding protein bind to each copy of the replicating phage DNA, and are then displaced by membrane-spanning phage coat proteins as the nascent phage is extruded through the bacterial plasma membrane. This complicated process takes place without killing the host bacterium. The bacteriophage is a semi-flexible worm-like nucleoprotein filament. The virion comprises a tube of several thousand identical major coat protein subunits around a core of single-stranded circular DNA. Each protein subunit is a polymer of about 50 amino-acid residues, largely arranged in an α-helix. The subunits assemble into a helical sheath, with each subunit oriented at a small angle to the virion axis and interdigitated with neighbouring subunits. A few copies of "minor" phage proteins necessary for infection and/or extrusion of the virion are located at each end of the completed virion. Here we review both the structure of the virion and aspects of its function, such as the way the virion enters the host, multiplies, and exits to prey on further hosts. In particular we focus on our understanding of the way the components of the virion come together during assembly at the membrane. We try to follow a basic rule of empirical science, that one should chose the simplest theoretical explanation for experiments, but be prepared to modify or even abandon this explanation as new experiments add more detail.
Collapse
Affiliation(s)
- D A Marvin
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK.
| | - M F Symmons
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| | - S K Straus
- Department of Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, BC V6T 1Z1, Canada.
| |
Collapse
|
105
|
Genome-scale analyses of Escherichia coli and Salmonella enterica AraC reveal noncanonical targets and an expanded core regulon. J Bacteriol 2013; 196:660-71. [PMID: 24272778 DOI: 10.1128/jb.01007-13] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Escherichia coli AraC is a well-described transcription activator of genes involved in arabinose metabolism. Using complementary genomic approaches, chromatin immunoprecipitation (ChIP)-chip, and transcription profiling, we identify direct regulatory targets of AraC, including five novel target genes: ytfQ, ydeN, ydeM, ygeA, and polB. Strikingly, only ytfQ has an established connection to arabinose metabolism, suggesting that AraC has a broader function than previously described. We demonstrate arabinose-dependent repression of ydeNM by AraC, in contrast to the well-described arabinose-dependent activation of other target genes. We also demonstrate unexpected read-through of transcription at the Rho-independent terminators downstream of araD and araE, leading to significant increases in the expression of polB and ygeA, respectively. AraC is highly conserved in the related species Salmonella enterica. We use ChIP sequencing (ChIP-seq) and RNA sequencing (RNA-seq) to map the AraC regulon in S. enterica. A comparison of the E. coli and S. enterica AraC regulons, coupled with a bioinformatic analysis of other related species, reveals a conserved regulatory network across the family Enterobacteriaceae comprised of 10 genes associated with arabinose transport and metabolism.
Collapse
|
106
|
Four products from Escherichia coli pseudogenes increase hydrogen production. Biochem Biophys Res Commun 2013; 439:576-9. [DOI: 10.1016/j.bbrc.2013.09.016] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2013] [Accepted: 09/03/2013] [Indexed: 11/22/2022]
|
107
|
Chen S, Zhang CY, Song K. Recognizing short coding sequences of prokaryotic genome using a novel iteratively adaptive sparse partial least squares algorithm. Biol Direct 2013; 8:23. [PMID: 24067167 PMCID: PMC3852556 DOI: 10.1186/1745-6150-8-23] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2013] [Accepted: 09/23/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Significant efforts have been made to address the problem of identifying short genes in prokaryotic genomes. However, most known methods are not effective in detecting short genes. Because of the limited information contained in short DNA sequences, it is very difficult to accurately distinguish between protein coding and non-coding sequences in prokaryotic genomes. We have developed a new Iteratively Adaptive Sparse Partial Least Squares (IASPLS) algorithm as the classifier to improve the accuracy of the identification process. RESULTS For testing, we chose the short coding and non-coding sequences from seven prokaryotic organisms. We used seven feature sets (including GC content, Z-curve, etc.) of short genes.In comparison with GeneMarkS, Metagene, Orphelia, and Heuristic Approachs methods, our model achieved the best prediction performance in identification of short prokaryotic genes. Even when we focused on the very short length group ([60-100 nt)), our model provided sensitivity as high as 83.44% and specificity as high as 92.8%. These values are two or three times higher than three of the other methods while Metagene fails to recognize genes in this length range.The experiments also proved that the IASPLS can improve the identification accuracy in comparison with other widely used classifiers, i.e. Logistic, Random Forest (RF) and K nearest neighbors (KNN). The accuracy in using IASPLS was improved 5.90% or more in comparison with the other methods. In addition to the improvements in accuracy, IASPLS required ten times less computer time than using KNN or RF. CONCLUSIONS It is conclusive that our method is preferable for application as an automated method of short gene classification. Its linearity and easily optimized parameters make it practicable for predicting short genes of newly-sequenced or under-studied species.
Collapse
Affiliation(s)
- Sun Chen
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China.
| | | | | |
Collapse
|
108
|
The Escherichia coli CydX protein is a member of the CydAB cytochrome bd oxidase complex and is required for cytochrome bd oxidase activity. J Bacteriol 2013; 195:3640-50. [PMID: 23749980 DOI: 10.1128/jb.00324-13] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Cytochrome bd oxidase operons from more than 50 species of bacteria contain a short gene encoding a small protein that ranges from ∼30 to 50 amino acids and is predicted to localize to the cell membrane. Although cytochrome bd oxidases have been studied for more than 70 years, little is known about the role of this small protein, denoted CydX, in oxidase activity. Here we report that Escherichia coli mutants lacking CydX exhibit phenotypes associated with reduced oxidase activity. In addition, cell membrane extracts from ΔcydX mutant strains have reduced oxidase activity in vitro. Consistent with data showing that CydX is required for cytochrome bd oxidase activity, copurification experiments indicate that CydX interacts with the CydAB cytochrome bd oxidase complex. Together, these data support the hypothesis that CydX is a subunit of the CydAB cytochrome bd oxidase complex that is required for complex activity. The results of mutation analysis of CydX suggest that few individual amino acids in the small protein are essential for function, at least in the context of protein overexpression. In addition, the results of analysis of the paralogous small transmembrane protein AppX show that the two proteins could have some overlapping functionality in the cell and that both have the potential to interact with the CydAB complex.
Collapse
|
109
|
Müller SA, Findeiß S, Pernitzsch SR, Wissenbach DK, Stadler PF, Hofacker IL, von Bergen M, Kalkhof S. Identification of new protein coding sequences and signal peptidase cleavage sites of Helicobacter pylori strain 26695 by proteogenomics. J Proteomics 2013; 86:27-42. [PMID: 23665149 DOI: 10.1016/j.jprot.2013.04.036] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2012] [Revised: 03/29/2013] [Accepted: 04/26/2013] [Indexed: 12/16/2022]
Abstract
UNLABELLED Correct annotation of protein coding genes is the basis of conventional data analysis in proteomic studies. Nevertheless, most protein sequence databases almost exclusively rely on gene finding software and inevitably also miss protein annotations or possess errors. Proteogenomics tries to overcome these issues by matching MS data directly against a genome sequence database. Here we report an in-depth proteogenomics study of Helicobacter pylori strain 26695. MS data was searched against a combined database of the NCBI annotations and a six-frame translation of the genome. Database searches with Mascot and X! Tandem revealed 1115 proteins identified by at least two peptides with a peptide false discovery rate below 1%. This represents 71% of the predicted proteome. So far this is the most extensive proteome study of Helicobacter pylori. Our proteogenomic approach unambiguously identified four previously missed annotations and furthermore allowed us to correct sequences of six annotated proteins. Since secreted proteins are often involved in pathogenic processes we further investigated signal peptidase cleavage sites. By applying a database search that accommodates the identification of semi-specific cleaved peptides, 63 previously unknown signal peptides were detected. The motif LXA showed to be the predominant recognition sequence for signal peptidases. BIOLOGICAL SIGNIFICANCE The results of MS-based proteomic studies highly rely on correct annotation of protein coding genes which is the basis of conventional data analysis. However, the annotation of protein coding sequences in genomic data is usually based on gene finding software. These tools are limited in their prediction accuracy such as the problematic determination of exact gene boundaries. Thus, protein databases own partly erroneous or incomplete sequences. Additionally, some protein sequences might also be missing in the databases. Proteogenomics, a combination of proteomic and genomic data analyses, is well suited to detect previously not annotated proteins and to correct erroneous sequences. For this purpose, the existing database of the investigated species is typically supplemented with a six-frame translation of the genome. Here, we studied the proteome of the major human pathogen Helicobacter pylori that is responsible for many gastric diseases such as duodenal ulcers and gastric cancer. Our in-depth proteomic study highly reliably identified 1115 proteins (FDR<0.01%) by at least two peptides (FDR<1%) which represent 71% of the predicted proteome deposited at NCBI. The proteogenomic data analysis of our data set resulted in the unambiguous identification of four previously missed annotations, the correction of six annotated proteins as well as the detection of 63 previously unknown signal peptides. We have annotated proteins of particular biological interest like the ferrous iron transport protein A, the coiled-coil-rich protein HP0058 and the lipopolysaccharide biosynthesis protein HP0619. For instance, the protein HP0619 could be a drug target for the inhibition of the LPS synthesis pathway. Furthermore it has been proven that the motif "LXA" is the predominant recognition sequence for the signal peptidase I of H. pylori. Signal peptidases are essential enzymes for the viability of bacterial cells and are involved in pathogenesis. Therefore signal peptidases could be novel targets for antibiotics. The inclusion of the corrected and new annotated proteins as well as the information of signal peptide cleavage sites will help in the study of biological pathways involved in pathogenesis or drug response of H. pylori.
Collapse
Affiliation(s)
- Stephan A Müller
- Department of Proteomics, UFZ, Helmholtz-Centre for Environmental Research Leipzig, 04318 Leipzig, Germany
| | | | | | | | | | | | | | | |
Collapse
|
110
|
Gannoun-Zaki L, Alibaud L, Carrère-Kremer S, Kremer L, Blanc-Potard AB. Overexpression of the KdpF membrane peptide in Mycobacterium bovis BCG results in reduced intramacrophage growth and altered cording morphology. PLoS One 2013; 8:e60379. [PMID: 23577107 PMCID: PMC3618439 DOI: 10.1371/journal.pone.0060379] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2013] [Accepted: 02/26/2013] [Indexed: 12/19/2022] Open
Abstract
Membrane peptides appear as an emerging class of regulatory molecules in bacteria, which can interact with membrane proteins, such as sensor kinases. To date, regulatory membrane peptides have been completely overlooked in mycobacteria. The 30 amino-acid-long KdpF peptide, which is co-transcribed with kdpABC genes and regulated by the KdpDE two-component system, is supposed to stabilize the KdpABC potassium transporter complex but may also exhibit unsuspected regulatory function(s) towards the KdpD sensor kinase. Herein, we showed by quantitative RT-PCR that the Mycobacterium bovis BCG kdpAB and kdpDE genes clusters are differentially induced in potassium-deprived broth medium or within infected macrophages. We have overexpressed the kdpF gene in M. bovis BCG to investigate its possible regulatory role and effect on mycobacterial virulence. Our results indicate that KdpF does not play a critical regulatory role on kdp genes expression despite the fact that KdpF interacts with the KdpD sensor kinase in a bacterial two-hybrid assay. However, overexpression of kdpF results in a significant reduction of M. bovis BCG growth in both murine and human primary macrophages, and is associated with a strong alteration of colonial morphology and impaired cording formation. To identify novel KdpF interactants, a mycobacterial library was screened using KdpF as bait in the bacterial two-hybrid system. This allowed us to identify members of the MmpL family of membrane proteins, known to participate in the biosynthesis/transport of various cell wall lipids, thus highlighting a possible link between KdpF and cell wall lipid metabolism. Taken together, these data suggest that KdpF overexpression reduces intramacrophage growth which may result from alteration of the mycobacterial cell wall.
Collapse
Affiliation(s)
- Laila Gannoun-Zaki
- Laboratoire de Dynamique des Interactions Membranaires Normales et Pathologiques, Universités de Montpellier 2 et 1, CNRS-UMR5235, Montpellier, France
| | - Laeticia Alibaud
- Laboratoire de Dynamique des Interactions Membranaires Normales et Pathologiques, Universités de Montpellier 2 et 1, CNRS-UMR5235, Montpellier, France
| | - Séverine Carrère-Kremer
- Laboratoire de Dynamique des Interactions Membranaires Normales et Pathologiques, Universités de Montpellier 2 et 1, CNRS-UMR5235, Montpellier, France
| | - Laurent Kremer
- Laboratoire de Dynamique des Interactions Membranaires Normales et Pathologiques, Universités de Montpellier 2 et 1, CNRS-UMR5235, Montpellier, France
- INSERM, DIMNP, CNRS-UMR5235, Montpellier, France
| | - Anne-Béatrice Blanc-Potard
- Laboratoire de Dynamique des Interactions Membranaires Normales et Pathologiques, Universités de Montpellier 2 et 1, CNRS-UMR5235, Montpellier, France
- * E-mail:
| |
Collapse
|
111
|
Weel-Sneve R, Kristiansen KI, Odsbu I, Dalhus B, Booth J, Rognes T, Skarstad K, Bjørås M. Single transmembrane peptide DinQ modulates membrane-dependent activities. PLoS Genet 2013; 9:e1003260. [PMID: 23408903 PMCID: PMC3567139 DOI: 10.1371/journal.pgen.1003260] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2011] [Accepted: 12/05/2012] [Indexed: 11/18/2022] Open
Abstract
The functions of several SOS regulated genes in Escherichia coli are still unknown, including dinQ. In this work we characterize dinQ and two small RNAs, agrA and agrB, with antisense complementarity to dinQ. Northern analysis revealed five dinQ transcripts, but only one transcript (+44) is actively translated. The +44 dinQ transcript translates into a toxic single transmembrane peptide localized in the inner membrane. AgrB regulates dinQ RNA by RNA interference to counteract DinQ toxicity. Thus the dinQ-agr locus shows the classical features of a type I TA system and has many similarities to the tisB-istR locus. DinQ overexpression depolarizes the cell membrane and decreases the intracellular ATP concentration, demonstrating that DinQ can modulate membrane-dependent processes. Augmented DinQ strongly inhibits marker transfer by Hfr conjugation, indicating a role in recombination. Furthermore, DinQ affects transformation of nucleoid morphology in response to UV damage. We hypothesize that DinQ is a transmembrane peptide that modulates membrane-dependent activities such as nucleoid compaction and recombination. Exposure of the bacterium Escherichia coli to DNA damaging agents induces the SOS response, which up-regulates gene functions involved in numerous cellular processes such as DNA repair, cell division, and replication. Most of the SOS regulated genes in E. coli have been characterized, but still there are several genes of unknown function. One of these uncharacterized genes is dinQ. In this work we characterize dinQ and two novel small RNAs, agrA and agrB, that regulate expression of dinQ. The DinQ peptide is localized in the inner membrane as a single transmembrane peptide of 27 amino acids. Small proteins of less than 50 amino acids are important in cellular processes such as regulation, signalling, and antibacterial action. Here we demonstrate that DinQ modulates recombination and transformation of nucleoid morphology in response to UV damage. Our results provide new insights into small hydrophobic peptides that could regulate important DNA metabolic processes dependent on the inner membrane of the cell.
Collapse
Affiliation(s)
- Ragnhild Weel-Sneve
- Centre for Molecular Biology and Neuroscience (CMBN), University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
- Department of Microbiology, University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Knut Ivan Kristiansen
- Centre for Molecular Biology and Neuroscience (CMBN), University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
- Department of Microbiology, University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
- * E-mail: (KIK); (MB)
| | - Ingvild Odsbu
- Department of Cell Biology, Institute for Cancer Research, University of Oslo and Oslo University Hospital, Radiumhospitalet, Oslo, Norway
| | - Bjørn Dalhus
- Centre for Molecular Biology and Neuroscience (CMBN), University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
- Department of Biochemistry, University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - James Booth
- Centre for Molecular Biology and Neuroscience (CMBN), University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
- Department of Microbiology, University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Torbjørn Rognes
- Centre for Molecular Biology and Neuroscience (CMBN), University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
- Department of Microbiology, University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Kirsten Skarstad
- Department of Cell Biology, Institute for Cancer Research, University of Oslo and Oslo University Hospital, Radiumhospitalet, Oslo, Norway
| | - Magnar Bjørås
- Centre for Molecular Biology and Neuroscience (CMBN), University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
- Department of Microbiology, University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
- Department of Biochemistry, University of Oslo and Oslo University Hospital, Rikshospitalet, Oslo, Norway
- * E-mail: (KIK); (MB)
| |
Collapse
|
112
|
Shimada T, Yamazaki K, Ishihama A. Novel regulator PgrR for switch control of peptidoglycan recycling in Escherichia coli. Genes Cells 2013; 18:123-34. [PMID: 23301696 DOI: 10.1111/gtc.12026] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2012] [Accepted: 11/02/2012] [Indexed: 01/06/2023]
Abstract
Peptidoglycan (PG), also designated as murein, forms a skeletal mesh within the periplasm of bacterial membrane. PG is a metabolically stable cell architecture in Escherichia coli, but under as yet ill-defined conditions, a portion of PG is degraded, of which both amino sugar and peptide moieties are either recycled or used as self-generated nutrients for cell growth. At present, the control of PG degradation remains uncharacterized. Using the Genomic SELEX screening system, we identified an uncharacterized transcription factor YcjZ is a repressor of the expression of the initial step enzymes for PG peptide degradation. Under nutrient starvation, the genes encoding the enzymes for PG peptide degradation are derepressed so as to generate amino acids but are tightly repressed at high osmotic conditions so as to maintain the rigid membrane for withstanding the turgor. Taken together, we propose to rename YcjZ as PgrR (regulator of peptide glycan recycling).
Collapse
Affiliation(s)
- Tomohiro Shimada
- Department of Frontier Bioscience, Hosei University, Koganei, Tokyo, Japan
| | | | | |
Collapse
|
113
|
Abstract
EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection.
Collapse
Affiliation(s)
- Jindan Zhou
- Department of Biochemistry and Molecular Biology, The Miller School of Medicine, University of Miami, Miami, FL 33143, USA
| | | |
Collapse
|
114
|
Slavoff SA, Mitchell AJ, Schwaid AG, Cabili MN, Ma J, Levin JZ, Karger AD, Budnik BA, Rinn JL, Saghatelian A. Peptidomic discovery of short open reading frame-encoded peptides in human cells. Nat Chem Biol 2012; 9:59-64. [PMID: 23160002 PMCID: PMC3625679 DOI: 10.1038/nchembio.1120] [Citation(s) in RCA: 437] [Impact Index Per Article: 36.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2012] [Accepted: 10/16/2012] [Indexed: 11/24/2022]
Abstract
The amount of the transcriptome that is translated into polypeptides is of fundamental importance. We developed a peptidomic strategy to detect short ORF (sORF)-encoded polypeptides (SEPs) in human cells. We identified 90 SEPs, 86 of which are novel, the largest number of human SEPs ever reported. SEP abundances range from 10-1000 molecules per cell, identical to known proteins. SEPs arise from sORFs in non-coding RNAs as well as multi-cistronic mRNAs, and many SEPs initiate with non-AUG start codons, indicating that non-canonical translation may be more widespread in mammals than previously thought. In addition, coding sORFs are present in a small fraction (8/1866) of long intergenic non-coding RNAs (lincRNAs). Together, these results provide the strongest evidence to date that the human proteome is more complex than previously appreciated.
Collapse
Affiliation(s)
- Sarah A Slavoff
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
115
|
Conserved small protein associates with the multidrug efflux pump AcrB and differentially affects antibiotic resistance. Proc Natl Acad Sci U S A 2012; 109:16696-701. [PMID: 23010927 DOI: 10.1073/pnas.1210093109] [Citation(s) in RCA: 146] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
The AcrAB-TolC multidrug efflux pump confers resistance to a wide variety of antibiotics and other compounds in Escherichia coli. Here we show that AcrZ (formerly named YbhT), a 49-amino-acid inner membrane protein, associates with the AcrAB-TolC complex. Co-purification of AcrZ with AcrB, in the absence of both AcrA and TolC, two-hybrid assays and suppressor mutations indicate that this interaction occurs through the inner membrane protein AcrB. The highly conserved acrZ gene is coregulated with acrAB through induction by the MarA, Rob, and SoxS transcription regulators. In addition, mutants lacking AcrZ are sensitive to many, but not all, of the antibiotics transported by AcrAB-TolC. This differential antibiotic sensitivity suggests that AcrZ may enhance the ability of the AcrAB-TolC pump to export certain classes of substrates.
Collapse
|
116
|
First insights into the unexplored two-component system YehU/YehT in Escherichia coli. J Bacteriol 2012; 194:4272-84. [PMID: 22685278 DOI: 10.1128/jb.00409-12] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Two-component systems (TCSs) consisting of a membrane-anchored histidine kinase (HK) and a response regulator (RR) with DNA-binding activity. are major players in signal transduction in prokaryotes. Whereas most TCSs in Escherichia coli are well characterized, almost nothing is known about the LytS-like HK YehU and the corresponding LytTR-like RR YehT. To identify YehT-regulated genes, we compared the transcriptomes of E. coli cells overproducing either YehT or the RR KdpE (control). The expression levels of 32 genes differed more than 8-fold between the two strains. A comprehensive evaluation of these genes identified yjiY as a target of YehT. Electrophoretic mobility shift assays with purified YehT confirmed that YehT interacts directly with the yjiY promoter. Specifically, YehT binds to two direct repeats of the motif ACC(G/A)CT(C/T)A separated by a 13-bp spacer in the yjiY promoter. The target gene yjiY encodes an inner membrane protein belonging to the CstA superfamily of transporters. In E. coli cells growing in media containing peptides or amino acids as a carbon source, yjiY is strongly induced at the onset of the stationary-growth phase. Moreover, expression was found to be dependent on cyclic AMP (cAMP)/cAMP receptor protein (CRP). It is suggested that YehU/YehT participates in the stationary-phase control network.
Collapse
|
117
|
Kröger C, Dillon SC, Cameron ADS, Papenfort K, Sivasankaran SK, Hokamp K, Chao Y, Sittka A, Hébrard M, Händler K, Colgan A, Leekitcharoenphon P, Langridge GC, Lohan AJ, Loftus B, Lucchini S, Ussery DW, Dorman CJ, Thomson NR, Vogel J, Hinton JCD. The transcriptional landscape and small RNAs of Salmonella enterica serovar Typhimurium. Proc Natl Acad Sci U S A 2012; 109:E1277-86. [PMID: 22538806 PMCID: PMC3356629 DOI: 10.1073/pnas.1201061109] [Citation(s) in RCA: 301] [Impact Index Per Article: 25.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
More than 50 y of research have provided great insight into the physiology, metabolism, and molecular biology of Salmonella enterica serovar Typhimurium (S. Typhimurium), but important gaps in our knowledge remain. It is clear that a precise choreography of gene expression is required for Salmonella infection, but basic genetic information such as the global locations of transcription start sites (TSSs) has been lacking. We combined three RNA-sequencing techniques and two sequencing platforms to generate a robust picture of transcription in S. Typhimurium. Differential RNA sequencing identified 1,873 TSSs on the chromosome of S. Typhimurium SL1344 and 13% of these TSSs initiated antisense transcripts. Unique findings include the TSSs of the virulence regulators phoP, slyA, and invF. Chromatin immunoprecipitation revealed that RNA polymerase was bound to 70% of the TSSs, and two-thirds of these TSSs were associated with σ(70) (including phoP, slyA, and invF) from which we identified the -10 and -35 motifs of σ(70)-dependent S. Typhimurium gene promoters. Overall, we corrected the location of important genes and discovered 18 times more promoters than identified previously. S. Typhimurium expresses 140 small regulatory RNAs (sRNAs) at early stationary phase, including 60 newly identified sRNAs. Almost half of the experimentally verified sRNAs were found to be unique to the Salmonella genus, and <20% were found throughout the Enterobacteriaceae. This description of the transcriptional map of SL1344 advances our understanding of S. Typhimurium, arguably the most important bacterial infection model.
Collapse
Affiliation(s)
- Carsten Kröger
- Department of Microbiology, School of Genetics and Microbiology, Moyne Institute of Preventive Medicine, and
| | - Shane C. Dillon
- Department of Microbiology, School of Genetics and Microbiology, Moyne Institute of Preventive Medicine, and
| | - Andrew D. S. Cameron
- Department of Microbiology, School of Genetics and Microbiology, Moyne Institute of Preventive Medicine, and
| | - Kai Papenfort
- Institute for Molecular Infection Biology, University of Würzburg, 97080 Würzburg, Germany
| | - Sathesh K. Sivasankaran
- Department of Microbiology, School of Genetics and Microbiology, Moyne Institute of Preventive Medicine, and
| | - Karsten Hokamp
- Department of Genetics, School of Genetics and Microbiology, Smurfit Institute of Genetics, Trinity College Dublin, Dublin 2, Ireland
| | - Yanjie Chao
- Institute for Molecular Infection Biology, University of Würzburg, 97080 Würzburg, Germany
| | - Alexandra Sittka
- Molecular Pulmonology, Universities of Giessen and Marburg Lung Center, German Center for Lung Research, Philipps University, 35043 Marburg, Germany
| | - Magali Hébrard
- Department of Microbiology, School of Genetics and Microbiology, Moyne Institute of Preventive Medicine, and
| | - Kristian Händler
- Department of Microbiology, School of Genetics and Microbiology, Moyne Institute of Preventive Medicine, and
| | - Aoife Colgan
- Department of Microbiology, School of Genetics and Microbiology, Moyne Institute of Preventive Medicine, and
| | - Pimlapas Leekitcharoenphon
- Department of Systems Biology, Center for Biological Sequence Analysis, and
- National Food Institute, Technical University of Denmark, 2800 Kongens Lyngby, Denmark
| | - Gemma C. Langridge
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, United Kingdom
| | - Amanda J. Lohan
- School of Medicine and Medical Science, Conway Institute, University College Dublin, Dublin 4, Ireland; and
| | - Brendan Loftus
- School of Medicine and Medical Science, Conway Institute, University College Dublin, Dublin 4, Ireland; and
| | - Sacha Lucchini
- Institute of Food Research, Colney, Norwich NR4 7UA, United Kingdom
| | - David W. Ussery
- Department of Systems Biology, Center for Biological Sequence Analysis, and
| | - Charles J. Dorman
- Department of Microbiology, School of Genetics and Microbiology, Moyne Institute of Preventive Medicine, and
| | - Nicholas R. Thomson
- The Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, United Kingdom
| | - Jörg Vogel
- Institute for Molecular Infection Biology, University of Würzburg, 97080 Würzburg, Germany
| | - Jay C. D. Hinton
- Department of Microbiology, School of Genetics and Microbiology, Moyne Institute of Preventive Medicine, and
| |
Collapse
|
118
|
Cheng H, Chan WS, Li Z, Wang D, Liu S, Zhou Y. Small open reading frames: current prediction techniques and future prospect. Curr Protein Pept Sci 2012; 12:503-7. [PMID: 21787300 DOI: 10.2174/138920311796957667] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2011] [Revised: 04/01/2011] [Accepted: 05/04/2011] [Indexed: 11/22/2022]
Abstract
Evidence is accumulating that small open reading frames (sORF, <100 codons) play key roles in many important biological processes. Yet, they are generally ignored in gene annotation despite they are far more abundant than the genes with more than 100 codons. Here, we demonstrate that popular homolog search and codon-index techniques perform poorly for small genes relative to that for larger genes, while a method dedicated to sORF discovery has a similar level of accuracy as homology search. The result is largely due to the small dataset of experimentally verified sORF available for homology search and for training ab initio techniques. It highlights the urgent need for both experimental and computational studies in order to further advance the accuracy of sORF prediction.
Collapse
Affiliation(s)
- Haoyu Cheng
- Indiana University School of Informatics, Indiana University-Purdue University and Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | | | | | | | | | | |
Collapse
|
119
|
Goli B, Nair AS. The elusive short gene – an ensemble method for recognition for prokaryotic genome. Biochem Biophys Res Commun 2012; 422:36-41. [DOI: 10.1016/j.bbrc.2012.04.090] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2012] [Accepted: 04/17/2012] [Indexed: 10/28/2022]
|
120
|
Arnvig K, Young D. Non-coding RNA and its potential role in Mycobacterium tuberculosis pathogenesis. RNA Biol 2012; 9:427-36. [PMID: 22546938 PMCID: PMC3384566 DOI: 10.4161/rna.20105] [Citation(s) in RCA: 77] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
It is estimated that one third of the human population is infected with Mycobacterium tuberculosis. Efforts to understand the molecular basis of its gene regulation have been focused on identification of protein encoding genes and regulons implicated in pathogenesis. Recently, a number of studies have described the identification of several non-coding RNAs that are likely to contribute significantly to the regulatory networks responsible for adaptation and virulence in M. tuberculosis. We have reviewed emerging information on the presence and abundance of different types of non-coding RNA in M. tuberculosis and consider their potential contribution to the adaptive responses that underlie disease pathogenesis.
Collapse
Affiliation(s)
- Kristine Arnvig
- Division of Mycobacterial Research, MRC National Institute for Medical Research, London, UK.
| | | |
Collapse
|
121
|
Affiliation(s)
- Jos Boekhorst
- TI Food and Nutrition, 6700AN Wageningen, the Netherlands
| | | | | |
Collapse
|
122
|
Thomason MK, Fontaine F, De Lay N, Storz G. A small RNA that regulates motility and biofilm formation in response to changes in nutrient availability in Escherichia coli. Mol Microbiol 2012; 84:17-35. [PMID: 22289118 DOI: 10.1111/j.1365-2958.2012.07965.x] [Citation(s) in RCA: 158] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
In bacteria, many small regulatory RNAs (sRNAs) are induced in response to specific environmental signals or stresses and act by base-pairing with mRNA targets to affect protein translation or mRNA stability. In Escherichia coli, the gene for the sRNA IS061/IsrA, here renamed McaS, was predicted to reside in an intergenic region between abgR, encoding a transcription regulator and ydaL, encoding a small MutS-related protein. We show that McaS is a ∼95nt transcript whose expression increases over growth, peaking in early-to-mid stationary phase, or when glucose is limiting. McaS uses three discrete single-stranded regions to regulate mRNA targets involved in various aspects of biofilm formation. McaS represses csgD, the transcription regulator of curli biogenesis and activates flhD, the master transcription regulator of flagella synthesis leading to increased motility, a process not previously reported to be regulated by sRNAs. McaS also regulates pgaA, a porin required for the export of the polysaccharide poly β-1,6-N-acetyl-d-glucosamine. Consequently, high levels of McaS result in increased biofilm formation while a strain lacking mcaS shows reduced biofilm formation. Based on our observations, we propose that, in response to limited nutrient availability, increasing levels of McaS modulate steps in the progression to a sessile lifestyle.
Collapse
Affiliation(s)
- Maureen K Thomason
- Cell Biology and Metabolism Program, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD, USA
| | | | | | | |
Collapse
|
123
|
Abstract
The web application PrimerPair at ecogene.org generates large sets of paired DNA sequences surrounding- all protein and RNA genes of Escherichia coli K-12. Many DNA fragments, which these primers amplify, can be used to implement a genome reengineering strategy using complementary in vitro cloning and in vivo recombineering. The integration of a primer design tool with a model organism database increases the level of quality control. Computer-assisted design of gene primer pairs relies upon having highly accurate genomic DNA sequence information that exactly matches the DNA of the cells being used in the laboratory to ensure predictable DNA hybridizations. It is equally crucial to have confidence that the predicted start codons define the locations of genes accurately. Annotations in the EcoGene database are queried by PrimerPair to eliminate pseudogenes, IS elements, and other problematic genes before the design process starts. These projects progressively familiarize users with the EcoGene content, scope, and application interfaces that are useful for genome reengineering projects. The first protocol leads to the design of a pair of primer sequences that were used to clone and express a single gene. The N-terminal protein sequence was experimentally verified and the protein was detected in the periplasm. This is followed by instructions to design PCR primer pairs for cloning gene fragments encoding 50 periplasmic proteins without their signal peptides. The design process begins with the user simply designating one pair of forward and reverse primer endpoint positions relative to all start and stop codon positions. The gene name, genomic coordinates, and primer DNA sequences are reported to the user. When making chromosomal deletions, the integrity of the provisional primer design is checked to see whether it will generate any unwanted double deletions with adjacent genes. The bad designs are recalculated and replacement primers are provided alongside the requested primers. A list of all genes with overlaps includes those expressed from the translational coupling motifs 5'-UGAUG-3' and 5'-AUGA-3'. Rigid alignments of the 893 ribosome binding sites (RBSs) linked to the AUG codons of this coupled subset are assessed for information content using WebLogo 3.0. These specialized logos are missing the G at the prominent information peak position normally seen in the rigid alignment of all genes. This novel GHOLE motif was apparently masked by the normal RBSs in two previously published rigid alignments. We propose a model constraining the distance between the ATG and the RBS, obviating- the need for a flexible linker model to reveal a Shine-Dalgarno-like sequence.
Collapse
Affiliation(s)
- Jindan Zhou
- Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, USA
| | | |
Collapse
|
124
|
Warnecke T, Lynch M, Lipscomb M, Gill R. Identification of a 21 amino acid peptide conferring 3-hydroxypropionic acid stress-tolerance to Escherichia coli. Biotechnol Bioeng 2012; 109:1347-52. [DOI: 10.1002/bit.24398] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2011] [Revised: 11/15/2011] [Accepted: 11/23/2011] [Indexed: 11/10/2022]
|
125
|
Abstract
In recent years, the capability of synthetic biology to design large genetic circuits has dramatically increased due to rapid advances in DNA synthesis technology and development of tools for large-scale assembly of DNA fragments. Large genetic circuits require more components (parts), especially regulators such as transcription factors, sigma factors, and viral RNA polymerases to provide increased regulatory capability, and also devices such as sensors, receivers, and signaling molecules. All these parts may have a potential impact upon the host that needs to be considered when designing and fabricating circuits. DNA microarrays are a well-established technique for global monitoring of gene expression and therefore are an ideal tool for systematically assessing the impact of expressing parts of genetic circuits in host cells. Knowledge of part impact on the host enables the user to design circuits from libraries of parts taking into account their potential impact and also to possibly modify the host to better tolerate stresses induced by the engineered circuit. In this chapter, we present the complete methodology of performing microarrays from choice of array platform, experimental design, preparing samples for array hybridization, and associated data analysis including preprocessing, normalization, clustering, identifying significantly differentially expressed genes, and interpreting the data based on known biology. With these methodologies, we also include lists of bioinformatic resources and tools for performing data analysis. The aim of this chapter is to provide the reader with the information necessary to be able to systematically catalog the impact of genetic parts on the host and also to optimize the operation of fully engineered genetic circuits.
Collapse
Affiliation(s)
- Virgil A Rhodius
- Department of Microbiology and Immunology, University of California at San Francisco, San Francisco, California, USA
| | | |
Collapse
|
126
|
The Escherichia coli MntR miniregulon includes genes encoding a small protein and an efflux pump required for manganese homeostasis. J Bacteriol 2011; 193:5887-97. [PMID: 21908668 DOI: 10.1128/jb.05872-11] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Manganese is a critical micronutrient for cells, serving as an enzyme cofactor and protecting against oxidative stress. Yet, manganese is toxic in excess and little is known about its distribution in cells. Bacteria control intracellular manganese levels by the transcription regulator MntR. When this work began, the only Escherichia coli K-12 gene known to respond to manganese via MntR repression was mntH, which encodes a manganese importer. We show that mntS (formerly the small RNA gene rybA) is repressed by manganese through MntR and encodes an unannotated 42-amino-acid protein. Overproduction of MntS causes manganese sensitivity, while a lack of MntS perturbs proper manganese-dependent repression of mntH. We also provide evidence that mntP (formerly yebN), which encodes a putative efflux pump, is positively regulated by MntR. Deletion of mntP leads to profound manganese sensitivity and to elevated intracellular manganese levels. This work thus defines two new proteins involved in manganese homeostasis and suggests mechanisms for their action.
Collapse
|
127
|
Shinhara A, Matsui M, Hiraoka K, Nomura W, Hirano R, Nakahigashi K, Tomita M, Mori H, Kanai A. Deep sequencing reveals as-yet-undiscovered small RNAs in Escherichia coli. BMC Genomics 2011; 12:428. [PMID: 21864382 PMCID: PMC3175480 DOI: 10.1186/1471-2164-12-428] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2011] [Accepted: 08/24/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In Escherichia coli, approximately 100 regulatory small RNAs (sRNAs) have been identified experimentally and many more have been predicted by various methods. To provide a comprehensive overview of sRNAs, we analysed the low-molecular-weight RNAs (< 200 nt) of E. coli with deep sequencing, because the regulatory RNAs in bacteria are usually 50-200 nt in length. RESULTS We discovered 229 novel candidate sRNAs (≥ 50 nt) with computational or experimental evidence of transcription initiation. Among them, the expression of seven intergenic sRNAs and three cis-antisense sRNAs was detected by northern blot analysis. Interestingly, five novel sRNAs are expressed from prophage regions and we note that these sRNAs have several specific characteristics. Furthermore, we conducted an evolutionary conservation analysis of the candidate sRNAs and summarised the data among closely related bacterial strains. CONCLUSIONS This comprehensive screen for E. coli sRNAs using a deep sequencing approach has shown that many as-yet-undiscovered sRNAs are potentially encoded in the E. coli genome. We constructed the Escherichia coli Small RNA Browser (ECSBrowser; http://rna.iab.keio.ac.jp/), which integrates the data for previously identified sRNAs and the novel sRNAs found in this study.
Collapse
Affiliation(s)
- Atsuko Shinhara
- Institute for Advanced Biosciences, Keio University, Tsuruoka 997-0017, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
128
|
Modell JW, Hopkins AC, Laub MT. A DNA damage checkpoint in Caulobacter crescentus inhibits cell division through a direct interaction with FtsW. Genes Dev 2011; 25:1328-43. [PMID: 21685367 DOI: 10.1101/gad.2038911] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Following DNA damage, cells typically delay cell cycle progression and inhibit cell division until their chromosomes have been repaired. The bacterial checkpoint systems responsible for these DNA damage responses are incompletely understood. Here, we show that Caulobacter crescentus responds to DNA damage by coordinately inducing an SOS regulon and inhibiting the master regulator CtrA. Included in the SOS regulon is sidA (SOS-induced inhibitor of cell division A), a membrane protein of only 29 amino acids that helps to delay cell division following DNA damage, but is dispensable in undamaged cells. SidA is sufficient, when overproduced, to block cell division. However, unlike many other regulators of bacterial cell division, SidA does not directly disrupt the assembly or stability of the cytokinetic ring protein FtsZ, nor does it affect the recruitment of other components of the cell division machinery. Instead, we provide evidence that SidA inhibits division by binding directly to FtsW to prevent the final constriction of the cytokinetic ring.
Collapse
Affiliation(s)
- Joshua W Modell
- Department of Biology, Massachusetts Institute of Technology, Cambridge, USA
| | | | | |
Collapse
|
129
|
Abstract
Integral membrane proteins of the cell surface and most intracellular compartments of eukaryotic cells are assembled at the endoplasmic reticulum. Two highly conserved and parallel pathways mediate membrane protein targeting to and insertion into this organelle. The classical cotranslational pathway, utilized by most membrane proteins, involves targeting by the signal recognition particle followed by insertion via the Sec61 translocon. A more specialized posttranslational pathway, employed by many tail-anchored membrane proteins, is composed of entirely different factors centered around a cytosolic ATPase termed TRC40 or Get3. Both of these pathways overcome the same biophysical challenges of ferrying hydrophobic cargo through an aqueous milieu, selectively delivering it to one among several intracellular membranes and asymmetrically integrating its transmembrane domain(s) into the lipid bilayer. Here, we review the conceptual and mechanistic themes underlying these core membrane protein insertion pathways, the complexities that challenge our understanding, and future directions to overcome these obstacles.
Collapse
Affiliation(s)
- Sichen Shao
- Cell Biology and Metabolism Program, National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | |
Collapse
|
130
|
Fontaine F, Fuchs RT, Storz G. Membrane localization of small proteins in Escherichia coli. J Biol Chem 2011; 286:32464-74. [PMID: 21778229 DOI: 10.1074/jbc.m111.245696] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Escherichia coli synthesize over 60 poorly understood small proteins of less than 50 amino acids. A striking feature of these proteins is that 65% contain a predicted α-helical transmembrane (TM) domain. This prompted us to examine the localization, topology, and membrane insertion of the small proteins. Biochemical fractionation showed that, consistent with the predicted TM helix, the small proteins generally are most abundant in the inner membrane fraction. Examples of both N(in)-C(out) and N(out)-C(in) orientations were found in assays of topology-reporter fusions to representative small TM proteins. Interestingly, however, three of nine tested proteins display dual topology. Positive residues close to the transmembrane domains are conserved, and mutational analysis of one small protein, YohP, showed that the positive inside rule applies for single transmembrane domain proteins as has been observed for larger proteins. Finally, fractionation analysis of small protein localization in strains depleted of the Sec or YidC membrane insertion pathways uncovered differential requirements. Some small proteins appear to be affected by both Sec and YidC depletion, others showed more dependence on one or the other insertion pathway, whereas one protein was not affected by depletion of either Sec or YidC. Thus, despite their diminutive size, small proteins display considerable diversity in topology, biochemical features, and insertion pathways.
Collapse
Affiliation(s)
- Fanette Fontaine
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | |
Collapse
|
131
|
Small RNAs as regulators of primary and secondary metabolism in Pseudomonas species. Appl Microbiol Biotechnol 2011; 91:63-79. [PMID: 21607656 DOI: 10.1007/s00253-011-3332-1] [Citation(s) in RCA: 102] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2011] [Revised: 04/11/2011] [Accepted: 04/11/2011] [Indexed: 10/18/2022]
Abstract
Small RNAs (sRNAs) exert important functions in pseudomonads. Classical sRNAs comprise the 4.5S, 6S, 10Sa and 10Sb RNAs, which are known in enteric bacteria as part of the signal recognition particle, a regulatory component of RNA polymerase, transfer-messenger RNA (tmRNA) and the RNA component of RNase P, respectively. Their homologues in pseudomonads are presumed to have analogous functions. Other sRNAs of pseudomonads generally have little or no sequence similarity with sRNAs of enteric bacteria. Numerous sRNAs repress or activate the translation of target mRNAs by a base-pairing mechanism. Examples of this group in Pseudomonas aeruginosa are the iron-repressible PrrF1 and PrrF2 sRNAs, which repress the translation of genes encoding iron-containing proteins, and PhrS, an anaerobically inducible sRNA, which activates the expression of PqsR, a regulator of the Pseudomonas quinolone signal. Other sRNAs sequester RNA-binding proteins that act as translational repressors. Examples of this group in P. aeruginosa include RsmY and RsmZ, which are central regulatory elements in the GacS/GacA signal transduction pathway, and CrcZ, which is a key regulator in the CbrA/CbrB signal transduction pathway. These pathways largely control the extracellular activities (including virulence traits) and the selection of the energetically most favourable carbon sources, respectively, in pseudomonads.
Collapse
|
132
|
Pilalis E, Chatziioannou AA, Grigoroudis AI, Panagiotidis CA, Kolisis FN, Kyriakidis DA. Escherichia coli genome-wide promoter analysis: identification of additional AtoC binding target elements. BMC Genomics 2011; 12:238. [PMID: 21569465 PMCID: PMC3118216 DOI: 10.1186/1471-2164-12-238] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2010] [Accepted: 05/13/2011] [Indexed: 11/16/2022] Open
Abstract
Background Studies on bacterial signal transduction systems have revealed complex networks of functional interactions, where the response regulators play a pivotal role. The AtoSC system of E. coli activates the expression of atoDAEB operon genes, and the subsequent catabolism of short-chain fatty acids, upon acetoacetate induction. Transcriptome and phenotypic analyses suggested that atoSC is also involved in several other cellular activities, although we have recently reported a palindromic repeat within the atoDAEB promoter as the single, cis-regulatory binding site of the AtoC response regulator. In this work, we used a computational approach to explore the presence of yet unidentified AtoC binding sites within other parts of the E. coli genome. Results Through the implementation of a computational de novo motif detection workflow, a set of candidate motifs was generated, representing putative AtoC binding targets within the E. coli genome. In order to assess the biological relevance of the motifs and to select for experimental validation of those sequences related robustly with distinct cellular functions, we implemented a novel approach that applies Gene Ontology Term Analysis to the motif hits and selected those that were qualified through this procedure. The computational results were validated using Chromatin Immunoprecipitation assays to assess the in vivo binding of AtoC to the predicted sites. This process verified twenty-two additional AtoC binding sites, located not only within intergenic regions, but also within gene-encoding sequences. Conclusions This study, by tracing a number of putative AtoC binding sites, has indicated an AtoC-related cross-regulatory function. This highlights the significance of computational genome-wide approaches in elucidating complex patterns of bacterial cell regulation.
Collapse
Affiliation(s)
- Eleftherios Pilalis
- Institute of Biological Research and Biotechnology, National Hellenic Research Foundation, Athens, Greece
| | | | | | | | | | | |
Collapse
|
133
|
Samayoa J, Yildiz FH, Karplus K. Identification of prokaryotic small proteins using a comparative genomic approach. ACTA ACUST UNITED AC 2011; 27:1765-71. [PMID: 21551138 DOI: 10.1093/bioinformatics/btr275] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Accurate prediction of genes encoding small proteins (on the order of 50 amino acids or less) remains an elusive open problem in bioinformatics. Some of the best methods for gene prediction use either sequence composition analysis or sequence similarity to a known protein coding sequence. These methods often fail for small proteins, however, either due to a lack of experimentally verified small protein coding genes or due to the limited statistical significance of statistics on small sequences. Our approach is based upon the hypothesis that true small proteins will be under selective pressure for encoding the particular amino acid sequence, for ease of translation by the ribosome and for structural stability. This stability can be achieved either independently or as part of a larger protein complex. Given this assumption, it follows that small proteins should display conserved local protein structure properties much like larger proteins. Our method incorporates neural-net predictions for three local structure alphabets within a comparative genomic approach using a genomic alignment of 22 closely related bacteria genomes to generate predictions for whether or not a given open reading frame (ORF) encodes for a small protein. RESULTS We have applied this method to the complete genome for Escherichia coli strain K12 and looked at how well our method performed on a set of 60 experimentally verified small proteins from this organism. Out of a total of 11 407 possible ORFs, we found that 6 of the top 10 and 27 of the top 100 predictions belonged to the set of 60 experimentally verified small proteins. We found 35 of all the true small proteins within the top 200 predictions. We compared our method to Glimmer, using a default Glimmer protocol and a modified small ORF Glimmer protocol with a lower minimum size cutoff. The default Glimmer protocol identified 16 of the true small proteins (all in the top 200 predictions), but failed to predict on 34 due to size cutoffs. The small ORF Glimmer protocol made predictions for all the experimentally verified small proteins but only contained 9 of the 60 true small proteins within the top 200 predictions. CONTACT jsamayoa@jhu.edu
Collapse
Affiliation(s)
- Josue Samayoa
- Department of Biomedical Engineering, Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD 21218, USA.
| | | | | |
Collapse
|
134
|
Hobbs EC, Fontaine F, Yin X, Storz G. An expanding universe of small proteins. Curr Opin Microbiol 2011; 14:167-73. [PMID: 21342783 PMCID: PMC3079058 DOI: 10.1016/j.mib.2011.01.007] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2010] [Accepted: 01/28/2011] [Indexed: 01/04/2023]
Abstract
Historically, small proteins (sproteins) of less than 50 amino acids, in their final processed forms or genetically encoded as such, have been understudied. However, both serendipity and more recent focused efforts have led to the identification of a number of new sproteins in both Gram-negative and Gram-positive bacteria. Increasing evidence demonstrates that sproteins participate in a wide array of cellular processes and exhibit great diversity in their mechanisms of action, yet general principles of sprotein function are emerging. This review highlights examples of sproteins that participate in cell signaling, act as antibiotics and toxins, and serve as structural proteins. We also describe roles for sproteins in detecting and altering membrane features, acting as chaperones, and regulating the functions of larger proteins.
Collapse
Affiliation(s)
| | | | - Xuefeng Yin
- School of Basic Medical Sciences, Peking University, 100191 Beijing, China
| | - Gisela Storz
- Corresponding author address of corresponding author:
| |
Collapse
|
135
|
Washietl S, Findeiss S, Müller SA, Kalkhof S, von Bergen M, Hofacker IL, Stadler PF, Goldman N. RNAcode: robust discrimination of coding and noncoding regions in comparative sequence data. RNA (NEW YORK, N.Y.) 2011; 17:578-94. [PMID: 21357752 PMCID: PMC3062170 DOI: 10.1261/rna.2536111] [Citation(s) in RCA: 146] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
With the availability of genome-wide transcription data and massive comparative sequencing, the discrimination of coding from noncoding RNAs and the assessment of coding potential in evolutionarily conserved regions arose as a core analysis task. Here we present RNAcode, a program to detect coding regions in multiple sequence alignments that is optimized for emerging applications not covered by current protein gene-finding software. Our algorithm combines information from nucleotide substitution and gap patterns in a unified framework and also deals with real-life issues such as alignment and sequencing errors. It uses an explicit statistical model with no machine learning component and can therefore be applied "out of the box," without any training, to data from all domains of life. We describe the RNAcode method and apply it in combination with mass spectrometry experiments to predict and confirm seven novel short peptides in Escherichia coli and to analyze the coding potential of RNAs previously annotated as "noncoding." RNAcode is open source software and available for all major platforms at http://wash.github.com/rnacode.
Collapse
Affiliation(s)
- Stefan Washietl
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB101SD, United Kingdom.
| | | | | | | | | | | | | | | |
Collapse
|
136
|
Thomassen GOS, Weel-Sneve R, Rowe AD, Booth JA, Lindvall JM, Lagesen K, Kristiansen KI, Bjørås M, Rognes T. Tiling array analysis of UV treated Escherichia coli predicts novel differentially expressed small peptides. PLoS One 2010; 5:e15356. [PMID: 21203457 PMCID: PMC3009722 DOI: 10.1371/journal.pone.0015356] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2010] [Accepted: 11/09/2010] [Indexed: 11/19/2022] Open
Abstract
Background Despite comprehensive investigation, the Escherichia coli SOS response system is not yet fully understood. We have applied custom designed whole genome tiling arrays to measure UV invoked transcriptional changes in E. coli. This study provides a more complete insight into the transcriptome and the UV irradiation response of this microorganism. Results We detected a number of novel differentially expressed transcripts in addition to the expected SOS response genes (such as sulA, recN, uvrA, lexA, umuC and umuD) in the UV treated cells. Several of the differentially expressed transcripts might play important roles in regulation of the cellular response to UV damage. We have predicted 23 novel small peptides from our set of detected non-gene transcripts. Further, three of the predicted peptides were cloned into protein expression vectors to test the biological activity. All three constructs expressed the predicted peptides, in which two of them were highly toxic to the cell. Additionally, a remarkably high overlap with previously in-silico predicted non-coding RNAs (ncRNAs) was detected. Generally we detected a far higher transcriptional activity than the annotation suggests, and these findings correspond with previous transcription mappings from E. coli and other organisms. Conclusions Here we demonstrate that the E. coli transcriptome consists of far more transcripts than the present annotation suggests, of which many transcripts seem important to the bacterial stress response. Sequence alignment of promoter regions suggest novel regulatory consensus sequences for some of the upregulated genes. Finally, several of the novel transcripts identified in this study encode putative small peptides, which are biologically active.
Collapse
Affiliation(s)
- Gard O. S. Thomassen
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, Rikshospitalet, Oslo University Hospital, Oslo, Norway
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, University of Oslo, Oslo, Norway
| | - Ragnhild Weel-Sneve
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, Rikshospitalet, Oslo University Hospital, Oslo, Norway
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, University of Oslo, Oslo, Norway
| | - Alexander D. Rowe
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, Rikshospitalet, Oslo University Hospital, Oslo, Norway
| | - James A. Booth
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, Rikshospitalet, Oslo University Hospital, Oslo, Norway
| | | | - Karin Lagesen
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, Rikshospitalet, Oslo University Hospital, Oslo, Norway
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, University of Oslo, Oslo, Norway
| | - Knut I. Kristiansen
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, Rikshospitalet, Oslo University Hospital, Oslo, Norway
| | - Magnar Bjørås
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, Rikshospitalet, Oslo University Hospital, Oslo, Norway
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, University of Oslo, Oslo, Norway
- Institute of Clinical Biochemistry, University of Oslo, Oslo, Norway
| | - Torbjørn Rognes
- Centre for Molecular Biology and Neuroscience (CMBN) and Department of Microbiology, Rikshospitalet, Oslo University Hospital, Oslo, Norway
- Department of Informatics, University of Oslo, Oslo, Norway
- * E-mail:
| |
Collapse
|
137
|
Postic G, Frapy E, Dupuis M, Dubail I, Livny J, Charbit A, Meibom KL. Identification of small RNAs in Francisella tularensis. BMC Genomics 2010; 11:625. [PMID: 21067590 PMCID: PMC3091763 DOI: 10.1186/1471-2164-11-625] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2010] [Accepted: 11/10/2010] [Indexed: 12/29/2022] Open
Abstract
Background Regulation of bacterial gene expression by small RNAs (sRNAs) have proved to be important for many biological processes. Francisella tularensis is a highly pathogenic Gram-negative bacterium that causes the disease tularaemia in humans and animals. Relatively little is known about the regulatory networks existing in this organism that allows it to survive in a wide array of environments and no sRNA regulators have been identified so far. Results We have used a combination of experimental assays and in silico prediction to identify sRNAs in F. tularensis strain LVS. Using a cDNA cloning and sequencing approach we have shown that F. tularensis expresses homologues of several sRNAs that are well-conserved among diverse bacteria. We have also discovered two abundant putative sRNAs that share no sequence similarity or conserved genomic context with any previously annotated regulatory transcripts. Deletion of either of these two loci led to significant changes in the expression of several mRNAs that likely include the cognate target(s) of these sRNAs. Deletion of these sRNAs did not, however, significantly alter F. tularensis growth under various stress conditions in vitro, its replication in murine cells, or its ability to induce disease in a mouse model of F. tularensis infection. We also conducted a genome-wide in silico search for intergenic loci that suggests F. tularensis encodes several other sRNAs in addition to the sRNAs found in our experimental screen. Conclusion Our findings suggest that F. tularensis encodes a significant number of non-coding regulatory RNAs, including members of well conserved families of structural and housekeeping RNAs and other poorly conserved transcripts that may have evolved more recently to help F. tularensis deal with the unique and diverse set of environments with which it must contend.
Collapse
|
138
|
Guan Z, Wang X, Raetz CRH. Identification of a chloroform-soluble membrane miniprotein in Escherichia coli and its homolog in Salmonella typhimurium. Anal Biochem 2010; 409:284-9. [PMID: 21050835 DOI: 10.1016/j.ab.2010.10.035] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2010] [Revised: 10/27/2010] [Accepted: 10/27/2010] [Indexed: 11/17/2022]
Abstract
Two homologous 29 amino acid-long highly hydrophobic membrane miniproteins were identified in the Bligh-Dyer lipid extracts of Escherichia coli and Salmonella typhimurium using liquid chromatography/tandem mass spectrometry (LC/MS/MS). The amino acid sequences of the proteins were determined by collision-induced dissociation tandem mass spectrometry, in conjunction with a translating BLAST (tBLASTn) search, i.e., comparing the MS/MS-determined protein query sequence against the six-frame translations of the nucleotide sequences of the E. coli and S. typhimurium genomes. Further MS characterization revealed that both proteins retain the N-terminal initiating formyl-methionines. The methodologies described here may be amendable for detecting and characterizing small hydrophobic proteins in other organisms that are difficult to annotate or analyze by conventional methods.
Collapse
Affiliation(s)
- Ziqiang Guan
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA.
| | | | | |
Collapse
|
139
|
Mok WWK, Patel NH, Li Y. Decoding toxicity: deducing the sequence requirements of IbsC, a type I toxin in Escherichia coli. J Biol Chem 2010; 285:41627-36. [PMID: 20980267 DOI: 10.1074/jbc.m110.149179] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Bacterial genomes encode a collection of small peptides that are deleterious to their hosts when overexpressed. The physiological relevance of the majority of these peptides is unknown at present, although many of them have been implicated in regulatory processes important for cell survival and adaptability. One peptide that is of particular interest to us is a 19-amino acid proteic toxin, coined IbsC, whose production is repressed by SibC, an RNA antitoxin. Together, IbsC and SibC constitute a type I toxin-antitoxin (TA) pair. To better understand the function of IbsC and to decipher the sequence determinants for its toxic phenotype, we carried out extensive sequence analyses of the peptide. We generated a series of truncation and single amino acid deletion mutants to determine the minimal sequence required for toxicity. We further probed into functionally relevant amino acids with a comprehensive set of IbsC mutants produced using a systematic sequence randomization strategy. We found that IbsC remained toxic in the presence of multiple deletions and single amino acid substitutions, despite being well-conserved in Escherichia coli and across other Gram-negative bacteria. The toxicity of this peptide was determined to be dependent on a stretch of highly hydrophobic residues near its center. Our results defined sequence-function relationship of IbsC and offered additional insights into properties common to membrane-targeting type I toxins in E. coli and related species.
Collapse
Affiliation(s)
- Wendy W K Mok
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario L8N 3Z5, Canada
| | | | | |
Collapse
|
140
|
Nielsen JS, Christiansen MHG, Bonde M, Gottschalk S, Frees D, Thomsen LE, Kallipolitis BH. Searching for small σB-regulated genes in Staphylococcus aureus. Arch Microbiol 2010; 193:23-34. [DOI: 10.1007/s00203-010-0641-1] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2010] [Revised: 08/25/2010] [Accepted: 09/09/2010] [Indexed: 02/05/2023]
|
141
|
Abstract
The idea that we could build molecular communications systems can be advanced by investigating how actual molecules from living organisms function. Information theory provides tools for such an investigation. This review describes how we can compute the average information in the DNA binding sites of any genetic control protein and how this can be extended to analyze its individual sites. A formula equivalent to Claude Shannon's channel capacity can be applied to molecular systems and used to compute the efficiency of protein binding. This efficiency is often 70% and a brief explanation for that is given. The results imply that biological systems have evolved to function at channel capacity, which means that we should be able to build molecular communications that are just as robust as our macroscopic ones.
Collapse
Affiliation(s)
- Thomas D. Schneider
- National Institutes of Health, National Cancer Institute at Frederick, P.O. Box B, Frederick, MD 21702-1201, United States
| |
Collapse
|
142
|
Müller SA, Kohajda T, Findeiss S, Stadler PF, Washietl S, Kellis M, von Bergen M, Kalkhof S. Optimization of parameters for coverage of low molecular weight proteins. Anal Bioanal Chem 2010; 398:2867-81. [PMID: 20803007 PMCID: PMC2990009 DOI: 10.1007/s00216-010-4093-x] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2010] [Revised: 07/26/2010] [Accepted: 08/03/2010] [Indexed: 11/16/2022]
Abstract
Proteins with molecular weights of <25 kDa are involved in major biological processes such as ribosome formation, stress adaption (e.g., temperature reduction) and cell cycle control. Despite their importance, the coverage of smaller proteins in standard proteome studies is rather sparse. Here we investigated biochemical and mass spectrometric parameters that influence coverage and validity of identification. The underrepresentation of low molecular weight (LMW) proteins may be attributed to the low numbers of proteolytic peptides formed by tryptic digestion as well as their tendency to be lost in protein separation and concentration/desalting procedures. In a systematic investigation of the LMW proteome of Escherichia coli, a total of 455 LMW proteins (27% of the 1672 listed in the SwissProt protein database) were identified, corresponding to a coverage of 62% of the known cytosolic LMW proteins. Of these proteins, 93 had not yet been functionally classified, and five had not previously been confirmed at the protein level. In this study, the influences of protein extraction (either urea or TFA), proteolytic digestion (solely, and the combined usage of trypsin and AspN as endoproteases) and protein separation (gel- or non-gel-based) were investigated. Compared to the standard procedure based solely on the use of urea lysis buffer, in-gel separation and tryptic digestion, the complementary use of TFA for extraction or endoprotease AspN for proteolysis permits the identification of an extra 72 (32%) and 51 proteins (23%), respectively. Regarding mass spectrometry analysis with an LTQ Orbitrap mass spectrometer, collision-induced fragmentation (CID and HCD) and electron transfer dissociation using the linear ion trap (IT) or the Orbitrap as the analyzer were compared. IT-CID was found to yield the best identification rate, whereas IT-ETD provided almost comparable results in terms of LMW proteome coverage. The high overlap between the proteins identified with IT-CID and IT-ETD allowed the validation of 75% of the identified proteins using this orthogonal fragmentation technique. Furthermore, a new approach to evaluating and improving the completeness of protein databases that utilizes the program RNAcode was introduced and examined.
Collapse
Affiliation(s)
- Stephan A Müller
- Department of Proteomics, UFZ, Helmholtz-Centre for Environmental Research, Permoserstraße 15, 04318 Leipzig, Germany
| | | | | | | | | | | | | | | |
Collapse
|
143
|
Abstract
Using an oligonucleotide microarray, we searched for previously unrecognized transcription units in intergenic regions in the genome of Bacillus subtilis, with an emphasis on identifying small genes activated during spore formation. Nineteen transcription units were identified, 11 of which were shown to depend on one or more sporulation-regulatory proteins for their expression. A high proportion of the transcription units contained small, functional open reading frames (ORFs). One such newly identified ORF is a member of a family of six structurally similar genes that are transcribed under the control of sporulation transcription factor σ(E) or σ(K). A multiple mutant lacking all six genes was found to sporulate with slightly higher efficiency than the wild type, suggesting that under standard laboratory conditions the expression of these genes imposes a small cost on the production of heat-resistant spores. Finally, three of the transcription units specified small, noncoding RNAs; one of these was under the control of the sporulation transcription factor σ(E), and another was under the control of the motility sigma factor σ(D).
Collapse
|
144
|
Coornaert A, Lu A, Mandin P, Springer M, Gottesman S, Guillier M. MicA sRNA links the PhoP regulon to cell envelope stress. Mol Microbiol 2010; 76:467-79. [PMID: 20345657 DOI: 10.1111/j.1365-2958.2010.07115.x] [Citation(s) in RCA: 85] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Numerous small RNAs regulators of gene expression exist in bacteria. A large class of them binds to the RNA chaperone Hfq and act by base pairing interactions with their target mRNA, thereby affecting their translation and/or stability. They often have multiple direct targets, some of which may be regulators themselves, and production of a single sRNA can therefore affect the expression of dozens of genes. We show in this study that the synthesis of the Escherichia coli pleiotropic PhoPQ two-component system is repressed by MicA, a sigma(E)-dependent sRNA regulator of porin biogenesis. MicA directly pairs with phoPQ mRNA in the translation initiation region of phoP and presumably inhibits translation by competing with ribosome binding. Consequently, MicA downregulates several members of the PhoPQ regulon. By linking PhoPQ to sigma(E), our findings suggest that major cellular processes such as Mg(2+) transport, virulence, LPS modification or resistance to antimicrobial peptides are modulated in response to envelope stress. In addition, we found that Hfq strongly affects the expression of phoP independently of MicA, raising the possibility that even more sRNAs, which remain to be identified, could regulate PhoPQ synthesis.
Collapse
Affiliation(s)
- Audrey Coornaert
- UPR9073 du CNRS affiliated with Université de Paris 7-Denis Diderot, Institut de Biologie Physico-chimique, Paris, France
| | | | | | | | | | | |
Collapse
|
145
|
Warren AS, Archuleta J, Feng WC, Setubal JC. Missing genes in the annotation of prokaryotic genomes. BMC Bioinformatics 2010; 11:131. [PMID: 20230630 PMCID: PMC3098052 DOI: 10.1186/1471-2105-11-131] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2009] [Accepted: 03/15/2010] [Indexed: 12/04/2022] Open
Abstract
Background Protein-coding gene detection in prokaryotic genomes is considered a much simpler problem than in intron-containing eukaryotic genomes. However there have been reports that prokaryotic gene finder programs have problems with small genes (either over-predicting or under-predicting). Therefore the question arises as to whether current genome annotations have systematically missing, small genes. Results We have developed a high-performance computing methodology to investigate this problem. In this methodology we compare all ORFs larger than or equal to 33 aa from all fully-sequenced prokaryotic replicons. Based on that comparison, and using conservative criteria requiring a minimum taxonomic diversity between conserved ORFs in different genomes, we have discovered 1,153 candidate genes that are missing from current genome annotations. These missing genes are similar only to each other and do not have any strong similarity to gene sequences in public databases, with the implication that these ORFs belong to missing gene families. We also uncovered 38,895 intergenic ORFs, readily identified as putative genes by similarity to currently annotated genes (we call these absent annotations). The vast majority of the missing genes found are small (less than 100 aa). A comparison of select examples with GeneMark, EasyGene and Glimmer predictions yields evidence that some of these genes are escaping detection by these programs. Conclusions Prokaryotic gene finders and prokaryotic genome annotations require improvement for accurate prediction of small genes. The number of missing gene families found is likely a lower bound on the actual number, due to the conservative criteria used to determine whether an ORF corresponds to a real gene.
Collapse
Affiliation(s)
- Andrew S Warren
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA.
| | | | | | | |
Collapse
|
146
|
Guccione E, Hitchcock A, Hall SJ, Mulholland F, Shearer N, van Vliet AHM, Kelly DJ. Reduction of fumarate, mesaconate and crotonate by Mfr, a novel oxygen-regulated periplasmic reductase inCampylobacter jejuni. Environ Microbiol 2010; 12:576-91. [DOI: 10.1111/j.1462-2920.2009.02096.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
147
|
Small stress response proteins in Escherichia coli: proteins missed by classical proteomic studies. J Bacteriol 2010; 192:46-58. [PMID: 19734316 DOI: 10.1128/jb.00872-09] [Citation(s) in RCA: 123] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Proteins of 50 or fewer amino acids are poorly characterized in all organisms. The corresponding genes are challenging to reliably annotate, and it is difficult to purify and characterize the small protein products. Due to these technical limitations, little is known about the abundance of small proteins, not to mention their biological functions. To begin to characterize these small proteins in Escherichia coli, we assayed their accumulation under a variety of growth conditions and after exposure to stress. We found that many small proteins accumulate under specific growth conditions or are stress induced. For some genes, the observed changes in protein levels were consistent with known transcriptional regulation, such as ArcA activation of the operons encoding yccB and ybgT. However, we also identified novel regulation, such as Zur repression of ykgMO, cyclic AMP response protein (CRP) repression of azuC, and CRP activation of ykgR. The levels of 11 small proteins increase after heat shock, and induction of at least 1 of these, YobF, occurs at a posttranscriptional level. These results show that small proteins are an overlooked subset of stress response proteins in E. coli and provide information that will be valuable for determining the functions of these proteins.
Collapse
|
148
|
Small RNAs and small proteins involved in resistance to cell envelope stress and acid shock in Escherichia coli: analysis of a bar-coded mutant collection. J Bacteriol 2010; 192:59-67. [PMID: 19734312 DOI: 10.1128/jb.00873-09] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
More than 80 small regulatory RNAs (sRNAs) and 60 proteins of 16 to 50 amino acids (small proteins) are encoded in the Escherichia coli genome. The vast majority of the corresponding genes have no known function. We screened 125 DNA bar-coded mutants to identify novel cell envelope stress and acute acid shock phenotypes associated with deletions of genes coding for sRNAs and small proteins. Nine deletion mutants (ssrA, micA, ybaM, ryeF, yqcG, sroH, ybhT, yobF, and glmY) were sensitive to cell envelope stress and two were resistant (rybB and blr). Deletion mutants of genes coding for four small proteins (yqgB, mgrB, yobF, and yceO) were sensitive to acute acid stress. We confirmed each of these phenotypes in one-on-one competition assays against otherwise-wild-type lacZ mutant cells. A more detailed investigation of the SsrA phenotype suggests that ribosome release is critical for resistance to cell envelope stress. The bar-coded deletion collection we generated can be screened for sensitivity or resistance to virtually any stress condition.
Collapse
|
149
|
Global approaches for finding small RNA and small open reading frame functions. J Bacteriol 2010; 192:26-8. [PMID: 19854892 DOI: 10.1128/jb.01316-09] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
150
|
Kim W, Silby MW, Purvine SO, Nicoll JS, Hixson KK, Monroe M, Nicora CD, Lipton MS, Levy SB. Proteomic detection of non-annotated protein-coding genes in Pseudomonas fluorescens Pf0-1. PLoS One 2009; 4:e8455. [PMID: 20041161 PMCID: PMC2794547 DOI: 10.1371/journal.pone.0008455] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2009] [Accepted: 12/02/2009] [Indexed: 11/18/2022] Open
Abstract
Genome sequences are annotated by computational prediction of coding sequences, followed by similarity searches such as BLAST, which provide a layer of possible functional information. While the existence of processes such as alternative splicing complicates matters for eukaryote genomes, the view of bacterial genomes as a linear series of closely spaced genes leads to the assumption that computational annotations that predict such arrangements completely describe the coding capacity of bacterial genomes. We undertook a proteomic study to identify proteins expressed by Pseudomonas fluorescens Pf0-1 from genes that were not predicted during the genome annotation. Mapping peptides to the Pf0-1 genome sequence identified sixteen non-annotated protein-coding regions, of which nine were antisense to predicted genes, six were intergenic, and one read in the same direction as an annotated gene but in a different frame. The expression of all but one of the newly discovered genes was verified by RT-PCR. Few clues as to the function of the new genes were gleaned from informatic analyses, but potential orthologs in other Pseudomonas genomes were identified for eight of the new genes. The 16 newly identified genes improve the quality of the Pf0-1 genome annotation, and the detection of antisense protein-coding genes indicates the under-appreciated complexity of bacterial genome organization.
Collapse
Affiliation(s)
- Wook Kim
- Center for Adaptation Genetics and Drug Resistance and Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, Massachusetts, United States of America
| | - Mark W. Silby
- Center for Adaptation Genetics and Drug Resistance and Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, Massachusetts, United States of America
| | - Sam O. Purvine
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Julie S. Nicoll
- Center for Adaptation Genetics and Drug Resistance and Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, Massachusetts, United States of America
| | - Kim K. Hixson
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Matt Monroe
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Carrie D. Nicora
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Mary S. Lipton
- Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Stuart B. Levy
- Center for Adaptation Genetics and Drug Resistance and Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|